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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence motif, as well as 
various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 
case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
4 The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 
sequences are designated as SEQ ID NOS: 1-13901 . The polypeptides sequences are designated 
SEQ ID NOS: 13902-27802. The nucleic acids and polypeptides are provided in the Sequence 
8 Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is 
guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence 
Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
12 hybridize to the complement of SEQ ID NO: 1-13901 under stringent hybridization conditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 
domain or truncation of the peptides encoded by SEQ ID NOS: 1 -1 3901 . A polynucleotide 
1 6 comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO: 1 - 1 390 1 or a degenerate variant or fragment thereof. The identifying sequence can be 1 00 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 

20 from the nucleic acid sequences of SEQ ID NO: 1 - 1 3901 . The sequence information can be a 
segment of any one of SEQ ID NO: 1 - 1 3 90 1 that uniquely identifies or represents the sequence 
information of SEQ ID NOS: 1-13901. 

A collection as used in this application can be a collection of only one polynucleotide. The 

24 collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full -match or mismatch to the polynucleotide that contains the segment. The collection 

28 can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 

32 reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing 
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full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 - 1 390 1 or novel 
segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1-13901 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 
et ai., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1 -13901; a 
polynucleotide comprising any of the full length protein coding sequences of SEQ ID NOS: 1 - 
1 3901 ; and a polynucleotide comprising any of the nucleotide sequences of the mature protein 
coding sequences of SEQ ID NOS: 1-13901. The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 
conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID NO : 1 - 
1 3901 ; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the 
Sequence Listing (e.g., SEQ ID NOS: 1 3902-27802); (c) a polynucleotide which is an allelic variant 
of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 
orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-13901 ; or (b) polynucleotides that hybridize to the complement of the 
polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof (e.g. , with at least about 65%, 70%, 75%, 8Q%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated. The 
polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutically acceptable, carrier. 
4 The invention also provides host cells transformed or transfected with a polynucleotide of 

the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
8 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
12 techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
16 largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
24 procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
28 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
32 pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 

4 identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufficient to form a 

8 complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
12 and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
16 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 

monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that modulate 

20 (/'. e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 

24 bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

28 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

32 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 

symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
4 useful for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 



4. DETAILED DESCRIPTION OF THE INVENTION 
4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 
natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5'. Complementarity between two single-stranded molecules 
may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into gemi cells and other cells. PGCs are the source from which GSCs and ES cells 
are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
modulates the expression of an operably linked ORE or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 
(U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 1 5 to about 50 nucleotides, more preferably from about 1 7 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
4 invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
ID NOs: 1-13901 . 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
8 described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods AppI 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 
art. Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
12 Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

The nucleic acid sequences of the present invention also include the sequence 

1 6 information from the nucleic acid sequences of SEQ ID NOS : 1 - 1 3 90 1 . The sequence 

information can be a segment of any one of SEQ ID NO: 1-13901 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NOS : 1 - 1 390 1 . One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty -mer is 

20 fully matched in the human genome is 1 in 300. In the human genome, there are three billion 
base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 
times more twenty-mers than there are base pairs in a set of human chromosomes. Using the 
same analysis, the probability for a seventeen-mer to be fully matched in the human genome is 

24 approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 

mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

28 Similarly, when using sequence information for detecting a single mismatch, a segment can 

be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a full match (U4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25), The probability that an 

32 eighteen mer with a single mismatch can be detected in an array for expression studies is 

approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 
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The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids, without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
4 acid sequences. For example, a promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
8 transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 
differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 
12 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
1 6 preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
24 acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 
length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
28 or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
32 may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
4 substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

• The term "variant" (or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
8 recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 
12 or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 

1 6 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

24 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

28 glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 

negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 

32 insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

recombinant DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the indicated 
nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 
at least one other component {e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 
polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant {e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fungal {e.g., yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 
proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 
having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
4 methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 
8 transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 

12 gene expression, for example, promoters or enhancers. Recombinant expression systems as 

defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

1 6 The term "secreted" includes a protein that is transported across or through a membrane, 

including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 
P.A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors released from damaged cells 

24 (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

28 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization 

32 to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 

65°C, and washing in 0.1 X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 
described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
]4-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
functional dissimilarity between the reference and subject sequences. Typically, such a 
substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 
by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 
embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 
embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 
mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 
nucleotide sequences of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequences having 
substantially equivalent biological activity and substantially equivalent expression characteristics 
are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 1 83:626-645). Identity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 
types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ID NO: 1-13901 ; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NOS: 13902-27802; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one 
of SEQ ID NOS: 13902-27802. The polynucleotides of the present invention also include, but 
are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-13901; (b) nucleotide 
sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a 
polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 
polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NOS: 13902-27802. Domains of interest may depend on the nature of 
the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, 
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extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include the variable immunoglobulin-Jike domains; domains in 
enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand 
polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can 
be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-13901 can be obtained by screening 
appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO: 1 -13901 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO: 1-13901 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 
gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 
sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-13901, or complements thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 

15 



_0164B35A2_I_> 



WO 01/64835 



PCT/USO 1/04927 



are selective for {i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
4 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
8 variations can be routinely determined by comparing the sequence provided SEQ ID NO: 1-1 3901, 
a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 
identical, to SEQ ID NO: 1-1 3901 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

1 2 coding for the same amino acid sequences as do the specific ORPs disclosed herein. In other words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

1 6 including SEQ ID NO: 1 - 1 390 1 can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 
S.F. et al. J. Mol. Biol. 2 1 :403-4 1 0 (1 990)). Alternatively a FASTA version 3 search against 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

24 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 

28 polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 

32 native or variant polynucleotide. There are two variables in the construction of amino acid 

sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 
choices {e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-hisfidine sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses' oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 
DNA 2:183 (1983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCRmay also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 
gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
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of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-13901 , or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory. NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 
plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 
selectable marker for the host cell. Vectors according to the invention include expression 
vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof or any 
other polynucleotides of the invention. In one embodiment, the recombinant constructs of (he 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof is 
inserted, in a forward or reverse orientation. In the case of a vector comprising one of the ORFs 
of the present invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and 
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promoters are known to those of skill in the an and are commercially available for generating the 
recombinant constructs of the present invention. The following vectors are provided by way of 
example. Bacterial: pBs, phagescript, PsiXl 74, pBluescript SK, pBs KS, pNHSa, pNH16a, 
pNHl 8a, pNH46a (Stratagene); P Trc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al., 
Nucleic Acids Res. 1 9, 4485-4490 (1991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 
Enzymology 1 85, 537-566 (1 990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
(transfected) with the ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 
kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 
and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 
transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3 -phosphogly cerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a functional promoter. The vector will comprise one or 

19 

_0164835A2_L> 



WO 01/64835 



PCT/USO 1/04927 



more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimnrium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections axe combined with an appropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Nat. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
against the encoded polypeptide following topical administration of -naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1-13901 , or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 
100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NOS: 13902-27802 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ 
ID NO: 1-13901 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
"noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ 
ID NO: 1-13901 ), antisense nucleic acids of the invention can be designed according to the rules 
of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil- 5 -oxy acetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by Unking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 
strands run parallel to each other (Gaultier et al (1987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2-o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res 15: 6131-6148) or achimeric RNA -DNA analogue (Inoue et al. (1987) 
FEBSLett 215: 327-330). 

4.4 R1BOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes {e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1- 
1 3901 ). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
be cleaved in a SECX-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,07] ; and 
Cech et al. U.S. Pat. No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e g , 
Bartel et al., (1993) Science 261:141 1-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (1992) Ann. N. Y. Acad. Set 660:27-36; and 
Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefe et al. (1996) PNAS 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g. , inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 
gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), 
above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance then- 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 
4 using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
8 phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 

5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
12 DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg Med Chem 
LettS: 1119-11124. . - • 

In other embodiments, the oligonucleotide may include other appended groups such as 
16 peptides (e.g. , for targeting host cell receptors in vivo), or agents facilitating transport across the 
, cell membrane (see, e.g. , Letsinger etal., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; 

Lemaitre etal, 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or 
. the blood-.brain barrier (see, e.g. , PCT Publication No, W089/1 0 1 34). In addition, 
20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 
al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
24 cleavage agent, etc. 

4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
28 polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
32 with a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is operatively linked to the encoding sequences. See, for example, PCT International Publication 
No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. W09 1/0995 5. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 
L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the present 
invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 
protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 

4 replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5 ' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 

8 the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
12 Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

* 16 Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 

or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

24 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 

28 may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

32 negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, rnRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA, 
allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NOS: 13902-278Q2 or an 
amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-13901 or 
the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901 or 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NOS: 
13902-27802 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
either (a) or (b) under stringent hybridization conditions. The invention also provides 
biologically active or immunologically active variants of any of the amino acid sequences set 
forth as SEQ ID NOS: 13902-27802 or the corresponding full length or mature protein; and 
"sxibstantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 
92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 
Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compared to polypeptides comprising SEQ ID NOS: 13902-27802. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described in H. 
U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 
without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 
of the protein is also determinable from the amino acid sequence of the full-length form. Where 
proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

Protein compositions of the present invention may further comprise an acceptable carrier, 
such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. • 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 
present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells which have been altered to express the desired polypeptide or protein. As used herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 
culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow expression of the encoded polypeptide. The polypeptide can be recovered from the 
culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
tmmunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer- Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

4 The purified polypeptides can be used in in vitro binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
8 activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 

12 may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NOS: 13902-27802. 

The protein of the invention may also be expressed as a product of transgenic animals, 

16 e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 

made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

24 residues may be deleted or replaced with another amino acid to alter the conformation of the 
molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

28 protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 

32 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are useful for screening or other immunological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
4 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
8 Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 
12 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 

16 protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffmity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression and purification of such fusion proteins are commercially available 

24 from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

28 Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 

steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 

32 homogeneous isolated recombinant protein. The protein thus purified is substantially free of 

other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 

4 modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g., targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 

8 provide for the delivery of polypeptide to pancreatic cells, e.g. , antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
12 immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

16 4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 

AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. 

24 vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 

28 reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 

105-31 (1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

32 Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a ''chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 

32 



JSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 
C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 
polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprises one or more domains are fused 
to sequences derived from a member of the immunoglobulin protein family. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 
The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 
are ligated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current'Protocols in Molecular Biology, John Wiley & 
Sons, 1 992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 
4 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the protein of the invention. 

4.8 GENE THERAPY 
8 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 

12 appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors {e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of 

16 gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 

24 or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 
states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 

28 molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

32 The present invention still further provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
4 permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , by 
homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
8 operatively linked to the desired protein encoding sequences. See, for example, PCT International 
Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT 
International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which 

12 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the desired protein coding sequences in the cells. 

16 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

24 of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 
protein produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 

28 which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 

32 element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell -type specificity than the naturally 
occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 

4 targeting event may also be facilitated by the use of one or more marker genes exhibiting the " 

property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 

8 not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
1 2 aspect of the invention are more particularly described in U.S. Patent No. 5,272,07 1 to Chappel; 
U.S. Patent No. 5^578,461 to Sherwin et al.; International ApplicationNo. PCT/US92/09627 
(WO93/09222)by Seldenet al.; and International ApplicationNo. PCT/US90/0643 6 
(W09 1/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 

16 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244: 1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 

24 referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 

28 identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 

32 polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
4 through, e.g. , homologous recombination or knock out strategies, of animals that fail to express 
polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 
8 In preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 

12 control of exogenous or endogenous promoter elements, are known as transgenic animals. 

Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 

16 animals are useful to determine the roles polypeptides of the invention play in biological 

processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094/28 1 22, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 

24 methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

28 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one or 
more of the uses or biological activities (including those associated with assays cited herein) 
32 identified herein. Uses or activities described for proteins of the present invention may be 

provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 
mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 
4 (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 

polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
8 activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 
indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
12 helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 
or in one of the other physiological pathways described herein. 

16 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 

24 positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 

28 oligomers for attachment to a "gene chip" or other support, including for examination of 

expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

32 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biologfcal fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutive! y or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
sources or supplements. Such uses include without limitation use as a protein or amino acid 
supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 
4 MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
8 M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 

12 et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 

16 eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons^ Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

24 Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:293 1-2938, 1 983; Measurement of mouse 

and human interleukin 6~Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
I pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 1 1 --Bennett, F., Giannotti, J., 

28 Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9— Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

32 Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 
1980; Weinberger et al., Eur. J. Immun. 11 :405-41 1, 1981; Talcai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity and 
be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 
cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 
large quantities of human cells has important working applications for the production of human 
proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 
for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 
specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 
fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 
optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 
with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 
4 Stem cells themselves can be transfected with a polynucleotide of the invention to induce 

autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
8 of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

12 Expansion and maintenance of totipotent stem cell populations will be useful in the 

treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 

16- genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

24 types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 

28 et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 

32 retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al! Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1 995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
4 factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al., Blood, 77: 2316-2321 (1991). 

8 4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 

12 involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 

16 cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 

monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 

24 those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 

28 as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

32 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 

4 Freshney, et al. eds. Vol pp. 265-268, Wiley -Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N. Y. 1994; Neben et 

8 al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
12 Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

16 4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
- repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 

24 prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

28 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

32 inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another categoiy of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not nonnally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or l igament 
defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 
induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
invention may provide environment to attract tendon- or ligament-forming cells, stimulate 
growth of tendon- or ligament- forming cells, induce differentiation of progenitors of tendon- or 
ligament-fonning cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 
compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 
cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
W09 1/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 
protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g. , in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g. , 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 
treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoirrimune inflammatory eye disease. Such a protein (or antagonists. thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 
models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 
by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 
generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an imniune reaction that destroys the transplant. The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 
to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
may also be necessary to block the function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 
(1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 
reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 
cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 
long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 
human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 
myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
responses may be in the form of enhancing an existing immune response or eliciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient. Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 
invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and {3 2 microglobulin protein or an MHC class II alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 
proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 
the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by the 
following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kxuisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986;Takai et al., J. 
Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61 :1992-1998; Bertagnolli et al., 
Cellular Immunology 133:327-341, 1991; Brown etal., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 
will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
Mond, J. J: and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai etal., J. Immunol. 140:508-512, 
1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 
Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 
173:549-559, 1 991 ; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et 
al.. Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et al., Science 264:961-965, 1 994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 
94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al,, Cytometry 
13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 
53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:1 1 1-1 17, 1994; Fine et al., 
Cellular Immunology 155:111-122, 1994; Galy etal., Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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4.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: Vale et 
al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et ah, Nature 
321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 
T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 
modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
4 Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
8 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
12 M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
16 1994; Johnston et al. J. of Immunol. 153 :1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 

hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 

in treating wounds resulting from trauma, surgery or other causes. A composition of the 
24 invention may also be useful for dissolving or inhibiting formation of thromboses and for 

treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 

cardiac and central nervous system vessels (e.g., stroke). 

• Therapeutic compositions of the invention can be used in the following: 
28 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al, J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al, Thrombosis Res. 

45:413-419, 1987; Humphrey etal., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

35:467-474, 1988. 

32 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
4 Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
8 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 

12 cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

16 cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 

20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 

24 hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

28 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

32 The composition can also be administered in therapeutically effective amounts as a 

portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 
DDP), Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (VI 6-2 13), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon AIpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
Lpmusrine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 
Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate. Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary, conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 
cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 1 8 and Ch 2 1), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 
e.g. from American Type Tissue Culture Collection catalogs. 

4.10-12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor Iigand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics: Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 
Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. 
Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; 
Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stittet al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 
through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 
overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 
partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 
Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 
Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 
carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent 
molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 
toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellularly. One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of 
complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 
increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 
review, see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 
oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 
Biotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
Al-Obeidi et ah, Mol. Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 
1(1):114-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art. In brief, the. molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
4 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

8 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 

12 to identify polynucleotides encoding binding partners. As another example, affinity 

chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 

1 6 that modulate (i. e., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas the other does not. The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

24 in the art can be used to identify binding partner polypeptides, including, (1 ) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 

28 polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

32 the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 
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4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1 . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1 , 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

4.10.16 LEUKEMIAS 

24 Leukemias and related disorders may be treated or prevented by administration of a 

therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, 

28 myelomonocytic, monocytic, erythro leukemia, chronic leukemia, chronic myelocytic 

(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

32 4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
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therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
4 human and non-human mammalian patients) according to the invention include but are not 

limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
8 surgery, for example, lesions which sever a portion of the nervous system, or compression 

injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

12 infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

1 6 tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B 1 2 deficiency, folic acid deficiency, Wernicke disease, 

24 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 

28 sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
32 injured by a demyelinating disease including but not limited to multiple sclerosis, human 

immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g. , 
choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

• Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1 980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Arm. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 
depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 
invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects, effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 

4 nutritional factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

8 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
12 in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

1 6 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 

20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 

24 polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 

28 polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 

32 single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
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absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 

4 present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 

8 by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 

12 arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., ] 983, 
Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129. 
Induction of the disease can be caused by a single injection, generally intradermally, of a 

16 suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
\ 1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing the effects of the test compound would consist of intradermally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 1 8, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 

24 described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

28 4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
32 include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 

4 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 

8 condition and response of the individual patient. Typically, the amount of polypeptide 

administered per dose will be in the range of about 0.01u.g/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0. lug/kg to 10 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 
12 with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline. Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 
1 6 The preparation of such solutions is within the skill of the art. 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

24 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 

28 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, 1L-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, 

32 IL-1 3, IL-14, IL-15, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transforming growth factors (TGF-a and TGF-P), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular clotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 
IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g. , heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g. , at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 
relevant medical condition, or an increase in rate of treatment, healing, prevention or 
amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
a mammal having a condition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, 
the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 
intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 
or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 
occurring as complication of glaucoma surgery, the compounds may be administered topically, 
for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 
afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit. 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical^. These pharmaceutical compositions may be 
manufactured in a manner that is itself known, e.g. , by means of conventional mixing, 
4 dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
8 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 

12 90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 

16 saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
.. When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be i n the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

24 active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

28 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 

32 solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 

solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art. 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
4 liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
8 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 

preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethyl cellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
12 may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
16 solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 

added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
24 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 

stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 
28 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 
dichlorodifiuoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
32 other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges of, e.g. , gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
4 emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
8 may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodmm carboxymethyl cellulose, sorbitol, or 

12 dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

16 The compounds may also be formulated in rectal compositions such as suppositories or 

- retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 

Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic ' 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

24 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 

system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 

28 polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 

32 without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 

co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 
or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with pharmaceutically compatible counter ions. Such pharmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 
monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredient(s) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 
those encoded by class I and class II MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 
well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutically 
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acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871 ; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 
ingredient of the present invention with which to treat each individual patient. Initially, the 
attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 
the patient, and at that point the dosage is not increased further. It is contemplated that the 
various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 pg to about 100 mg (preferably about 0.1 pg to about 10 mg, more preferably 
about 0. 1 pg to about 1 mg) of protein or other active ingredient of the present invention per kg 
body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 
described above, may alternatively or additionally, be administered simultaneously or 
sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing bone and cartilage and optimally 
capable of being resorbed into the body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 
hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 1 50 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 
cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 
hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 
carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and polyvinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 
protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 
compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor (EGF), 
platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-(3), and 
insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, the 
addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 
tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 
the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that includes the IC S0 as determined in cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 
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A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the LD 50 (the dose lethal to 50% of the 
population) and the ED 50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD 5 o and EDso- Compounds which exhibit high therapeutic indices are preferred. 
The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED 5 o with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and the route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 
desired effects, or minimal effective concentration (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 u.g/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0. 1 ug/kg to 25 mg/kg of patient body weight daily, varying in adults and 
children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
4 be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

8 4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 
12 an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a b, F a b- and F( a b-)2 
fragments, and an F a b expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 

16 by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGi, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 

24 invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 

28 specific immune complex with the full length protein or with any fragment that contains the 

epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

32 located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 
may be generated by any method well known in the art, including, for example, the Kyte 
Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., 
Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals {e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyro globulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels {e.g. , aluminum hydroxide), surface 
active substances {e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 
Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 

75 



_016483SA2J_> 



WO 01/64835 



PCT/USOl/04927 



The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal {e.g., from the blood) and further purified by well known techniques, 
such as affinity chromatography using protein A or protein G, which provide primarily the IgG 
fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 
target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 
binding site capable of irnmunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature . 256 :495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 
Antibodies: Principles and Practice . Academic Press, (1986) pp. 59-103). Immortalized cell 
lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 

4 can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 

California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol.. 133 :3001 (1984); Brodeur et al, Monoclonal 

8 Antibody Production Techniques and Applications , Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 

12 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

16 Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107 :220 (1980). Preferably, 

antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 

20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 

24 culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

28 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures {e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 

32 source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368 , 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

5.13.2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature . 321:522-525 (1986); Riechmann et al., Nature . 332:323-327 (1988); Verhoeyen et al., 
Science . 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225.539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol. . 
2:593-596 (1992)). 

5.13.3 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: Monoclonal 
4 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. ProcNatl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
8 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227:381 (1991); 
Marks et al., J. Mol. Biol. . 222:581 (1991)). Similarly, human antibodies can be made by 

12 introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 

16 is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 

5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10. 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison ( Nature 368 . 812-13 (1994)); Fishwildet aU Nature 
Biotechnology 14, 845-51 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 

20 Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 ( 1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 

24 endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 

28 provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 

32 cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an irnmunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 

antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 

example, single chain Fv molecules. 
4 An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
8 locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy'chain locus, 

the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 

contain the gene encoding the selectable marker. 
12 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 

nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 

an expression vector containing a nucleotide sequence encoding a light chain into another 
16 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 

antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 

epitope on an immunogen, and a correlative method for selecting an antibody that binds 
20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 

WO 99/53049. 

5.13.4 F a b Fragments and Single Chain Antibodies 

24 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of F a b expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

28 monoclonal F a b fragments with the desired specificity for a protein or derivatives, fragments, 

analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F (ab )2 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F ab fragment generated 

32 by reducing the disulfide bridges of an F (ab ') 2 fragment; (iii) an F ab fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 

5.13.5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature . 305 :537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker eta!., 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
ah, Methods in Enzvmology. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains {e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones {e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments {e.g. 
F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenate to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
4 generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
8 immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et aL, J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' fragment 

12 was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

16 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 

24 alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and V L domains of one fragment are forced to pair with the complementary V L and V H 

28 domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

32 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1 991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
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a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG (Fc R), such as Fc R I (CD64), Fc R1I (CD32) and Fc RIII (CD1 6) so as to focus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
4 can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 
radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

8 

5.13.6 Heterocon jugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 

12 have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in.vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, inimunotoxins 

16 can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed," for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 

24 formation in this region. The homodimeric antibody thus generated can have improved 

internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1 191-1 195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 

28 tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

32 

5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
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bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 
2,2 Bi, 131 I, 13l In, 9 °Y,and I86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a 
ricin irnmunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 
Carbon- 1 4-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

4.14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 



84 



WO 01/64835 



PCT/US01/04927 



artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-13901 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 
sequences of SEQ ID NO: 1-13901 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 
BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORPs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
4 invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
8 sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 

12 systems of the present invention. Examples of such software includes, but is not limited to, 
Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOL YPEPTIDEI A) . A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 

1 6 computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

24 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 

28 enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 



32 4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
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Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241:456 (1988); and Dervan 
et al., Science 25 1 : 1 360 (1 991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 
56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 
molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an antisense or triple helix oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 
with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 
polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
4 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
8 present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 

12 extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 

1 6 provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

20 In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

24 agents or solutions of each container can be added in a quantitative fashion from one 

compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 

28 contain the reagents used to detect the bound antibody or probe. Tj'pes of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 

32 probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

4.17 MEDICAL IMAGING 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention {e.g. , where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-13901 , or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, 
said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 
the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
invention for a time sufficient to form a polypeptide/compound complex, and detecting the 
complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can also 
comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 
receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound that 
binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly ' 
described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 
ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 

4 antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORPs of the present invention can 
be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 

8 composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
12 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOS: 1-13901. Because the corresponding gene is only expressed in a 
limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences 
16 SEQ ID NO: 1-13901 can be used as an indicator of the presence of RNA of cell type of such a 
tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

24 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in viti o by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 

28 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 

32 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

4 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265: 198 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

8 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides,/.^, small nucleic acid segments, may be readily prepared by, for 
12 example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 

using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 

skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
1 6 precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 

achieved using passive adsorption (Inouye & Hondo, (1 990) J. Clin. Microbiol. 28(6) 1469-72); 

using UV light (Nagata et al, 1985; Dahlen et al, 1987; Morrissey & Collins, (1 989) Mol. Cell 

Probes 3(2) 1 89-207) or by covalent binding of base modified DNA (Keller et al, 1 988; 1 989); all 
20 references being specifically incoiporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 

interaction as a linker. For example, Broude et al. (1 994) Proc. Natl. Acad. Sci. USA 91(8) 3072-6, . 

describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
24 streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 

Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 

Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies 

(Alameda, C A). 

28 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 

32 purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5'-end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussenef a/., (1991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLink NH strips for covalent binding of DNA molecules at the 5'-end has 
been described (Rasmussen et al., (1991). In this technology, a phosphoramidatebond is employed 
(Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as immobilization using 
4 only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 

CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus must have a 5'-end 
8 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturing for 1 0 min. at 95°C and cooling on ice for 1 0 min. Ice-cold 0.1M1 -memylimidazole, 
12 pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 1-Melm 7 . A ss DNA solution is 
then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M l-emyI-3-(3-dimethylaminopropyI)-carbodiimide(EDC), dissolved in 
10 mM 1-Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
1 6 50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
20 described in PCT Patent Application WO 90/03 3 82 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
24 nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
28 arrays may be employed. For example, addressable laser-activated photodeprotection may be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodor et al. (1 99 1 ) Science 25 1 (4995) 767-73, incorporated herein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al. ( 1 99 1 ) Nucleic Acids Res. 
32 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1 988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporated herein. 
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To link an oligonucleotide to a nylon support, as described by Van Ness et al. (199 1), 
requires activation of the nylon surface via alkylation and selective activation of the S'-amine of 
oligonucleotides with cy anuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1 994) PNAS USA 91 (1 1) 5022-6, incorporated 
herein by reference). These authors used current photolithographictechniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 
the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5-protected A^-acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 
combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
including mRNA without any amplification steps. For example, Sambrook et al. (1 989) describes 
three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples 
may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et 
al. ( 1 989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al. (1 990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
device allows controlled application of low to intermediate pressures to the cell. The results of 
these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
base recognition endonuclease, Cv/JI, described by Fitzgerald et al. (1 992) Nucleic Acids Res. 
20(1 4) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease Cv/'JI normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this enzyme (Cv/JI**), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC 1 9 (2688 base pairs). Fitzgerald et at. ( 1 992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a CviJl** digest of pUC19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
M 1 3 cloning vector. Sequence analysis of 76 clones showed that Cvz'JI* * restricts pyGCPy and 
PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 
chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 
array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
nylon membrane. By offset printing, a density of dots higher than the density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 
one 96- well plate (ail 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8x12 cm membrane. 
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Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 
present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention are expected to occur to those skilled in the art upon 
consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
screened with oligonucleotide probes {e.g., 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosy stems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5' direction. 

4 5.2 EXAMPLE 2 

Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
8 from one or more public databases. The sequences for the resulting nucleic acid contigs are 

designated as SEQ ID NO: 1 -1 3901 and are provided in the attached Sequence Listing. The contigs 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 

12 (i.e., Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 114, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 
component sequences into the assemblage was based on a BLASTN hit to the extending 

1 6 assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
no vel polynucleotides (SEQ ID NO: l-13901)of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: : 1-13901. Table 3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FASTY (available from http://fasta.bioch.virginia.edu ) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 1 83:63-98 (1 990), herein incorporated by reference). Method B 

24 refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 

28 to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-13901 were obtained by a BLASTP 

32 version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database release 
200101 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest 
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homologue for SEQ ID NO'S: 1-13901. The nearest neighbor results for SEQ ID NO: 1-13901 
are shown in Table 2 below. 

Tables 1 , 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NOS: 1 - 
13901. Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor 
result shows the closest homolog with an identifiable function for each assemblage. Table 3 
contains the start and stop nucleotides for the translated amino acid sequence for which each 
assemblage encodes. Table 3 also provides a con-elation between the amino acid sequences set forth 
in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 
NO. in USSN 09/51 5,126 
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TABLE 1 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


adult brain 


GIBCO 


AB3001 


83 544 597-598 600-607 616 841 1004 1148 1346 1493 1974 
2138 2141 2143 2161 2266 2345 2363 2511 2569 2876 2880 
3001 3099-3101 3 1 05-3 1 06 3 1 1 0-3 111 311 5-3 117 3199 3272 
3282 3284 3356 3425 3537 3634 3689 3709 3797 3810 3839 
3899 4006 4021-4022 4025 4043 4194 4201 4253 4277 4297 
4388 4399 4410 4667 4671 4722 4747-4748 4750 4755 4767 
4845 4865 4940 5037 5075 5093 51 18 5163 5171-5172 5268 
5481 5523 5553 5656 5724 5894 5902 5938 6052 6170-6 173 
6176 6214 6307 6336 6369 6374 6793 6894-6897 6979 7058 
7169 7455 7492-7493 7495-7499 7501 7504 7577 7586 7761 
7792 7864 7870 8035 8065 8085 8110 8120 8140 8224 8226 
8298 8372 8427 8452 S456 8535 8648 8672 8674-S679 8681- 
8684 8816 8838-8839 8870 8898 9012 9041 9079 9128 9257 
9264 9304 9317 9460 9503 9517 9567 9623 9734 9781 9792- 
9798 9929 9964 9999 10296 10330 10469-10470 10578 10679 
10778 10786 10895 10984-10986 11032 11052 11069 11130 
11145 11239 11289 1 1402 1 1818 1 1862 1 1870-1 1 876 1 1878- 
11881 12017 12037 12127 12160 12294 12363 12375 12405 
12424 12438 12467 12539 12570 12590 12615-12616 12618 
12685 12688 12712 12739 12748 12830 12913 12916 12948- 
12950 13002 13064 13073 13083 13141 13150 13153 13164- 
13166 13257 13391 13456 13479 13489 13492 13494 13499 
13501 13503 13560 13595-13596 13627 13645 13679 13782 
13795 13861 13866 13869 13882 


adult brain 


GIBCO 


ABD003 


67 83 142 443 587 598 608-609 6 1 1 613-624 633 731 734 737- 
742 760 799-800 809 1148 1152 1 167-1 184 1 193 1346 1433 
1516 1552 1575 1671 1756 1774 1833 1974 21382145 2176- 
2178 2237 2266 2299-2301 2303-2306 2343 2363 2412 2444 
2449 251 1 2516 2555 2569 2576 2614 2716 2809 2876 291 1 
2926 300 1 3093 3 1 1 4 3 1 1 9 3 1 2 1 -3 1 24 3 1 26 3 128-3 1 30 3234 
3254-3256 3258-3263 3265-3267 3270-3274 3276-3277 3280- 
3281 3284 3286 3348 3356 3378 3435 3459 3484 3537 3548 
3595 3605 3625 3627 3634 3686-3697 3700 3702 3709 371 1 
3720 3722 3737 3757 3797 3804 3810 3839 3856 4006 401 9 
4025 4040 4055 4057-4058 4060 4078 4194 4201 4246 4253 
4277 4282 43 90 4405 44 12 4431 4620 4622 464 1 4689 475 1 - 
4764 4791 4808 4837 4845 4847-4849 4852-4858 4860-4862 
4864-4869 4940 4957 4962 4972 4998 5021 5031 5037-5038 
5040 5076 5093 5108 5118 5167 5169 5171-5172 5251-5261 
5263-5265 5270 5364 5401 5481 5492 5521 5523 5535 5656 
5674 5693 5766 5788 581 7 5906-5909 5938 6005 6027 6057 
6064 6147 6178 6180-6182 6189 6214 6229-6233 6254 6272 
6369 6371 6421-6426 6555 6595 6598 6601 6799 6803 6825 
6836 6886 6894 6913 6972 6995 7058 7104 7130 7133 7148 
7164 7169 7339 7347 7386 7426 7455 7494 7502 7507 7509 
751 1-7512 7516 7520 7584-7587 7590-7596 7598-7601 7603- 
7604 7608 7632 7677 7743 7748 7761 7768 7792 7797 7807 
7815 7839 7849-7861 7864 7870 7930 7937 8035 8065 8067 
S080 8087 8095 81 10 8120 8139-8140 8209 8224 8226 8235 
8246 8262 8285 8298 8320 8323 8336 8354 8361 8365 8370 
8375 8387 8452 8456 8535 8556 8576-8577 8603 8630 8648 
8674 8685-8686 8688-8690 8693 8695 8702 8712 8742 8760- 
8761 8763-8764 8766-8769 8813 8815-8816 8830 8834 8838- 
8839 8848 8863 8870 8898 8921 8943-8944 8951 8989 9010 
9041 9050-9056 9058-9064 9076 9079 9092 9097 9128 9144- 
9145 9257 9264 9271 9278-9279 9304 9315 9317 9455 9466 
9472 9475 9480 9503 951 1 9517 9525 9539 9689 9734 9773 
9781 9791 9799-9802 9847 9852 9873 9928-9929 9964 9999- 
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Tissue 
Origin 


RNA 


Library 
Name 


SEQ ID NOS: 








10001 10003-10004 10053 10175 10267 10276 10330 10349 
10434 10449 10466 10471-10474 10492 10496 10509 10566 
10578 10597-10599 10645 10679 10788 10891 10948 10988- 
10990 11032 11039 11041-11043 11052 11065 11069 11105 
11108 11130 11145 11167 11196 1 1203-1 1207 1 1209 11239 
11399 11401-11402 11406 11459 1 1470 1 1 604 1 1606 11642 
11761 11818 11862 11877 11882-11884 1 1 886 1 1 889-1 1 893 
11944-11946 11981 11988 12016 12019 12022 12037 12083 
12127 12143 12164-12165 12168-12171 12178 12195 12236 
12265 12305 12327 12363 12375 12405 12423-12424 12430 
12438 12546 12570 12590 12594 12612 12615-12618 12630 
12670 12674 126S5-12688 12693 12704 12706-12707 12748 
12772 12830 12885 12904 12913 12916 12923 12933 12951 
12956 12993 13001 13020-13021 13038 13047 13064 13072- 
13073 13084-13085 13092 13117 13142 13167-13171 13191 
13254 13257 13260 13295 13390-13391 13394 13456 13479 
13483 13489 13497 13501 13503 13505-13507 13512 13516 
13546 13551 13555 13575 13590 13592 13597 13613-13614 
13645 13649 13659 1371 1 13782 13795 13838 13861 13869 
13875 13882 13884-13885 13888 13892 13896 


adult brain 


Clontech 


ABR001 


142 858 1542 2174 2407 2483 2652 3272 3287 3460 3492 3535 
3595 3737 3839 4005 4060 4282 4434 4791 4972 5040 5293 
5523 5530 5535 5788 5906 6082 6601 6799 6980 7373 7577 
7587 7759 7788 7851 8081-8082 81 10 9167 9455 9466 9781 
9928 10422 10774 10791 11069 1 1401 11406 11459 11604 
11607 11791 11818 11865 11961 11979 12022 12122 12160 
12327 12442 12594 12615 12640 12670 12705 12935 12957 
12985 13047 13197 13257 13456 13511-13512 13546 13554 
13646 13793 13885 13889 13893 


adult brain 


Clontech 


ABR006 


6 67 1004 1908 3272 3286 3548 401 1 4282 4998 5923 5928 
6374 6730 6815 6867 6890 7067 8365 9264 9729 9780 10776 
11587 11618 12596 12601 12605 12704 12749 12754 12951 
13047 13051 13090 13479 13488 13498-13499 13503 13512 
13575 138S2 


adult brain 


Clontech 


ABR008 


6 11 21 41 51 88 142 364 376 579 598 651 736 800 1050 1148 
1184 1251-1265 1291 1346 1404 1479 1529 1543 1671-1674 
1697 1699-1710 1820 1830 1832-1838 1840 1848-1849 1908 
1914 1919 1927 1957 1964 1974 1976 1978-1979 2005-2006 
2050 2081 2090 2110-2111 2129 2150 2174 2200 2310 2327 
2342 2408-2410 2420 2444 2449 2461-2467 2484 2490 2499- 
2506 251 1 2553 2574 2576 261 J 2652 2809 2827 2866 2894 
3032 3207 3535 3591 3610 3634 3715 3722 3737 3766-3770 
3819 4006-4007 4011 4025 4032 4060 4078 4095 4109 4128 
4143-4155 4182 4194 4247-4257 4277 4282 4294 4296 43 10 
4330 4348 4355 4360 4381 4395 4399 441 1 4431 4543 4641 
4662 4694 4698 4767 4781 4791 4808 4833 4837 4985 5001 
5022 5040 5075 5094 5108 5163 5303-5306 5308-5314 5320 
5380 5523 5553 5615 5625-5626 5634 5638-5644 5701 5706 
5711 5727-5742 5766 5772 5775 5783 5801 5814 5817 5820- 
5821 5829 5837 5851 5855 5858 5864 5867 5874 5885 5890 
5897 5901 5906 5923 6057 6125 6214 6223 6288 6302 6456- 
6464 6545 6598 6601-6602 6624 6676-6685 6699 6726 6728 
6746-6749 6765 6799 6805 6854 6860 6893-6894 7004-7007 
7049 7076 7078 7081-7083 7105 7117 71 19 7133 7153 7166 
7431 7579 7708 7768 7849 7900-7905 8018 8083-8084 8095 
8110 8196-8208 8262 8288 8312 8320 8331 8336 8356 8375 
8452 8482 8633 8681 8710 8739 8777 8815 8817 8830 8839 
8963 8965 8983 9010 9097 9100 9102-9108 91 1 1 9128 9142 
9257 9264 9313 9364 9378-9384 9401 9454-9455 9458 9460- 
9464 9503 9509 951 1 9515-9516 9522 9528-9529 9533 9539 
9542 9544 9573 9577 9646 9773 9780 9924 10000 10025- 
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Tissue 
Origin 


RNA 
Source 


Libra ry 
Name 


SEQ 1DNOS: 








10029 10097 10148 10209 10218-10223 10225-10229 10267 
10275-10276 10284 10292 10296 10303 10313 10326 10329- 
10331 10333-10334 10343 10346 10393 10430 10494 10496 
10513 10542 10557 10613-10616 10679 10688 10691 10719- 
10721 10729 10743 10758 10760 10776 10782 10786 10795 
10895 11100 11114 11132 11195 1 1240 1 1242-1 1254 1 1401 
1 1406 1 1437 1 1454 3 1459 J 1462 1 1466-1 1473 1 1475-1 1476 
11519 11532-11535 11550 11553 11555-1 1556 1 1559 11588- 
11589 11606 11615 11618 11621 11627-11628 11633 11761 
11791 11S07 11818 11932 11955 12006 12039 12041 12092 
12212-12224 12231 12236 12305 12363 12368 12396-12399 
12405 12424 12439 12442 12465-12466 12468 12477 12479- 
12482 12490 12521 12546 12552-12553 12576-12577 12579 
12582 12585 12590 12601 12609 12617-12618 12636 12658 
12707 12725 12735-12736 12749 12754 12776-12779 12859 
12868 12894 12905 12909 12940 12955 12959 12977 12980 
12990 13002 13004-13005 13020 13035 13038 13042-13044 
13047 13051-13052 13056 13062 13073 13082-13083 13196 
13249 13280 13311-13312 13336 13387 13417 13421-13426 
13436 13445 13456 13458 13479-13482 33488 13490 13494- 
13495 13497-13500 13503 13507 13512 13516 13533 13546 
13554-13555 13590 13613 13630 13649 13659 13670 13678 
13713 13724 13769 13793-13794 13808 13827-13828 13838 
13861 13867-13868 13875 13882 13884-13885 13888-13889 
13893 13896 13898 


adult brain 


Clontech 


ABR011 


1006 1257 3797 4006 4025 5535 6057 7169 7870 8262 8937 
8966 9257 10778 12736 13394 13679 13793 13861 


adult brain 


BioChain 


ABR012 


88 598 1007 1134 2597 3557 3590 3627 3797 4006 4192 4246 
4282 439) 4940 5523 5535 6288 6338 7138 8110 8898 9076 
9401 9455 9476 10772 1 1061 11114 12989 13394 1351 1 13866 


adult brain 


Invitrogen 


ABR013 


598 2614 3191 4355 4391 5523 5788 8085 8486 11513 12521 
12989 13861 


adult brain 


Invitrogen 


ABT004 


4051 598 1050-1057 1148 1777-1778 1947 1976 2270-2272 
2327 2490 261 7 3050 3600-3602 3722 3987 4390-4391 4434 
4543 4689 5031 5157-5159 5167 5169 5466 5505 5682-5683 
5701 5766 5778 5794 5902 6147 6367-6371 6459 6545 6709 
6728 6783 6801 6971 7104 7175 7815 7839 7864 8139 8342 
8345 8355 8363 8372 8452 8633 8963 8975-8976 9012 9133 
9423-9424 9511 9515 9517 9528 9556 9827 9949 10260 10267 
10275 10570-10571 10733 10767 11132 11159 11406 13459 
11932 12009 12092 12109-12111 12127 12283 12428 12511 
12579 12605 12725 12747 12830 12885-12886 12910 12913 
12954 12987-12989 13051 13054 13062 13073 13090 13249- 
13253 13438 13445 13456 13489 13500 13512 13516 13533 
13546 13590 13622 13649 13683-13684 13713 13803 13838 
13861 13866 13896 


cultured 
preadipocytes 


Strategene 


ADP001 


1 134 1346 2343 2614 3272 3426 3610 3 720 3 83 9 3 885 401 1 
4277 4282 4297 4346 4388 4391 4405 4434 4641 4833 4940 
4985 5018 5030 5040 5163 5167 5523 5581 5778 5788 5794 
5895 5951 6082 6147 6272 6607 7067 7141 8093 8235 8285 
8312 8363 8629 8648 8830 8839 9290 9401 9466 9503 9781 
10346 30470 10776 10795 30973 3 1 108 3 1 170 1 1513 11818 
12034 12037 12046 12093 12375 12387 12405 12424 12570 
12636 12670 12674 12688 12735 12749 12913 12940 13126 
13163 13295 13489 13494 13497 13499 13511 13516 13575 
13652-13653 13866 13888-13889 


adrenal gland 


Clontech 


ADR002 


8 83 142 225 351 443 55 3 569 731 864 1 134 1266-1271 1273- 
1274 1276-1292 1294-1295 1381 1391 1544-1545 1658 1671 
1908 1959 1983 2010 2023 2145 2175 2283 2310 2328-2334 
2343 2444 2449 2510 2522 2576 3032 3069 3153 3166 3272 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








3378 3416 3548 3625 3709 3711 3771-3788 3790-3791 3797 
3839 3870 3899 3985 4019 4054 4060 4109 4173 4192 4194 
4201 4253 4277 4282 4389 4391 4395-4396 4431 4667 4687 
4694 4783 4957 4966 4985 4998 5037 5 1 08 5293 5316-5323 
5325-5336 5481 5505 5527 5531-5533 5560 5628 5656 5701 
5766 5865 5874 5902 5939 5979 6020 6052 6057 6227 6288 
6354 6405 6449 6465-6482 6521 6603 6726 6894 6979 7008- 
7011 7155-7156 7204 7604 7634 7845 7871 7906-7912 7915- 
79 1 8 7920-7930 8022 8067 8085-8086 8095 8110 8116 8224 
8262 8363 8365 8412 8520 8535 8554 8699 8742 8831 8870 
8950-8951 9002 9010 9012 9097 9109-91 19 9121-9127 9190 
9264 9280 9304 9317 9455 9457 9495 9503 9553 9556 9586 
9709 9780-9781 9895 9927 10007 10030-10032 10034-10035 
10037-10045 10119 10277 10284 10346 10595 10617-10620 
10623-10627 10645 10675 10679 10760-10761 10766 10774 
10782 11108 11194 11255 1 1258-1 1270 1 1289 1 1400 11406 
11457 11519 11588 11600 11621 11626 11818 11952 12080 
12159 12225 12227-12230 12232-12237 12239-12240 12242- 

19940 199^1 19Tfi^ 193A"3 T~>Af\A nine lO/fO/f T>,t?ft 

12577 12599 12601 12630 12636 12657-12658 12663 12688 
12693 12706 12713 12715 12735 12749 12754 12780-12787 
12848 12863 12909 12913 12923 12957 12990 13006-13007 
13020-13021 13064 13095 13104 13295 13313 13315 13388- 
13389 13391 13456 13473 13494 13507 13515 13546 13613 
33631 13679 13725-13733 13858 13866 13872 13SS3-13884 
13888-13889 


adult heart 


G1BCO 


AHR001 


51 83 88 94 221 239 360 366-367 404 410-411 413 415 458- 
459 461 465-468 471 473-478 486 545-546 559 567 616 625- 
630 743-744 799 802-806 808 810-835 837-842 959 1004 1066- 
1085 1134 1178 11S4-1193 1346 1512 1516 1546-3547 1556 
1575 1671 1727 1774 1829 1959 1976-1977 2090 2096 2108 
2110 2128 2138 2145-2147 2161 2179 2195-2198 2257 2276 
2278-2281 2302 2307-2309 2363 2398-2399 2409 2411-2412 
2444 2449 2497 25 1 6 2529 2563 2569 2575-2576 2597 2605 
2614 2617 2762 2809 2816 2879-2880 291 1-2924 2926 2961 
2978-2980 2985-2986 2993 2995-3002 3032 3042 3051 3058 
3069 3081 3091-3094 3109 3114 3132-3133 3135 3137-3138 
3141 3191 31963199 32153263 3272 3282-3286 3317 3349- 
3350 3353-3359 3361-3362 3364-3367 3370-3393 3396-3399 
3403 3406 3425 3451 3465-3466 3479 3500 3503 3537-3538 
3544 3548 3550 3555 3557 3590-3591 3595 3604 3606-3612 
3614-3621 3623-3627 3634 3689 3697-3698 3701-3709 371 1- 
3713 3720 3722 3737 3757 3797 3839 3885 3898 3988-3989 
3996 4005-4006 4008 4010-401 1 4019 4021-4022 4025 4040 
4043 4054-4055 4058-4060 4078 4183 4192 4194 4201 4246 
4253 4269 4277 4282 4341 4351 4391 4403 4405 4434 4517 
4543 4553 4590 4615 4622 4631 4633-46344641 4654 4664 
4666-4667 4672-4675 4688-4689 4693-4694 4738 4740 4755 
4783 4828 4870-4871 4905-4906 4909-4915 4917-4919 4921- 
4924 4926-49274940 4957 4972 4985 4991 4998 5001 5030 
5037-5038 5040 5076 5118 5163 5171-5175 5177-5178 5180- 
5182 5266-5268 5380 5422 5481 5490 5492 5506-5507 5523 
5534-5535 5581 5656 5682 571 1 5740 5766 5788 5895 5908 
5923-5924 5938 6047 6057 6083 6085 6104 6110-6111 6117 
6147 6184-6185 6189 6195 6215 6255-6257 6259-6266 6268 
6272 6288 6307 6336 6375-6379 6382 6427-6429 6458 6555 
6588 6595 6607 6629 6665 6689 6765 6767 6799 6815 6871- 
6873 6879 6881-6882 6898 6926-6928 6975-6978 6987 7058 
7090 7148 73 13-7314 7341 7343-7344 7346-7347 7350 7354 
7359 7362 7373 7380 7394 7402 7407-7408 7410 7413 741 5- 
7416 7418-7419 7426 7431 7468 7473 7480 7494 7505 7513- 
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Origin 


R1NA 
Source 


Library 

Name 


SEQ ID NOS: 








7516 7520 7544 7561 7584 7587 7599 7601-7604 7635 7638- 
7643 7645-7649 7651-7655 7657 7659-7662 7733 7735 7743 
7748 7783-7796 7815 7852 7857 7862-7863 7865-7870 7930 
7933 7983 8062-8065 8067 8087-8088 8093 8095 8110 8] 16 
8120 8139-8140 8224 8226 8235 8262 8298 8336 8344-8345 
8354 8356 8363 8368 8372 8378 8387 8410 8427 8452 8456 
8531-8532 8534-8535 8563-8569 8572 8576-8577 8592-8593 
8597 8603 8606 8610 8613-8614 8616-8617 8646 8648 8670 
8681 8691 8698-8699 8702 8712 8742 8756 8760 8763 8795 
8807-8809 8811-8814 8816-8819 8821-8833 8835-8839 8858 
8863 8870 8898 8921 8927 8936 8939 8943 8946 8950-8951 
8966 8988-8992 8994-8995 9010 9049 9059 9065-9067 9070- 
9072 9076 9097-9098 9167 9190 9257-9260 9262 9269 9281 
9287 9301 9304 9315 9317 9401 9454-9455 9466 9476 9480 
9484 9556 9577 9612 9689 9698 9720-9721 9734 9741-9743 
9747-9750 9758 9781 9791 9804 9871-9882 9884-9885 9928 
9939 9942 9954-9960 9999-10000 10005 10175 10179 10275 
10284 10292 10296 10329-10331 10346 10400 10422 10430- 
10431 10437 10442 10444-10447 10475 10511-10512 10514- 
10521 10557 10576-10577 10616 10645 10679 10691 10729 
10742 10744 10772 10774 10777-10778 10782 10788 10839 
10891 10894-10895 10902 10917-10920 10937 10942 10946 
10948 10969-10970 10992 11032 11044-11045 11061 11066- 
11074 11108 11114 11132 11145 11153 1 1 165-1 1 170 1 1 173 
11205 11208 11210-11215 11283 11289 11386 11388 11401- 
11402 11406 11462 11559 11565 11576 11596 11606 11615 
11620 11744 11788-11789 11793 1 1 818 1 1823-1 1 825 11831- 
11832 11869 11894 11947 11961 11982-1198912000 12006 
12009 12011 12019-12020 12028 12037 12044 12078 12081 
12093 12119-12122 12143 12160 12166 12172-12175 12177- 
12179 12197 12305 12335 12363 12375-12376 12383 12387 
12400 12402 12405 12424 12428 12438 12479 12521 12523 
12546 12560 12564 12570 12590 12599 12601 12605 12609 
12611 12616-12618 12653 12662-12663 12670 12674 12688 
12717-12720 12734 12752 12754 12772 12905 12907 12934 
12916-12917 12923 12925 12940 12961 12963-12965 12989 
12991 12993 13020-13021 13033 13035 13072-13073 13082- 
13083 13104 13117 13126 13132 13136 13141-13142 13148 
13169 13203-13210 13212 13260-13261 13293 13295 13326 
13377 13394 13413 13442 13456 13477 13480 13488 13490 
13494-13496 13501-13503 13506-13507 13511 13516 13533 
13568 13570-13571 13575 13582 13592 13613 13624-13630 
13632 13644 13646 13659-13660 13678-13679 13689 13701 
13711-13713 13775 13782 13795 13797 13866-13869 13872 
13882 13884-13885 13893 


adult kidney 


GIBCO 


AKD001 


49 67 83-84 142 354 405-407 415-429 431-432 445 460 462 
479-484 486 488 492-493 524 548-549 59S 616 63 1-638 744 
787 809 841 1004 1068 1086 1160 1163-1166 1171 1184 1193 
1346 1359 1449 1479 1516 1552 1556 1671 1724 1727 1774 
1826 1858 1914 1974 1976 1978-1979 2081 2097-2099 2111- 
2113 2117-21182129-213021382145 2148-21502161 2186 
2218 2266 2268 2302 2310 2327 2343 2363 2409 2412 2444 
2449 2468 2483 2523 2569 2576 2614 26 1 7 2827 2845 2876 
2910 2915 2926-2931 2933-2934 2938-2943 2945-2947 2955- 
2956 2976 2981 2983-2984 3001 -3016 301 8 3053 3109 31 14 
3 140 3 142-3 146 3 149-3 1 50 3 1 99 3254 3265 3272 3283-3284 
3286 3333 3350 3356 3378 3406 3435 3445 3460 3492 3503 
3535 3537 3544 3548 3590-3591 3598 3625 3627 3634 3676- 
3677 3679-3685 3689 3697 3709 371 1 3720 3722 3737 3757 
3797 3808 3810 3839 3885 3989 4005-40064011 4019 4022 I 
4025 4040 4043 4054-4055 4060 4078 4109 4192 4194 4201 j 
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Origin 


RNA 
Source 


Library 
Name 


SEQ1DNOS: 








4246 4253 4269 4277 4341 4354 4387-4388 4390 4398 4402 
4405 4410 4434 4473 4540 4543 4583 4591-4592 4607 4637 
4640-4641 4649 4654 4665 4668 4676-4677 4680-4685 4688- 
4689 47 1 1-47 12 475 1 4758 4766-4768 4783 4808 4833 4836- 
4837 4845 4874 4904 4940 4957 4962-4963 4972 4983 4991 
4998 5022 5037 5040 5076 5093 51 18 5143 5163 5171-5172 
5246-5249 5281 5362 5364 5380 5422 5481 5521 5523 5526- 
5527 5535 5656 5693 5726 5740 5766 5769 5778 5788 5794 
5894 5902 5908 591 1 5916-5917 5923 5928 5938 5990 6005 
6049-6051 6057 6078 6082 6086 6088 6105-6107 6112-6117 
6120-6122 6147 6159 618^-6189 6215 6257 6272 6288 6369 
6418 6423 6430 6555 6595 6598 6738 6743 6765 6767 6793 
6799 6805 68 1 5 6836 6857 6867-6870 6872-6873 6878 6884 
6886 6893 6899-6900 6931 6947 6976 6979 6988 7024 7045 
7058 7104 71 18 7155 7275 7281 7306 7312 7338 7347-7348 
7351-7352 7355 7357 7359-7363 7366 7369 7371-7373 7381 
7383 7386 7400 7402 7404-7406 7408 7420 7422 7424 7426- 
7428 7430-7431 7433-7435 7439 7455 7465-7468 7494 7502 
7506 75 1 7-752 1 7579 7587 7604 7634 7639 7642 7733 7735 
7743 7748 7792 7797 7839 7842-7846 7848 7852 7857 7862 
7864 7870 7930 7933 8065 8077 8084 8087 8093 8095 8105 
8110 8116 8120 8139-8140 8156 8224 8226 8235 8262 8320 
8336 8345 8351 8354 8359 8364 8368 8370 8372 8375 8377 
8387 8427 8452 8456 8461 8486 8518 8520 8527 8535 8563- 
8564 8566 8572-8573 8575-8577 8580-8583 8585 8588-8589 
8597 8603 8618-8621 8623 8625-8626 8628 8630 8634 8647- 
8650 8664 8674 8677-8678 8691 8694 8701-8704 8706 871 1 
8722 8740 8742 8763 8792 8798 8806 8812-8813 8816 8830 
8838-8839 8848 8853 8863 8S76 8898 8921 8935 8943-8944 
8948 8951 8966 8989 9012 9041 9047-9049 9076 9092 9128 
9 1 37 9 1 67 92 1 8 9244 9249 9257 9264 9287 9304 93 1 5 93 17 
9424 9455-9457 9466 9472 9475 9484 9503 9511 9517 9529 
9536 9542 9544 9558 9612 9683 9699 9703 9722-9723 9725 
9734 9744 9752-9755 9758 9772-9773 9780-9781 9805-9807 
9841 9843 9927-9929 9939 9942 9969 9998-10000 10007 
10175 10275 10284 10287 10292 I'03I9 10346 10376 10423- 
10424 10431 10442 10446 10448 10450 10475 10480 10496 
10542 10557 10645 10679 10774 10778 10782 10788 10895 
10921-10928 10947 10949-10953 10986 10994-10996 1 1032 
11052 11061 11069 11114 11130 11132 11143 11145 11197- 
11202 11205 11208 11212 11239 1 1289 1 1401-1 1402 11406 
11459 11513 11586 11596 11604 1 1607 1 161 8 1 1620 11695 
11711 11759 11763 11790-11791 1 1793-1 1794 1 ISIS 11820- 
1 1 822 1 1 834-1 1 836 1 3 857 1 1 865 1 1 869 1 1 895- 1 1 899 1 1 950 
11970 12006 12022 12041 12078 12143.12159-12160 12178 
12195 12197 12256 12265 12305 12327 12359 12363 12375 
12387 12405 12424 12428 12438 12442 12467 12511 12521 
12523 12535 12539 12546 12564 12570 12590 12599 12601 
12609-1261 1 12616 12618 12630 12653 12656 12664 12666- 
12667 12670 12674 12688-12689 12691 12739 12754 12830 
12834 12904 12913-12914 12916-12917 12923 12940-12941 
12951-12952 12956 12972 12976-12977 12993 12999 13002 
13052 13062 13064 13066 13072-13073 13082-13083 13095 
13104 13127 13131 13133 13136 13141 13343-13146 13148 
13354-33155 13169 13172 13184 13195-1319613249 13254 
13260 13263 13280 13295 13307 13319 13394 13442 13456 
13477 13479-13480 134S8-13490 13492 13494-13495 13497- 
13503 13506-13507 13512 13515-13516 13546 13551 13554- 
13556 13572-1357413590 13592 13613 13627 13631 13644- 
13645 13656 13659-13660 13665 13670 13710 13713 13776 
13782 13795 13859-13860 13864 13866-13868 13872 13882- 
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Tissue 


RNA 


Library 


SEQ ID NOS: 


Origin 


Source 


Name 










13885 13888 13891-13892 


adult kidney 


Invitrogen 


AKT002 


67 142 550-552 598 639-641 1004 1015 1493 1516 1947 2215 
2299 2343 2363 2449 26 1 8 3054-3055 3 1 52-3 153 3 1 55-3 1 57 
3434 3535 3590 3709 3797 3808 3839 3885 401 1 4022 4025 
4040 4253 4277 4391 4405 4671 4759 4767 4769 4837 4949 
4972 5001 5035 5037 5052 5108 5526 5581 5615 5726 5788 
5895 6062 6139 6190-6191 6803 6900 6947 6975 7468-7469 
7473 7733 8095 81 10 8139 8262 8323 8361 8363 8375 8520 
8539 8648 871 1 8798 8912 8950 8966 8983 9076 9264 9368 
9510 9517 9665 9703 9734 10175 10476 10791 10989 10997 
11132 11618 11745 11900 12006 12039 12160 12363 12375 
12405 12424 12685 12702 12707 12904 13035 13060 13104 
13136 13295 13394 13456 13488 13495 13501 13512 13533 
13554 13583 13644 13670 13679 13696 13713 13795 13866 
13888 13891 


adult lung 


GIBCO 


ALG001 


83-84 553 598 642-644 650 747 975 1004 1009 1015-1022 
1449 1516 18162161 2186 2215 2343 2444 29683056 3158 
3160-3162 3345 3466 3503 3519 3566 3568-3573 3595 3709 
3797 3810 3839 3885 4025 4039 4059 4194 4405 4622 4755 
4767 4770 4797 4865 4940 4949 4963 4972 4987 4991 4998 
5001 5117-512? 5171-5172 5233 5401 5481 5581 5724 5788 
593 8 5960 6123 612 5 6140 6215 6322 6336 6343-6348 6371 
6886 6966-6967 7024 7143 7275 7441 7444 7502 7522-7523 
7749-7752 8093 81 10 8140 8336 8345 8375 8378 8449 8535 
8612 8622 8635 8648 8651 8674 8702 8707-8708 8838-8839 
8898 8941-8942 8944 8948 8951 9076 9128 9457 9466 9475 
9497 9503 9544 9567 9703 9756 9758 9808-9810 9843 9873 
9926 9999 10161 10175 10275 10329 10344 10400 10458 
10554-10555 10796 10998 11032 11046 11069 11132 11144- 
11145 11483 11627 11818 11837 11901 12006 12028 12039 
12041 12087-12092 12146 12363 12375 12424 12438 12539 
12570 12601 12617 12661 12670 12674 12688 12738 12749 
12754 12904 12940 12974 13062 33173 13195 13232 13234 
13260 33295 13307 13456 13472 13477 13490 13494 13497- 
13498 13569 13584 13592 13660 13663-13666 13670 13743 
13776 13882 13885 13889 13891 


lymph node 


Clontech 


ALN001 


83 142 364 487 495-497 554 629 645-646 648-650 716 938-951 
953-962 1134 1516 1549 1671 1774 1976 2138 2225-2232 
2234-2235 2340 2879 3019-3020 3057 3103 3163-3165 3272 
3356 3498-3505 3507-3513 3515-3516 3538 3548 3628 3697 
4194 4201 4253 4405 4641 4687-4688 4771-4772 4783 4808 
4845 4963 4972 4987 4998 5042-5065 5076 5163 5504 5523 
5835 5895 5917 6027 6142 6192 6272 6288 6308-63 1 1 6313- 
6314 6765 6805 6871 6949-6951 7700-7705 7707-7709 8085 
8088 8091 8110 8235 8375 8387 8432 8629 8631 8633 8648 
8677 8709 8713-8715 8830 8863 8887-8889 8891-8896 8943- 
8944 8966 9010 9076 91 1 1 9128 9142 9222 9455 9472 9520 
9544 9734 9774 9780 981 1 9905-9907 9928 9939 9999 10027 
10129 10296 10439 10452 10501 10543-10545 10679 10777- 
10778 10788 10891 10999-1 1000 1 1 1 1 1-1 1 1 13 11115 11130 
11145 11344 11406 11513 11584 11885 12006 12028 12050- 
12056 12143 12256 12363 12405 12442 12570 12674 12690 
12913 12917 12940 12968-12970 13173 13220-13221 13394 
13400 13492 13503 13511 13533 13630 13642 13645 13713 
13868 13885 13889 13891 


young liver 


GIBCO 


ALV001 


211 498 598 651-653 1008-1014 1193 1264 1575 1976-1977 
2131 2161 2254-2255 2269 2363 2568 2617 2627 2633 2636 
2961 3021 3059 3093 3166 3215 3272 3356 3378 3426 3479 
3559-3565 3590 3597 3627 3634 3673 3709 3797 3810 3885 
3993 4006 401 1 4019 4025 4194 4246 4253 4277 4422 4426 
4431 4434 4437-4438 4634 4654 4687 4714 4722 4755 4773- 
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Tissue 1 RNA 
Origin Source 


Library 
Name 


SEQ JD NOS: 








4774 4808 4957 5001 5037 5052 5113-51 15 5163 5167 5171 
5508 5510 5526 5581 5616 5693 5726 5757 5772 5788 5928 
6125 6141-6142 6193 6288 6340-6342 6589 6765 6815 6886 
6901 7024 7118 7141 7442 7524 7735 7747-7748 7792 7852 
7870 7924 7930 8066 81 10 8120 8262 8375 8378 851 8 8520 
8648 8652 8691 8702 8716-8718 8816 883S-8839 8938-8940 
9097 91 19 9263-9264 9301 9317 9424 9454-9455 9544 9689 
9703 9734 9758 9775 9925 9947 10393 10477 10769 10774 
1 1 001 1 1 1 39 1 1 140-1 114? 1117"? I 1908 1 1 11 R 1 t 1 <;07 
11588 11725 11804 11902 1 2081-12086 12118 12160 12 1 78 
12375 12377 12405 12424 12479 12511 12570 12599 12601 
12618 12670 12674 12688 12737 12830 12913 12916 12933 
12953 12977 12993 13047 13062 13073 13126 13169 13174 
13230-13231 13295 13488-13489 13494 13498-13499 33502 
13506-13507 13575 13592 13646 13660-13662 13670 13866 
13868-13869 13882 13888 13901 


adult liver 


Invitrogen 


ALV002 


6 25 60 142 598 1004 1213 1296-1301 1381 1493 1513-1514 
1977 2139 2161 2269 2310 2335 2342 2400 2413 2449 2458 
2497 2568-2569 2576 2636 2809 2827 2880 2926 3471 3484 
3503 3597 3722 3792-3796 3885 3994-3995 4025 4060 4201 
4326 4422 4426 461 8 4689 4767 4786 4985 4998 5093 5163 
5337-5339 551 1 5526 5581 5693 5701 5724 5726 5757 5895 
5922-5923 5979 6020 6027 6083 6125 6192 6195 6253 6333 
6346 6483-6485 6716 6765 6797 6975 7169 7422 7468 7614 
7642 7807 7932 8089 8110 8140 8262 8323 8378 8633 8677 
871 1 8740 8759 8786 8834 8839 8944 9002 9128-9129 9264- 
9265 9282 9466 9484-9485 951 7 9677 9700 9773 10007 10046- 
10047 10135 10616 10669 10777 11032 1 1 1 14 1 1 132 1 1 142 

1 1 1 QA 1 197 1 1 1 IRQ 1 1 A£S7 1 1 <JfY7 1 1 ^An 1 T ^K7 1 l <cm I 1 oi c 
1 1 1 \ ijL l \ 1 1 Jo? 11 ^tOZ 1 1 jUZ 1 1 JUu 1 J Jo / 11 OUZ 1 1 o 1 0 

11980 11989 12086 12160 12195 12253-12254 12316 12327 
12363 12378-12379 12411 12424 12511 12570 12630 12693 
12706 12788-12789 12840 12863 12913-12914 13047 13062 
13072-13073 13090 13242 13256 13378-13379 13389 13479 
13487 13489 13497 13555 13734-13738 13859 13864 13889 


adult liver 


Clontech 


ALV003 


346 2529 3548 3797 3885 4438 4940 5103 5801 5902 9597 
10769 10778 1 1587 1 1927 12086 


adult ovary 


Invitrogen 


AOV001 


6 13 40 67 83 88 142 196 444 522 555-560 577 579 598 609 
654-657 659-665 667-668 708 740 745-749 751-754 756-761 
806 841 871 1004 1023 1028-1034 1036-1039 1041-1047 1055 
1087-1115 1119 1151-1162 1298 1346 1359 1456 1493 1507 
1516 1658 1697 1752 1774 1812 1826 1848 1914 1974 1976- 
1977 1979 1983 20992111 2138 2153-2155 2161 2175 2180- 
2 1 8 1 21 86 2258-2267 2284-2286 2288-2290 2298-2299 2342- 
2343 2449 2483 2523 2529 2555 2569 2576 2591 2597 2618 
2708 2750 2783 2818 2839 2926 2961-2962 2988 3007 3022 
3031 3060-3061 3063 3069 3166-3172 3174-3175 3177-3189 
3226 3272 3286 3288-3289 3291-3293 3295-3299 3435 3450 
3460 3479 3486 3503 3535 3577-3589 3612 3627 3629 3631- 
3643 3673 3675 3720 3722 3737 3797 3806 3810 3812-3813 
3839 3885 3985 4006 4012-4014 4023 4025 4040 4043 4060 
4078 4133 4192 4194 4201 4246 4269 4277 4282 4341 4362 
4381 4383 4385 4388 4390-4391 4396 4399 4402 4405 4410 
4434 4465 4473 4515 4543 4582 4610 4654 4689 4694 4715- 

d71 7 4777 47SQ A1GH All^ A1Q1 ARIA AWZH AQT2 A QHQ A OQn 

4940 4944 4957 4960 4962-4964 4972 4975 4981 4985 4998 
5001-5002 5018 5030 5037 5040 5045 5075-5076 5093 5108 
5135-5136 5138-5146 5148 5163 5191-5209 5213 5238-5245 
5262 5374 5380 5451 5455 5490 5503 5505 5521 5523 5527 
5560 5581 5674 5701 5724 5757 5778 5783 5788 5794 5817 
5874 5894-5895 5902 5906 5908-5909 5916-5917 5923 5932 
5938 5979 5989-5990 6005 6027 6048 6054 6057 6061 6078 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








6082 6086 6125 6140 6143-6144 6165 6171 6192 6194-6200 
6215 6235-6239 6241 6270 6288 6307 6333 6336 6354-6360 
6362-6364 6374 6385-6392 6410 6412 6415-6417 6458 6461 
6508 6555 6583 6595 6598 6604 6607 6624 6626 6629 6708 
6765 6767 6793 6797 6799 6801 6805 6825 6860 6870-6871 
6873 6886 6888 6900 6902-6904 6915-6917 6947 6969-6970 
6975 6980-6987 6998 7054 705871 04 71 09 711871337137 
7141 7155 7166 7169 7171 7191 7204 7281 7386 7408 7431 
7443 7468 7471-7472 7494 7507 7525-7528 7604-7609 7614 
7642 7687 7716 7729 7762-7763 7765-7767 7769-7771 7773- 
7775 7788 7802-7818 7829-7835 7838-7841 7864 7905 7937 
7977 8007 8035 8077 8088 8093 8095 81 10 8120 8134 8139- 
8140 8143 8156 8235 8246 8262 8292 8320 S336 8345 8355 
8358 8363 8365 8368 8370 8372 8375 8387 8444 8452 8456 
8486 8520 8593 8610 8633-8634 8653 8719-8724 8773-8777 
8779 8781-8782 8792 8798 8813 8830 8839 8853 8863 8870 
8876 8898 8928 8944 8950 8956-8961 8963-8971 8989 9004- 
9007 9009-90 1 9 9042-9046 9055 9076 9097 91 28 9 1 34 9 1 85 
9264 9283-9284 9291 9313 9358 9424 9445 9454-9455 9457 
9460 9466 947 J -9472 9475 9480 951 1 9517 9533 9539 9542 
9544 9553 9565 9577 9586 9612 9626 9677 9694 9703 9722 
9734 9757-9758 9773 9776-9778 9791 9812-9819 9827 9838 
9843 9848-9853 9922 9927-9940 9942 9962-9970 9989-9992 
9994-9997 10007 10027 10097 10149-10152 10249 10252 
10275 10298 10333-10334 10346 10349 10351 10383 10386 
10417 10438-10439 10452 10459 10470 10478-10480 10493- 
10498 10557-10562 10564-10565 10580-10586 10594-10596 
10645 10658 10676 10679 10767 10772 10774 10778 10782 
10788 10791 10954 1097] -10972 11002-1 1004 11032 11047- 
11052 11061 11069 11083 1 1 108 1 1 130 1 1 132 11151-11153 
11155 11157-11158 11167 11170 1 1 172-1 1 176 1 1 193-1 1 196 
11274 11343 11369 11406 11411 11431 11588 11596 11600 
11604 11606-11607 1 1618 1 1620-1 162 1 1 1629 11668 11814 
11860 11865 11876 1 1903-1 1909 1 1932 1 1949-1 1954 11980 
11985 12000 12006 12017 12022 12033 12039 12080 120S3 
12092-12093 12095-12104 12127-12129 12131-12142 12146 
12159-12160 12162-12163 12178 12213 12216 12236 12256 
12260 12305 12327 12363 12368 12371 12375 12379 12387 
12401-12402 12405 12424 12430 12467 12520 12522 12546 
12570 12576 12590 12594 12599 12605 12609 12611 12615 " 
12617-12618 12630 12636 12643 12657-12658 12663 12670 
12674 12685 12688 12691 12693 12702 12705-12707 12713 
12724 12729 12735 12740-12745 12749 12754-12760 12765- 
12770 12777 12842 12848 12875 12904 12906 12910 12913 
12916-12917 12935 12940 12955 12957 12963 12972 12977- 
12980 12982-12984 12992-12994 12997-13000 13020 13034 
13047 13051 13054 13056 13060 13062 13066 13070 13072- 
13073 13075 13082 13090 13092-13093 13104 13126 13136 
13141 13175-13177 13179-13180 13193-13194 13196-13197 
13202 13236-13241 13243 13245 13249 13254 13263-13275 
13277 13286-13292 13295 13319 13351 13377 13389-13391 
13394 13420 13436 13456 13477 13479 13488 13492 13494- 
13495 13497-13499 13502-13503 13506 13512 13516 13533 
13546 13549 13554-13555 13575 13590 13597 13600-13601 
13613 13616-13620 13627 13631 13644-13645 13649 13659- 
13660 13670 13674-13679 13693-13699 13707-13709 13713 
13782 13803 13864 13866 13868-13869 13872 13875 13882- 
13885 13888-13889 13892 


adult placenta 


Clontech 


APL001 


669-671 1006 1 134 1 184 1551 2053 2090 2156-2158 2342 
2490 2716 3001 3064-3065 3190 3272 3625 3670 4019 4055 
4194 4201 4246 4641 4718 4720-4721 4783-4786 4957 5523 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








5536 5581 5788 5895 6145-6146 6201-6202 6358 6803 7049 
7204 7529 8367 8375 8609 861 1 8630 8725-8727 9097 9480 
9734 9820 10319 10460 10579 10795 10895 11519 11723 
11737 11776 11827-11828 11959 12403 12546 12570 12663 
12848 32875 12913 13051 13394 13782 13869 13889 


placenta 


Invitrogen 


APL002 


142 561 1023 1551 1976 2449 2569 2614 3191 3340 3592 3668 
4060 4346 4767 4787-4789 4892 4985 5536 5674 5693 5757 
5772 5794 5906 6147 6203 6215 6598 6788 6799 6979 7158 
7530-7531 8139 8361 8611 8863 9457 9484 9517 9563 9703 
9758 9821 9927 9939 9942 10481 10557 10778 10802 10973 
11006-1 3011 11723 31731 11737 11776 3 1807 3 3 866 1 1910- 
11916 11959 12379 12403 12590 12685 13020 13054 13392 
13489 13533 13554-13555 13867 13882 13888 13891 


adult spleen 


GIBCO 


AS POO 1 


463 499 562-563 598 672 990-993 995 1004 3020 1346 1515- 
1516 1556 1774 1877 1914 1977 2159 2161 2215 2248 2257 
2363 2529 2569 2614 2617 2812 2827 3093 31923195 3199 
3272 3286 3350 3356 3434 3470 3503 3537 3544-3546 3548 
3604 3634 3689 3709 3720 3722 3796-3798 3812 3839 3885 
4006 401 1 4022 4025 4043 4194 4201 4246 4253 4277 4388 
4391 4396 4405 4434 4641 4689-4690 47S1 4957 4987 4998 
5001 5017 5030 5037 5052 5076 51 18 5512-5513 5523 5526 
5701 5740 5778 5788 5796 5801 5895 5938 6005 6064 6125 
6140 6147 6192 6204 6272 6329 6458 6551 6590 6607 6873 
6886 6931 6958 7018 7058 7109 71 18 7133 7171 7233 7506 
7532-7533 7561 7586 7733 7736 7807 7842 7845 7933 8065 
8085 8093 8095 81 10 81 16 8139-8140 8226 8235 8262 8323 
8359 8363 8365 8368 8372 8456 8535 8648 8702 8792 8816 
8838-8839 8858 8863 8865 8876 8923-8928 8943 8950-8951 
9 128 9257 9264 9317 9455 9457 9466 9474 9544 9560 9773 
9781 9919-9920 9927 9939 9942 10000 10048-10049 10155 
10175 10267 10275 10296 10331 10549 10566 10774 10777 
10828 10944 11032 11061 11108 11113 11132 11173 11272 
11401 11406 11620-11621 11804 11818 11861 11917 12006 
12039 12041 12073-12074 12078 12138 12127 12236 12327 
12375 12387 12402 12405 12424 12426 12511 12546 12560 
12570 12601 12617-12618 12670 12674 12734-12735 12749 
12772 12790 12913 12916-12917 12972 12977 13002 13021 
13066 13082 13084-13085 13169 13228-13229 13256 13260 
13295 13316-33317 13394 13456 13488-13489 13494 13497 
33532 13516 13546 13645 13649 13651 13659 13679 13739 
33776 13864 13866-13867 13888 


testis 


GIBCO 


ATS001 


50 142 500-502 564-565 598 673-678 963-968 1193 1346 1556 
1671 1826 1968 1977 2360-2163 2236-2237 2555 2590 2597 
3024 3093 3196-3202 3284 3378 3517-3522 3524 3720 3797 
3839 3885 4006 4025 4055 4109 4253 4354 4377 4388 4405 
4434 4473 4480 4543 4688 4792 4940 4957 4962 4964 4975 
5001 5018 5030 5037 5067-5072 5076 5118 5171 5481 5525 
5535 5740 5757 5788 5907 5924 6125 6147 6257 63 16-6322 
6324 6607 6886 6952 7058 7141 7288 7445 7484 7534-7540 
7677 7710-771 1 7713-7717 7735 7792 7815 8087 8110 8120 
8262 8368 8370 8375 8535 8577 8648 8655 8702 8729 8838- 
8839 8848 8898-8906 8936 8939 8966 9076 9087 9244 9264 
9304 9455 9466 9485 9542 9567 9703 9758 9822-9825 9908- 
9910 9928 10462 10475 10478 10482 10551 10583 10679 
10772 10778 10795 10955 10974 1 1 108 1 1 1 13 11116 11118- 
31120 11132 11173 11239 11425 13606 1 1620 1 1629 1 1 695 
13 807 11862 11918-11919 12000 12006 12017 12033 12057- 
12061 12375 12405 12424 12570 12577 12599 12601 12605 
12609 12668 12692 12730-12731 12748 12904 12917 12923 
12929 12935 12940 12956 12971 13047 13051 13073 13090 
13169 13222-13225 13249 13394 13442 13456 13479 13494 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








13497 13502-13503 13506 13512 13515 13533 13549 13555 
13575 13613 13616 13627 13644-13646 13660 13866 13875 
13882 13889 


Genomic 
DNA from 
BAC 63118 


Research 

Genetics 

(CITB 

BAC 

Library) 


BAC001 


8711 


adult bladder 


Invitrogen 


BLD001 


731 1710 1779-17912076 2367 2479-2481 4067 4208-4214 
4681 4767 4775 5163 5169 5553 5560 5581 5674 5684-5688 
5794 6082 6244 6716 6718-6721 6799 6931 7045 7094-7097 
7967 8110 8226 8249-8258 8364 8648 8737 9012 9097 9425- 
9428 9430 9626 9703 9928 10007 10261 10557 10734-10735 
11459 11504-11506 12000 12212 12512-12513 12515-12517 
12670 12735 12914 13085 13439 13512 13687 13838-13841 
13866 


bone marrow 


Clontech 


BMD001 


11 70 83 85 142 150 162-184 186-198 200-210 230-243 245- 
277 279-281 306 370 373-387 389 433-435 438-440457 483 
503-510 566 568-569 595-596 598 609 616 679-683 740 770 
843-857 859 861 864 867-876 880-884 886-890 892-893 896 
900 902-920 1000 1 004 1 1 16-1 120 1 122-1 126 1 128-1 134 1 1 84 
1211 1346 1359 1516 1552-1554 1575 1583 1671 1724 1774 
1877 1894 1927 1974 1976 1983 2012-2024 2031 2033-2038 
2040 2043-2044 2084-2088 2111 2120-2121 2132-2133 2137- 
21382161 2163-2 1 64 2 1 86 2 1 89 2 1 99-2200 2202-2203 2205- 
2210 2213-2218 2266 2291-2295 2342-2343 2414-2416 2444 
2529 2555 2566 2569 2575-2576 2591 2597 2652 2681-2709 
271 1-2716 2735-2738 2740-2744 2746-2748 2750-2756 2758- 
2759 2761-2764 2766-2768 2770 2772-2781 2783-2787 2806 
2812-2813 28162873 2875 2880-2881 2883-2885 2887-2897 
2899 2901 2926 2948 2954 2958 2977 2984 3026-3029 3032 
3068-3069 3071 3081 3093 3096-3097 3 139 3203-3205 3207 
3238 3257 3272 3282-3283 3286 3317 3345 3356 3404-341 8 
3422-3434 3437-3438 3440 3442 3447 3449-3450 3456-3459 
3461-3464 3466-3473 3483 3497 3535 3538 3548 3557 3572 
3588 3593 3600 3604-3605 3610 3612 3625-3627 3634 3644- 
3647 3649-3651 3653-3657 3709 371 1 3722 3725 3727 3737 
3797 3804 3808 3810 3839 3899 4005-4006 4011 4015-4019 
4023 4025 4040 4043 4058 4060 4129 4132 4192 4194 4201 
4246 4253 4277 4282 4367 4403 4466-4472 4474-4477 4479- 
4484 4486-4490 4492-4497 4509 45 1 2-4540 4582 4595 4597- 
4606 4608-4615 4622 4642 4648 4650 4654 4667 4691-4695 
4723 4741-4742 4759 4767 4783 4794-4796 4808 4836 4928- 
4929 493 1-4934 4938 4940 4942 4944-4948 4950-4953 4955- 
4960 4962-4964 4967-4971 4973 4976-4979 4985-4987 4992- 
4998 5000 5004-5016 5030 5037 5052 5075-5076 5093 5143 
5149-5151 5163 5169 5210-5215 5217-5226 5262 5317 5357 
5503 5523 5535 5537-5540 5560 5604 5695 5740 5748 5766 
5788 5796 5801 5862 5874 5895 5906 5908 5938 5965-5967 
5969-5979 5991-5994 5996-6005 6007-6009 6027 6047 6057 
6065 6067-6068 6071-6072 6082 6086 6089-6091 6112 6125- 
6127 6148-6152 6195 6214 6233 6257 6269-6276 6278-6280 
6282-6283 6285-6289 6292-6300 6321 6374 6393-6403 6508 
6555 6605 6607 6722 6730 6788 6815-6822 6826-6832 6836 
6859-6861 6867 6870-6871 6873-6874 6905 6915 6929 6933- 
6937 6940-6942 6944-6945 6988-6992 7051 7126 7155 7166 
7169 7199-7210 7212-7219 7226 7233-7253 7256-7264 7275 
7317-7318 7320 7322-7326 7347 7349 7373-7374 7379 7382 
7446-7448 7473-7474 7491 7502 7542 7544 7569 7577 7579 
7663-7671 7674 7677-7678 7680-7684 7686 7689-76967742 
7768 7791 7815 7819-7824 7847 7864 7930 7995 8065 8085 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








8088 8090-8093 8095 81 10 8 J 20 8134 8139 8224 8226 8235 
8241 8246 8262 8336 8354 8359 8363 8365 8368 8375 8421- 
8427 8429-8440 8444 8452 8456-8460 8463-8465 8467-8476 
8478-8489 8538-8539 8542-8551 8563 8566 8577 8593-8595 
8597 8603 8633 8635-8637 8648 8656-8657 8671 8677 8679 
8701 8792 8796 8817 8S30 8S40-8845 8847-8849 8852 8855- 
8860 8863-8868 8870 8873 8875 8879-S881 8898 8935 8944 
8963 8965 8995 9010 9012 9020-9031 9049 9051 9066 9076 
9092 9097-9098 9128 9257 9264 9285-9288 9304 9317 9339 
9381 9399 9401 9454-9455 9466 9471-9472 9474-9476 9516- 
9517 9519 9529 9544 9622 9630-9643 9646 9650-9665 9689 
9705-9712 9730 9734 9740 9758 9779-9781 9826 9885-9889 
9892-9902 9928-9929 9939 9942 9972-9976 9978-9982 9999- 
10000 10068 10153 101 75 10249 10275 10277 10284 10287 
10296 10319 10321 10343-10344 10346 10364-10374 10379- 
10382 10385-10394 10414 10416 10426 10429 10439 10447 
10452 10467-10468 10475 10483 10520 10522-10526 10528- 
10529 10531-10535 10587-10590 10595 10616 10677-10679 
10691 10750 10760 10767 10772 10778 10788 10794-10795 
10828 10831-10837 10846-10847 10849-10851 10853-10858 
10891 10895 10897 10899-10903 10905-10907 10931-10933 
10956-10959 10975-10976 1 1012 11032 1 1046 11057 1 1059 
11061 11077-11082 11085-11086 11088-11098 11108 11113 
11132 11145 11177-11181 11194 11208 1 1289 11344 11401- 
11402 11513 11618 11620 11631 11673-11675 11677-11690 
11695 11697-11704 11706 11708-11715 1 1736 1 1742 1 1 762- 
11772 11774 11795-11796 11802-11804 11817-11818 11829 
11839-11842 11863 11920 11946 11992-12002 12004-12008 
12010 12012-12013 12015-12019 12022-12029 12033 12041 
12081 12142-12149 12160 12178 12195 12200 12231 12236 
12283 12305 12308 12316 12327 12363 12368 12404-12405 
12424 12426 12430 12439 12527 12546 12570 12576 12590 
12608 12610 12616 12634-12636 12641-12645 12654-12655 
12657-12659 12669-12670 12674 12721-12724 12754 12761 
12834 12840 12842 12904-12905 12910-12911 12913 12916- 
12917 12922-12923 12925-12928 12933 12938 12956 12966 
12977 12989-1299012993 12995 13020 13033 13047 13060 
13062 13064 13066 13095 13098 13103-13105 13107 13111- 
13113 13129 13135 13137 13141 13150 13163 13173 13181 
13213-13217 13234 13276-13278 13280 13391 13394 13400 
13456 13472 13477 13490 13492 13494 13498 13501-13502 
13506-13507 1351 1 13516 13528 13530-13534 13538 13540 
13546 13551-13552 13554-13555 13560 13585-13586 13594 
13613 13630 13633-13639 13644-13646 13649 13659-13660 
13670 13673 13679 13713 13775 13795 13866 13872 13875 
13882-13885 13889 13891 13893 


bone marrow 


Clontech 


BMD002 


51 242 442 654 1004 1134 1841-1904 1908 1927 2023 2107 
221 5 2342 2408 2507-2529 2576 2597 2806 2866 3286 3434 
3722 3736-3737 3817 3823 3839 4060 4246 4258-4290 4389 
4396 4411 4618 4641 4828 4836 4957 4987 5030 5037 5052 
5108 5163 5526 571 1 5743-5772 5774-5804 5813 5895 5939 
6131 6236 6266 62S7 6306-6307 6333 6655 6675 6728 6730 
6750-6762 6764-6769 6788 7049 7106-71 17 7137 7156 7738 
8068 8085 8134 8156 8290-8297 8299-8305 8307-8312 8323 
8368 8482 8499 8858 8863 8865 8928 8935 9076 91 1 1 9128 
9465-9502 9626 9700 9703 9923 9928-9929 9942 10277-10280 
10282-10284 10286-10297 10396 10434 10515 10551 10645 
30675 10691 10744-10753 10772 10778 11057 11098 11108 
11132 11232 11252 1 1519 1 1536-1 1553 11606 11620 12033 
12039 12146 12260 12305 12387 12402 12405 12500 12554- 
12578 12594 12599 12608 12674 12754 12777 12839 12895- | 
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12904 12916 12923 13020-13021 13043 13057-13064 13169 
13249 13446-13454 13479 13492 13494 13513 13533 13555 
13659 13713 13775 13803 13863-13875 13877-13887 13889 
13891 


bone marrow 


Clontech 


BMD004 


2249 2529 3286 3494 3548 355 1 3797 3839 4025 4058 4201 
4277 4282 5052 5108 6545 6961 8262 8898 9474 10000 1 1098 
11818 13021 13893 


bone marrow 


Clontech 


BMD007 


8539 9780 9927 13021 


adult colon 


Invitrogen 


CLN001 


319346 487 731 799 1792 1848 20502161 2449 2482-2483 
3431 390 1 42 1 5-42 1 7 4940 4957 4987 5 163 5239 5560 5689- 
5695 5865 591 1 5923 6722 6765 7098 7815 7864 7880 8110 
8259-8262 8486 8597 8951 9484 9529 9542 9556 10376 1 1507- 
11508 11617 11869 12127 12236 12424 12518-12523 12601 
12610 12777 12976 13062 13073 13367 13440 13507 13512 
13630 13713 13843-13844 13864 13868-13869 13888 


Mixture of 16 
tissues — 
mRNAs* 


Various 
Vendors* 


CTL016 


6815 10776 12977 13064 13512 


Mixmre of 16 
tissues — 
mRNAs* 


Various 
Vendors* 


CTL021 


1671 6738 8432 8648 8863 8944 9511 10769 13021 13062 
13064 


adult cervix 


BioChain 


CVX001 


50 67 142 158 308 332 346 475 59S 654 895 1004 1086 1286 
1449 1516 1671 1698 1701 1711-1756 1758-1776 3 828 1848 
1959 2134 2186 2257 2267 2343 2408 2414 2468-2474 2476- 
2478 2608 27 1 6 3002 3 1 36 3 1 66 3 1 9 1 3 1 99 3529 353 5 3554 
3572 3627 3722 3737 3777 3797 3839 3985 4158-41764178- 
4195 4197 4199-4207 4246 4277 4391 4396 4434 4641 4667 
4759 4783 4828 4885 4940 4957 4963 4987 4998 5001 5038 
5075 5108 5163 5293-5294 5455 5481 5523 5552 5581 5646- 
5652 5654-5659 5661-5671 5673-5681 5687 5701 5711 5723 
5740 5788 5794 5848 5902 5908 5923-5924 5964 6020 6052 
6057 6062 6091 6106 6112 6125 6129 6181 6350 6371 6374 
6410 6446 6458 6504 6508 6512 6551 6598 6686-6687 6689- 
6705 6707-6715 6788 6873 6893 6917 6998 7008 7045 7078 
7084-7093 7095 7130 7141 7148 7169 7204 7507 7579 7608 
7675 7733 7768 7815 7871 7880 7893 8078 8138 8209-8215 
8217-8236 8238-8242 8244-8248 8298 8345 8370 8444 8456 
8486 8499 8535 8558 8592 8633 8635 8648 8669 8679 8742 
8853 8863 8870 8898 8921 8939 8948 9012 9061 9098 9107 
9128 9137 9153 9304 9308 93 17-93 18 9355 9385-9391 9393- 
9403 9405-9406 9408-9418 9420-9422 9457 9466 9475 9510 
9539 9612 9734 9773 9927-9928 9939 9947 9960 101 10 10175 
10230-10256 10258-10259 10267 10274 10319 10329 10344 
10491 10496 10540 10616 10660 10691 10722-10732 10778 
10782 11055 11145 11217 11376 1 1462 1 1477-3 1489 1 1491- 
11503 11519 11584 11604 11695 11853 11869 11891 11980 
12006 12066 12081 12127 12160 12195 12216 12240 12266 
12308 12363 12379 12402 12405 12424 12438 12483-12494 
12496-12510 12579 12605 12610-12611 12617-12618 12643 
12653 12670 12674 12688 12691 12703 12707 12735 12740 
12754 12830 12840 1 2866 1 2870-12881 12883-12884 12905 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 
10) human leukemia lymphoblastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lymph> 
node mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) 
human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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12913-12914 12917 12923 12951 12955 12957 12965 12989- 
12990 12993 13002 13020 13035 13045-13050 13062 13064 
13072 13092 13136 13141 13174 13197 13254 13283 13307 
13391 13428 13430-13437 13442 13473 13479 13492 13494- 
13495 13497-13498 13502 13532 13554-13555 13575 13590 
13597 13613 13616 13627 13644 13679 13713 13775 13S29- 
13837 13866 13868-13869 13872 13884 13888 13891 


diaphragm 


BioChain 


DIA002 


731 1346 3548 371 1 3885 4282 4654 5895 6873 8120 8931 
8936 9455 1] 132 ] 1818 12405 12609 


endothelial 
cells 


Strategene 


EDT001 


21 51 67 83 332 569 598 609 762 796 1004 1024-1026 1086 
1561 1848 1928 1959 1976-1977 1983 2138 2161 2366 2257 
2282-2283 2417 2483 2490 2555 2569 2614 2926 3042 3 1 89 
3191 3272 3300-3303 3426 3494 3503 3548 3574-3576 3605 
3627-3628 3673 3709 3720 3722 3737 3797 3839 3885 4005 
40114019 4055 4 1 33 4 1 92 4246 4269 4282 4340 43 54 4365 
4384 4388 4399 4405 4410-441 1 4434 4543 4641 4654 4767 
4797-4799 4802 488 1 -4882 4885-4886 4888 4940 4957 4964 
4972 4985 4998 5002 5017 5030 5076 51O3-5104 5132-5133 
5163 5167 5183-5185 5187-5190 5380 5523 5527 5535 5541- 
5542 5544 5674 5684 5693 5724 5766 5778 5788 5794 5796 
5874 5895 5916 5923 5928 5938 6005 6048 6057 6068 6082 
6165 6205 6215 6240-6241 6307 6321-6322 6349-6353 6383 
6458 6595 6598 6606-6607 6765 6799 6805 6815 6860 6871 
6873 6890 6918 6968 6972 6976 6979-6980 6998 7058 7067 
7104 71 13 7116 7137 7139 7169 7275 7468 7613 7716 7755- 
7757 7759-7760 7797 7799-7801 7930 8077 8084 8093-8095 
8120 8139 8235 8262 8320 8323 8335-8336 8345 8354 8358 
8363-8364 8370 8372 8375 8387 8452 8592 8648 8786 8788 
8792 8813 8863 8898 8944-8955 8965 8996-9001 9051 9076 
9097 9128 9264 9289-9291 9304 9315 9414 9455-9456 9466 
9472-9473 9475 9484 9504 9517 9529 9542 9563 9570 9626 
9703 9780-9781 9843 9927 9939 9961 10000 10027 10154 
10267 10285 10321 10330-10331 10342 10344 10349 10496 
10500 10550 10556-10557 10579 10679 10772 10776 10778 
10788 10795 10802 11013 11132 1 1 136 1 1 146-1 1 149 11406 
11483 11565 11588 11600 11606-1 1607 3 1615 11626 11807 
11818 11932 11955 12006 12034 12037 12041 12044 12078 
12092-12094 12123-12126 12150 12213 12375 12381 12387 
12405 12411 12424 12426 12522 12570 12576 12590 12601 
12610 12612 12615 12617-12618 12663 12670 12674 12707 
12729 12739 12749 12753-12754 12777 12830 12842 12913- 
12914 12916-12918 12929 12940 12972 12975-12977 13002 
13024 13047 13051 13054 13062 13064 13082-13084 13090 
13092 13094 13123 13126 13136 13195 13235 13263 13380 
13389 13392 13394 13400 13456 13479 13488-13489 13492 
13494-13499 13502 13506-13507 13514 13516 13546 13555 
13568 13575 13590 13592 13613 13616 13621 13630 13649 
13659-13660 13667-13668 13670-13673 13678-13679 13690- 
13692 13713 13796-13797 13838 13866-13869 13872 13882 
13884 13888 13893 


Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM001 


150 2023 2327 2490 4109 4783 5503 5560 10267 10760 12017 
12160 12557 12582 12923 13020 33514 


Genomic 
clones from 
the short arm 
Of 

chromosome 


Genomic 

DNA 

from 

Genetic 

Research 


EPM003 


5560 12017 12146 
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8 








Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM004 


4783 4798 5560 10817 11926 12017 12160 


esophagus 


BioChain 


ESO002 


999-1000 2449 3272 33 15 3548 3550 3634 3697 3796 4011 
4025 4058 420 1 4282 5106 5163 5553 6082 6873 7739 9304 
10296 11133 11818 12033 12570 13869 


fetal brain 


Clontech 


FBR001 


51 142 1184 3664 4060 4109 4940 5021 5270 5523 5553 6112 
6805 6908 7294 8558 9457 10376 1 1059 1 1985 12006 12122 
12160 12754 13438 13507 13888 


fetal brain 


Clontech 


FBR004 


60 2704 3711 4025 4109 4783 5001 6082 7597 9010 9504 
9949 11837 12033 12039 12363 12705 12905 13020 13503 
13512 13891 


fetal brain 


Clontech 


FBR006 


6 60 67 598 800 932 1004 1 170 1793-1794 1796-1797 1799- 
1805 1905-1914 1916-1958 1974 1976 1979 1983 2057 2129 
2 1 74 222 1 2407 2444 2449 2484-2492 2530-2554 2556-256 1 
2563 2576 2857 3064 3207 3479 3556 3673 3709 3722 4060 
4078 4357 4218-4221 4223-4224 4277 4291-4334 4338 4355 
4364 4369 4431 4957 5001 5109 5270 5380 5553 5634 5696- 
5706 571 1 5724 5766 5788 5794 5801 5805-5832 5834-5879 
5882-5901 5936 5990 6057 6723-6732 6765 6770-6791 6797 
6805 6894 7049-7050 7100-7102 7105 71 18-7123 7125 7127 
7169 7905 8263-8265 8267-8273 8294 8312-8333 8359 836! 
8375 8452 8633 8664 8740 8757 8884 9010 9111 9432-9436 
9503-9516 9518-9545 9547-9551 9556 9570 9577 9780 9895 
9923-9924 9928 9942 10007 10027 10202 10263-10268 10276 
10284 10298-10310 10329 10331 10496 10542 10595 10621 
10736-10737 10755-10761 10772 10774 10795 1110S 11132 
11406 11483 11509-11523 11555-11582 11589-11590 11600 
3 1606 11621 11713 11729 11807 11S37 12006 12039 12044 
12092 12113 12218 12231 12236 12327 12363 12398 12405 
12465 12511 12524-12530 12576-12577 12579-12601 12729 
12735 12754 12863 12869 12889 12906-12910 12914 12954 
12973 13020-13021 13051-13052 13054 13065 13082-13083 
13427 13445 13455-13470 13488 13490 13496 13498-13501 
13507 13516 13560 13613 13630 13649 13708 13713 13769 
13831 13845-13855 13868 13872 13882 13884 I3888-I3894 
13896-13900 


fetal brain 


Clontech 


FBRs03 


1005 4405 5111 6337 6964 7742 13084 13864 13891 


fetal brain 


Invitrogen 


FBT002 


51 83 142 321 430 746 932 1054 1058-1065 1493 1833 1947 
2273-2275 2299 2444 24492926 3479 3492 3885 4347 4354 
4391 4405 4410 4434 4530 4804 4985 4998 5075 5160-5169 
5380 5428 5466 5750 5788 5801 5895 6132 621 5 6371-6374 
6458 6598 6973-6974 7067 7096 7776-777S 7780-7782 7937 
8143 8323 8361 8364 8372 8377 8452 8633 8977-8984 8986 
9010 9142 9264 9332 9457 9474 9503 951 1 9517 9539 9582 
9827 9848 9927 9950-9953 10027 10161 10329 10430 10492 
10573-10575 11014 11160-1 1 164 1 1406 11628 11742 11814 
11830 11985 12092 12112-12114 12116-12117 12127 12424 
12511 12521 12570 12576 12643 12696 12735 12748-12751 
12754 12830 12835 12913 12957 12977 12990 13002 13020 
13062 13072 13083 13117 13254-13259 13377 13486 13489 
13496 13499 13507 13590 13649 13685-13688 13713 13867 
13888 13891 13893 


fetal heart 


Invitrogen 


FHR001 


1001 1004 2250 4025 6334 6765 7740 8933 8935 9457 9544 
10000 11132 12599 12609 13021 13568 13656 13866 


fetal kidney 


Clontech 


FKD001 


1 42 346 364 5 1 1 -5 1 7 570-572 574 598 685-690 969-970 972- 
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981 983-987 1134 13462123 2167-2169 2238-22462342 2444 
2483 25 1 6 2555 2617 2728 2843 2876 3032 3049 3072 3206 
3208 3282-3283 3525-353 1 3533-3543 3548 3591 3709 3722 
3797 3839 3878 401 5 401 9 4043 4246 4277 4367 4405 4696- 
4698 4725 4767 4805-4810 4940 4947 4957 4986 4998 5037 
5056 5073-5080 5082-5091 5099-5100 5108 5258 5504 5523 
5560 5923 6005 6207-6208 6225 6272 6288 6325-6332 6478 
6603 6702 6793 6815 6906 6953-6959 7045 7058 7204 7355 
7426 7449-7450 7520 7543-7546 7561 7587 7718-7732 7930 
8077 8097 8262 8375 8387 8452 8520 8638 8658 8736-8737 
8834 8863 8898 8907-8918 8922 8950 9010 9134 9257 9401 
9457 9544 9597 9760 9781 9791 9828-9830 9912 9914-9918 
10296 10440 10484 10546-10548 10772 1 1 108 1 1 121-1 1 129 
11131-11132 11170 11513 11638 11695 11923-11924 12006 
12033 12062-12070 12072 12160 12405 12522 12570 12594 
12599 12605 12626 12663 12670 12732-12733 12749 12848 
12904 129J4 12940-12941 12990 13020 13083 13 3 88 13226- 
13227 13234 13263 13277 13280 13351 13391 13394 13491 
13501 13512 13590 13644 13647-13650 13713 13782 13867- 
13868 13872 13875 


fetal kidney • 


Clontech 


FKD002 


3286 5030 5037 5105 1 1 108 12033 12490 12570 13494 13866 


fetal kidney 


Invitrogen 


FKD007 


3272 3806 4025 4253 4277 4654 51 12 5535 5788 5801 8863 
8935 9401 9466 10553 11628 11818 13494 13646 13866 


fetal lung 


Clontech 


FLG001 


79 2367 2395 3010 3460 3885 4828 4948 4962 5001 5723 5748 
5902 5908 6186 6738 7051 7067 7677 7759 9264 9553 9700 
10007 10478 11098 12017 123S3 12417 12424 12749 12917 
13020 13169 13472 13554 13644 13782 13835 


fetal lung 


Invitrogen 


FLG003 


142 319 364 629 1671 1806-1814 1816-1819 1877 2129 2161 
2169 2367 2449 2493 2529 3191 3503 3610 4109 4225-4234 
4367 4434 4957 5108 5380 5421 5581 5707-5710 5712 5714 
5788 5801 6057 6733-6741 7034 7103 8274-8278 8365 8597 
8948 9264 9327 9437-9442 9444 9466 9510 9525 9530 9539 
9677 9773 9841 10007 10190 10198 10269-10271 10329 11519 
11524-11527 11927 12531-12539 12848 12890 12904 13021 
13072 13249 13445 13472 13489 13551 13575 13649 13670 
13679 13856-13857 


fetal lung 


Clontech 


FLG004 


1003-1004 2597 5110 6963 9924 10552 1 1 138-1 1139 12080 
12990 13659 


fetal liver- 
spleen 


Columbia 
University 


FLS001 


.2 4-14 16-22 24 26 28-31 33-46 48-49 51-61 63-68 71-91 93- 
102 104-110 112-124 126-156 158-162 282-283 285-290 292- 
299 301-304 307-312 314-326 328-338 340-344 346-353 355- 
365 369 390-400 402 436 441 483 557 567 575-585 595 598 
629 673 678 691-699 701-702 708 731 736 763-767 769-776 
778-786 788-791 793-794 796 925 975 1004 1015 1023 103S 
1068 1104 1134 1144 1184 1192 1216 1264 1298 1346 14S2 
1493 1516 1518-1521 3551 3556 1575 1583 1594 1636 1641 
1707 1724 1774 1826-1829 1841 1858 1927 3959 1962-1965 
1967-1972 1974-1979 1981-1998 2000-2009 2011 2045-2051 
2053-2055 2057-2058 2060-2063 2065-2083 2089-2094 2100- 
2101 2161 21702174 2184-2194 2215 2222 2269 2290 2310 
2342 2409 24 1 1 24 1 4 2444 2449 2458 2483 2490 2497-2498 
2510 2516 2523 2529 2555 2562 2566-25762578-2586 2588- 
2591 2593-2601 2604 2607-2608 2611-2612 2614-2618 2620 
2622-2642 2644 2646-2653 2655-2664 2666 2668 2670-2680 
2696 2750 2788-2793 2795-281 1 2814-2826 2828-2835 2837- 
2842 2844 2846-2848 2851-2858 2860 2862-2871 2876 2878 
2893 2900-2905 2907-2909 2926 2929 2949 2952-2953 2959- 
2960 2984 2992 3032 3058 3069 3073-3076 3078-3080 3082 
3093 3166 3194 3196 3207 3210-3211 3213-3214 3217-3225 
3249 3257 3272 3282 3286-3287 3304-3307 3310-331 1 3314- 
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Name 


SEQ ID NOS: 








3317 33 1 9 3321-3332 3334-3346 3356 3419 3426 3434 3446 
3450 3455 3468 3470 3479 3484 3494 3503 3535 3537 3548 
3552 3557 3572 3590 3594 3597 3604-3605 3610 3612 3625 
3627 3634 3668 3670 3709 371 1 3720 3722 3729 3737 3777 
3797 3806 3808 3810 3813 3839 3885 3926 3990 3996-3997 
4006 4009 401 1 4019-4022 4025 4040 4043 4060 4078 4095 
4109 4129 4192 4194 4201 4245-4246 4253 4261 4277 4282 
4297 4335-4338 4340-4357 4359-4361 4364 4366-4367 4370- 
4372 4374-4377 4379-4382 4384-4386 4388-4392 4395-4396 
4398-4414 441 7-4424 4426-4455 4457-4458 4460-4465 4542- 
4543 4545 4547-4553 4555-4562 4565-4575 4577 4579 4581- 
4582 4585-458 8 4593 4596 4607 4616-4629 4644^1645 4647 
4654 4671 4676 4687 4689 4694 4721 4726-4729 4759 4767 
4775 4783-4784 4788 4790-4791 481 1-4819 4830 4837 4845 
4862 4874 4889-4902 4930 4940 4948-4949 4957 4962-4965 
4972 4985 4998 5022 5029-5030 5037 5040 5075-5076 5092 
5108 51325152 5163 5167 5171-5172 5335 5380 5398 5473 
5503 5514 5516 5523 5526-5527 5535-5536 5553 5581 5598 
5604 5616 5674 5684 5691 5693 5711 5715 5724 5726 5748 
5750 5757 5778 5788 5794 5801 5817 5865 5874 5894-5895 
5902-59 14 5 91 6-59 1 9 592 1 -592 8 5930-5949 595 1 -5954 5956- 
5964 5966 5979 6005 6010 6013-6015 6017-6018 6020-6022 
6024-6025 6028-6032 6034-6048 6052-6062 6064 6068 6073- 
608 1 6086 6092-6094 61 04 61 12 6125 6135 6140 6147 6149 
6153-6157616561866195 6209-621 1 6240-6241 6243-6248 
6250-6253 6272 6276 6287-6288 6307 6313 6338 6371 6374 
6419 6430 6446 6451 6458 6478 6496 6508 6545 6579 6595 
6598 6607 661 1 6624 6629 6642 6658 6695 6726 6728 6730 
6738 6745 6754 6765 6767 6788 6793-6797 6799-6808 6810- 
6815 6833-6843 6845-6847 6850-6856 6858 6860 6862-6865 
6870-6871 6875 6877 6891 6893-6894 6900 6907-6908 6917 
6919-6925 6950 6979 7049 7058 7067 7077 7096 7109 71 16 
7118 7128-7131 7133-7135 7137-7141 7144-7147 7149-7150 
7152-7153 7155-7179 7181-7192 7194-7198 7265-7271 7273- 
7280 7282-7305 7307-7312 73 1 6 7327-7336 7375-7376 7383 
7386 7451 7455 7468 7473 7475-7477 7479 7484 7548-7553 
7555-7558 7561 7608 7617-7618 7620-7631 7642 7675 7687 
7695 7716 7768 7809 781 1 7839 7842 7864 7883 7933 7977 
8069 8093 8095 8105 8110 8116 8139 8224 8226 8235 8241 
8262 8323 8334-8340 8342-8350 8352-8355 8357-8359 8361- 
8408 8410-841 1 8414-8419 8452 8456 8490-8498 8500-8502 
8504-8526 8529-8530 8537 8540-8541 8553-8560 8563 8566 
8568 8577 8592 861 1 8639 8648 8659-8661 8664 8668-8669 
8710 8738-8739 8741 8777 8792-8804 8830 8834 8839 8853 
8858 8863 8865 8876 8898 8926 8935 8939 8950-8951 8957 
8963 9010 9012 9076 9092 9097 91 12 91 19 9128 9257 9264 
9266 9284 9291 9301 9304 9313 9318 9352 9375 9399 9414 
9424 9445 9455 9457 9466 9473-9474 9476 9480 9484-9485 
9511 9517 9528-9529 9536 9539 9542 9544 9552-9557 9559 
9561-9564 9566-9572 9574-9577 9579-9630 9646 9655 9666- 
9667 9669-9671 9673 9675-9697 9700-9701 9703-9704 9713- 
9715 9718-9719 9722 9726-9729 9734 9752 9758 9761 9777 
9780-9784 9813 9831 9854-9867 9869 9906 9927 9929 9939 
9942-9943 9969 9983 10000 10007 10205 10267 10284 10296 
10305 10311 10313-10316 10318-10325 10327 10329-10331 
10333-10354 10356-10363 10395-10397 10400-10403 10405- 
10410 10413 10417-10422 10427 10446-10447 10463 10470 
10485 10496 10502-10507 10515 10542 10557 10565-10566 
10645 10679 10688 10691 10729 10750 10760 10762 10764- 
10769 10772-10774 10776-10782 10784 10786-10795 10797- 
10815 10817-10830 10859 10861-10869 10871-10879 10881- | 
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Name 
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10893 1 0908- 1 09 1 6 10929 10934 10943 10948 10968 10971 
10975 10977-10983 11005 11009 11011 11015-11018 11 028 
11031 11054-11063 ] 1094 11 108 1 1 1 14 1 1 132 11136 11142 
11194 11208 11213 11246 11390-11391 11401 11406 11411 
11414 11459 11462 11483 11508 11513 11521 11534 11559- 
11560 11565 11573 11585-11603 1 1606-1 1610 1 1612-1 1616 
1 1618-1 1638 1 1640 1 1642-1 1657 1 1659-1 1667 1 1669-1 1670 
11672 11699 11716-11718 11720-11722 11724-11741 11743- 
11758 11760-11761 1 1775-1 1787 1 1797 11799-11801 11818 
11830 11862 11864-11866 11869 1 1876 11 889 1191 1 11925- 
11928 11958-11961 1 1963-11976 1 1979-1 1980 12000 12006 
12017 12019 12028 12033-12034 12041 12078 12086 12092 
12122 12127 12159-12160 12178 12216 12218 12236 12245 
12256 12260 12289 12303 12305 12308 12327 12330 12363 
12375 12382 12387 12402-12405 12424 12426 12439 12442 
12465 12470 12476 1251 1 12522-12523 12546 12551 12560 
12570 12576 12590 12601-12602 12604-12619 12621-12633 
12636 12639 12646-12653 12670-12671 12674 12682-12683 
12688 12693 12696 12706 12709-12715 12735 12749-12750 
12772 12790 12823 12830-12832 12835 12842 12849 12855 
12863 12904-12905 12910-12911 12913-12916 12918-12924 
12929 12931-12932 12934-12937 12939-12940 12954-12955 
12965 12973 12976-12977 12989-12990 12993 12999 13020- 
13021 13035 13051 13054-13055 13060 13062 13066-13067 
13069-13087 13089-13095 13097-13102 13114-13117 13119- 
13120 13122-33126 13334-13136 13141 13147 13156 13174 
13193 13395-13196 1319S-I3203 13254 13260 13264 13277 
13280 13295 13351 13368 13377 13391 13394 13400 13409 
13412 33420 33456 13472-13477 13479-134S4 13486-13492 
13494-13496 13498-13499 13501-13510 13512-13516 13518- 
13527 13533 13541-13544 13546-13551 13553-13555 13560 
13575 13587-13589 13597 13603-13604 13613 13616 13622- 
13623 13630-13631 13644-13646 13649 13659 13670 13679 
13713 13743 13748 13769 13775 13782 13793 13803 13808 
13818-13819 13858-13860 13864 13866-13869 13872 13882 
13884 13888-13889 13891 13893 13901 


fetal liver- 
spleen 


Columbia 
University 


FLS002 


6 16 24 30-31 63 67 81 83 89 95 103 115 117 126 140 142 147 
150-151 158 162211 225 287 308 332 356 358 390-391 438- 
439 483 551 556 641 654 694 701 708 731 788 997 1006 1012 
1047 1082 1151 1154 1178 1184 1208 1212 1356 1480 1507 
1551 1556 1623-1647 1649-1667 1669-1671 3675 1677-1698 
1828-1829 1858 1877 1885 1889 1914 1927 1947 1961 1997 
1999 2028 2057 2070 2092 2098 2138-2139 2154 2161 2174- 
2 1 75 2 1 89 2 1 9 1 2257 2282 2327 2342 23 63 2399 2403 2409 
2440-2441 2443-2450 2452-2459 2490 2498 2522-2523 2544 
2555 2568 2575-2576 2592 2597 2605 2619 2623 2659 2806 
2809 2812 2860 2869 2879 2903 2926 2932 2988 303 1 3066 
3075 3089 3188-3189 3286-3287 3319 3343-3344 3356 3426 
3535 3548 3552 3554-3555 3583 3591 3610 3625 3634 3673 
3709 371 1 3720 3722 3839 3899 3926 3985 4005-4006 401 1 
4020 4025 4055 4058 4060 4078 4091-41 16 41 18-4126 4128- 
4142 4172 4194 4201 4253 4277 4282 4347 4350 4353-4355 
4362 4367 4374 4379 4386 4391 4394 4396 4402 4426 443 1 
4435 4437-4439 4512 4553 4578 4586 4607 4622 4644 4654 
4671 4758 4767 4783 4798 4836 4845 4899 4940 494S 4962- 
4963 4985 49914998 5001 5037 5108 5167 5373 5377 5198 
5237 5293-5294 5380 5400 5523 5535-5536 5581-5591 5593- 
5599 5601-5613 5615-5624 5627-5637 5653 5674 5691 5693 
571 1 5724 5726 5733 5748 5757 5772 5778 5794 5817 5874 
5894 5902 5904 5906-5907 5909 591 1 5916-5919 5923-5924 
5927 5929 5932 5938 5941 5948 5957 5959-5960 5962 5964 
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5979 6005 6020 6027 6037-6038 6052 6054 6057 6060-6062 
6064 6068 61 12 6140 6195 6225 6246 6287-6288 6306-6307 
.6336 6354 6371 6374 6423 6430 6451 6458 6461 6469 6478 
6496 6508 6538 6545 6563 6580 6583 6595 6598 6607 6629 
6638 664 1-6643 6645-6675 6709 6726 6730 6765-6766 6788 
6792-6793 6795 6797-6798 6800-6801 6808 6812 6836 6843 
6850-6851 6855 6860 6864 6870 6875 6888 6908 6924 6950 
6958 6968 6998 7003 7018 7045 7049 7056 7071-7079 7095- 
7096 7109 7116 7118 7126 7133 7135 7137-7139 7155-7156 
7166-7167 7169 7175 7178 7275 7291 7294 7329 7422 7426 
7478 7608 7627 7687 7695 7716 7815 7839 7871 7893 7934 
7937 7977 8022 8095 8120 8133-8134 8148-8149 8151-8 167 
8169-8182 8184 8186-8192 8194-8196 8241 8246 8276 8289 
8298 8336 8339 8343 8345 8349 8355-8356 8361 8363 8365 
8367-8368 8370 8373 8375-8378 8385 8388 8417 8496 8518 
8520 8543 8558 8561 8563 8646 8667 8710 8738-8740 8786 
8803 8813 8865 8926 8946 8948 8963 8970 9010 9049 91 19 
9128 9142 9164 9222 9264 9289 9296 9301 93 17-93 1 8 9320 
9322 9335-9341 9343-9362 9365-9368 9370 9372-9373 9375- 
9377 9399 9455 9457 9466 9472 9475 9480 9483 9495 9526 
9533 9536 9553 9556 9558 9560 9563 9567 9570 9582 9597 
9601 9630 9646 9655 9671 9683 9695 9700 9703 9715 9722 
9729 9733 9752 9758 9783 9843 9848 9855 9880 9936 9942 
9983 10007 10027 10103 10142 10186-10217 10274 10284 
10287 10296 10313-10314 10329 10331 10349 10352 10354 
10400 10405 10430 10439 10496 10500 10507 10542 10621 
10709-10710 10712-10717 10750 10760 10776-10779 10782 
10794 10809 10816 10837 10885 10891 10895 10913 10960 
10971 10975 11057 11071 11098 11142-11143 11148 11194 
11246 11401 11406 11430-11453 11455-11464 11483 11504 
11508 11556 11560 11589 11596 1 1615 1 161 8 1 1626 1 1631 
11714 11729 11830 11833 11865 11988 12000 12006 12017 
12019-12020 12041 12044 12081 12092 12178 12195 12245 
12256 12277 12297 12327 12363 12383 12402-12403 12405 
12426-12427 12444-12455 12457-12465 12467-12473 12475- 
12476 12520 12546 12576 12599 12607 12611 12614-12615 
12617-12618 12622 12627 12630-12632 12636 12647-12648 
12650 12685 12688 12691 12693 12696 12703 12739-12740 
12743 12763 12808 12830 12834 12842 12849-12858 12860- 
12866 12885 12895 12906 12910 12913 12915 12917 12920 
12922 12929 12933 12940-12941 12954-12955 12957 12965 
12977 12990 12993 12999 13021 13039-13040 13047 13051 
13054-13055 13060 13062 13072-13073 13077 13082-13083 
13085 13092-13095 13098 13117 13123 13136 13173 13195- 
13197 13202 13249 13254 13362 13377 13391 13394 13406- 
13409 13411-13415 13417-13420 13456 13473-13474 13477 
13480 13484 13486 13488-13489 13496-13498 13500 13502 
13504 13506-13507 13511-13512 13515 13521 13546 13551 
13554-13555 13560 13575 13590 13597 13613 13616-13617 
13622 13630 13659-13660 13670 13678-13679 13695-13696 
13701 13708 13713 13726 13748 13775 13795-13796 13815- 
13826 13835 13838 13859-13860 13864 13867 13869 13872 
13882-13885 13888-13889 13891 13893 


fetal liver- 
spleen 


Columbia 
University 


FLS003 


525 2269 2529 2627 2636 3552 3554-3555 3597 4201 4246 
4253 4422 4426 4434 4438 5536 5801 8935 9536 1 1 136 1 1 142 
11519 11626 11927 12028 12077-12079 12086 12305 12403 
12424 13021 13472 


fetal liver 


Invirrogen 


FLV001 


40 51 60 598 731 1023 1048-1049 1820 1976 2111 2268-2269 
2449 2483 2529 2568 2576 2614 2809 3207 3220 3468 3595- 
3599 3885 4020 4346 4390 4422 4434 4543 4615 4767 4833 
5092 5152-5155 5270 5505 5526 5553 5581 5715 5724 5726 
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5750 5766 5788 5794 5801 5902 6125 6147 6458 6598 66?4 
6645 6765 6799 6805 6862 7104 7856 8070 8156 8359 8361 
8432 8740 8928 8935 8972-8974 9012 9041 9128 9264 9445 
9474 9485 95] ] 9536 9593 9597 9703 9780 9927 9943 9945- 
9946 9948 10007 10137 10275 10334 10529 10557 10566- 
10569 10776 10828 1 1 142 1 1560 1 1587 1 1618 1 1927 12070 
12086 12105-12108 12127 12218 12236 12327 12424 1251 1 
12541-12542 12605 12746 12916 12977 12985-32986 13020- 
13021 13060 13064 13135 13246-13247 13472 13479 13488- 
13489 13497 13533 13554 13590 13659 13680-13682 13859- 
13860 13882 13888-13889 


fetal liver 


Clontech 


FLV002 


360 996 1134 3226 5102 9401 


fetal liver 


Clontech 


FLV004 


998 1927 2449 2627 4025 4043 4426 4438 4834 5030 5726 
6730 9474 9942 10769 11142 13587 12590 12608 13889 


fetal muscle 


Invitrogen 


FMS001 


40 150 731 894 1264 1555 16J3 1821-1822 1883 1959 2161 
2343 ?4Q4-?4Qrt ?555 i">fn d?^S-d9^7 ddid a.f*n\ aaqa aiv* 

£-~3*lJ -^7t - JJJ J-UJ *fZJJ-4iJ / HHJH HO 1 I H07H H /6.3 

4885 4940 4985 5037 5045 5481 5716-5718 5724 5788 5902 
5932 6624 6742-6743 6825 7054 7104 714 1 7632 7800 8279- 
8281 8863 8935 9204 9264 9446-9449 9451 9474 951 1 9536 
9556 9722 9780-9781 10097 10273-10274 10330 10738-10739 
10750 11098 11406 11528-11530 11560 12305 12544-12546 
12590 12609 12618 12663 12670 12702 12735 I2891-I2892 
12916 13051 13053 13195 13441-13442 13479 13613 13782 


fetal muscle 


Invitrogen 


FMS002 


3378 4348 4434 8932 1 1 132 12033 12570 13782 


fetal skin 


Invitrogen 


FSK001 


60 142 235 319 641 683 800 1015 1050 1346 1774 1823-1825 
2044 2099 2 1 1 1 2 1 6 1 22 1 5 2223 2280 2367 240 1 2408 25 1 3 
2516 2614 2812 2871 2926 3207 3356 3468 3490 3503 3548 
3599 3720 3722 3885 4020 4032 4060 4 109 4238 4240-4244 
4246 4253 4277 4340 4353 4355 4388-4389 4391 4405 4434 
4543 4562 4568 4654 4667 4671 4767 4940 4944 4947 5030 
5037 5075 5 1 63 5 1 98 5296 53S0 548 1 55 1 4 55S 1 5656 5674 
5691 5719-5724 5748 5757 5801 5894 5902 5923 6057 6076 
6078 6125 6147 6215 6336 6374 6409 6437-6438 6551 6563 
6744 6783 6803 6805 6815 6873 6900 6908 6917 6985 6987 
7037 7054 7058 7067 7075 7275 7355 7431 7484 7507 7587 
7627 7642 765 3 7687 7871 7977 81 10 8134 8186 8226 82 3 5 
8251 8262 8282-8287 8298 8356 8365 8368 8452 8455-8456 
8539 8597 8648 8679 8813 8863 8898 8921 8935 8939 8948 
895 1 8957 8983 9002 9041 9061 9122 9128 9257 9285 9414 
9452-9453 9466 9474 9484 9503 9517 9529 9553 9563 9671 
9677 9703 9714 9781 9841 9927 10007 10175 10296 30343 
10491 10741-10742 10778 10795 10907 10948 11239 11513 
11531 11573 11606 11626 31807 11869 11952 12000 12006 
12160 12190 12218 12256 12327 12363 12371 12375 12378 
12426 12438 12465 32470 12488 12549-12550 12570 12604 
12615 12617 12663 12670 12735 12749 12754 12830 12893 
12904 12910 12916-12917 12976-32977 13038 13084 13090 
13116 13249 13254 13367 13389 13391 13443-13445 13456 
13472 13479-13480 13494 13496 13499 13505 I35I2 13516 
13551 13554-13555 13575 13590 13613 13630-13631 13644 
13670 13713 13782 13784 13793 13803 13858 13866 13869 
13882 13891 


fetal skin 


Invitrogen 


FSK002 


1004 3544 4834 5523 9922 9942 11134-11135 12570 13495 
13499 13793 13884 13889 


fetal spleen 


BioChain 


FSP001 


997 5030 9466 1 1 108 12033 12749 13590 


umbilical 
cord 


BioChain 


FUC001 


60 83 89 142 166 567 609 760 997 1302-1304 1306-1307 1309 
1671 1697 1724 1848 1917 1978 21 1 1 2154 2161 2207 2215 
23 15 2343 2444 2569 2576 2591 2597 2652 2866 2926 3468 
3526 3599 3602 3625 3697 3722 3799-3805 3813 3839 3885 
4025 4060 4173 4197 4246 4277 4340 4364 4387 4391 4395- 
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4396 4405 44 10 4434 4543 4589 4667 4694 4767 4S34 4865 
4940 4957 4963 4985 5030 5093 50965108 51635171 5213 
5313 5340 5342-5344 5380 5481 5521 5526 5627 5656 5674 
5714 5743 5750 5757 5801 5895 5902 5912 5938 6027 6061 
6082 6125 6186 6195 62186241 6313 6321 6410 6458 6486- 
6495 6595 6607-6608 6668 6745 6825 6860 6S70 6979 7054 
7141 7260-7261 7275 7329 7355 7373 7383 7444 7579 7586 
7677 7716 7807 793 0 7935-793 6 7995 8077 8093 8226 8246 
8262 8298 8343 8345 8358 8387 8452 8535 8563 8635 8669 
871 1 8819 8858 8863 8921 8935 8943-8944 8948 8963 9001- 
9002 9012 9097 9130 9132 9134-9138 9264 9303-9304 9313 
9332 9401 9424 9466 9472 9474 9544 9597 9665 9677 9700 
9722 9734 9758 9834 9841 9S48 10050 10175 10183 10267 
10277 10311 10337 10446 10470 10508 10529 10566 10691 
10772 10828 10961 11108 11132 11145 11239 11241 11273- 
11276 11293 11408 11484 11513 11596 11604 11607 11620 
11695 11823 11909 11927 11942 12000 12077 12160 12245 
12255-12256 12292 12305 12308 12363 12383 12404-12405 
12426 12438 1251 1 12570 12601 12605 12630 12700 12706 
12754 12875 12913 12916 12935 12957 12993 13002 13009 
13020-13021 13062 13135-1313613141 13254 13283 13318- 
13319 13336 13394 13400 13472 13479 13481 13488-13489 
13495-13496 13499 13507 13511-13512 13516 13551 13555 
13568 13575 13590 13613 13630 13644 13713 13740-13742 
13866-13867 13882 13884 13888 


fetal brain 


G1BCO 


HFB001 


51 70 21 1-213 215-222 224-229 445 586-588 598 608 703-710 
712-716 1068 1148 1178 1184 1193 1308 1516 1556 1774 1778 
1 957 1974 1 976- 1 977 20 1 0 2025-2029 2 1 34-2 135 21452161 
2171 2174 2220 2223 2257 2266 2283 2444 2449 2510 2555 
2563 2569 2590 2597 2614 2617 2717 2719-2723 2726 2729- 
2734 2809 2997 3002 3081 3083 3086 3109 3226-3237 3239 
3257 3272 3278 3282-3283 3286 3356 3419 3460 3479 3492 
3495 3544 3548 3557 3590 3604-3605 3625 3627 3634 3673 
3689 3697 3709 3722 3797 3810 3839 3885 4006 401 1 4019 
4022 4025 4040 4054 4059 4095 4194 4201 4253 4277 4282 
4355 4364 4383 4399 4405 4410 4412 4434 4473 4480 4498- 
4504 4507-4508 4517 4543 4654 46S9 4694 4730-4733 4735 
4755 4783 4820-4823 4825-4830 4845 4885 4940 4949 4957 
4962 4972 4985 5017 5019 5022 5031 5037 5108 5132 5167 
5171-5172 5262 5380 5481 5521 5523 5527 5553 5616 5656 
571 1 5740 5788 5801 5894 5902 5906 5908 5923 5938 5979- 
5989 6057 6082 6096 6125 6147 6159 6161-6163 6165 6207 
6212-6213 6215-6222 6241 6257 6288 6336 6369 6374 6419 
6422 6512 6551 6555 6595 6598 6607 671 1 6767 6788 6823- 
6825 6860 6871 6879 6892-6894 6900 6909-6910 7058 7118- 
71 19 7169 7220 7222 7224-7225 7228-7229 7231-7233 7275 
7426 7431 7444 7481-7483 7485-7488 7560-7567 7569 7608 
7743 7768 7792 7814 7845 7864 7930 7977 7995 8093 8095 
8110 8140 8226 8235 8262 8345 8358 8361 8370 8372 8387 
8441-8443 8445 8447 8450-8455 8520 8535 8558 8597 8603 
8648 8663-8666 8677 8702 8742-8749 881 1 8838-8839 8943 
8951 9010 9092 9134 9137 9257 9287 9304 9317 9455 9457 
9466 9472 9475 9484 95 1 0 95 1 5 9533 9553 9567 9644-9645 
9647-9649 9734 9758 9781 9785-9786 9791 9832-9837 9927- 
9928 9939 9942 9970 10053 10175 10275 10277 10296 10329 
10375-10378 10434 10464 10486 10496 10645 10679 10691 
10778 10782 10791 10838-10844 10928 1 1019-1 1025 1 1027 
11032 11055 11061 11108 11132 11145 11153 11208 11239 
11343 11483 11513 11588 11596 11604 11606 11620-11621 
11668 11691-11695 11818 11867 1 1869 1 1929-1 1930 12006 
12033 12039 12041 12044 12047 12066 12078 12197 12218 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








12327 12375 12405 12411 12424 12438 12521-12522 12564 
12570 12576 12582 12590 12599 12601 12604 12611 12626 
12630 12637-12640 12670 12674 12688 12694-12695 12703 
12734 12748 12754 12785 12835 12840 32913 12916-12917 
12929-12930 12935 12940 12946 12959 12989 12993 13020 
13051-13052 13054 13073 13083 13090 13108-13110 13] 17 
13131 13136 13148.13158 13160 13169 13184-13186 13254 
13263 13277 13295 13389 13394 13438 13456 13477 13479- 
13481 13489-13490 13494-13500 13502-13503 13506 13512 
13516 13535-13536 13555 13575 13590 13592 13605-13606 
13613 13616 13627 13630 13644-13645 13660 13670 13679 
13687 13713 13793 13803 13838 13861 13866 13868-13869 
13875 13888 13891 13896 


macrophage 


lnvitrogen 


HMP001 


1002 2253 3548 401 1 4058 4201 4246 4282 5526 8093 8262 
11137 12039 12426 12511 12521 I38S8 


infant brain 


Columbia 
University 


IB2002 


6 89 142 21 1 276 307 518 589-590 598 644 717-718 720-721 
773 841 921-924 926-932 937 1023 1091 1147-1150 1289 1493 

1522-1525 1557-1564 1724 1778 1928 1947 1976 1978 2097 
2111 21382172 2218-2223 2257 2283 2401 2418-2419 2444 
2449 25 1 6 2555 2563 2569 2575 2597 2627 2809 2818 2880 
2932 3087 3166 3226 3241-3245 3272 3435 3474 3476-3477 
3481 3483-3486 3488-3489 3492 3494-3495 3497 3538 3544 
3548 3572 3595 3597 3669 3671-3672 3709 3720 3722 3737 
3797 3817 3885 3985 3997-3999 4006 401 1 4022-4031 4040 
4095 4201 4246 4253 4277 4297 4355 4405 4410 4434 4676 
4689 47674775 4831-4834 4837 4885 4962 4983 4998 5017- 
5019 5021-5031 5035-5037 5040 5095 5143 5164 5167 5169 
5233-5236 5380 5505 5517-55 19 5523 5536 5546-5548 5691 
5695 5724 5750 5766 57SS 5801 5S95 6027 6048 6082 6095 
6111 611961326135 6147 6223 6301-6303 6305-6307 6374 
641 1 6459 6555 6563 6591-6595 6601 6606-6607 6609-661 1 
6613-6614 6624 671 1 6767 6799 6805 6909 6946 701 8 7052 
7067 7118 7133 7166 7168-7169 7386 7464 7469 7473 7571- 
7573 7697-7698 7716 7814 7827-7828 7905 7934 7977 8071- 
8073 8084 8098-8104 8235 8320 8331 8351 8354-8356 8364- 
8365 8370 8372 8376 8452 8520 8572 8667-8668 8750-8752 
8813 8830 8863 8883-8885 8950 8958 8963 9040-9041 9251 
9257 9267 9293-9298 9313 9424 9454 9456-9457 9466 9510 
9514-9515 9533 9542 9556 9576-9577 9597 9626 9646 9722 
9758 9787 9903-9904 9988 10027 10138-10139 10155-10160 
10284 10296 10319 10346 10496 10536-10542 10557 10670- 
10672 10680 10682 10778 10791 10897 10971 11028-11030 
11052 11 100-1 1109 11122 11132 1 1 192 1 1392 1 1403-1 1404 
11447 11533 11568 11595 1 1606 1 1626 1 1638 1 1818 11927 
11931-11933 12028 12030-12031 12033-12034 12036-12039 
12041 12047 12049 12146 12155-12157 12160 12327 12344 
12363 12383-12385 12402 12405-12408 12424 12439 12521- 
12522 12570 12590 12601 12611 12615-12616 12626 12639 
12684-12685 12688 12696-12697 12707 12725-12727 12729 
12748-12749 12754 12763-12764 12830 12904 12913 12916- 

' l^y^j izy/ / izyyu \2.yyt> 13UUU 1jU4/ 
13051 13054 13062 13073 13082-13084 13317 13141 13187- 
13188 13196 I32I8-132I9 13249 13257 13277 13284-13285 
13351 13389 13394-13395 13456 13458 13479-13481 13488 
13494-13495 13499 13503 13512 13516 13530 13535 13575 
13607-13608 13613 13616 13641 13649 13659 13679 13705- 
13706 13708 13713 13793 13798-13801 13803 13861 13869 
13872 13875 13883-13884 13888 13891-13893 13896 


infant brain 


Columbia 
University 


IB2003 


646 746 1914 1947 1959 19942111 2220 2257 25162962 
3166 3226 3272 3435 3572 3885 4277 4377 4410 4833 4837 
5029-5030 5040 5147 5259 5851 6147 6423 6595 6598 661 1 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








7145 7169 7716 8262 8354-8355 8364 8452 8863 9264 9457 
95 1 5 9556 9577 9626 9646 97 1 5 978 1 9928 9970 1 0027 1 0329 
10376 10557 10791 11083 11289 11406 11459 11513 11663 
1 1933 12039 12405 12546 12570 12601 12616 12685 12754 
12913 12916 12954 12977 12990 13000 13056 13062 13195 
13257 13392 13456 13458 13480-13481 13488 13497 13499 
13506-13507 1351 1-13512 13514 13516 13549 13616-13617 
13649 13793 13861 13884 13888 13891 13896 


infant brain 


Columbia 
University 


IBM002 


1564-1565 1976 2218 2420 3166 4391 5788 6147 6302 10329 
11108 11513 12039 12729 12830 13062 13257 13512 13617 
13803 13859 13861 


infant brain 


Columbia 
University 


IBS001 


927 931 1564 2915 3166 3737 3885 5029 5044 5095 5143 5701 
5894-5895 6135 6307 7018 8452 8963 9424 9556 9626 10296 
10542 11083 11100 11108 11167 12383 12611 12785 13257 
13280 13479 13495 13506 13617 13679 13713 13896 


lung, 
fibroblast 


Strategene 


LFB001 


67 235 552 64 1 746 1 346 2090 2 1 86 2343 2770 2S 1 2 287 1 
3203 3537 3548 3634 3812 3839 4073 4253 4434 4654 4783 
4786 4821 4940 4957 4963 4985 5163 5172 5492 5523 5788 
591 1 5938 6027 6048 6057 6223 6272 6322 6599 6873 6886 
6997 7204 7759 7815 7933 7977 8226 8235 8262 8354 8452 
8648 8742 8791 8830 8863 8S98 8944 8948 8951 9000 9076 
9097 9128 9290 9304 9414 9455 9466 9472 9476 9503 9544 
9563 9671 10000 10050 10342 10376 10434 10447 10679 
10760 10857 11055 11094 11289 1 1402 1 1607 1 1818 11837 
12000 12006 12178 12363 12405 12462 12582 12617 12636 
12670 12754 12913 12916-12917 12925 12940 13126 13377 
13394 13446 13456 13494 13497 13499 13502 13575 13613 
13670 13679 13691 13795 13797 13867-13868 13882 


lung tumor 


Invitrogen 


LGT002 


51 69 88 142 158 211 444 598 608 722-723 733 795-797 799 
1004 1023 1135-1145 1308 1338 1346 1493 1526-1533 1535 
J566-1569 1626 1654 1816 1841 1848 1860 1976 1983 2090 
2 1 50 2 1 6 1 2 1 73 2223 2257 2266 2296-2297 2342 2402 2449 
2483 2555 2569 2576 2591 261 1 2623 2724 2809 2868 2880 
2926 3088 3166 3272 3347 357 1 3658-3666 3673 3722 3737 
3797 3839 3885 4000-4002 4006 401 1 4025 4032-4034 4060 
4133 4201 4246 4253 4277 4282 4340-4341 4384 4388 4391 
4399 4402 4405 4434 4543 4622 4671 4686 4767 4783 4791 
4833-4834 4836 4885 4903 4940 4957 4987 4998 5001 5017- 
5018 5030 5037 5052 5108 5163 5210 5227-5232 5422 5451 
5520-5528 5581 5627 5637 5674 5693 5713 5724 5748 5766 
5772 5788 5894-5395 5902 5907 591 7 5938 5990 6005 6020 
6094 6129 6147 6171 6224-6225 6239 6241 6253 6322 6336 
6404-6410 6461 6508 6595-6598 6624 6765 6793 6798 6851 
6870 6873 6994 7003 700S 7045-7047 7053-7055 7067 7085 
7109 7116 71397141 71587169 71877468 75177524 7579 
7608 7676 7687 7716 7761 7825-7826 8074-S078 8105-8106 
8139 8235 8298 8323 8335 8345 8354 8359 8363 8365 8370 
8372 8375 8456 8563 8633 8648 8678 8741 8792 8798 8805 
8831 8863 8870 8989 9002 9010 9012 9032-9037 9039 9051 
9076 9080 9092 9128 9257 9269-9271 9273-9274 9276 9299- 
9301 9399 9414 9424 9457 9466 9472 9480 9484-9485 9517 
9533 9536 9539 9544 9567 9612 9626 9665 9677 9689 9700 
9703 9758 9773 9784 9788 9838-984 1 9870 9928-9929 9942 
9970 9983-9986 10140-10143 10149 10161 10175 10275 10296 
10319 10330 10346 10349 10386 10496 10508 10561 10591- 
10593 10673-10674 10742 10774 10778 10782 10788 10802 
10836 10971 11031-11033 1 1064 1 1 108 1 1 132 1 1 182-1 1 191 
11393-11395 11406 11459 11462 1 1560 1 1565 1 1588 1 1596 
11604 11606-11607 11615 11618 11629 11821 11823 11845 
1 1869 1 1932 1 1934 1 1961 1 1978-1 1980 12000 12006 12017 
12022 12033 12078 12080 12127 12150-12154 12178 12236 | 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








12327 12363 12371 12379 12386-12393 12409 12424 12521 
12570 12590 12616-12618 12630 12637 12674 12691 12693 
12698-12699 12716 12749 12754 12760 12762 12830 12832- 
12833 12842 12904 12907 12913 12917 12923 12993 13020 
13031-13032 13035 13060 13062 13064 13072 13083 13090 
13136 13150 13193 13197 13202 13234 13277 13279 13281- 
13282 13295-13296 13381-13383 13385-13386 13394 13400 
13456 13477 13479-13480 13484 13488-13489 13492 13494 
13496-13497 13499 13503 13506-13507 13511 13554 13575 
13597 13616 13630 13646 13649 13659-13660 13670 13701- 
13704 13713 13743 13782 13788-13789 13791-13792 13802- 
13803 13864 13867-13868 13872 13882-13884 13888 13892 


lymphocytes 


ATCC 


LPC001 


142 316 1034 1977 2161 2342 2367 2483 2513 25292932 2962 
3272 371 1 3722 3 839 4006 4277 4282 4405 443 4 483 6 4970 
5037 5239 5451 5706 5796 5924 5941 6048 6147 6301 6765 
6793 6805 6815 7133 7171 7275 7320 7687 783 5 8033 8093 
8095 8105 8110 8246 8262 8345 8365 8368 8370 8535 8664 
8fi74 8799 RRSR RRQR 8Q1G CQ<1 CtfUO Q'>«:'7 cmc£ oac-i 

9484 9539 9544 9560 9758 9928 9939 10175 10201 10284 
10333 10496 10520 10529 10679 10744 10766 J 1032 1 1 108 
11508 11513 11519 11853 11862 12033 12160 12363 12387 
12402 12424 12438 12520 12560 12570 12663 12830 12913 
12923 12968 13021 13066 13072 13085 13140 13147-13148 
13394 13479 13488 13495 13499 13507 13575 13867 13869 
13888 


leukocyte 


GIBCO 


LUC001 


21 49 51 67 83 88 94 142 211 316 326 340 368 371 403 438 
443-455 464 485 518-523 525-530 532-535 591-592 616 724- 
726 744 800 841 889 943 1004 1346 1556 1570-1572 1826 
1914 1916 1927 1959 1976-1977 2095 2103-2107 2124-9 197 
2134 2138 2145 2161 2163 2175 2215 2223 2237 2266 2343 
2363 2412 2421 2449 2483 2529 2555 2569 2575 2597 2617- 
2618 2695-2696 2827 2876-2877 2899 2903 2961 2965-2975 
3002 3035-3038 3040-3046 3 109 3 132 3 1 66 3 199 3203 3207 
3246-3251 3265 3272 3286-3287 3312 3344 33563361 3378 
3400 3434 3451 3468 3479 3484 3494 3503 3552 3590 3610 
3627 3634 3704 3709 3720 3722 3725 3757 3797 3804 3810 
3839 3885 3985 3996 4006 4009-401 1 4019 4022-4023 4025 
4035-4037 4058 4060 4194 4253 4269 4277 4297 434 1 4362 
4388 4390-4391 4396 4399-4400 4402 4405 4410 44 12 443 1 
4434 4534 4543 4594 4615 4630 4641 4651-4663 4670 4688- 
4689 4694 4699-4706 4736 4755 4758-4759 4767 4775 4783 
4791 4798 4828 4835-4840 4845 4865 4930 4940 4947-4948 
4957 4962 4964 4972 4976 4985 4991 4998 5001 5003 5017 
5022 5030 5037 5040 5075-5076 5108 51 18 5143 5163 5171- 
5172 5313 5481 5503 5505 5521 5523 5526 5535 5549 5552 
5656 569} 5724 5726 5740 5750 5766 5772 5788 5794 5796 
5801 5865 5874 5894-5895 5906 590S 5923-5924 5928 5938 
5989 6027 6042 6057 6063-6064 6082 6094-6103 6125 6130- 
6137 6142 6147 6166-6167 6171 6181 6214 6226 6239 6253 
6301 6307 6371 6374 6418 6512 6662 6716 6730 6788 6792 
6799 6815 6836 6860 6873 6876-6877 6886-6887 6945 6975 
6979 7018 7037 7056 7058 7067 7116 7118 7137 7155 7158 
7171 7275 7315 7384-7385 7387-7389 7391-7393 7395-7398 
7452-7463 7468 7494 7608 7671 7676 7687 7714 7733 7792 
7815 7845 7864 7870 7905 7930 8093 8107 81 10 8120 8139- 
8140 8224 8226 8262 8276 8320 8363-8365 8368 8375 8387 
8432 8452 8456 8520 8535-8536 8539 8562-8563 8577 8579 
8597 8599-8601 8603-8605 8608 8640-8646 8648 8664 8669 
8674 8677 8691 8702 8722 8753 8755 8798 8815-8816 8830 
8838-8839 8858 8863 8870 8876 8898 8943-8944 8948 8951 
9001 9010 9012 9061 9076 9092 9097 9128 9257 9304 9454- 
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Name 


SEQ ID NOS: 








9456 9466 9472-9475 9484 9497 9511 9515 9517 9529 9536 
9542 9556 9567 9612 9626 9702-9703 9723 9731-9738 9745 
9752 9758 9762-9770 9780-9781 9789 9791 9808 9832 9842 
9927-9928 9939 9942 9970 10000 10175 10275 10319 10329- 
10331 10333 10345-10346 10412 10428 10430 10433 10435- 
10436 10453-10454 10456 10465 10470 10475 104S8 10496 
10513 10566 10645 10679 10683 10691 10774 10776 10778 
10788 10791 10795 10802 10891 1092S 10930 10935-10941 
10948 10961-10962 10964-10967 11034 11055 11108 11132 
11136 11145 11239 11289 1 1405-1 1406 1 1459 11483 11519 
11588 11604 11606 11618 11620 11693 11695 11805 11809- 
11816 11818 11830 11833 11848 11850-1185611868 11927 
11932 11934-11935 11937-11939 11954 11988 12000 12006 
12009 12028 12033 12037 12039 12047 12053 12092 12097 
121 18 12127 12143 12159-12160 12178 12186 12327 12336 
12363 12375 12387 12402 12404-12405 12410 12424 12438 
12442 12490 12511 12521-12522 12556 12560 12564 12570 
12590 12599 12601 12605 12615-12617 12636 12643 12653 
12660 12670 12672-12680 12693 12700-12701 12706-12707 
12715 12735 12754 12823 12895 12904 12910 12913 12916- 
12918 12923 12925 12935 12945 12947 12955-12956 12972 
12977 12993 13002 13020-13021 13051 13062 13066 13072- 
13073 13082 13085 13117 13126 13130-13131 13135 13138- 
13140 13147-13151 13161 13169 13189 13254 13295 13391 
13394 13425 13436 13442 13456 13477 13479 13488-13490 
13494-13496 13498-13503 13505-13507 13512 13530 13546 
13550-13551 13554 13557-13564 13575-13581 13590 13592- 
13593 13609 13613 13616 13627 13630 13645-13646 13660 
13679 13713 13782 13803 13864 13866-13869 13872 13883 
13885 13888 13893 


leukocyte 


Clontech 


LUC003 


536 539 541-543 593 728 1552 1927 1974 3089 3252-3253 
3434 3548 3709 371 1 3722 3797 401 1 4019 4040 4060 4194 
4201 4277 4282 4622 4707 479 1 4841-4842 4949 4998 5001 
5030 5345 5895 6052 6138 6227 6419 6595 6754 6765 6767 
6788 69 1 1 -69 1 2 7049 7139 7171 7464 7575-7577 8093 8110 
81 16 8365 8370 8375 8592 8648 8830 8863 8944 9466 9544 
9612 9758 9771 9773 9790 9929 10326 10346 10679 10779 
I096I 11035-11036 11132 11401 11513 11853 11940 12160 
12256 12405 12670 12674 12681 12840 12904 12906 13020 
13051 13162 13280 13400 13488 13511 13516 13554 13670 
13679 13860 13869 13883 13889 


melanoma 
from cell line 
ATCC UCRL 
1424 


Clontech 


MEL004 


83 142 360 447 841 1061 1346 1516 1573 1724 1959 1963 
2159 2175 2343 2367 2513 2652 2812 2876 2961 3001 3272 
3345 3604 3627 3 722 3 754 3 839-3840 3985 3 988 4040 4194 
4246 4341 4515 4568 4641 4667 4862 4940 5075 5132 5163 
548 1 5492 5523 5740 5788 6008 6 1 12 6 11 9 6147 6 1 89 61 95 
6272 6287 6418 6423 6508 6563 6726 6860 6870 7134 7166 
7169 7275 7294 7579 7677 7748 7831 7852 79808110 8116 
8226 8235 8262 8320 8345 8363 8370 8456 8563-8564 8633 
8838 9001-9002 9128 9296 9304 9313 9484 9542 9544 9646 
9703 9758 9780-9781 9808 9999 10007 10027 10296 10346 
10470 11145 11239 11289 11401 11406 11508 11596 11606 
11620 11823 11899 11950 12019 12107 12160 12292 12329 
12363 12405 12436 12523 12599 12658 12670 12749 12754 
12774 12842 12930 13020 13296 13394 13458 13489 13498 
13501 13507 13551 13554 13575 13613 13616 13649 13660 
13743 13775 13868 


mammary 
gland | 


Invitrogen 


MMG001 


51 67 142 449 594 598 616 708 729-733 1004 1060 1194-1196 
1310-1315 1329 1536-1541 1826 1848 1858 1914 1947 1974 
1976-1978 2174-2175 2283 2299 2310 2336 2343 2403-2406 
2449 2483 2555 2563 2576 2597 261 1 2614 261 7 2812 2827 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








2926 3191 3207 3490 3503 3673 3714-3715 3722 3737 3777 
3796 3807-38 19 3885 4003-4004 4060 4330 4346 4355 4391 
4399 4434 4543 4671 4767 4791 4836-4S37 4843 4940 4964 
4975 4985 4998 5001 5017 5019 5022 5052 5108 5155 5163 
5168-5169 5269 5345-5347 5466 5526-5527 5529 5581 5615 
5724 5772 5778 5788 5794 5894-5895 5902 5907 591 1 5917 
5923-5924 6048 6057 6064 6094 6109 6125 6147 6156-6157 
6215 6228 6321 6430 6458 6496-6499 6545 6579 6598-6599 
6607 671 1 6765 6780 6793 6797 6799 6805 6880 6931 6979 
6987 6996 7012-7013 7048 7054 7057 7104 7141 7158 7169 
7517 7578-7581 7583 7872 7937-7940 8110 8139 8156 8224 
8262 8292 8298 8336 8359 8363 8365 8372 8452 8619 8628 
8646 8648 8722 8758-8759 8782 8813 8815 8863 8882 8939 
8963 8983 9012 9097 91 1 1 9139-9141 9143 9157 9164 9264 
93 1 3 9327 9347 9466 9484 95 1 1 95 1 7 9544 9553 9563 9577 
9646 9700 9703 9734 9746 9780 9791 9844-9846 9927-9928 
9942 10006-10007 10051-10055 10144 10146-10147 10162 
10285 10330 10346 10439 10490-10491 10496 10542 10557 
10600 10628 10645 10691 10729 10772 10796 10983 11038 
11108 11132 11216-11217 11277-11279 11396-11397 11406 
11459 11513 11596 11606-11607 11626 11628 11823 11830 
11833 11902 11941-11943 11979 12000 12009 12037 12047 
12078 12093 12146 12160 12181 12258-12263 12281 12327 
12371 12383 12387 12395 12424 12426 12438 32442 12484 
12511 12564 12570 12590 12594 12601 12605 12609 12615- 
12616 12618 12621 12653 12670 12691 12699 12702-12703 
12749 12771 12791-12794 12834-12835 128S5 12898 12905 

19Q1^ 1 OQ 1 *7 lOQOn lOQ^^ noiA T>fi*7*7 nnni 
i&y\o uyio-iz^i / L^yzu i£yj.j> Izy/o-izy// 13002 

13020 13035 13047 13062 13064 13066 13072-13073 13084 

13090 13092 13127 13135 13169 13190 13193 13195 13249 

13254 13296 13320-13322 13392 13456 13477 13479-13480 

13488-13489 13494 13497-13499 13505-13507 13512 13514 

13516 13546 13551 13554-13555 13590 13611-13612 13631 

13649 13670 13713 13743-13745 13838 13864 13866-13867 

13869 13872 13875 13882-13885 13888 13891 


induced 
neuron cells 


Strategene 


NTD001 


88 1493 1552 15612034 2090 2510 2522 2570 2576 2623 3226 
3272 3839 4006 4058 4282 4384 4694 4783 4888 5103-5104 

6418 6449 6607 6960 6972 7058 7494 7815 7937 8370 8929- 
8930 8996 9000 9003 9128 9157 9289-9291 9472 9928 10275 
10550 10729 10760 1 1 607 1 1 736 11818 12039 12075 1^094 
12256 12363 12381 12423 12522 12707 12735 12863 12906 
12958-12959 12963 13020 13052 13054 13196 13202 13310 
13368 13378 13438 13479 13486 13506-13507 13546 13616 
13654 13659 13691 13795 13891 


retinoid acid 
induced 
neuronal cells 


Strategene 


NTR001 


1552 3839 3885 4282 4434 5030 9308 9466 11108 11132 
12034 12039 12405 12424 12570 12905 13047 


neuronal cells 


Strategene 


NTU001 


88 708 1552 1561 1947 2223 2490 2614 3272 3610 3625 3627 

d9f)l AO^I AOQ1 A11*7 AA'XA Ac.A'X A/Z^>^> AAA i ci r\A ci^i 

5523 5541 5684 5788 6094 6147 6307 6601 6862 7755 8648 

10963 11032 11108 11132 12034 12039 12075 12405 12424 
12570 12590 12706 12749 12830 12905 12917 12963 13035 
13073 13280 13490 13507 13613 13616 


pituitary 
gland 


Clontech 


PIT004 


1 148 5674 6068 6371 7759 8093 10430 12605 12636 12658 
12916 13021 13082 13875 


placenta 


Clontech 


PLA003 


3885 4025 5030 5536 571 1 5923 8349 8367 9921 12033 12076 
12403 12511 13655 13885 


prostate 


Clontech 


PRT001 


67 83 142 225 235 379 486 572 616 1286 1316-1329 1538 
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RNA 
Source 


Library 
Name 


SEQ ID NOS: 








2023 2139 2147 2161 2266 2290 2337-2344 2346 2367 2422 
2444 2511 2631 2809 3001-3002 3084 3419 3563 3625 3627 
3797 3S10 3821-3827 3829-3835 4019 4022 4173 4282 4332 
4434 4667 4755 4791 4862 4865 4998 5001 5066 5171 5349 
5352-5356 5358-5361 5363-5364 5481 5581 5656 5674 5723 
5740 5902 6027 6047 6125 6321 6371 6374 6427 6458 6463 
6500-6507 6509 6563 6598 6643 6793 6803 6871 7014-7015 
7058 7104 7275 7320 7608 7635 7733 7842 7852 7864 7941- 
7943 7946-7951 7953-7954 8077 8087 S093 81 10 8224 8226 
8452 8487 8520 8558 8635 8677 8863 8950 8963 8983 8997 
9012 9145-9148 9150-9154 9269 9302 93 17 9466 9503 9528 
9646 9703 9780 10000 10027 10056-10060 10491 10629 10691 
10777 10891 11145 11200 11239 1)281 11283-11295 11344 
11406 11761 11837 11862 12006 12166 12264-12278 12305 
12363 12368 12411-12412 12438 12467 12685 12691 12729 
12734 12795-12798 12800 12830 12863 12904 13010-1301 1 
13104 13146 13295 13323-13326 13377 13394 13456 13473 
13477 13489 13516 13533 13550 1361 1 13678-13679 13746 
13866-13867 13884 13889 


rectum 


Invitrogen 


REC001 


6 67 142 683 731 997 1 178 1909 1959 2005 2023 2596 261 1 
2614 2809 2926 3314 3333 3455 3722 3780 3870 4173 4355 
4837 4949 4987 5526 5580 5615 5674 5691 5724 5788 5895 
5909 5924 6057 6112 6195 6765 6805 7815 7833 8095 8664 
8863 9517 9539 9544 9884 9927 10400 10666 11401 11513 
11606 11985 12160 12327 12428 12693 12848 12910 12977 
13051 13064 13072 13420 13494 13497 13507 13512 13515- 
13516 13866 13869 13884 13888 


salivary gland 


Clontech 


SAL001 


67 731 800 997 1054 19142267 2395 252931363595 3627 
401 1 4192 4246 4330 4434 4641 4957 4987 5040 5052 5163 
5451 5481 5706 5723 5788 5895 6219 6621 6801 6900 6975 
7045 7733 8110 8372 8535 8563 8635 8830 S95I 9000 9010 
9051 9313 9472 9475 9671 9724 9758 9927 10027 11145 
11695 11725 12017 12284 12363 12424 12427 12570 12609 
12670 12674 12693 12977 13035 13307 13554 13617 13867 
13872 13889 13891 


salivary gland 


Clontech 


SALs03 


1516 1724 1858 5030 6186 13657 13864 


skin fibroblast 


ATCC 


SFB001 


2251-2252 5788 6068 12511 


skin fibroblast 


ATCC 


SFB002 


6068 8951 12511 


skin fibroblast 


ATCC 


SFB003 


4025 5895 7741 


small 
intestine 


Clontech 


SIN001 


142 319 627 654 1034 1063 1197-1198 1330-1338 1340-1359 
1575 1646 1774 1814 19782161 2347-2354 2409 2876 3046 
3419 3460 3605 3716-3718 3737 3797 3837-3839 3841-3843 
3845-3857 3885 3986 4060 4201 4301 4351 4385 4568 4689 
4694 5076 5163 5270-5273 5304 5326 5365 5367-5372 5374 
5503 5550 5701 5772 6064 6094 6171 6288 6427 6430-6432 
6438 6510-6522 6598 6615 6793 6815 6997-6998 7016-701 8 
7054 7058 7072 7309 7450 7604 7769 781 1 7873-7876 7955 
7957 7959-7962 7964 8120 8298 8350 8452 8830 8863 8950- 
8951 8966 9010 9073-9075 9119 9126 9128 9155-9166 9303 
9544 9560 9780 9884 9928-9929 10008-10010 10061-10068 
10097 10262 10330 10351 10601 10630-10634 10760 10983 
11061 11219 11296-11308 11310-11313 11513 11620 11693 
12182-12183 12280-12287 12327 12363 12488 12707 12799- 
12801 12922 12991 13012-13014 13035 13051 13064 13297 
13307 13328-13332 13335 13382 13499 13506 13554 13560 
13575 13631 13695 13714 13747-13749 13751 13882 13884 


skeletal 
muscle 


Clontech 


SKM001 


1 104 1346 2363 2367 2495 2555 2876 2880 3555 3634 3722 
4011 4022 4194 4201 4253 4277 4282 4434 4641 4940 4972 
4998 5343 5481 5523 5801 6005 6336 6873 7408 7995 81 10 
8120 8235 8262 8292 8345 8372 8576 8740 8830 8936 8951 
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Tissue 


RNA 


Library 


SEQ ID NOS: 








9303 9689 10616 10679 11132 11 145 11825 11869 11927 
12387 12442 12467 12570 12663 12735 12916 13280 13503 
13872 13885 


skeletal 
muscle 


Clontech 


SKM002 


8535 


skeletal 
muscle 


Clontech 


SKMsOS 


6336 6962 8936 


skeletal 
muscle 


Clontech 


SKMs04 


770 1724 3797 4277 6336 12405 13658 


spinal cord 


Clontech 


SPC001 


83 142 390 415 598 668 708 731 1184 1199-1207 1360-1375 
1377-1396 1516 1574-1576 1595-1596 1849 1927 20702129 
2161 23 1 1-23 14 2345 2355-2368 2423-2424 2430 24S4 2529 
2569 2576 2876 3215 3249 3272 3283 3532 3584 3627 3634 
3711 3719-3722 3737 3839 3860-3884 4011 4025 4038-4039 
4043 4055-4056 4173 4246 4282 4354 4375 4391 4434 4681 
4767 4781 4808 4964 4985 4998 5037 5163 5233 5274-5277 
5375-5392 5394-5402 5523 5569-5570 5581 5615 5723 5788 
5835 5902 5928 5936 6047 6078 6082 62 1 1 6288 6374 6433- 
6435 6512 6523-6531 6534 6595 6616 6625-6626 6788 6894 
6979 6999 7018-7026 7126 7166 7359 7473 7642 7653 7807 
7814 7877-7879 7965-7968 7970 7972-7980 8105 8108-81 10 
8139 8246 8298 8345 8363 8368 8482 8603 8646 8884 8898 
8981 9010 9012 9076-9078 9098 9167-9179 9184-9189 9264 
9302 9304 9319-9320 945 5 9466 9520 9530 9544 9556 9567 
9781 9895 9901 9928 9942 9947 9969 9999 10007 10069- 
10077 10079-10085 10177 10296 10326 10346 10376 10422 
10566 10602 10635-10638 10679 10685-10686 10729 10776 
11132 11220 11246 11314-11323 11325-11330 11417-11418 
11459 11513 11818 12000 12011 12037 12033 12039 12160 
12184-12185 12288-12292 12295-12299 12301-12305 12363 
12375 12383 12387 12402 12413 12442 12468 12527 12605 
12617 12636 12657-12658 12739-12740 12754 12772 12802- 
12809 12830 12835 12841-12842 12905 12923 12940 12976 
13003 13015 13017-13021 13051-13052 13117 13126 13136 
13260 13277 13283 13295 13336-13343 13367 13442 13456 
13473 13477 13481 13495 13497 13499-13500 13507 13516 
13659 13670 13713 13715-13716 13748 13752-13759 13803 
13869 13872 13884-13885 13888 13893 13896 


adult spleen 


Clontech 


SPLcOl 


800 1927 4032 4834 6064 6135 6195 6446 6788 6873 7166 
7455 8966 9929 10744 12402 12564 12590 12691 12904 12933 
13082 13500 13506-13507 13516 13575 13864 13869 13883 
13889 


stomach 


Clontech 


STO001 


21 83 142 1004 1208-1215 1217-1219 1397 1399-1405 1671 
23 1 5-23 1 6 2345 2369-2373 2375 2575-2576 2809 2846 2984 
3 136 3166 3537 3610 3698 3723-3725 3839 3885-3897 4057- 
4059 4173 4277 4410 4480 4667 4791 4808 4940 4987 5262 
5278-5281 5283-5284 5403-5405 5407-5424 5481 5656 5674 
5796 5904 6418 6436-6440 6535-6540 6563 6627-6629 6765 
6940 7000-7001 7027-7030 7064 7135 7509 7604 7880-7885 
7981-7990 8087 81 10 8120 8143 8226 8452 8535 9010 9079- 
9081 9191 9193-9196 9304-9306 9313 9317 9321 9715 10007 
10011-10013 10086-10093 10178-10179 10603-10605 10640- 

1 CifyAO 1 1 OfiQ 111 fn 1 1 '79 1 11 ')'>'> i n^i ip^t iiT>n iii/ii 
iVUHZ, 1 1UD7 HID/ I IZZZ 1 J DO \ —\ \ DJ f \ ]j>jy-l 1343 

11419 11513 11818 12186-12190 12307-12314 12327 12363 
12425-12427 12438 12617 12773-12774 12810-12811 12834 
13082 13103 13298-13299 13344-13349 13592 13630 13670 
13717 13760-13764 13782 13888 


thalamus 


Clontech 


THA002 


579 598 616 1065 1148 1220-1221 1223-1226 1407-1432 1597 
2266 2317-2319 2340 2342 2376-2378 2380 2431 2444 2555 
3093 3230 3286 3537 3722 3726-3732 3737 3898-3902 3904- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








3918 3920-3922 4060-4062 4173 4201 4282 43604391 5270 
5285-5288 5425-5438 5440-5449 5701 5902 5938 6137 6437 
6458 6461 6541-6544 6546-6550 6630 6755 7031 7095 7119 
7166 7484 7579 7815 7849 7905 7977 7991-8000 8002-8005 
8126-8128 8134 8363 8558 8664 8786 8890 8930 8963 9082- 
9085 9128 9197-9199 9201-9216 9251 9264 9308 9503 9515 
9556 9646 9703 9928 10014-10017 10027 10094-10100 10102 
10^67 10496 10606-10608 10643-10650 10696 10891 11145 
11223-11225 11344-11355 11406-11407 11420 11513 11604 
11695 11791 11950 12022 12107 12191-12199 12315-12331 
12363 12375 12405 12428-12430 12688 12706 12735 12748 
12754 12812-12813 12815 12835 12914 12959 13020 13023- 
13025 13060 13073 13300 13351-13358 13402-13403 13489 
13496 13506 13512 13719-13720 13765-13768 13770 13872 
13883 


thymus 


Clontech 


THM001 


51 142 150 332 346 360 438 546 731 760 895 1004 1104 1227- 
1234 1264 1391 1516 1577-1585 1812 1860 187721292174 
2215 2263 2321-2322 2408 2414 2425 2449 2490 2555 2569 
2575 261 1 2847 2880 3435 3530 3722 3727 3733 3735-3742 
3839 3870 4006 4041 4043-4046 4060 4109 4375 4396 4399 
4434 4667 467 1 4759 4791 4885 4976 4987 5 108 5289-5290 
5466 5481 5526 5553-5554 5796 5956 5979 6020 6186 6253 
6336 6371 6411 6438 6441-6447 6617-6621 6765 6788 6797 
6870 6886 6908 6972 6994 7059-7060 7126 7141 7166 7168 
7310 7383 7450 7494 7632 7716 7779 7887 7889-7891 8087 
81 1 1-81 14 81 17-8120 8139 8226 8343 8368 8370 8452 8456 
8633 8830 8898 8921 8963 9010 9076 9086-9087 9128 9134 
9264 9304 9307-9313 9401 9466 9497 9544 9563 9582 9612 
9626 9646 9758 9781 9927 9937 9969 10018 10065 10163- 
10166 10175 10284 10329 10414 10557 10609-10610 10617 
10679 10687-10691 10742 10760 10772 10776 10778 10881 
10891 11187 11194 11226-11229 11274 3 1406 11408-11410 
1 1412 I 14oy 1 1M j llo/o 1 loyj 1 ls^U 1 IbOJ 1 ly4z izlkjo 
12006 12016 12022 12092 12160 12186 12200-12204 12327 
12363 12414-12417 12427 12462 12470 12490 12564 12693 
12706 12735 12740 12835 12840 12898 12905 12910 12913- 
12914 12916 12977 13020 13036 13051 13062 13111 13126 
13141 13295 13301-13305 13326 13351 13391 13396-13397 
13456 13484 13498 13505 13507 13512 13516 13546 13713 
13803 13805 13808 13866 13869 13885 13888 


thymus 


Clontech 


THMc02 


16 27 51 67 142 390 598 1233 1493 1508 1586-1588 1598-1603 
1724 1841 1918 1927 1959 1976 1979 1989 2057 2107 2161 
2223 2290 2407 2426-2428 2432-2433 2444 2487 2569 2617 
3537 3664 3711 3720 3722 3737 3780 3817 4021 4025 4047- 
4049 4060 4063-4068 4246 4277 4377 4405 4688 4694 4759 
4791 4837 4949 4957 4992 4998 5037 5052 5108 5505 5553 
5555-5564 5571-5572 5683 5766 5772 5788 5796 5894 591 1 
6048 6186 6225 6288 6306 6333 6621 6728 6730 6765 6767 
6780 6788 6815 6867 6873 6979 7049 7061-7062 7166 7169 
7676 7687 7809 8084 8121 8129-8130 8179 8262 8354 8363 
8365 8375 8482 8597 8654 8740 8786 8791 8963 9076 9157 
9257 9264 9314 9323-9324 9401 9454 9466 9519 9529 9536 
9637 9700 9703 9927-9929 9940 10004 10007 10070 10167 
iaiaq 1^17*) imon lniRo inoo/t imo£ mm iniC2 
10439 10452 10542 10605 10691 10693-10694 10697 10744 
10775 10778 10795 11046 11098 1 1274 1 1413-1 1415 1 1462 
11519 11576 11592 11606 11618 11621 1 1627-1 1628 1 1693 
11807 11814 12022 12034 12044 12080-12081 12086 12160 
12236 12256 12327 12335 12363 12368 12387 12418-12419 
12424 12433 12556 12560 12564 12570 12577 12594 12599 
12612 12663 12735 12754 12836-12839 12844 12905 12913 
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RNA 
Source 


Name 


JsL,l^ ID iV%Jz>Z 








12955 12977 13020 13035 13037 13051 13062 13072 13087 
13193 13280 13295 13398 13404 13456 13488 13499-13500 
13503 13507 13516 13533 13535 13546 1361 1 13613 13630 
13646 13649 13659 13670 13678-13679 13713 13769 13775 
13803 13806-13808 13810-1381 3 13869 13872 13882-13883 
13885 13888-13889 


thyroid gland 


Clontech 


THR001 


49 115 142 360 641 698 800 1004 1134 1193 1233 1235 1237- 
1241 1329 1433-1471 1473-1476 1478-1479 1481 1589-1591 
1593-1594 1604 1606-1612 1614-1615 1639 1671 1906 1927 
1959 1976 1994 2090 2099 2134 2150 2161 2174 2186 2215 
2343 2367 2381-2387 2429 2434-2437 2449 2490 2510 2529 
2555 2569 2662 2827 29012961 2997 3001 3109 3265 3286 
3425 3490 3503 3560 3643 3698 3722 3737 3743-3747 3797 
3839 3885 3923-3941 3943-3954 3956-3961 4009 4050-4053 
4060 4069-4082 4 1 09 4 1 73 4 1 94 4253 4277 4282 4348 4354- 
4355 4376 4391 4396 4412 4434 46414689 4694 4755 4759 
4783 4791 4834 4957 4972 4985 4987 501 8 5037 5052 5075- 
5076 5108 5147 5163 5171 5270 5292-5294 5317 5380 5450- 
5461 5464-5481 5503 5523 5565-5568 5573-5574 5693 571 1 
5724 5757 5772 5788 5894-5895 5917 5923 5938 5959 5966 
6005 6013 6027 6047-6048 6061 6064 6125 6135 6139 6189 
6215 6240 6336 6371 6374 6448-6449 6458 6508 6538 6551- 
6554 6556-6561 6563-6572 6595 6598 6607 6622-6623 663 1- 
6636 6793 6803 6815 6873 6899 6955 6958 7032-7038 7040 
7043 7063 7066-7067 7079 71 16 7155 7233 7275 7455 7669 
7743 7792 7839 7845 7857 7892-7893 7910 7930 7989 8006- 
8008 8010-8033 8053 8067 8087 8092-8093 81 10 8116 8120 
8122-8124 8131-8143 8196 8226 8235 8262 8345 8365 8368 
8370 8372 8375 8452 8520 8535 8543 8597 8646 871 1 8760 
8816 8839 8863 8898 8930 8948 8953 8963 8966 9010 9055 
9088-909 1 9 1 42 9 1 5 7 92 1 7-9220 9222-9227 9229-9242 9244 
9257 9264 9304 9315-9316 9325-9330 9368 9401 9455 9466 
9484 9525 9542 9544 9563 9646 9695 9703 9780-9781 9855 
9928 9942 9947 10000 10019 10027 10103-30117 10119-10120 
10173-10175 10183-10185 10346 10376 10440 10470 10496 
10611 10651-10657 10666 10679 10695 10698-10704 10760 
10772 10778 10782 10788 10891 11033 11051-11052 11061 
11063 1111211231 11256 11356-11357 11359-11360 11362- 
11367 11369-11372 1 1376 11385 11406 11416 11421-11425 
11462 11513 11588 11605-11607 11620 11833 11869 11932 
11980 11985 12006 12017 12033 12143 12160 12193 12195 
12205 12207-12208 12292 12332-12336 12338-12345 12347 
12349-12356 12358-12360 12363 12368 12405 12420 12422 
12424 12428 12434-12439 12465 12523 12601 12605 12609- 
12611 12617 12636 12645 12670 12691 12707 12740 12749 
12754 12768 12775 12777 12817-12824 12830 12840 12845- 
12848 12863 12911 12940 12955 12977 13020 13026-13028 
13038 13047 13051 13062 13072-13073 13090 13104 13117 
13126 13195 13202 13254 13295-13296 13360-13371 13389 
13391 13399 13412 13479 13488-13489 13492 13494 13498 
13500 13503-13504 13506-13507 13511-13512 13516 13533 
13551 13554 13590 13613 13627 13644 13646 13656 13659- 
13660 13670 13713 13721 13743 13771-13774 13782 13803 

I^JIHO 1 ^5? 1 1 "351 1 A. I^R/^fi I^R^A nwn 1 1 oto i-»o©o 
ijour 1JOU-1J014 JJODU IJOOO-ljoO / IJOOy 135/Z JJ>o6z 

13884-13885 13888 13891 13893 


trachea 


Clontech 


TRC001 


83 438 483 858 1006 1034 1242-1245 1359 1483-1494 1616- 
1619 1621 1671 2266 2324 2388-2391 2409 2438-24392444 
2575 3001 3136 3272 3425 3460 3535 3548 3748-3756 3810 
3962-3967 4083-4086 4530 4755 4758 4949 4957 4987 5075 
5213 5295-5299 5481-5482 5484 5486-5488 5490-5491 5535 
5576-5580 5656 5941 6091 6450-6453 6574-6578 6637 6797 
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Name 


SEQ ID NOS: 








6955 7041-7042 7049 7068 7204 7347 7733 7S64 7894 8034- 
8043 8045 8120 8139 8144 8563 8635 8646 9012 9092-9099 
9245-925] 9317 9331 9333 9484 9503 9517 9734 10020-10022 
10027 10097 10121-10123 10125-10127 10284 10344 10478 
10612 10658-10659 10705 10812 10948 11145 11233-11239 
11373-11376 11428-11429 11942 12022 12209-12211 12292 
12361-12363 12365-12366 12438 12440-12441 12611 12617 
12734-12735 12825-12826 12900 12904-12905 12916 12923 
12965 12972 12976 12989 13029-13030 13306-13308 13372- 
13373 13492 13670 13713 13722 13775-13781 13803 138S4 


uterus 


Clontech 


UTR001 


1036 1134 1246-1250 1324 1493 1495-151 1 1622 1671 1928 
2145 2266 2310 2326 2343 2367 2392-2397 2555 3281 3479 
3720 3757-3760 3762 3969-3986 4088-4090 4192 4201 4246 
4277 4390-4391 4434 4515 4998 5002 5184 5300-5301 5481 
5492-5504 5656 5695 5712 5794 5936 6116 6475 6579-6587 
6595 6S70 7002 7043-7044 7311 7484 7895-7899 8046-8055 
8057-8059 8061 81 10 8146-8147 8152 8226 8262 8622 8677 
9008 9092 9137 9252-9257 9334 9370 9414 9466 9539 9703 
10023-10024 10128-10134 10346 10595 10660-10668 10706- 
10708 11145 11239 11289 11377 1 1 379-1 1385 1 1695 11830 
12367-12369 12371-12374 12442-12443 12570 12670 12693 
12827-12829 12914 13002 13047 13073 13083-13084 13131 
13260 13277 13309-13310 13374-13376 13394 13489 13512 
13713 13782 13784-13787 13866 13869 
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TABLE 2 



SEQ ID 

NO: 


Accession No. 


"Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 




MiOJUl 


Homo sapiens 


alpha-2-macroglobulin 


118 


69 


2 


AF 11 8090 


Homo sapiens 


PRO2044 


247 


59 


3 


X01683 


Homo sapiens 


alpha 1 -antitrypsin 


544 


78 


4 


L27428 


Homo sapiens 


reverse transcriptase 


79 


27 


5 


M22332 


Homo sapiens 


unknown protein 


89 


40 


6 


AF015539 


Mytilus edulis 


precollagen P 


113 


33 


7 


X03325 


Homo sapiens 


apolipoprotein B fragment 


540 


83 


8 


ABO 19280 


Mus musculus 


sprouty-4 


91 


35 


9 


D88152 


Homo sapiens 


acetyl-coenzyme A transporter 


625 


87 


10 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


58 


11 


AL049569 


Homo sapiens 


dJ37CI0.5 (KIAA0445) 


182 


82 


12 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-ricb glycoprotein DZ- 
HRGP 


85 


37 


13 


L27428 


Homo sapiens 


reverse transcriptase 


135 


61 


14 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


266 


72 


15 


U93569 


Homo sapiens 


putative pi 50 


135 


37 


16 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


67 


17 


X53581 


Rattus 
norvegicus 


ORF4 


124 


34 


18 


AF1 83961 


Homo sapiens 


carbon catabolite repression 4 protein 
homolog 


431 


75 


19 


AJ002190 


Homo sapiens 


dihydroxyacetone phosphate 
acyltransferase 


551 


88 


20 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


127 


45 


21 


AK001269 


Homo sapiens 


unnamed protein product 


1643 


99 


22 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


275 


58 


23 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


123 


75 


24 


AF 156550 


Mus musculus 


putative E1-E2 ATPase 


168 


58 


25 


AF 11 9856 


Homo sapiens 


PRO 1851 


585 


83 


26 


U49974 


Homo sapiens 


mariner transposase 


187 


46 


27 


G00901 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4982. 


86 


30 


28 


AF295773 


Homo sapiens 


ral guanine nucleotide dissociation 
stimulator 


126 


74 


29 


AF1 13685 


Homo sapiens 


PRO0974 


92 


73 


30 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


102 


50 


31 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


279 


75 


32 


AF003535 


Homo sapiens 


ORF2-like protein 


114 


47 


33 


Ml 5386 


Homo sapiens 


gamma-globin 


370 


84 


34 


M19419 


Mus musculus 


proline-rich salivary protein 


no 


35 


35 


AF2 11943 


Homo sapiens 


WW domain-containing protein 
WWOX 


586 


83 


36 


X13885 


Nicotiana 
tabacum 


extensin(AA 1-620) 


103 


35 


37 


U93563 


Homo sapiens 


putative pi 50 


127 


58 


38 


U93564 


Homo sapiens 


putative pi 50 


103 


77 


39 


AF069732 


Homo sapiens 


ADA2-like protein 


524 


88 


40 


X61046 


Hydra sp. 


mini-collagen 


101 


34 . 


41 


AK000322 


Homo sapiens 


unnamed protein product 


566 


80 


42 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 



130 



ISDOCID: <WO 0164835A2„I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Spaces 


Description 

: 


Waterman 
Score 


% 

Identit 

y 


— 






-r^ — — : 






J 


AP071 OR1 


Mycobacterium 
tuberculosis 


proline-nch mucin homolog 


— 

104 


41 


44 


AF0921 35 




PTD014 







45 


Y73353 


Homo salens 

omo sapiens 


HTRM clone 1870914 protein 


~§9l 


-~ 

56 


46 


AF1 18082 


Homosa iens 


PRO 1902 


~7T9 


-— 


47 


X78926 


H mo sa iens 

Homo salens 




~447 


— 


48 


X54326 




"lutarniriyl tRNAsynthetase 


542 


~ 


49 


D50645 


Homo salens 


SDF2 


321 


-— 


50 


M92439 


Homo sapiens 






"so — " 


51 


U28963 


Homo sapiens 


Gps2 


593 


82 




U41806 










53 


AF181490 


Homo sapiens 


prenylcysteine lyase 


461 


78 


"4^ — 


y 93 570 — 


Homo sapiens 


putative pi 50 


147 


36 


55 . 


W73499 


Homo sapiens 


Von Wjllebrand factor. 


529 


76 




AF1 19851 


Homo sapiens 


PRO 1722 


126 


57 


57 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


115 


61 


58 


AL021939 


Homo sapiens 


dJ352A20.2 (aldehyde dehydrogenase 
family protein) 


422 


90 




T 


Homo sapiens 


integrin alpha 9 protein 


117 


71 




Y32157 


Homo sapiens 


Human SH3D1A protein. 




91 


61 


X61296 


Rattus 
norvegicus 


open reading frame 2 


117 


31 


62 


AK002064 


Homo sapiens 


unnamed protein product 


330 


80 




A PHI 


Canis familiaris 




80 


56 


64 


U93570 


Homo sapiens 


putative pi 50 


113 


37 




U15647 


Mus musculus 


reverse transcriptase 


152 


55 


66 

— 


G03411 
_ 


Homo sapiens 
i — . .. 


Human secreted protein, SEQ ID NO: 
74 92. 


126 


54 


67 


Y48359 


Homo sapiens 


Human prostate cancer-associated 
protein 56. 


590 


99 


68 


W74879 


Homo sapiens 


Human secreted protein encoded by 
gene i ji cione ri 1 LDroz. 


368 


98 


69 


AF 104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


604 


93 




AF 175265 


Homo sapiens 


vacuolar sorting protein 35 


632 


88 




U93571 


Homo sapiens 


p40 


106 


33 




_X 15324 


Homo sapiens 


angiotensinogen 


330 


84 






Hordeum 
vulgare 


extensin 


1 1 1 


38 


74 

— 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


61 




/\r 1 1 auyz 


— — : 

Homo sapiens 


rKUZUDl 


"45* 


79 


76 


M63175 


Homo sapiens 


autocrine motility factor receptor 


190 


85 




IViZO jO j 


Mus musculus 


LINE/Ig H-chain fusion protein 


153 


3S 


~7S 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




58 




X^789^6 


Homo sapiens 


zinc finger protein 


199 


37 


80 


M77381 


Homo sapiens 


acrosin 


98 


54 


81 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


44 


82 


U93569 


Homo sapiens 


putative pi 50 


94 


38 


83 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


84 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


129 


34 


85 


R59837 


Homo sapiens 


Sequence of human microtubule- 


82 


48 



131 



.1SDOCID: <WO 01 6483SA2_I_> 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


[den tit 

y 








associated protein tau. 






86 


Y00749 


Homo sapiens 


Human secreted protein encoded by 

opnp inn i*Iimio urKTtrn to a 
gene iuu cione riiNriuyo. 


157 


76 


87 


AF116712 


Homo sapiens 


PR02738 


91 


58 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 




44 


89 


Y82742 


Homo sapiens 


DNA replication and repair associated 
protein (DRASP). 


315 


79 


90 


ivi i uyoi 


— : 

nomo sapiens 


alpha-2-HS-glycoprotein 


138 


74 


91 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


250 


56 


92 


AF220656 


Homo sapiens 


apoptosis-associated nuclear protein 
PHLDA 1 


62 


69 


93 


U65928 


Homo sapiens 


Jun activation domain binding protein 


188 


75 


94 


U93568 


Homo sapiens 


putative pi 50 




48 


95 


S801 19 


Rattus sp 


reverse transcriptase homolog 


130 


53 


96 




Homo sapiens 


putative pi 50 




50 


97 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
iragment, sty iu inu.jo/. 


105 


46 


98 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


62 


99 


X74045 


Equus caballus 


preproalbumin 


289 


65 


100 


AF1 18090 


Homo sapiens 


PRO2044 


269 


90 


101 

— — 


U49973 
_ 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


198 


51 




G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214 - . 


88 


53 


103 


Yg6573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


225 


82 






Homo sapiens 


ORF2-like protein 


1 14 


47 


105 


AF130079 


Homo sapiens 




133 


56 


106 


AF 130089 


Homo sapiens 


PRO2550 


107 


71 


T§£ 


M63473 


Homo sapiens 


alpha-5 type IV collagen 


131 


45 




AF1 16661 


Homo sapiens 


rKU 1 HJ & 


1 12 


54 


— 


X92485 


r lasmodium 


pval 


101 


41 


110 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


80 


111 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


199 


69 


1 12 


AF 19453 7 


omo sapiens 


NAG1:> : 




-1^1 


44 


1 13 


L27428 


riomo sapiens 


reverse transcriptase 




34 


114 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


137 


56 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


191 


67 






Homo sapiens 


PRO0956 


163 


47 


117 


L27428 


Homo sapiens 


reverse transcriptase 


117 


36 






Homo sapiens 


putative pi 50 


104 


66 


119 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


96 


66 


120 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


81 


57 


121 


G03438 


Homo sapiens 


Human secreted protein-, SEQ ID NO: 


78 


51 


122 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


95 


80 


123 


X61296 


Rattus 
norvegicus 


open reading frame 2 


94 


36 


124 


GO3801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


,3, 


60 



132 



ISDOCID: <WO 016483SA2J_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 




Description 


Smith- 
Score 


% 
y 


125 


AF 11 9900 


Homo sapiens 


PR02822 


168 


68 


126 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


209 


58 


127 


L2742S 


Homo sapiens 


reverse transcriptase 


102 


35 


128 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


99 


63 


129 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


73 


130 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


170 


36 


131 


U93572 


Homo sapiens 


putative pi 50 


168 


38 


132 


LJ37263 


Homo sapiens 


KRAB zinc finger protein; Method: 
conceptual translation supplied by 
author 


155 


57 


133 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


137 


92 


134 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


58 


61 


135 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


66 


136 


G03 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


102 


38 


137 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


230 


55 


138 


AK000496 


Homo sapiens 


unnamed protein product 


127 


46 


139 


X53581 


Rattus 
norvegicus 


ORF4 


136 


38 


140 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


158 


48 


14 1 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


162 


65 


142 


AF090930 


Homo sapiens 


PRO0478 


127 


65 


143 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


141 


58 


144 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


98 


65 


145 


AJ238588 


Sciurus vulgaris 


cytochrome c oxidase subunit III 


417 


72 


146 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


139 


76 


147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


40 


148 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


70 


149 


Y76184 


Homo sapiens 


Human secreted protein encoded by 


214 


85 


150 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


95 


57 


151 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


126 


66 


152 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


105 


51 


153 


AF1 19900 


Homo sapiens 


PR02822 


116 


62 


154 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


149 


66 


155 


AB009993 


Mus musculus 


collagen al(V) 


105 


36 


156 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


155 


69 


157 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


348 


71 


158 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


54 


159 


AF247705 


Oryctolagus 
cuniculus 


alpha 1 type X collagen 


102 


42 



133 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

ldentit 

y 


160 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 


161 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 12 


52 


162 


X71442 


Rattiis 
norvegicus 


ORF 1 ; putative 


96 


47 






Homo sapiens 


putative pi 50 


1 18 


38 


164 


U23515 


Caenorhabditis 
elegans 


weak similarity to adenylyl cyclase- 
associated protein (CAP) and to P. 
chabaudi adami major merozoite surfae 
antigen protein (PIR:A32555). Final 
exon overlaps gene predicted on other 
strand - 


93 


37 


"165 


GQO>?6 ^ 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7344. 


143 


66 


~tH — 




Homo sapiens 


PR02852 


143 


90 






Homo sapiens 


reverse transcriptase 


200 


53 


168 


G04072 


Homo sapiens 

_ . 


Human secreted protein, SEQ ID NO: 

f. 153 ' r 3 : 


92 


51 


169 


R95 


Homo sapiens 


Neural thread protein. . 


1 16 


54 


170 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


324 


85 


171 




Homo sapiens 


line-1 reverse transcriptase 


1 1 1 


50 


172 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


69 


173 


AF130089 


~u : 

Homo sapiens 


PRO^SO 


126 


59 


174 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


151 


46 


175 


G0245 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6532. 


1 13 


53 


176 


U15647 


Mus musculus 


reverse transcriptase 


— — 


— 

46 


177 


M24732 


Homo sapiens 


lamm-like protein 


TTI 




178 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


137 


38 


179 


AF0 1 0 1 44 




neuronal thread protein AD7c-NTP 




58 


180 


AF194537 


Homo sapiens 


NAG 13 


92 


90 


181 




Homo sapiens 


putative pi 50 


131 


53 


182 


U93574 


Homo sapiens 


putative pi 50 


86 


46 


183 


Y]4166 — 


Gallus gallus 


attachment region binding protein 


91 


40 


184 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


64 


185 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element . 


183 


73 


186 


U93572 


Horn : 

omo sapiens 


putative pi 50 


139 


64 


187 


M22332 


Homo sapiens 


unknown protein 


79 


41 


188 


Y87202 


Homo sapiens 


Human secreted protein sequence SEQ 
uj rNU.zii. 


75 


71 


189 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


132 


37 


190 


S80119 


Rattus sp. 


reverse transcriptase homolog 


172 


43 




A P 1 OA <T3 7 


Homo sapiens 


NAG 1 3 


81 


75 


192 


Y45^82 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


1 12 


64 


193 


X92485 


Plasmodium 
vivax . 


pval 


96 


40 


194 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 




45 


195 


L27428 


Homo sapiens 


reverse transcriptase 


141 


37 


196 


U93570 


Homo sapiens 


putative p 1 50 


201 


41 


197 


X92485 


Plasmodium 
vivax 


pval 


120 


48 


198 


AF130089 


Homo sapiens | PRO2550 


137 


60 



134 



WO 01/64835 



PCT/USO 1/0492 7 







Species 


Description 


Smith- 


% 


NO: 




Waterman 


Idcnlit 








Score 


y 


199 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


84 


200 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


52 


201 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


151 


68 








fragment, SEQ ID NO.490. 






202 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 


247 


78 








similar to pogo element 






203 


W48351 


Homo sapiens 


Human breast cancer related protein 


126 


56 






BCRB2. 






204 


Y91577 


Homo sapiens 


Human secreted protein sequence 


362 


80 








encoded by gene 2 SEQ ID NO:250. 






205 


R59842 


Homo sapiens 


ApoE4Ll protease. 


100 


82 


206 


AF161356 


Homo sapiens 


HSPC093 


78 


62 


207 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


48 








4581. 






208 


U83280 


Leishmania 


39 kDa antigen 


121 


53 






donovatii 








209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


80 








7795. 






210 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


146 


72 








similar to pogo element 






211 


M69197 


Homo sapiens 


haptoglobin-related protein 


344 


92 


212 


AF034611 


Homo sapiens 


intrinsic factor-B12 receptor precursor; 


123 


37 








cubilin 






21.3 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 


129 


65 








SEQ ID NO: 163. 






214 


V00662 


Homo sapiens 


cytochrome oxidase I 


485 


87 


215 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


80 








6595. 






216 


U35312 


Mus musculus 


nuclear receptor co-repressor 


115 


47 


217 


L26953 


Homo sapiens 


chromosomal protein 


143 


77 


218 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


224 


70 


219 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








7534. 






220 


AB018114 


Arabidopsis 


RING finger protein-like 


111 


38 






thaliana 








221 


D38112 


Homo sapiens 


ATPase subunit 6 


475 


84 


222 


V00662 


Homo sapiens 


cytochrome B 


466 


77 


223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


175 


85 








SEQ ID NO: 163. 






224 


AF010144 . 


Homo sapiens 


neuronal thread protein AD7c-NTP 


125 


38 


225 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


86 


65 








encoded by gene 121. 






226 


X77816 


Rattus 


PR-Vbetal 


130 


54 






norvegicus 








227 


U09500 


Homo sapiens 


cytochrome b 


274 


62 


228 


AF081104 


Mus musculus 


ORF2 


111 


36 






domesticus 








229 


AF090942 


Homo sapiens 


PRO0657 


88 


57 


230 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 


57 








8172. 






231 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


66 








7553. 






232 


B12310 


Homo sapiens 


Human secreted protein encoded by 


116 


54 








gene 10 clone HDPGP94. 






233 


AFO 10400 


Homo sapiens 


transaldolase-related protein 


253 


77 


234 


AF197913 


Helicoverpa 


basic DNA-binding protein BDBP 


137 


50 






armigera nuclear 












polyhedrosis 












virus 









135 



iNSDOCID: <WO 0164835AS_I_> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Spec.cs 


Description 


Smith- 
Waterman 


% 

Identit 

y 


235 


AF090931 


Homo sapiens 


PRO0483 


• 123 


75 


236 


U49973 


Homo sapiens 


ORF1 : MER37; putative transposase 
similar to pogo element 


185 


86 


237 


MI 9503 


Homo sapiens 


ORF1; putative 


99 


40 


238 


G0Q368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


93 


58 


239 


AFO 14883 


Homo sapiens 


NADH dehydrogenase subunit 2 


305 


65 


240 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


60 


44 


241 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


79 


50 


242 


U 15647 


Mus musculus 


reverse transcriptase 


117 


47 


243 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


56 


244 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


94 




245 


AK023542 


Homo sapiens 


unnamed protein product 


82 


38 


246 


X55702 


Drosophila 
melanogaster 


polycomb protein 


84 


31 


247 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


65 


248 


G02532 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6613. 


154 


70 


249 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1051. 


— — — 


— 


250 


Y17832 


Human 
endogenous 
retrovirus K 


env protein 


297 


71 


251 


U93S68 


Homo sapiens 


p40 


103 


46 


252 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


201 




253 


AF090895 


Homo sapiens 


PRO0117 


139 


60 


254 


Y02886 


Homo sapiens 


Fragment of* human secreted protein 
encoded by gene 90. 


185 


72 


255 


L27428 


Homo sapiens 


reverse transcriptase 


156 


40 


256 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


157 


59 


257 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


118 


70 


258 


AF194537 


Homo sapiens 


NAG 13 


141 


38 


259 


B01372 


Homo sapiens 


Neuron-associated protein. 


115 


71 


260 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


1 13 




261 


M22332 


Homo sapiens 


unknown protein 


78 




262 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


70 


78 


263 


X61296 


Rattus 
norvegicus 


open reading frame 2 


108 


36 


264 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


178 




265 


G03303 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7384. 


81 





266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


116 


84 


267 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


118 


52 


268 


AF1 19855 


Homo sapiens 


PRO 1 847 


— 


70 


269 


AF109907 


Homo sapiens 


S164 


85 


61 


270 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


137 


63 


271 


X92485 


Plasmodium 
vivax 


pval 


107 


72 


272 


AF 194537 


Homo sapiens 


NAG 13 


167 


51 



136 



JSDOCID: <WO 0164835A2_L> 



WO 01/64835 PCT/US01/04927 





Accession No. 


Species 


Description 






NO: 








Waterman 


Identit 












y 


273 


U93564 


Homo sapiens 


p40 


104 


40 


274 


L27428 


Homo sapiens 


reverse transcriptase 


142 


56 


275 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


60 








4435. 






276 


X61296 


Rattus 


open reading frame 2 


96 


48 






norvegicus 








277 


AF090931 


Homo sapiens 


PRO0483 


140 


65 


278 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


231 


66 








similar to pogo element 






279 


AF1300S9 


Homo sapiens 


PRO2550 


164 


60 


280 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


98 


58 








protein 






281 


L22548 


Homo sapiens 


collagen type XVII I alpha 1 


92 


38 


282 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


130 


68 








RHAla(AF078683) 






283 


AF116715 


Homo sapiens 


PR02829 


160 


75 


284 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


53 


285 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


53 








7871. 






286 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


53 








BCRB2. 






287 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


147 


66 








similar to pogo element 






288 


U93572 


Homo sapiens 


putative pi 50 


125 


32 


289 


AL050399 


Arabidopsis 


putative proline-rich protein 


142 


44 






thaliana 








290 


X92485 


Plasmodium 


pval 


147 


43 






vivax 








291 


AB047600 


Macaca 


hypothetical protein 


172 


66 






fascicularis 








292 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


108 


55 








7870. 






293 


D38115 


Pongo pygmaeus 


NADH dehydrogenase subunit 5 


342 


71 


294 


AF090942 


Homo sapiens 


PRO0657 


99 


66 


295 


M61185 


Bos taurus 


glutamic acid-rich protein 


114 


44 


296 


M13100 


Rattus 


unknown protein 


107 


43 






norvegicus 








297 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


50 


298 


X92485 


Plasmodium 


pval 


93 


78 






vivax 








299 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


75 








7253. 






300 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


42 


301 


L24521 


Homo sapiens 


transformation-related protein 


117 


60 


302 


X83413 


Human 


US8 


219 


49 






herpesvirus 6 








303 


U93567 


Homo sapiens 


putative pi 50 


130 


48 


304 


B08918 


Homo sapiens 


Human secreted protein sequence 


72 


61 








encoded by gene 28 SEQ ID NO:75. 






305 


AJ242540 


Volvox carteri f. 


hydroxy pro line-rich glycoprotein DZ- 


153 


68 






nagariensis 


HRGP 






306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


532 


79 


307 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


53 








7795. 






308 


L27428 


Homo sapiens 


reverse transcriptase 


151 


72 


309 


M69297 


Homo sapiens 


ORF3 


145 


43 


310 


X92485 


Plasmodium 


pval 


81 


60 






vivax 








311 


L27428 


Homo sapiens 


reverse transcriptase 


103 


41 



137 



[iMSDOCID: <WO 



C1164B3SA2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 






Description 


Smith- 
Waterman 


% 

ltlentit 


312 


AF 130079 


Homo sapiens 


PR02852 


135 


49 


313 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


110 


58 


314 


AF090928 


Homo sapiens 


PRO0470 


88 


48 


315 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


175 


64 


316 


U93568 


Homo sapiens 


putative p 1 50 


148 


46 


317 


AF1 19855 


Homo sapiens 


PRO 1847 


108 


84 


318 


P60839 


Homo sapiens 


Sequence of human serum albumin 
(HSA) on plasm id pXL53. 


175 


50 


339 


W46424 


Homo sapiens 


Human macrophage stimulating protein 
(MSP). 


257 


69 


320 


AL049547 


Homo sapiens 


dJ34F7.2 (CREB-RP (G13)) 


247 


64 


321 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
6613. 


no 


66 


322 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


4S 


323 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


324 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


126 


44 


325 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


146 


59 


326 


R59842 


Homo sapiens 


ApoE4Ll protease. 


95 


60 


327 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


84 


61 


328 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


123 


66 


329 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


125 


65 


330 


AF1 19855 


Homo sapiens 


PRO 1847 


121 


80 


331 


L78671 


Homo sapiens 


CoxII/D-ioop DNA fusion protein 


364 


71 


332 


AK000496 


Homo sapiens 


unnamed protein product 


145 


41 


333 


D00570 


Mus musculus 


open reading frame (1 96 AA) 


153 


53 


334 


AF1 19855 


Homo sapiens 


PRO 1847 


116 


74 


335 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


50 


336 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


129 


56 


337 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


107 


59 


338 


Y02158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


115 


72 


339 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


117 


50 


340 


AK022217 


Homo sapiens 


unnamed protein product 


127 


70 


341 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


115 


75 


342 


AF 11 8086 


Homo sapiens 


PRO 1992 


141 


73 


343 


X92485 


Plasmodium 
vivax 


pval 


96 


59 


344 


AF 106677 


Drosophila 
melanogaster 


dissatisfaction 


90 


48 


345 


U I 2693 


Homo sapiens 


cytochrome oxidase subunit II 


239 


91 


346 


L27428 


Homo sapiens 


reverse transcriptase 


95 


56 


347 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


69 


348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


51 


349 


X51616 


Volvox carteri 


SULFATED SURFACE 
GLYCOPROTEIN 185 


110 


41 



138 



JSOOCID: <WO 0164635A2J_= 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 




Description 


Waterman 


Iclentit 


350 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


55 


351 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


183 


60 


352 


AL390U4 


Leishmania 
major 


extremely cysteine/valine rich protein 


151 


51 


353 


R95913 


Homo sapiens 


Neural thread protein. 


95 


56 


354 


AF061340 


Artibeus 
jamaicensis 


cytochrome c oxidase subunit III 


346 


70 


355 


AF090895 


Homo sapiens 


PRO0117 


126 


60 


356 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


121 


48 


357 


AF1 18086 


Homo sapiens 


PR01992 


159 


73 


358 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


85 


89 


359 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


50 


360 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


57 


361 


M13100 


Rattus 
norvegicus 


unknown protein 


122 


34 


362 


Y36203 


Homo sapiens 


Human secreted protein #75. 


108 


63 


363 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


83 


75 


364 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


212 


65 


365 


AF130051 


Homo sapiens 


PRO0898 


136 


71 


366 


AF068294 


Homo sapiens 


HDCMB45P 


188 


65 


367 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


225 


70 


368 


S80119 


Rattus sp. 


reverse transcriptase homolog 


188 


45 


369 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


75 


48 


370 


AF 118082 


Homo sapiens 


PRO1902 


98 


79 


371 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


40 


372 


AFO 14903 


Pan troglodytes 


NADH dehydrogenase subunit 2 


169 


41 


373 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


93 


48 


374 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


66 


375 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7188. 


90 


80 


376 


U93568 


Homo sapiens 


putative pi 50 


140 


56 


377 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


140 


50 


378 


AF090942 


Homo sapiens 


PRO0657 


154 


66 


379 


U93568 


Homo sapiens 


putative pi 50 


149 


36 


380 


U93570 


Homo sapiens 


p40 


184 


57 


381 


L27428 


Homo sapiens 


reverse transcriptase 


128 


60 


382 


AF194537 


Homo sapiens 


NAG 13 


114 


35 


383 


AF1 16712 


Homo sapiens 


PR02738 


109 


56 


384 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


405 


81 


385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


63 


386 


M22334 


Homo sapiens 


unknown protein 


124 


39 


387 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


97 


52 


388 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


327 


94 


389 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


157 


70 


390 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


62 


391 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


259 


66 



139 



WSDCCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ IU 


Accession No. 


Species 


Description 


Smith- 
VV a term an 


% 

Identic 

y 


392 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


137 




393 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


129 


54 


394 


AK023582 


Homo sapiens 


unnamed protein product 


148 


46 


395 


M22332 


Homo sapiens 


unknown protein 


128 


41 


396 


AF 118086 


Homo sapiens 


PRO! 992 


160 


71 


397 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


101 


52 


398 


D38 112 


Homo sapiens 


cytochrome c oxidase subunit 3 


199 


66 


399 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


234 


78 


400 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


107 


40 


401 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


116 


61 


402 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


145 




403 


AF1 18082 


Homo sapiens 


PRO1902 


97 


55 


404 


AF202635 


Homo sapiens 


PP1200 






405 


V00662 


Homo sapiens 


cytochrome oxidase 1 


352 


68 


406 


AF229067 


Homo sapiens 


PADI-H protein 






407 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


197 


38 


408 


L26251 


Trypanosoma 
brucei 


CR5 


95 


46 


409 


U49973 


Homo sapiens 


ORF 1 j MER37; putative transposase 
similar to pogo element 


~^7o 


— 


410 


X92485 


Plasmodium 
vivax 


pval 


96 


68 


411 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


-pjg 


-— 


412 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


431 


83 


413 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


299 




414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


~sT 


415 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


296 


89 


416 


X58438 


Mus musculus 


proline rich protein 


112 


50 


417 


U70932 


Peromyscus 
leucopus 


reverse transcriptase 


89 




418 


V00662 


Homo sapiens 


cytochrome oxidase III 


200 


~84 


419 


AFO 17789 


Homo sapiens 


putative transcription factor CA 150 


120 


— 


420 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


183 


69 


421 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 3 02, variant a dna, 
complete virion genome. 


166 


44 


422 


AF130051 


Homo sapiens 


PRO0898 


158 


59 


423 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


86 




424 


D3S116 


Pan paniscus 


cytochrome c oxidase subunit 3 


342 





425 


U93570 


Homo sapiens 


jutative pi 50 


133 


"41 


426 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


165 


67 


427 


D13951 


Nicotiana 
tabacum 


extensin precursor 


140 


— 


428 


L2742S 


Homo sapiens 


reverse transcriptase 


304 


34 


429 


R95913 


Homo sapiens 


Neural thread protein. 


118 


49 


430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


97 


431 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


55 


!~432 


YI2713 


Mus musculus 


Pro-PoI-dUTPase polyprotein 


98 


54 


433 


J05042 


Oryctolagus 


alpha-] (Vni) collagen precursor 


91 


48 



140 



ISDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 


% 

Idenm 






cuniculus 








434 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


105 


56 


435 


U93572 


Homo sapiens 


putative pi 50 


118 


38 


436 


U93569 


Homo sapiens 


putative pi 50 


100 


30 


437 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


126 


81 


438 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


71 


439 


U52077 


Homo sapiens 


mariner transposase 


187 


52 


440 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


80 


45 


441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


71 


442 


AE003727 


Drosophila 
melanogaster 


CGI 67 1 8 gene product 


301 


48 


443 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


221 


74 


444 


U35730 


Mus musculus 


jerky 


159 


26 


445 


X53581 


Rattus 
norvegicus 


ORF3 


192 


46 


446 


G004I6 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


142 


52 


447 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


156 


38 


448 


AF194537 


Homo sapiens 


NAG 13 


315 


70 


449 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


93 


66 


450 


X92099 


Brugia pahangi 


collagen 


126 


44 


451 


AF090930 


Homo sapiens 


PRO0478 


88 


60 


452 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


93 


40 


453 


AF081114 


Mus musculus 
domesticus 


ORF2 


108 


32 


454 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


173 


65 


455 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


197 


54 


456 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


89 


68 


457 


R95913 


Homo sapiens 


Neural thread protein. 


114 


48 


458 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


160 


39 


459 


X92485 


Plasmodium 
vivax 


pval 


99 


52 


460 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


100 


52 


461 


S80119 


Rattus sp. 


reverse transcriptase homolog 


138 


48 


462 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


211 


67 


463 


X97707 


Pongo pygmaeus 
abelii 


stopcodon created by 
posttranscriptional polyadenylation 


229 


76 


464 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


67 


465 


L27428 


Homo sapiens 


reverse transcriptase 


154 


40 


466 


AK000496 


Homo sapiens 


unnamed protein product 


140 


69 


467 


GO0637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


61 


468 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


146 


69 


469 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


286 


79 


470 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


86 


471 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


296 


79 



141 



kfSISDOCiD: <WO 01 6483SA2_I_> 



WO 01/64835 PCT/USO 1/04927 



SEQ1D 

NO: 


Accession No. 


Species 


Description 


Waterman 


Idcntit 

y 


472 



Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


48 


473 


. AL080253 


Arabidopsis 
thaliana 


putative snRNP protein 


103 


42 


■474 


X99452 


Lycopersicon 
esculentum 


extensin-like protein Dif54 


108 


25 


75 

— - 


G0j1j3 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


68 


34 







Canis familiaris 


ORF2 


78 


66 




A PI 'iflDSQ 

/\r i JUU07 


Homo sapiens 


PRO2550 


113 


71 


"478 


G 03 652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


390 


97 


479 


AF2 10651 


Homo sapiens 


NAG 1 8 ' " 


146 


80 


480 


AB029309 


Homo sapiens 


Npvv38-bindmg protein NpwBP 


103 


40 




A T* 1 GA^^"? 
r\r I y*tDj / 


Homo sapiens 


NAG 13 


118 


31 


482 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


45 






Homo sapiens 


reverse transcriptase 


184 


47 


484 


U93570 


Homo sapiens 


putative pi 50 


101 


50 


485 


AF194537 


Homo sapiens 


NAG 13 


213 


52 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


82 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


204 


86 


488 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


140 


53 


489 


U93574 


Homo sapiens 


putative pi 50 


86 


54 


490 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


83 


56 


491 


AJ27I872 


Nicotiana 
sylvestris 


extensin 


220 


47 


492 


Ul 1288 


Drosophila 
melanogaster 


diaphanous protein 


113 


33 


493 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


184 


70 


494 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


68 


495 


AF1 19900 


Homo sapiens 


PR02822 


148 


65 




AB026542 


Homo sapiens 


WASP-family protein 


96 


38 


497 


D86853 


Catharanthus 
roseus 


extensin 


104 


34 




AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


109 


47 


499 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


109 


67 


"501) 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


112 


62 


501 


AF1 19901 


Homo sapiens 


PR0283 1 


116 


82 




AF238235 


Entamoeba 
histolytica 


diaphanous protein 


120 


41 




M22332 


Homo sapiens 


unknown protein 


123 


49 


504 


AF1 19851 


Homo sapiens 


PRO 1722 


204 


52 


505 


X61296 


Rattus 


open reading frame 2 


107 


45 


506 


AF1 18082 


Homo sapiens 


PRO 1902 


145 


52 


507 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


68 


508 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


175 


39 


509 


X92485 


Plasmodium 


pval 


117 


52 



142 



_0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 














-5TO 


AF090942 


Homo sapiens 




-— 




51 1 


U93569 






~7?0 


"34" 


512 


U93574 


Homo sapiens 


putative pi 50 


140 


49 




AJ242540 


Voivox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 




63 


_L!^ — 





Homo sapiens 


reverse transcriptase 


132 


37 


~frl — 


U9^574 


Homo sapiens 


putative pi 50 


101 


45 






Homo sapiens 


putative pi 50 


178 


35 


517 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


81 


27 


518 


AP053538 


A 1 vine! la 
pompejana 


fibrillar collagen chain FApl alpha 


112 


36 






Homo sapiens 


ORFII 


148 


35 


520 


AF130051 


Homo sapiens 


PRO0898 


98 


61 


521 


L02 1 06 


Drosophila 
nielanogaster 


ribonucleoprotein 


143 


40 


522 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


202 


70 


523 


U93570 


Homo sapiens 


putative pi 50 


159 


43 


524 



Y86248 



Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


230 


100 




7428 


Homo sapiens 


reverse transcriptase 


128 


38 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


96 


65 






norvegicus 


ORF4 


130 


42 


528 


U93570 


Homo sapiens 


putative pi 50 


195 


35 


529 


AF130089 


Homo sapiens 


PRO2550 


132 


43 


530 


AK024455 


Homo sapiens 


FLJ00047 protein 


126 


54 


531 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


61 


532 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


210 


44 


533 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


109 


42 


534 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


120 


62 


535 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


124 


37 


536 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


122 


50 


5j7 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


43 


538 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 




AFO 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


39 


540 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


102 


67 


541 


S80119 


Rattus sp. 


reverse transcriptase homolog 


191 


50 


542 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


543 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


125 


91 


544 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB*^ 


146 


62 


545 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


45 


546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


94 


75 


547 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


510 


86 


548 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


146 


68 



143 



NJSDCCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 




Description 


Smith- 


y 








RHAla (AF078683) 






549 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


128 


82 


550 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


153 




551 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


12] 


— T~ 


.552 


X92485 


Plasmodium 
vivax 


pval 


103 





553 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


129 




554 


L27428 


Homo sapiens 


reverse transcriptase 


149 




555 


AF194537 


Homo sapiens 


NAG13 




--^ 

-_z£ 


556 


Y13247 


Homo sapiens 


FBI 9 protein 


~T7fc 


. 42 


557 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO* 
4718. 


"Too" 




558 


G01984 


Homo sapiens 


6065. 






559 


S80119 


Rattus sp. 


reverse transcriptase homo log 






560 


AY008270 


Homo sapiens 




~17?f 


~ 


561 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


— — — 


~ 


562 


U49973 


Homo sapiens 


similar to pogo element 


-— 


_ 

68 


563 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


— 


— 


564 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 3 (COI1I) 






565 


Y36156 


Homo sapiens 


Human secreted protein #28. 


~f§3~ — : — 


-~ 


566 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


131 


48 


567 


D38122 


Homo sapiens 


cytochrome c oxidase subunit 3 




— ^ 


568 


AF130079 


Homo sapiens 


PR02852 


"TT-rr 




569 


AF081114 


Mus musculus 
domesticus 


ORF2 


— — 


_ 

40 


570 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


65 


— 


571 


D86853 


Catharanthus 
roseus 


extensin 




— 


572 


AF1 04415 


Mus musculus 


gene trap locus- 1 3 






573 


AF1 30089 


Homo sapiens 


PRO2550 


114 


56 


574 


X67863 


Mus musculus 


T2 






575 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


28 


576 


S80119 


Rattus sp. 






57 


577 


U49973 


Homo sapiens 


ORfT^^R^p^ 

similar to pogo element 





74 


578 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO* 
4718. 


"Toe 


~57 


579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO" 
7871. 






580 


L24521 


Homo sapiens 


trsnsfbrnifltion-related protein 






581 


D38112 


Homo sapiens 


cytochrome c oxidsse subunit 3 


"5T7" 


~P 


582 


AF090895 


Homo sapiens 


PRO0117 


127 


SO 


583 


G03203 


Homo sapiens 


72S4. 






584 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


1 12 


38 


585 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


473 


60 


586 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


68 


587 


U47855 


Araneus 
diadematus 


fibroin-3 


124 


30 


588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


150 


75 


589 


U93567 


Homo sapiens 


putative pi 50 


?9 5 


47 



144 



WO 01/64835 PCT/US01/04927 



SEQ1D 
NO. 


Accession No. 


Species 


Description 


Sraiih- 

Walermarc 

Score 


Idcntit 

y 


590 


X71602 


Nicotiana 
tabacum 


extensin 


147 


33 


591 


X57527 


Homo sapiens 


alpha 1(VI11) collagen 


103 


42 


592 


G031 12 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


75 


47 


593 


R28916 


Homo sapiens 


Type 111 procollagen (prior art). 


J 16 . 


48 


594 


R95913 


Homo sapiens 


Neural thread protein. 


1 16 


37 


595 


U11880 


Petronryzon 
marinus 


cytochrome oxidase subunit I 


127 


52 


596 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


84 


62 


597 


L27428 


Homo sapiens 


reverse transcriptase 


158 


40 


598 


M55251 


Canis familiaris 


glycoprotein 80 


559 


86 


599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


600 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


143 


33 


601 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


154 


54 


602 


X73481 


Drosophila hydei 


mstl01(2) 


107 


42 


603 


M81321 


Macaca 
fascicularis 


proline-rich protein 


114 


39 


604 


X05561 


Homo sapiens 


alpha-1 chain precursor (A A -27 to 
917) (2953 is 2nd base in codon) 


109 


42 


605 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


82 


62 


606 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


64 


607 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


44 


608 


L27428 


Homo sapiens 


reverse transcriptase 


147 


43 


609 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


61 


610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


151 


82 


611 


U93568 


Homo sapiens 


putative pi 50 


144 


32 


612 


AB022223 


Arabidopsis 
thaliana 


extensin protein-like 


186 


58 


613 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 


49 


614 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


133 


36 


615 


X92485 


Plasmodium 
vivax 


pval 


120 


42 


616 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


175 


89 


617 


U83280 


Leishmania 
donovani 


39 kDa antigen 


111 


51 


618 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


137 


67 


619 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


620 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. j 


92 


45 


621 


AF 13 0089 


Homo sapiens 


PRO2550 


123 


34 


622 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


133 


59 


623 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


45 


624 


U49973 


Homo sapiens 


ORF1; MER37; putative rransposase 
similar to pogo element 


205 


66 



145 



0164835A2 I 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Identit 

y 




X07882 


Homo sapiens 


Po protein 


1 19 


39 


626 
— — 


Y86248 

..... 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


70 


100 


6 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


51 


628 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


54 


~629 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


1 15 


46 






Homo sapiens 


HSJlb 


151 


52 


631 


AF130089 


Homo sapiens 


PRO2550 


155 


47 


632 


X92485 


Plasmodium 

...y ivax 


pval 


102 


61 


~633 


^ 0 -, 905 


Homo sapiens 


salivary proline-rich protein precursor 


102 


39 


634 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


74 


635 


X92485 


Plasmodium 
vivax 


pval 


95 


73 




S801 19 


Rattus sp. 


reverse transcriptase homolog 


114 


58 


637 


U 15647 


Mus musculus 


reverse transcriptase 


170 


42 


638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


76 


639 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


83 


36 


640 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


96 


34 


641 


X61296 


Rattus 
norvegicus 


open reading frame 2 


166 


35 


642 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


127 


35 


643 


U23 1 72 



Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR: JH0 1 62) 


115 


35 


-—r 


AF081 111 


Mus musculus 
domesticus 


ORF2 


168 


33 


~645 




Homo sapiens 


unnamed protein product 


90 


51 


646 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


101 


59 


647 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


76 


648 


AF273441 


Pongo pygmaeus 


NADH dehydrogenase subunit 3 


121 


58 


649 


L27428 


Homo sapiens 


reverse transcriptase 


173 


58 




AF1 19851 


Homo sapiens 


PRO 1722 


176 


53 


651 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


168 


68 


652 


AF1 30089 


Homo sapiens 


PRO2550 


130 


36 


653 
— ~ 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1 . 


126 


41 


— — 




Homo sapiens 


unnamed protein product 


195 


63 




A OA /I IQQi 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


158 


100 


656 


X61047 


Hydra sp. 


mini-collagen 


60 


38 




M22332 


Homo sapiens 


unknown protein 


100 


50 


658 


AF010144 




neuronal thread protein AD7c-NTP 


143 


62 


659 


AF 194537 


Homo sapiens 


NAG 13 


95 


48 


660 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


50 


661 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


86 


32 


662 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


169 


53 


663 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 | 72 



146 



ISDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/O4927 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Wau ;m an 


% 

1 dentil 

y 








BCRB2 




-— 







ftg^gp 


Mus musculus 


reverse transcriptase 


148 




665 




Homo sapiens 


Neural thread protein. 


"T6l ~ 


~59 


666 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


80 


667 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


44 


668 


- 

AB01 8705 


.. . 

Mus musculus 


~bRf2 


~J\5 


~32 


669 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


280 


75 


670 


X53581 


Rattus 
norvegicus 


ORF4 


71 


j9 


671 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


201 


66 


672 


R95913 


Homo sapiens 


Neural thread protein. 






673 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


"164" 


"~46 


674 


AF1 1 8082 


Homo sapiens 


PRO 1902 






675 


G03 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


-qT~ 

91 





— 




D00570 


Mus musculus 


open reading frame (251 AAj 


— — - 


~72 

-— 


677 


AF194537 


Homo sapiens 




— ^ 




678 


M ] 3 1 00 


norvegicus 


unknown protein 


~~R6 


-jj 


679 


U 15647 


Mus musculus 


reverse transcriptase 


123 




680 


R959 1 3 


Homo sapiens 


Neural thread protein. 




~J? 


681 


R59842 


Homo sapiens 


ApoE4Ll protease. 


107 


63 


682 


AF130089 


Homo sapiens 


PRO2550 


94 




683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


69 


684 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 
KSRP 


93 


50 


685 


U93569 


Homo sapiens 


putative pi 50 


133 


58 


686 


G03095 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 17 


64 


687 


Y36243 


Homo sapiens 


Human secreted protein encoded by 


69 


73 

-— 


688 


AF1 16712 


Homo sapiens 


PKUi / Jo 


— — — 




"41 — 


689 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6953 ; -— 


J 


— — 








— : 

Homo sapiens 


putative pi 50 


"732 




691 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


349 


70 


_— _ 


ArUyCJoiO 


— — : 

Homo sapiens 


DP /"vi i 1 n 
rKUU 11/ 


~TR 


-^r 

— 


_222 




Homo sapiens 


r l\\JSD JU 


-p2 

-~ 




694 


S801 1 9 


Rattus sp. 


reverse transcriptase homolog 




~43 


695 


U 15647 


Mus musculus 


reverse transcriptase 


120 


64 


696 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1j2 


59 


697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


72 


698 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


182 


47 ... . 


699 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


459 


83 


700 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


73 


701 


AF003535 


Homo sapiens 


ORF2-Iike protein 


125 


49 


702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


89 


41 


703 


L27428 j 


Homo sapiens 


reverse transcriptase 


255 


50 


704 


AF130089 


Homo sapiens 


PRO2550 


87 


55 


705 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


52 



147 



NSDOCID: <WO 0164B35A2 I 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description - +; , ' ■; 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7133. 










Homo sapiens 


Human haemopoietic stem cell 
regulatory protein SCM3. 


211 


88 


707 




Mus musculus 


reverse transcriptase 


94 


47 


708 


Y02693 


Homo sapiens 
. 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


66 


"709 


GO j 807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


107 


50 


710 


U49973 
— — „ 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


163 


64 


~m — 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


146 


65 


712 


U93565 


Homo sapiens 


putative pi 50 


108 


33 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


108 


37 






Homo sapiens 


lamin-Iike protein 


92 


35 


"TO 


D38U2 


Homo sapiens 


cytochrome c oxidase subunit 3 


306 


79 


716 


U97674 


Mesocricetus 
auratus 


cytoclirome c oxidase chain I 


462 


85 


"717 




Homo sapiens 


jerky gene product homolog 


100 


42 


718 


X92485 


Plasmodium 
vivax 


pval 


84 


48 


719 


AF 130089 


Homo sapiens 


PRO2550 


132 


74 




AL45 1017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


125 


43 




AK024455 


Homo sapiens 


FLJ00047 protein 


108 


68 


722 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


46 


723 


X83413 


Human 
herpesvirus 6 


U8S 


269 


41 


724 

... 


X92485 


Plasmodium 
vivax 


pval 


117 


43 




X92485 


Plasmodium 
vivax 


pval 


97 


41 




G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


102 


73 




G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


43 






Beta vulgaris 


ch kinase 1 


91 




729 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


730 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


131 


59 


"731 


W4y / 1 7 


Homo sapiens 


Protein polymer adhesive substrate 
PPAS1-C. 


148 


29 




W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


95 


45 




X9673 1 


Ostertagia 
circumcincta 


cuticular collagen 


104 


37 


734 


AF130089 


Homo sapiens 


PRO2550 


118 


40 




G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


736 


Y91577 




Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


494 


86 


737 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


188 


69 


738 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


593 ! 


91 


739 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


102 


62 



148 



_016483SA2„L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

IVO: 


Accession No. 


Species 


Description 


Waterman 


y 


740 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


I 1 1 


54 


741 


A23786 


Beta vulgaris 


chitinase 1 


106 


36 


742 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


72 


743 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


184 


6} 


744 


AE003629 


Drosophila 
melanogaster 


CG17108 gene product 


76 


36 


745 


U93563 


Homo sapiens 


putative pi 50 


145 


50 


746 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


68 


66 


747 


AF2 17973 


Homo sapiens 


unknown 


113 


79 


748 


AC006161 


Arabidopsis 
thaliana 


putative CENP-B/ARS-binding 
protein-like protein 


112 


31 


749 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


106 


47 


750 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


115 


54 


751 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


99 


72 


752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


85 


48 


753 


X92485 


Plasmodium 
vivax 


pval 


89 


73 


754 


U93563 


Homo sapiens 


putative pi 50 


186 


68 


755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


50 


756 


AF 194537 


Homo sapiens 


NAG 13 


138 


40 


757 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


79 


758 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


119 


61 


759 


AF 130079 


Homo sapiens 


PR02852 


138 


40 


760 


X92485 


Plasmodium 
vivax 


pval 


88 


77 


761 


U93050 


Mus musculus 


poly(A) binding protein U 


95 


35 


762 


V01 555 


Human 
herpesvirus 4 


BYRF1, encodes EBNA-2 (Dambaugh 
etal, 1984;DillneretaI, 1984) 


80 


41 


763 


V00662 


Homo sapiens 


ATPase 6 


337 


83 


764 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


117 


41 


765 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


423 


81 


766 


R95913 


Homo sapiens 


Neural thread protein. 


114 


66 


767 


V00662 


Homo sapiens 


cytochrome oxidase I 


223 


83 


768 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


268 


83 


769 


V00662 


Homo sapiens 


cytochrome oxidase I 


357 


81 


770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


296 


71 


771 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


95 


39 


772 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


53 


773 


X92485 


Plasmodium 
vivax 


pval 


95 


39 


774 


AF130051 


Homo sapiens 


PRO0898 


123 


38 


775 


ABO 12223 


Canis familiaris 


ORF2 


174 


51 


776 


AB028664 


Paralichthys 
olivaceus 


cytochrome oxidase subunit-3 


268 


57 


777 


V00662 


Homo sapiens 


cytochrome oxidase I 


436 


85 


778 


X52235 


Homo sapiens 


ORFII 


125 


47 


779 


U93569 


Homo sapiens 


putative pi 50 


235 


49 



149 



WO 01/64835 



PCT/US01/04927 



3 



SEQ ID 

NO: 


Accession No. 






Waterman 


Identit 

y 


780 


U49973 


Homo sapiens 


ORF 1; MER37; putative transposase 
similar to pogo element 


262 


76 


781 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


239 


70 


7S2 


AF090942 


Homo sapiens 


PRO0657 


146 


69 


783 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


85 


66 


784 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


209 


62 


785 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


184 


60 


786 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


158 


60 


787 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


293 


86 


788 


AF 130089 


Homo sapiens 


PRO2550 


153 


64 


789 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


131 


57 


790 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


100 


51 


791 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


80 


792 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


77 


793 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


117 


58 


794 


AF090930 


Homo sapiens 


PRO0478 


130 


68 


795 


AK024455 


Homo sapiens 


FLJ00047 protein 


114 


61 


796 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


141 


70 


797 


AF 130089 


Homo sapiens 


PRO2550 


273 


85 


798 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


134 


66 


799 j 


AF 130051 


Homo sapiens 


PRO0898 


162 


80 


800 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


179 


70 


80] 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


75 


51 


802 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


103 


34 


803 


D38484 


Hylobates 
syndactylus 


Cytochrome C oxidase subunit 1 
(COXI) 


263 


70 


804 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


41 


805 


X92485 


Plasmodium 
vivax 


pval 


103 


38 


806 


AF194537 


Homo sapiens 


NAG 13 


285 


51 


807 


AF121360 


Drosophila 
melanogaster 


DNZDHHC/NEW1 zinc finger protein 
11 


179 


47 


808 


X92485 


Plasmodium 
vivax 


pval 


131 


46 


809 


D13951 


Nicotiana 
tabacum 


extensin precursor 


88 


41 


810 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


174 


73 


811 


U93565 


Homo sapiens 


putative pi 50 


118 


38 




AF1 1 8082 


Homo sapiens 


PRO1902 


88 


54 


813 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


42 


814 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


114 


50 


815 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


113 


43 


816 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


148 


40 



150 



JSDOCID: <WO_ 



_0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


I Species 

■ 


Description 


Smith- 
Waterman 


% 

Identit 

y 


817 


AF2 1 7449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


102 


37 


818 


ABO 12223 


Canis familiaris 


ORF2 


101 


54 






Nicotiana 
tabacum 


extensin 


1 62 


45 




yjUWy 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 




36 


821 


R95913 


Homo sapiens 


Neural thread protein. 


1 53 


61 




G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


125 


61 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


364 


57 




U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


48 


825 


AF194537 


Homo sapiens 


NAG13 


224 


58 




U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


437 


72 


J*zZ 


A ~cr\ c 1 too 
Ar05 1 /5Z 


Homo sapiens 


diaphanous 1 


108 


38 


828 


AFI 94537 


Homo sapiens 


NAG 13 


92 


45 


829 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


492 


75 


830 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


1 10 


46 


831 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


108 


31 


832 




Mus musculus 


Pro-Pol-dUTPase polyprotein 


117 


34 


833 


U93564 


Homo sapiens 


putative pi 50 


84 


40 


834 


U93563 


Homo sapiens 


putative pi 50 


262 


54 


835 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


214 


80 


836 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 Finger 
protein rhala. 


107 


48 


837 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


76 


838 


AFI 94537 


Homo sapiens 


NAG 1 3 


153 


60 


839 


U52077 


Homo sapiens 


mariner transposase 


344 


67 


840 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


104 


46 


841 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


130 


92 


842 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


114 


36 


843 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


196 


43 


844 


X92485 


Plasmodium 
vivax 


pval 


102 


73 


845 


X61048 


Hydra sp. 


mini-collagen 


106 


41 


846 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


48 


847 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 12 


54 


848 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


128 


68 




AF194537 


Homo sapiens 




141 


46 


850 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


45 


851 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


58 


852 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


60 



]51 



, MSDOCID: <WO 0 1 64835A2_l_s 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 
— — 


Description 


Smilh- 

Waterman 

Score 


% 

Identit 

y 


~853 


X92485 


Plasmodium 
vivax 


pval 


127 


45 


~m — 




Homo sapiens 


Scarj 


99 


38 






Homo sapieDS 


NADH dehydrogenase subunit 2 


343 


68 


856 


S80119 


Rattus sp. 


reverse transcriptase homolog 


159 


56 


857 


AF] 30089 


Homo sapiens 


PRO2550 


1 12 


40 


858 


AK024372 


Homo sapiens 


unnamed protein product 


129 


50 


859 


G03790 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7871. 


103 


76 




D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


279 


80 


861 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


96 


44 


862 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


83 


40 


863 


AF2 10651 


Homo sapiens 


NAG 18 


122 


63 


864 


AFO 16099 


Mus musculus 


en don uc lease/reverse tran scriptase 


109 


51 


865 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


115 


34 


866 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


107 


80 


867 


X92485 


Plasmodium 
vivax 


pval 


105 


41 


868 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


127 


62 


869 


X92485 


Plasmodium 
vivax 


pval 


105 


38 


870 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


90 


56 


871 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


128 


72 




G03 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


123 


36 




AF130089 


Homo sapiens 


PRO2550 


160 


82 


874 


AF1 18082 


Homo sapiens 


PRO 1902 


143 


65 


875 


U93564 


Homo sapiens 


putative pi 50 


180 


44 


876 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


248 


75 


877 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


127 


52 


878 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 


35 


879 


UI 1288 


Drosophila 
melanogaster 


diaphanous protein 


93 


46 


880 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha 1 a. 


71 


47 


881 


B08942 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO:99. 


95 


40 


882 


AF 130089 


Homo sapiens 


PRO2550 


137 


44 


883 


AF090942 


Homo sapiens 


PRO0657 


142 


73 


884 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


97 


60 


885 


X61296 


Rattus 
norvegicus 


open reading frame 2 


106 


43 


886 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


91 


57 






Homo sapiens 


collagen-like protein (447 AA) 


130 


51 


888 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


117 


55 


889 


L25616 


Homo sapiens 


CGI protein 


150 


62 


890 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


171 


66 


891 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


107 


40 



152 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


1 % 
Identit 








fragment, SEQ ID NO:387. 




— 


892 


X523 1 8 


Bos taurus 


histone H2A.Z (AA 1-127) 


356 


79 


893 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


102 


46 


894 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


173 


75 


895 


X92485 


Plasmodium 


pval 


109 


47 


896 


L76159 


Homo sapiens 


FRGI gene product 


100 


35 


897 


D50926 


Homo sapiens 


The KIAA0I36 gene product is novel. 


280 


89 


898 


X0401 1 


Homo sapiens 


precursor polypeptide 


1 14 


95 


899 


M90656 


Homo salens 




101 


90 


900 






Ne^rafthread^ synthetase — _ 






901 


L27428 


Homo sapiens 


reverse transcriptase 


81 


47 


~§o~I 






Pro-Pol-dUTPase polyprotein 









G00412 


Hom^sapiens 3 


4493. P ' Q 


~7o3 




~fo3 


a c-ni ni Ad 


Homo sapiens 












Homo sapiens 




~i&r 





906 


U49973 


Homo sapiens 


ORF1* MER37* putative trail sposase 
similar to pogo element 


~230 


— 


907 


L27428 


Homo sapiens 


reverse transcriptase 


1 14 


64 


908 


G00377 




Human secreted protein SEQ ID NO- 
4458. 


142 






U93570 










910 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


63 






Homo sapiens 


reverse transcriptase 






912 


X92485 


Plasmodium 


pval 


91 


60 


913 



G03789 


Homo sapiens 
— — : . 


Human secreted protein, SEQ ID NO: 
J** 70 " : — 


103 

— 


64 


914 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
JJ^O. — — j — j — - 


96 


— 

64 


~9\5 


Y64890 


Homo sapiens 


Human 5 EST related polypeptide SEQ 
IDNO:1051. 


~T2i 


— 




G02501 




6582. 










M[us musculus 








918 


M15530 


Homo sapiens 


B-cell growth factor 


88 


51 




ATI mma 

r\r LOW ly 


Homo sapiens 








920 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


77 


921 


X53581 


Rattus 


ORF4 


124 


32 


922 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


110 


50 


923 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


99 


36 


924 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


47 


925 


AC006127 


Homo sapiens 


BRG- 1 -HUMAN ; nuclear protein 
GRB1 ; homeotic gene regulator; SNF2- 
BETA; MITOTIC GROWTH AND 
TRANSCRIPTION ACTIVATOR; 

TRANSCRIPTION ACTIVATOR 
SNF2L4 


442 


92 


926 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


126 


39 


927 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 
similar to pogo element 


321 


76 



153 



WO 01/64835 



PCT/US01/04927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Wnterman 

Score 


% 

Identit 

y 






Arabidopsis 
thaliana 


putative CENP-B/ARS-bmding 
protein-like protein 


97 


35 






Daucus carota 


put. precursor 


1 12 


3S 


930 


U93563 


Homo sapiens 


putative pi 50 


125 


70 




nDU 1 ZZZ J> 


Canis familiaris 


ORF2 


202 


50 
37 


932 


AF053538 


Alvinella 
pompejana 


fibrillar collagen chain FApl alpha 


114 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


107 


57 




U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


283 


54 


935 


U41017 


Caenorhabditis 
elegans 


repetitive region; weakly similar to E. 
gracilis major membrane skeletal 
protein (PIR: A434 1 7) 


107 


33 


936 


U47855 


Araneus 
diadematus 


fibroin-3 


109 


33 


937 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


95 


43 


938 


W48351 




Homo sapiens 


Human breast cancer related protein 
BCRB2. 


108 


50 


~939 


M13101 


norvegicus 


unknown protein 


121 


40 


940 


G00689 

— 


Homo sapiens 
— . 


Human secreted protein, SEQ ID NO: 
4770. 


117 


46 


"941 


AFO 101 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


62 


942 


L27428 


Homo sapiens 


reverse transcriptase 


86 


45 






Homo sapiens 


putative pi 50 


279 


40 


944 


Y64890 


Homo sapiens 

,., 


Human 5' EST related polypeptide SEQ 
ID NO: 1051. 


1 10 


48 


~~945 


L27428 


Homo sapiens 


reverse transcriptase 


238 


66 


946 


AF1 94537 


Homo sapiens 


NAG 13 


146 


47 




G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
6619. 


81 


55 


948 


AC006161 


Arabidopsis 
thaliana 


putative CENP-B/ARS-binding ' 
protein-like protein 


113 


37 




AF 194537 


Homo sapiens 


NAG 1 3 


106 


66 


950 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


74 


75 


951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


78 


66 


952 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


85 


72 




U44838 


Glycine max 


extensm 


145 


39 


~9"54" 




Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


39 


955 


G03801 


Homo sapiens 
• 


Human secreted protein, SEQ ID NO: 
7882. 


128 


53 


"956 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


103 


74 






Homo sapiens 


PRO2550 


120 


37 


~93I 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


115 


71 




AF090942 


Homo sapiens 


PRO0657 


83 


63 


960 


L27428 




reverse transcriptase 


121 


30 


961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


157 


68 


962 


AF090942 


Homo sapiens 


PRO0657 


138 


61 


963 


U83280 


Leishmania 
donovani 


39 kDa antigen 


101 


53 


964 


U93566 


Homo sapiens 


p40 


102 


31 



154 



WO 01/64835 



PCT/U SO 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
VVatcrman 


% 

Jdentit 

y 


965 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID "NO; ■ 
7867. 


104 




966 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


136 




967 


AF090930 


Homo sapiens 


PRO0478 


158 


80 


968 


ABO 12223 


Canis familiaris 


ORF2 


94 


36 


969 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 




970 


X53581 


Rattus 


ORF4 


163 


43 


971 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


109 


56 


972 


G031 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


82 


47 


973 


Y69166 




transferase protein. 






974 


G03095 




7176. 









^ 3574 


Homo sapiens 


putative pi 50 


_112 









Homo sapiens 


Human breast cancer related protein 


79 


65 


~977 


Y08062 


Homo sapiens 


HumaifpR0245 protein fragment 

derived from DNA3 5638. 


— — 


~55 


978 


G0341 1 




Human secreted protein SEQ ID NO* 
7492. 






979 


AF1301 14 


Homo salens 


PR02459 


~9^ 





980 


K03202 




un'knowrf rc 1 " 16 "" 0 ** Pr ° te ' n precursor — 






981 


AF1 16909 


Homo sapiens 




~Ti5 


~42" 


982 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 


75 


983 


Y 14674 


Plasmodium 
f ale iparum 


gJutamate-cysteine ligase 


106 


66 


984 


AF229067 


Homo sapiens 


PADI-H protein 


152 


60 


985 


AFI 19900 


Homo salens 


PR02822 


— 




986 


G03714 


omo sapiens 


Human secreted protein SEQ ID NO* 
7795. 







987 


R13556 


Homo sapiens 


oncoprotein. 


106 


— 


988 


G03 1 1 9 


Homo sapiens 


Human secreted protein SEQ ID NO" 
7200. 


81 


51 


990 


U35730 


Mus musculus 


jerky 


~~ 133 


~29 


991 


AFI 13685 


Homo sapiens 


PRO0974 


136 


63 


992 


U52077 










993 


Z97211 


Schizosaccharom 


kinesin-like protein 


197 


47 


994 


AF010144 


H^mo^apiens 


neuronal thread protein AD7c-NTP 


124 


~68 


995 


U93563 


Homo sapiens 


putative pi 50 


157 


50 


996 


AB015802 


xylinus 


protein of A. xylinum ATCC23869 


140 


54 


997 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


144 


60 


998 


D381J6 


Pan paniscus 


cytochrome c oxidase subunit 1 


352 


75 


999 


303148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


524 


78 


1000 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


70 


1001 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


109 


50 


1002 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


249 


61 


1003 


M14702 


Murine leukemia 


pol polyprotein 


206 


48 



155 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


-s wi(n . 

Waterman 


% 

Identit 

y 




______ 


virus 








~T004 — 




Homo sapiens 


human elongation factor-1 -delta 


51 1 


85 


1005 


L27428 


Homo sapiens 


reverse transcriptase 


176 


63 


1006 


Dj8112 


Homo sapiens 


NADH dehydrogenase subunit 5 


332 


77 


i^2Z — 


AJF090895 


Homo sapiens 


PROO 1 1 7 


162 


66 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


140 


65 


"4^r^ — 


U44838 


Glycine max 


extensin 


166 


33 




ArZj lzyu 


Plasmodium 
falciparum 


glutamic acid-rich protein 


1 14 


52 


1011 


L27428 


Homo sapiens 


reverse transcriptase 


114 


52 




AFl 30089 


Homo sapiens 


PRO2550 


114 


77 


1013 


G04092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8173. 


81 


44 


1014 


AF090895 


Homo sapiens 


PRO0117 


97 


65 


1015 ... 


AF1300S9 


Homo sapiens 


PRO2550 


168 


83 




AF079367 


Mesocricetus 
auratus 


cytochrome c oxidase subunit III 


276 


52 


1017 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


94 


44 


1018 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


114 


46 


1019 


AF090944 


Homo sapiens 


PRO0663 


137 


50 


1020 


Y2 1 1 66 


Homo sapiens 


Human bc!2 proto-oncogene mutant 
protein fragment 14. 


84 


36 


1021 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


37 


1022 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


105 


34 


1023 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


152 


52 


1024 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


66 


1025 


M12140 


Homo sapiens 


envelope protein 


143 


62 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


200 


58 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


266 


67 




AF04I330 


Bodo saltans 


NADH dehydrogenase subunit 5 


97 


31 


1029 


G02950 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7031. 


302 


56 


1030 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


76 


63 


1031 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 


56 


1032 


X90568 


Homo sapiens 


Protein sequence and annotation 
available soon via Swiss-Prat; available 
at present via e-mail from 
LABEIT@EMBL-Heidelberg.DE 


389 


100 


1033 
■ 


G02654 
— 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


96 


40 


_I2£Z 


AF0 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


316 


60 


1035 


AF194537 


Homo sapiens 


NAG 13 


208 


52 


1036 


L27428 




reverse transcriptase 


166 


51 


1037 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


44 


1038 


AFl 16638 


Homo sapiens 


PRO 1546 


56 


61 


1039 


U93570 


Homo sapiens 


putative pi 50 


138 


40 


1040 


AF130089 


Homo sapiens 


PRO2550 "~ 


150 


91 


1041 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 



156 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smi.h- 

Watcrman 

Score 


•>/» 

Idcnlit 

y 


1042 


Y36156 


Homo sapiens 


Human secreted protein #28. 


97 


41 


1043 


U93568 


Homo sapiens 


putative p 1 50 


124 


34 


1044 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


208 


71 


1045 


U93563 


Homo sapiens 


putative pi 50 


246 


54 


1046 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


131 


64 


1047 


U93563 


Homo sapiens 


putative p350 


127 


30 


1048 


AF130114 


Homo sapiens 


PR02459 


117 


67 


1049 


U12919 


Mus musculus 


adenylyl cyclase type VI 1 


170 


68 


1050 


AC008054 


Leishmania 
major 


L8453.1 


129 


30 


1051 


X99467 


Medicago 
truncatula 


ENOD20 


1 10 


38 


1052 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


150 


76 


1053 


AF1 16712 


Homo sapiens 


PR02738 


109 


44 


1054 


M96256 


Homo sapiens 


rapamycin binding protein 


168 


56 


1055 


U15647 


Mus musculus 


reverse transcriptase 


86 


37 


1056 


AL024498 


Homo sapiens 


dJ417M14.2 (novel serine/threonine- 
protein kinase (ortholog of mouse and 
rat MAK (male germ cell-associated 
kinase)) 


190 


72 


1057 


AF090942 


Homo sapiens 


PRO0657 


103 


63 


1058 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


156 


68 


1059 


AF081114 


Mus musculus 
domesticus 


ORF2 


134 


47 


1060 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


180 


83 


1061 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


126 


45 


1062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


84 


55 


1063 


U15647 


Mus musculus 


reverse transcriptase 


95 


38 


1064 


U93567 


Homo sapiens 


putative pi 50 


128 


58 


1065 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


130 


66 


1066 


X92485 


Plasmodium 
vivax 


pval 


1 19 


62 


1067 


U93567 


Homo sapiens 


p40 


161 


48 


1068 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


540 


84 


1069 


U93570 


Homo sapiens 


putative p 1 50 


107 


59 


1070 


AF321051 


Chalinolobus 
tuberculatus 


cytochrome c oxidase subunit III 


333 


71 


1071 


U93567 


Homo sapiens 


p40 


99 


28 


1072 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


105 


66 


1073 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


59 


1074 


U93572 


Homo sapiens 


putative pi 50 


140 


53 


1075 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


66 


43 


1076 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


105 


37 


1077 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


94 


66 


107S 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


81 


57 


1079 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


35 



157 



NSDOCID: <WO 0164835Aa l_> 



WO 01/64835 



PCT/US01/04927 



SEQID 

NO: 


Accession No. 




Description 


Waterman 


Idcntit 

y 








8172. 






1080 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


103 


41 


1081 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


148 


75 


J 082 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


42 


1083 


U93564 


Homo sapiens 


p40 


97 


42 


1084 


AF229067 


Homo sapiens 


PADI-H protein 


145 


61 


1085 


U49973 


Homo sapiens 


ORF1 ; MER37; putative trahsposase 
similar to pogo element 


221 


60 


1086 


U88573 


Homo sapiens 


NBR2 


165 


67 


1087 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


72 


1088 


AF1 94537 


Homo sapiens 


NAG13 


142 


62 


1089 


B08976 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO: 133. 


93 


62 


1090 


AF194537 


Homo sapiens 


NAG 13 ~ 


155 


40 


1091 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


35 


1092 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


195 


40 


1093 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


91 


46 


1094 


X53581 


Rattus 
norvegicus 


ORF4 


106 


62 


1095 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


129 


51 


1096 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:388. 


72 . 


33 


1097 


U40342 


Mus musculus 


ninein 


152 


44 


1098 


M24732 


Homo sapiens 


lamin-like protein 


92 


37 


1099 


X92485 


Plasmodium 
vivax 


pval 


111 


67 


1100 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


156 


86 


1101 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


116 


90 


1102 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
451S. 


158 


71 


1103 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 


36 


1104 


U93572 


Homo sapiens 


putative pi 50 


168 


56 


1105 


U93570 


Homo sapiens 


putative pi 50 


96 


40 


1106 


L27428 


Homo sapiens 


reverse transcriptase 


188 


43 


1107 


X53581 


Rattus 
norvegicus 


ORF4 


141 


43 


1108 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


344 


77 


1 109 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


126 


68 


1110 


U93569 


Homo sapiens 


putative pi 50 


156 


38 


1111 


AF1]8086 


Homo sapiens 


PRO 1992 


135 


54 


1112 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


71 


63. 


1113 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


124 


62 


1114 


L27428 


Homo sapiens 


reverse transcriptase 


200 


40 


1115 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


101 


50 


1116 


L27428 


Homo sapiens 


reverse transcriptase 


122 


70 


1117 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: j 


148 


65 



158 



JSDOCID: <WO 0164B3SA2 I > 



WO 01/64835 PCT/US01/04927 





Accession No. 


Species 


Descr.ption 


Smith- 




NO: 








Waterman 


Idcntit 










Score 


y 








4718. 






1118 


W48351 


Homo sapiens 


Human breast cancer related protein 


116 


65 








BCRB2. 






1119 


D38484 


Hylobates 


Cytochrome C oxidase subunit 1 


315 


89 






syndactylus 


(COXI) 






1120 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


76 


1121 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


68 








7867. 






1122 


AFO 13990 


Homo sapiens 


ubiquitin C-terminal hydrolase 


163 


50 


1123 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


55 








7867. 






1124 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


65 








6571. 






1125 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


53 


1126 


AF 130089 


Homo sapiens 


PRO2550 


76 


72 


1127 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


59 








6913. 






1128 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 


80 


59 








encoded by gene 14. 






1129 


AF1 19855 


Homo sapiens 


PRO 1847 


146 


70 


1130 


AF1 94537 


Homo sapiens 


NAG 13 


182 


66 


1131 


L27428 


Homo sapiens 


reverse transcriptase 


173 


38 


1132 


U93570 


Homo sapiens 


putative pi 50 


1 19 


34 


1133 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


68 








4488. 






1134 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


79 


87 


1135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP n 


138 


70 


1136 


GO 1502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


73 








5583. 






1137 


ABO 18705 


Mus musculus 


ORF2 


138 


36 


1138 


L20321 


Homo sapiens 


protein serine/threonine kinase 


150 


63 


1139 


W48351 


Homo sapiens 


Human breast cancer related protein 


92 


60 








BCRB2. 






1140 


AF 19453 7 


Homo sapiens 


NAG 1 3 


115 


33 


1141 


U93564 


Homo sapiens 


putative pi 50 


135 


51 


1142 


D86853 


Catharanthus 


extensin 


142 


37 






roseus 








1143 


D00570 


Mus musculus 


open reading frame (251 AA) 


213 


50 


1144 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


62 


38 








7284. 






1145 


Z70684 


Caenorhabditis 


F28D1.8 


105 


32 






elegans 








1146 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


54 








7214. 






1147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


151 


62 


1148 


A23786 


Beta vulgaris 1 


chitinase 1 


98 


37 


1149 


AF129756 


Homo sapiens 


BAT2 


177 


52 


1150 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


147 


73 


1151 


L27428 


Homo sapiens 


reverse transcriptase 


77 


31 


1152 


U34044 


Homo sapiens 


selenium donor protein 


238 


48 


1153 


AK024455 


Homo sapiens 


FLJ00047 protein 


78 


78 


1154 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


197 


67 








similar to pogo element 






1155 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


120 


64 








clone HPMBQ32. 






1156 


U25281 


Rattus 


SH3 domain binding protein 


98 


34 






norvegicus 








1157 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 | 


54 



159 



; NSOOCID :<Wn ni64fl35A2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 


% 
y 


1158 


AF 194537 


Homo sapiens 


NAG 13 


106 


48 


1159 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


357 


69 


1160 


U93572 


Homo sapiens 


p40 


89 


47 


1161 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


184 


68 


1162 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


123 


57 


1163 


AF116712 


Homo sapiens 


PR02738 


129 


69 


1164 


V00662 


Homo sapiens 


cytochrome oxidase I 


465 


69 


1165 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


118 


57 


1166 


L26163 


Mus musculus 


histone Hie 


104 


37 


1167 


X70343 


Nicotiana 
sylvestris 


extensin 


95 


33 


1168 


AF130051 


Homo sapiens 


PRO0898 


117 


43 ■ 


1169 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


104 


48 


1170 


L27428 


Homo sapiens 


reverse transcriptase 


149 


33 


1171 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


108 


62 


1172 


AF030277 


Tragelaphus 
spekii 


cytochrome oxidase subunit III 


266 


54 


1173 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


87 


38 


1174 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


168 


45 


1175 


U43627 


Arabidopsis 
thaliana 


extensin 


111 


42 


1176 


U43627 


Arabidopsis 
thaliana 


extensin 


98 


29 


1177 


U93565 


Homo sapiens 


putative pi 50 


89 


58 


1178 


JO 1047 


Caenorfaabditis 
elegans 


collagen 


108 


39 


1179 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


68 


1180 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


113 


36 


1181 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


134 


65 


1182 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


107 


62 


1183 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


45 


1184 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


418 


86 


1185 


U 87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


106 


41 


1186 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


381 


75 


1187 


D38II2 


Homo sapiens 


cytochrome c oxidase subunit 1 


434 


79 


1188 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


75 


35 


1189 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7870. 


159 


68 


1190 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


146 


68 


1191 


AF1 18086 


Homo sapiens 


PRO 1992 


146 


81 


1192 


W12842 


Homo sapiens 


Truncated pro-alpha 1(UI) chain. 


106 


35 


1193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


67 


34 


1194 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


116 


74 


1195 


Y30681 


Homo sapiens 


Splice variant ZAP- 1 B protein of the 
human tumor suppressor gene ZAP-1 . 


132 | 82 



160 



-ISDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1 196 


G03 1 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7 ^ 5, ; 


75 


48 


~Tf97 


u^yy / j 


: 

Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 




73 






Homo sapiens 




103 


30 


1199 


AL3901 14 


Leishmania 


extremely cysteine/valine rich protein 


145 


39 


1200 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


56 


1201 


AF090942 


Homo sapiens 


PRO0657 


88 


64 


1202 


G 00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523 • 


124 


70 






Homo sapiens 


Human secreted protein encoded from 
gene 12. 


1 13 


46 


1204 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065 - _ 


81 


41 






Mus musculus 


jerky 


107 


27 


1206 


U 15647 


Mus musculus 


reverse transcriptase 


191 


45 


1207 


U15647 


Mus musculus 


reverse transcriptase 


124 


50 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 




58 


1209 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


54 


1210 


AF1 19900 


Homo sapiens 


PR02822 


160 


81 


121 1 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


68 


1212 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


45 


1213 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


61 


1214 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


102 


61 


1215 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


101 


46 


1216 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


58 


1217 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


105 


46 


1218 


AC002483 


Homo sapiens 


putative product from mRNA sequence 
CG003 from BRCA2 region; match to 
U50534 (NID:gl685103) 


378 


97 


1219 


AF090895 


Homo sapiens 


PRO0117 


130 


58 


1220 


AF1 13685 


Homo sapiens 


PRO 09 74 


117 


60 


1221 


X61295 


Rattus 
norvegicus 


LI retroposon, a portion of its ORF2 
sequence 


126 


50 


1222 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


148 


70 


1223 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


99 


56 


t 

1224 




Homo sapiens 


putative pi 50 


93 


44 


1225 


AF130051 


Homo sapiens 


rKUUSyc. 


133 


69 


1226 


U93563 


Homo sapiens 


putative p 1 50 


J 25 . 


47 


1227 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


47 


1228 


U93564 


Homo sapiens 


putative pi 50 


116 


47 






Homo sapiens 


JNir-i encoded oy clone Dy. 




63 


1230 


U 15647 


Mus musculus 


reverse transcriptase 


105 


42 


1231 


U93563 


Homo sapiens 


putative pi 50 


299 


54 


1232 


R959I3 


Homo sapiens 


Neural thread protein. 


138 


51 


1233 


AF130079 


Homo sapiens 


PR02852 


203 


70 


1234 


X53581 


Rattus 
norvegicus 


ORF3 


106 


60 



161 



JSDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description ' 


Smilh- 
Watermnn 


% 

Identit 

y 


"iHi — 


API 1 RftRfi 


Homo sapiens 


PRO 1992 


144 


81 






Plasmodium 
vivax 


pval 


125 


71 




rt r U i U I 


Homo sapiens 


neuronal thread protein AD7C-NTP 


.116 


75 


1238 


U93572 


Homo sapiens 


putative pi 50 


133 


40 




GO 1 249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


69 


56 




AF130089 


Homo sapiens 


PRO2550 


136 


41 


1241 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


109 


53 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


75 


124 j 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


87 


45 


1244 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


128 


58 


1245 


U93570 


Homo sapiens 


putative pi 50 


161 


50 




Z70684 


Caenorhabditis 
elegans 


F28D1.8 


121 


45 





AF257305 


Homo sapiens 


ASH1 


576 


89 




G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


98 


68 


1249 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


107 


43 




AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


130 


91 


1251 


U63542 


Homo sapiens 


FAP protein 


116 


61 


1252 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


109 


53 


1253 


AF068294 


Homo sapiens 


HDCMB45P 


251 


63 




AF090895 


Homo sapiens 


PRO0 1 1 7 


1 1 1 


60 


1255 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


201 


75 


1256 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


51 




AB033032 


Homo sapiens 


KIAA1206 protein 


115 


80 


1258 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


122 


60 


1259 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


137 


68 


1260 


AB032906 


Hylobates 
pileatus 


dopamine receptor D4 


96 


35 


1261 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen; coded for by C. 
elegans cDNA yk55f3.3; coded for by 
C. elegans cDNA yk66d5.3; coded for 
by C. elegans cDNA yk71e4.3; coded 
for by C. elegans cDNA yk55f3.5; 
coded for by C. elegans cDNA 
yk66d5.5; coded for by C. elegans 
cDNA yk71e4.5 


106 


38 


1262 


U93566 


Homo sapiens 


p40 


182 


39 


1263 


L20096 


Manduca sexta 


ribosomal protein s7 


227 


59 


1264 


AF1 19901 


Homo sapiens 


PR0283 1 


103 


71 


1265 


D87446 




Similar to a C elegans protein encoded 
in cosmid C27F2 (U40419) 


219 


97 


1266 


D90279 


Homo sapiens 


collagen alpha 1(V) chain precursor 


120 


42 


1267 


L27428 


Homo sapiens 


reverse transcriptase 


111 


41 


1268 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


126 


68 


1269 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


34 



162 



WO 01/64835 PCT/US01/04927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


1270 


AF130089 


Homo sapiens 


PRO2550 


96 


69 


1271 


U 12707 






121 




1272 


AF165310 


Homo sapiens 


ATP cassette binding transporter 1 


243 


100 




R95913 


Pi° m ° S H aP ' enS 


^vai™* thr6ad pr0te ' n ' 




— ^ 


^ 


X92485 


vivax° 


pva 


106 




1275 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


104 


56 


1276 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


53 


36 


1277 


X03725 


Mus musculus 


ORF 2 (466 aa) 


103 


41 


YP* — 


U93570 


Homo sapiens 


putative pi 50 




-4^ — 


1279 


(jUU4U / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


83 


1280 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
443 —. : 


167 


71 


"1281 


A T*7*7 1 C7 1 

A / 1 o / 1 


— — : 

Nicotiana 
sylvestris 


putative extensm 


~7o5 


— 




K.03205 


Homo sapiens 


salivary proline-rich protein precursor 






1283 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


66 


1284 


G03 1 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


50 


1285 


A3 1039 


Nicotiana alata 


PRP3 


112 


36 


1286 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 16 


72 


1287 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo clement 


289 


67 




S801 19 


Rattus sp. 


reverse transcriptase )iomo)og 


— — 

-1-1= 


— 

— 





U4997j 


Homo sapiens 


ORF1" IvlER.37j putative transposase 
similar to pogo element 




76 


1290 


AF010144 


Homo sa iens 




Tsi 


— 


1291 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


82 


1292 


AF130089 


Homo sapiens 


PRO2550 


127 


62 





iW \J\jjDjD 


Homo sapiens 


ORF2-like protein 


~I 









Homo sapiens 


cpn ir> MO 1?R frnm WO007?^4^ 
OLy lU rivj JZO LJ UiII W yjzfz? <L£jL*+D . 






1295 


L27428 


Homo sapiens 


reverse transcriptase 


126 


36 






Homo sapiens 


Human secreted protein, SEQ ID NOi 
4518. 






1297 


L24433 


Oncorhynchus 
mykiss 


complement component C3 


j59 


31 


1298 


AC004381 


Homo sapiens 


SA gene 


443 


55 






Homo sapiens 


Wnman et*rr**tf»r\ nrntpiri CCA TFV Kin- 

numdn secreieu protein, odv^ iu iskj. 
5327. 






1^00 


Y 14479 


Homo sapiens 


Fragment of" human secreted protein 
encoded by gene 14. 


135 


66 


n 


D84391 


~~m i 










13Q ^ 




Nicotiana^ ~ 

tabacum 


extensln precursor 56 


~T34 


— : 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 










Mus musculus 


Pro-Pol-dUTPase polyprotein 


9 <3 


^9 


"Bol — 


Y 13520 


Homo sapiens 








1306 


U93567 


Homo sapiens 


putative pi 50 


245 


63 


1307 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


130 


70 


1308 


AL355774 


Streptomyces 
coelicolor A3 (2) 


putative integral membrane protein 


136 


40 


1309 


W54966 


Homo sapiens 


Synthetic human type III collagen 


124 


41 



163 



WO 01/64835 



PC77US01/04927 



seq a>" 

NO: 






Description 


Waterman 


% 

I dentil 

y 








SYN-C3. 






1310 


M20670 


Plasmodium 
vivax 


circumsporozoite protein 


107 


34 


1311 


AF151366 


Arabidopsis 
thaliana 


aiginine/serine-rich protein 


114 


36 


1312 


G03099 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7180. 


76 


43 


1313 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


109 


85 


1314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


37 


1315 . 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


123 


62 


1316 


R95913 


Homo sapiens 


Neural thread protein. 


98 


58 


1317 


AF1 13685 


Homo sapiens 


PRO0974 


170 


47 


1318 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


122 


61 


1319 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


99 


68 


1320 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


56 


1321 


U93569 


Homo sapiens 


putative pi 50 


124 


37 


1322 


AF090931 


Homo sapiens 


PRO0483 


111 


85 


1323 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


85 


44 


1324 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


131 


43 


1325 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


144 


54 


1326 


AF194537 


Homo sapiens 


NAG 13 


125 


49 


1327 


L27428 


Homo sapiens 


reverse transcriptase 


111 


45 


1328 


U93568 


Homo sapiens 


putative pi 50 


112 


30 


1329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


110 


51 


1330 


L27428 


Homo sapiens 


reverse transcriptase 


142. 


53 


1331 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


98 


88 


1332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


55 


1333 


X71602 


Nicotiana 
tabacum 


extensin 


113 


35 


1334 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


129 


41 


1335 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


112 


50 


1336 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


102 


68 


1337 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


95 


51 


1338 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


60 


1339 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


142 


55 


1340 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


123 


31 


1341 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


118 


70 


1342 


X71629 


Mus musculus 


msgl 


106 


57 


1343 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


105 


79 



164 



_016483SA2J_5 



WO 01/64835 PCT/US01/04927 



SEQ ID 




Species 


Description 




% 


NO: 








Waterman 


Identit 










Score 


y 


1344 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


85 








7516. 






1345 


AF161356 


Homo sapiens 


HSPC093 


88 


88 


1346 


JO 1435 


Rattus 


ATPase 


348 


66 






norvegicus 








1347 


U93563 


Homo sapiens 


putative pi 50 


142 


36 


1348 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


66 








7871. 






1349 


AF090942 


Homo sapiens 


PRO0657 


152 


54 


1350 


G0O354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


63 








4435. 






1351 


U93572 


Homo sapiens 


putative pi 50 


113 


84 


1352 


X92485 


Plasmodium 


pval 


130 


70 






vivax 








1353 


X61047 


Hydra sp. 


mini-collagen 


105 


36 


1354 


AF 115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


128 


46 


1355 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


39 








7214. 






1356 


AF194537 


Homo sapiens 


NAG 13 


148 


63 


1357 


AF 130079 


Homo sapiens 


PR02852 


139 


73 


1358 


X53581 


Rattus 


ORF4 


208 


43 






norvegicus 








1359 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


50 








SEQ ID NO: 163. 






1360 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


328 


69 








similar to pogo element 






1361 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


126 


65 


1362 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


47 








6532. 






1363 


X53581 


Rattus 


ORF4 


110 


35 






norvegicus 








1364 


U93569 


Homo sapiens 


putative pi 50 


123 


41 


1365 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


63 


1366 


X61294 


Rattus 


L 1 retroposon, a portion of its ORF2 


153 


43 






norvegicus 


sequence 






1367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


69 


80 








gene 100 clone HNFIU96. 






1368 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


46 








8172. 






1369 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


26 








8148. 






1370 


X924S5 


Plasmodium 


pval 


106 


46 






vivax 








1371 


U90946 


Dictyostelium 


myosin heavy chain kinase B 


114 


62 






discoideum 








1372 


L27428 


Homo sapiens 


reverse transcriptase 


98 


61 


1373 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


80 








7881. 






1374 


U49974 


Homo sapiens 


mariner transposase 


137 


57 


1375 


AC004891 


Homo sapiens 


contactin-like; similar to U87224 


234 


58 








(PID:gl857710) 






1376 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


132 


60 


1377 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


317 


51 








similar to pogo element 






1378 


AF 118082 


Homo sapiens 


PRO 1902 


3 02 


42 


1379 


AF 115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


186 


45 


1380 


U93567 


Homo sapiens 


putative pi 50 


116 


38 


1381 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


218 


47 



165 



NSDOCID: <WO 0164835A2 I > 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


Idcn.it 








similar to pogo element 






1382 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


87 


66 


1383 


AF090895 


Homo sapiens 


PRO0117 


68 


82 


1384 


U93570 


Homo sapiens 


putative pi 50 


178 


39 


1385 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


132 


43 


1386 


AF130089 


Homo sapiens 


PRO2550 


142 


35 


1387 


L27428 


Homo sapiens 


reverse transcriptase 


163 


49 


1388 


X61296 


Rattus 
norvegicus 


open reading frame 2 


123 


44 


1389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


67 


1390 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


147 


57 


1391 


U93570 


Homo sapiens 


putative pi 50 


110 


28 


1392 


R95913 


Homo sapiens 


Neural thread protein. 


104 


35 


1393 


L27428 


Homo sapiens 


reverse transcriptase 


101 


39 


1394 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


147 


47 


1395 


AF2 16972 


Homo sapiens 


p8 protein 


118 


49 


1396 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


160 


58 


1397 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


312 


62 


1398 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


62 


1399 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


54 


1400 


X92485 


Plasmodium 
vivax 


pval 


124 


39 


1401 


U93563 


Homo sapiens 


putative p 1 50 


131 


36 


1402 


X67863 


Mus musculus 


T2 


160 


48 


1403 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


94 


39 


1404 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


73 


1405 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


71 


1406 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


138 


31 


1407 


AF 1 34304 


Homo sapiens 


Scar2 


118 


40 


1408 


Y08061 


Homo sapiens 


Human c-myb protein fragment- 


121 


82 


1409 


U93574 


Homo sapiens 


putative pi 50 


179 


43 


1410 


U93563 


Homo sapiens 


putative pi 50 


98 


43 


1411 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


46 


1412 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


56 


1413 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


48 


1414 


GO 3 246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


155 


82 


1415 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


92 


52 


1416 


AF1 19855 


Homo sapiens 


PRO 1847 


82 


70 


1417 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


34 


1418 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


99 


39 


1419 


AF130079 


Homo sapiens 


PR02852 


114 


69 


1420 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


52 



166 



■JSDOCID: <WO 0164B35A2 I > 



WO 01/64835 



PCT/U SO 1/04927 



SEQ IT) 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idenht 


1421 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


115 


44 


1422 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7S67.- 


1 1 1 


77 


1423 


R95913 


Homo sapiens 


Neural thread protein. 


128 


80 


1424 


L26953 


Homo sapiens 


chromosomal protein 


104 


34 


1425 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


51 


1426 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


98 


75 


1427 


U83303 






149 




1428 


AF090895 


Homo sapiens 


PRO0 1 1 7 


1 1 1 


75 


1429 


AF1 19855 




PRO 1847 


88 




1430 


AF229067 


Homo sapiens 


PADI-H protein 




~ 


1431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 




U49973 




similar to pogo element 








AF010144 










1434 


AF161356 


Homo sapiens 


HSPC093 


180 


46 





U93570 


H°mo ^iens 






"41 — 




G0j789 


omo sapiens 


Human secreted protein SEQ ID NO" 
7870. 


"TTT 




1437 


U83280 


Leishmania 


39 kDa antigen 


106 


80 


1438 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


103 


73 


~ 1439 


[JI5547 


Mus musculus 




~T r 3 


~44 


1440 


L27428 


Homo sapiens 


reverse transcriptase 


78 


40 




YU J*fUU 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 




68 




rmi p/17 
UUJoU / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 




63 






Homo sapiens 


unknown protein 






1444 


Ml 1901 


Rattus 


proline-rich salivary protein 


102 


40 


1445 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


101 


66 


1446 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


74 


1447 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


66 


1448 


X76208 


Drosophila 

melanogaster 


protein 3" specific exons 


~Y2r 


~48 




Y86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 






1450 


G04091 


Homo sapiens 


Human secreted protein SEQ ID NO* 
8172. 


109 


45 


1451 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


115 


54 


1452 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO.' 
7795. 


70 


63 


"T453 


U49471 


Mus musculus 


\Viscott-AIdrich Syndrome protein 
homolog 


To! 


— 








i ljuuu^/ protein 







-j— 


AC007258 


Arabidopsis" 5 

thaliana 




~To3 




1456 


AF194537 


Homo sapiens 


NAG 13 


208 


52 


1457 


U63542 


Homo sapiens 


FAP protein 


111 


84 


1458 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


111 


65 


1459 


AF09093 1 


Homo sapiens 


PRO0483 


84 


43 



167 



\SDOCID: <WO 



0164835A2 I > 



WO 01/64835 PCT/USO 1/0492 7 



SEQ ID 

NO: 


Accession No. 


I Species 


Description '■".>''' 


~ "Smith- 

Waterman 


~~r°7 1 

Identit 

y 


1460 


R95913 


Homo sapiens 


Neural thread protein. 


106 


69 




-J^£|^i£ 


Homo sapiens 


Neural thread protein. 


109 


40 


~T4"6l 




Homo sapiens 


putative pi 50 


237 


"42 


1463 




Homo sapiens 


Npw38-binding protein NpwBP 


97 


37 


1464 


U44838 


Glycine max 


extensin 


97 


33 






Homo sapiens 


dJ39G22. 1 (rearranged L-myc fusion 
sequence (ZN-15 related zinc finger 
protein)) 


121 


45 


1466 


L27428 


Homo sapiens 


reverse transcriptase 


94 


34 


1467 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


58 






Homo sapiens 


Human secreted protein (clone fk31 7- 
3). 


98 


48 




G03798 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
7879. 


123 


55 


1470 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


147 


67 




AF1 09907 


Homo sapiens 


S164 


133 


35 


1472 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


36 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


152 


90 


1474 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


157 


45 " 

~\ 


1475 



U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


398 




1476 


U49974 


Homo sapiens 


mariner transposase 


201 


59 




U49974 


Homo sapiens 


mariner transposase 


206 


60. 


1478 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


117 


72 


1479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


146 


82 


1480 


U93567 


Homo sapiens 


putative p 1 50 


202 


42 


1481 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


101 


46 


1482 


U87607 


Rattus 
norvegicus 


putative RNA binding protein I 


100 


37 


"T483 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7873. 


124 


75 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


108 


62 


1485 


U 15647 


Mus musculus 


reverse transcriptase 


115 


73 


1486 


AF194537 


Homo sapiens 


NAG 13 


132 


42 


1487 


Ml 1902 


Mus musculus 


jroline-rich salivary protein 


118 


40 




G03 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


102 


64 





AF009668 


multiple 
sclerosis 
associated 
retrovirus 


polyprotein 


no 


48 




A1S.UZJ04Z 


Homo sapiens 


unnamed protein product 


114 


37 


1491 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


133 


50 


1492 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


184 


47 


1493 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


222 


53 


1494 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


50 


1495 


AF109907 


Homo sapiens 


SI 64 


259 


45 


1496 


R95913 


Homo sapiens 


Neural thread protein. 


110 


51 


1497 


U49973 


Homo sapiens 


ORE I; MER37; putative transposase 
similar to pogo element 


299 


80 



168 



SDOCID: <WO 0164835A2_I_> 
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SEQ JD 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


1498 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


158 


65 


1499 


L26953 


Homo sapiens 


chromosomal protein 


104 


67 


1500 


AF090895 


Homo sapiens 


PRO0117 


145 


68 


1501 


U93572 


Homo sapiens 


p40 


1 15 


42 


1502 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


133 


50 


1503 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


143 


56 


1504 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


119 


69 


1505 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


105 


52 


1506 


AF 109907 


Homo sapiens 


SI 64 


184 


43 


1507 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


288 


67 


1508 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


69 


1509 


AK000241 


Homo sapiens 


unnamed protein product 


167 


72 


1510 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


60 


1511 


S62928 


Homo sapiens 


PRB1M protein precursor 


157 


39 


1512 


AB012223 


Canis familiaris 


ORF2 


1 16 


40 


1513 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


109 


59 


1514 


AF220264 


Homo sapiens 


MOST-1 


108 


80 


1515 


X53581 


Rattus 
norvegicus 


ORF4 


96 


44 


1516 


V00662 


Homo sapiens 


cytochrome oxidase III 


433 


74 


1517 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


263 


70 


1518 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


266 


70 


1519 


M24732 


Homo sapiens 


lam in-like protein 


107 


44 


1520 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


131 


51 


1521 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


81 


68 


1522 


AF194537 


Homo sapiens 


NAG 13 


1 16 


42 


1523 


X92485 


Plasmodium 
vivax 


pval 


85 


42 


1524 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


125 


50 


1525 


AB041228 


Homo sapiens 


G protein-coupled receptor TGR- 1 


220 


100 


1526 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


70 


1527 


U52077 


Homo sapiens 


mariner transposase 


237 


56 


1528 


L27428 


Homo sapiens 


reverse transcriptase 


189 


40 


1529 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


355 


78 


1530 


L13610 


Mus musculus 


IFN-response element binding factor 2 


90 


37 


1531 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


90 


64 


1532 


Ul 1288 


Drosophila 
melanogaster 


diaphanous protein 


138 


38 


1533 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


429 


63 


1534 


L27428 


Homo sapiens 


reverse transcriptase 


249 


55 


1535 


U93570 


Homo sapiens 


putative pi 50 


114 


31 


1536 


AF130089 


Homo sapiens 


PRO2550 


111 


47 



169 



>JSDOCID <WO 0164835A2J_> 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description ' : 


Smith- 
Waterman 


% 
y 


1537 


X55687 


Lycopcrsicon 
esculentum 


extcnsin (class II) 


63 


28 


1538 


U15647 


Mus musculus 


reverse transcriptase 


1 10 


42 


1539 


AK024455 


Homo sapiens 


FLJ00047 protein 


139 


SO 


1540 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


128 


48 


1541 


AC024788 


Gaenorhabditis 
elegans 


Hypothetical protein Y46E12A.d 


80 


46 


1542 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


132 


73 


1543 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


90 


32 


1544 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7052. 


71 


63 


1545 


R95913 


Homo sapiens 


Neural thread protein. 


118 


55 


1546 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


218 


78 


1547 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 


370 


71 


1548 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


119 


65 


1549 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


140 


70 


1550 


AF 11 3685 


Homo sapiens 


PRO0974 


115 


47 


1551 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 


75 


1552 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


1553 


W40113 


Homo sapiens 


Human alpha-2(IV) collagen protein. 


117 


59 


1554 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


138 


76. 


1555 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


1 13 


40 


1556 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


502 


89 


1557 


AF 130089 


Homo sapiens 


PRO2550 


98 


55 


1558 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


109 


43 


1559 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


183 


52 


1560 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


180 


52 


1561 


AF202893 


Mus musculus 


Kif21b 


254 


85 


1562 


M63421 


Drosophila 
melanogaster 


csp32 


104 


39 


1563 


G03652 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7733. 


129 


69 


1564 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


98 


59 


1565 


AJ132106 


Bos taurus 


SCO-spondin 


114 


40 


1566 


AL390114 


Leish mania 
major 


extremely cysteine/valine rich protein 


119 


66 


1567 


AF161356 


Homo sapiens 


HSPC093 


100 


38 


1568 


AF119851 


Homo sapiens 


PRO 1722 


94 


72 


1569 


L27428 


Homo sapiens 


reverse transcriptase 


107 


48 


1570 


X99451 


Lycopersicon 
esculentum 


extensin-like protein DiflO 


104 


32 


1571 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


126 


59 


1572 


X73481 


Drosophila hydei 


mstl03(2) 


105 


41 


1573 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NlD:g8148) 


133 


50 


1574 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


51 



170 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


1 Smith- 
Waterman 

- SC ° re 


% 

ldentit 

y 








8144. 






1575 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


323 


— 


1576 


AF062008 


Caenorhabditis 
elegans 


unknown 


111 


54 


1577 


X92485 


Plasmodium 


pval 


81 


57 


1578 


U93570 


Homo sapiens 


p40 


102 


33 


1579 


AF090944 


Homo sapiens 






59 


1580 


AL 137798 


Homo sapiens 


dJ1182A14.5.1 (novel gene (isoform 
1)) 


182 


53 


1581 


X92485 


Plasmodium 


pval 


120 


44 


1582 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


47 


1583 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


54 


1584 


G00637 


~H : 

omo sapiens 


Human secreted protein, SEQ ID NO: 


— — 

119 


52 


1585 


Y02749 


Homo sapiens 


Human secreted protein encoded by 

gene 100 clone HNFIU96. 


— 


— 


1586 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


152 


60 


1587 


D28482 


Homo sapiens 


SCR2 


390 


83 


1588 


AFO 1 0 1 44 


Homo salens 


neurons] thread protein AJD7c-NTTP 


~T^i 




1589 


AF010144 


omo sapiens 


neuronal thread protein AD7c-T*>J I P 




— 


1590 


AF1 1 8078 


riomo sapiens 


PPOl fi A Q 
i Ix L/ 1 o*f o 


— — — 

11° 


59 




G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
5066. 




78 


1592 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO! 


118 


80 


1593 


U93571 


Homo sapiens 


p40 


170 


77 


1594 


Y12713 










j 595 




Homosa iens 


re verse^ franrcr?ptese >yPr ° te ' ~ '' 


~T^S 





1596 


X97707 


Pongo pygmaeus 


stopcodon created by 


139 


61 


1597 


X98710 


Homosa iens 


COL1A1 and PDGFB fusion transcript 


~T07 


— 


1598 


G02480 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6561. 


""97 


"38" 


1599 


AF2 10651 


Homo sapiens 


NAG 18 


86 


89 


1600 


G00637 




Human secreted protein, SEQ ID NOi 
4718. 






1601 


JO3770 










1602 


AF1 19901 


Homo sapiens 


PR0283 1 


119 


56 


1603 


AL03 1 673 


Homo sapiens 


a joy*a i<*. i [ru lAiivc novel is_K>vt> 
ftage^d^maais) 1 1 § C2H2 tyPC 


2j3 


44 


1604 


G03714 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7795. 


130 


— 


1605 


Q03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


44 


T606 


D88461 


p — 

Kattus rattus 


IN" Wnor 


— — 


— 


1607 


AF090942 




PRO0657 


07 


6 


1608 


U35730 


Mus musculus 




105 


34 


1609 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


108 


45 


1610 


B06334 


Homo sapiens 


Human subtil isin-kexin isoenzyme 1. 


474 


84 


1611 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


101 


90 


1612 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


55 


1613 


D86853 


Catharanthus 


extensin 


123 


39 



171 
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SEQID 
NO: 


Accession No. 


Species 


Description .. 


Smith- 
Waterman 


' % 
Identit 






roseus 








1614 


AF1 19851 


Homo sapiens 


PRO 1722 






1615 


Y01158 


Homo sapiens 


clone HCACJ81. 


--j4| 


-~ 

57 


1616 


AF194537 


Homo sapiens 


NAG 13 


154 


63 


1617 


AF1I9851 


Homo sapiens 


PRO 1 722 * 




62 


1618 


W48351 


Homo sapiens 


BCRB2. 


-— 




1619 


AK024455 


Homo sapiens 


FLJ00047 protein 




60 


1620 


AF2 17973 


Homo sapiens 


unknown 


116 


67 


162] 


AF000298 


Gaenorhabditis 
elegans 


weak similarity to collagens^ glycine- 
and proline-rich 




41 


1622 






salivary proline-rich protein i 


108 


40 


1623 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


44 


1624 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


322 


62 


1625 


U49973 


Homo sa iens 

omo sapiens 


ORF1; MER37; putative transposase 


356 


72 


1626 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


113 


65 


1627 


X92485 


Plasmodium 


pval 


90 


45 


1628 


AF090895 


Homo sapiens 


PRO0 1 1 7 




61 


1629 


AF116661 


Homo sapiens 


PRO 143 8 


87 


54 


1630 


M13100 


. 

Ho'm'cfsa^ens 


unknown protein 


109 


76 


1631 


G03203 


omo sapiens 


Human secreted protein, SEQ ID NO; 


147 


60 


1632 


AF1 19851 


Homo sapiens 


PRO 1722 


107 


70 


1633 


M64792 


norvegicus 


salivary proline-rich protein 


1 09 


46 


1634 


L27428 


Homo sapiens 


reverse transcriptase 


109 


38 


1635 


AF1 18082 




riw-j i yoz 


80 


40 


1636 


R95913 


Homo sapiens 


Neural thread protein. 


118 


88 


1637 


G 02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


79 


60 


1638 


U93570 


Homo sapiens 


putative pi 50 


128 


54 


1639 


U49973 




ORF1 ; MER37; putative transposase 
similar to pogo element 


127 


69 


1640 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


68 


1641 


AF194537 


Homo sapiens 


NAG 13 


140 


66 


1642 


G 02493 


Homo sapiens 


6574. 


101 


50 


1643 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


33 


1644 


AF010144 




T7K A : a~F^5 — STF? 

neurorjal thread protein AD7c-NTP 


123 


50 


1645 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


70 


1646 


L27428 


Homo sapiens 


reverse transcriptase 


86 


84 


1647 


X92485 


Plasmodium 


pval 


137 


40 


1648 


U 15647 


Mus musculus 


reverse transcriptase 


93 


68 


1649 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


1650 


AF1 16712 


Homo sapiens 


PR02738 


107 


57 


1651 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


139 


72 


1652 


X05472 


Rattus 


ORF3 


84 


51 



172 



WO01/64835 PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 






norvegicus 








1653 


GO0588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


122 


71 


1654 


U93566 


Homo sapiens 


P 40 


117 


52 


1655 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


141 


70 


1656 


AF090895 


Homo sapiens 


PRO0117 


125 


60 


1657 


X92485 


Plasmodium 
vivax 




1 14 


45 


1658 


AFO 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


192 


61 


1659 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:388. 


73 


30 


1660 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


331 


74 


1661 


G0j438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


127 


69 


1662 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


312 


67 


1663 


S801 19 


Rattus s 




~~99 


~~59 


1664 


U43360 


Peromyscus 


reverse transcriptase 


106 


45 


1665 


M76729 


Homo sapiens 


pro-alpha- 1 type V collagen 


172 


47 


1666 


W48351 


Homo sapiens 


BCRB2. 


97 




1667 


AF 1693 88 


Mus musculus 


alpha 4 collagen IV 


84 


38 




W4S351 




BCRB2. P 


84 


66 


1669 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


428 


80 


1670 


M29622 


IVIus musculus 


open reading frame 2 


74 


46 


1671 


W90838 


Homo sapiens 


Human lymphocyte targeted peptide 
#6. 


98 


100 


1672 


GO0328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


55 


58 


1673 


AF09093 1 


Homo sapiens 


PRO0483 


72 


39 


1674 


AF051782 




diaphanous 1 


1 16 


49 


1675 


U57361 


Rattus SaP ' enS 

norvegicus 


collagen XII alpha 1 


108 


48 


1676 


AF1 82844 


Homo sapiens 


VPS28 protein 


395 


95 


1677 


L27428 


Homo sapiens 


reverse transcriptase 


189 


47 


1678 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


1679 


LJ93565 


Homo sapiens 


putative pi 50 


214 


40 


1680 


AK002129 


Homo sapiens 


unnamed protein product 


128 


57 


1681 


X03145 


Homo sapiens 


pot. ORF V 


93 


48 


1682 


X63005 


Mus musculus 


proline-rich protein 


108 


38 


1683 


AF118082 


Homo sapiens 


PRO 3 902 


117 


42 


1684 


R13556 


Homo sapiens 


oncoprotein. 


157 


58 


1685 


R95913 


Homo sapiens 


Neural thread protein. 


92 


66 


1686 


Q02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


111 


61 


1687 


X61296 


Rattus 
norvegicus 


open reading frame 2 


104 


38 


1688 


ABO 12223 


Canis familiaris 


ORF2 


98 


39 


1689 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


278 


72 


1690 


U52077 


Homo sapiens 


mariner transposase 


175 


56 


1691 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


46 



173 



WO 01/64835 



PCT/US01704927 



NO: 




Species 


Descnption : 


Waterman 
Score 


Identit 

y 


1692 


AF061128 


Plasmodium 
falciparum 


merozoite surface protein 1 


85 


44 


1693 


X77722 


Homo sapiens 


interferon alpha/beta receptor 


89 


60 


1694 


M13100 


Rattus 
norvegicus 


unknown protein 


94 


40 


1695 


AF202635 


Homo sapiens 


PP1200 


114 


60 


1696 


AK001116 


Homo sapiens 


unnamed protein product 


127 


53 


1697 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


162 


53 


1698 


AF 118078 


Homo sapiens 


PRO 1848 


93 


42 


1699 


X92485 


Plasmodium 
vivax 


pval 


147 


49 


1700 


M63819 


Plasmodium 
falciparum 


malaria antigen 


101 


64 


1701 


AF090930 


Homo sapiens 


PRO0478 


146 


76 


1702 


AB009993 


Mus musculus 


collagen al(V) 


94 


40 


1703 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


169 


66 


1704 


AF1 30089 


Homo sapiens 


PRO2550 


145 


38 


1705 


X83413 


Human 
herpesvirus 6 


U88 


113 


58 


1706 


S60088 


Homo sapiens 


putative adhesion molecule=ADMLX 


151 


86 


1707 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


176 


52 


1708 


AF1 30079 


Homo sapiens 


PR02852 


174 


41 


1709 


Y28682 


Homo sapiens 


Human pp392_3 secreted protein. 


557 


99 


1710 


Ml 4423 


Mus musculus 


pro-alpha-l type I collagen 


112 


34 


1711 


D13623 


Rattus sp. 


p34 protein 


128 


45 


1712 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


84 


41 


1713 


X97675 


Homo sapiens 


plakophilin 2b 


121 


60 


1714 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


102 


51 


1715 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


48 


1716 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


99 


63 


1717 


S80119 


Rattus sp. 


reverse transcriptase homolog 


140 


57 


1718 


L27428 


Homo sapiens 


reverse transcriptase 


103 


46 


1719 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


78 


1720 


AF130089 


Homo sapiens 


PRO2550 


106 


35 


1721 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7733. 


147 


49 


1722 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


111 


72 


1723 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


132 


60 


1724 


X92485 


Plasmodium 
vivax 


pval 


96 


62 


1725 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


249 


52 


1726 


M81321 


Macaca 
fascicularis 


proline-rich protein 


132 


45 


1727 • 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


119 


69 


1728 


U93564 


Homo sapiens 


p40 


129 


58 


1729 


U93574 


Homo sapiens 


putative p 1 50 


113 


76 


1730 


AF130089 


Homo sapiens 


PRO2550 


136 


61 


1731 


L05608 


Cercopithecine 


glycoprotein gl 


100 


46 



174 



WO 01/64835 



PCT/USO 1/04927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 














1732 


U 15647 


Mus musculus 


reverse transcriptase 


138 


36 






Homo sapiens 


putative pi 50 




43 


1734 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
J221i . 


155 


64 


—yf^l 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


102 


64 


1736 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


1 18 


64 


1737 


G00490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


58 


1738 




— : 

Homo sapiens 




._ 

.} 


55 




Ul 1288 


Drosophila 
melanomas ter 


diaphanous protein 


149 


46 


1740 


L27428 


Homo sapiens 


reverse transcriptase 


108 


33 




G 024 85 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 










Mus musculus 


Pro-Pol-dUTPase polyprotein 


165 


59 


1743 


AC003682 


Homo sapiens 


R28830 1 


179 


63 






Volvox carteri 


extensin 


173 


49 


~T7"i3 — 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 14 


62 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


89 


41 


1747 
— - — 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


1 15 


42 






Homo sapiens 


putative pi 50 


125 


37 


1749 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 




45 




Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 




53 






Homo sapiens 


neuronal thread protein AD7c-NTP 




80 


1752 


L27428 


Homo sapiens 


reverse transcriptase 


133 


42 


-y^| 


U22^76 


Homo sapiens 


putative pi 50 


126 


57 






Homo sapiens 


alternatively spliced product using exon 
13A 


134 


50 






Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


175 


76 




ftruuuoio 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


1 13 


60 




U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 






1758 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


109 


88 


T759 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


123 


73 


1760 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


1 13 


92 


1761 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ IDNO:496. 


110 


81 


"T762 


UUoUZU 


— r 

Mus musculus 


collagen pro-alpha-1 type I chain 


105 


34 


1763 


G03790 


Homo sapiens - 


Human secreted protein, SEQ ID NO: 
7871. 


90 


45 


1764 


U93569 


Homo sapiens 






,. 


1765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


45 


1766 


X02873 


Daucus carota 


put. precursor 


112 


47 


1767 


X92485 


Plasmodium 
vivax 


pval 


100 


45 


1768 


R95913 


Homo sapiens 


Neural thread protein. 


96 


65 



175 



WO 01/64835 



PCT/U SO 1/04927 



SEQID 


Accession No. 




Description > -' j» 


Smi(h- 


% 


NO: 






Waterman 


Identit 










Score 


y 


1769 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 14 


51 








4718. 






1770 


X97675 


Homo sapiens 


plakophilin 2b 


115 


70 


1771 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


58 








4718. 






1772 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


85 








6613. 






1773 


AF 130089 


Homo sapiens 


PRO2550 


158 


69 


1774 


U23552 


Ailuropoda 


cytochrome b 


206 


78 






melanoleuca 








1775 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


53 


1776 


AF1 36715 


Homo sapiens 


PR02829 


134 


67 


1777 


AC008054 


Leishmania 


L8453.1 


114 


28 






major 








1778 


AF037364 


Homo sapiens 


paraneoplastic neuronal antigen MAI 


397 


73 


1779 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


60 


1780 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


64 








6561. 






1781 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


80 


36 








similar to pogo element 






1782 


AF 1 744 82 


Homo sapiens 


polycomb 3 


133 


46 


1783 


U93563 


Homo sapiens 


putative pi 50 


196 


70 


1784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


59 


1785 


AF 19453 7 


Homo sapiens 


NAG 13 


1 14 


38 


1786 


U21123 


Drosophila 


ena polypeptide 


120 


44 






melanogaster 








1787 


AF200187 


cercopithicine 


EBNA2-like protein 


108 


30 






herpesvirus 15 








1788 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


56 








6953. 






1789 


M64792 


Rattus 


salivary proline-ricb protein 


128 


40 






norvegicus 








1790 


X92485 


Plasmodium 


pval 


98 


51 






vivax 








1791 


AF0IO144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


34 


1792 


U49973 


Homo sapiens 


OPvFl; MER37; putative transposase 


197 


81 








similar to pogo element 






1793 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


71 








4409. 






1794 


AF 104923 


Homo sapiens 


putative transcription factor 


142 


59 


1795 


W48351 


Homo sapiens 


Human breast cancer related protein 


133 


60 








BCRB2. 






1796 


AC003113 


Arabidopsis 


F2401.6 


57 


62 






thaliana 








1797 


M22332 


Homo sapiens 


unknown protein 


118 


29 


1798 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


100 


71 








gene 44 clone HTDAD22. 






1799 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


154 


37 






major 








1800 


U93570 


Homo sapiens 


P 40 


103 


56 


1801 


X99452 


Lycopersicon 


extensin-like protein Dif54 


101 


28 






esculentum 








1802 


L27428 


Homo sapiens 


reverse transcriptase 


102 


34 


1803 


L27428 


Homo sapiens 


reverse transcriptase 


141 


43 


1804 


M18933 


Mus musculus 


alpha- 1 type-Ill collagen precursor 


118 


30 


1805 


X92485 


Plasmodium 


pval 


106 


67 


1806 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


65 








7492. 







176 



01 64835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identic 

y 




G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


105 


33 






Homo sapiens 


plakophilin 2b 


154 


61 


1809 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


53 


1810 


X92485 


Plasmodium 


pval 


133 


54 


1811 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


66 


1812 


Y17833 


Human 
endogenous 
retrovirus K 


env protein 


119 


81 


1813 


AF1 19851 


Homo sapiens 


PRO 1722 


130 


58 


1814 


X53581 


Rattus 
norvegicus 


ORF4 


158 


50 


1815 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7554. 


111 


74 


1816 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


153 


68 


1817 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


164 


50 


1818 


AF1 18082 


Homo sapiens 


PRO 1 902 


90 


75 


1819 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


110 


52 


1 820 


U9356j 


Homo sapiens 


putative pi 50 


1 14 


35 






Caenorhabditis 
elegans 


proline rich 


95 


52 


1822 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


154 


45 






Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


53 


1824 


AF130089 


Homo sapiens 


PRO2550 


128 


40 


1825 


AF090944 


Homo sapiens 


PRO0663 


103 


45 


1826 


AC003113 


Arabidopsis 
thaliana 


F2401.18 


107 


40 


1827 


AF1 94537 


Homo sapiens 


NAG 13 


85 


28 


1828 


AF009668 


multiple 
sclerosis 
associated 
retrovirus 


polyprotein 


185 


41 


1829 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


155 


42 


18^0 


X69465 


Sus scrofa 


ryanodine receptor 1 


516 


86 


1831 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
45 ^5. — _ _ 


108 


40 


— 5^ 

llll 


uooyoo 


: 

Homo sapiens 


rapamycin associated protein FRAP2 


434 


89 




Ml 9 155 


Plasmodium 
falciparum 


S-antigen precursor 




32 


1834 




Mus musculus 


synapsin lb 


98 


33 




AJS-UjijUUj 


Homo sapiens 


unnamed protein product 


393 


81 


1836 


Y41740 


Homo sapiens 


Human PRO701 protein sequence. 


429 


78 


1837 


M36913 


Zea mays 


cell wall protein (put.); putative 


72 


35 


1838 


X63005 


Mus musculus 


proiine-rich protein 


98 


40 


1839 


X83413 


Human 
herpesvirus 6 


U88 


149 


45 




AF134304 


Homo sapiens 




87 


37 


1841 


AC024772 


Caenorhabditis 
elegans 


contains similarity to Mus musculus 
alpha-NAC, muscle-specific form 
(GB:U48363) 


131 


25 


1842 


AB002366 


Homo sapiens 


KIAA0368 


153 


75 


1843 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


47 



177 



WO 01/64835 



PCT/US01/04927 



NO: 








Waterman 


fdentit 

y 


1844 


AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


93 




1845 


D26156 


Homo sapiens 


hSNF2b 


91 


34 


1846 


Ml 4228 


Gallus gallus 


c-beta-3 beta-tubulin 


598 


83 


1847 


AK022217 


Homo sapiens 


unnamed protein product 


97 


56 


1848 


AJ250042 


Homo sapiens 


Rab5 GDP/GTP exchange factor 
homologue 


174 


83 


1849 


W23949 


Homo sapiens 


Human phosphoinositide 30H-kinase 
plOl subunit. 


143 


28 


1850 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


101 


63 


1851 


AB017114 


Homo sapiens 


AD 3 


113 


100 


1852 


D00570 


Mus musculus 


open reading frame (251 A A) 


174 


55 


1853 


A675 30 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


133 


42 


1854 


TJ49974 


Homo sapiens 


manner transposase 


214 


82 


1855 


U93569 


Homo sapiens 


p40 


95 


31 


1856 


D89729 


Homo sapiens 


CRM1 protein 


475 


90 


1857 


AF090895 


Homo sapiens 


PRO0117 


89 


36 


1858 


AFO 15926 


Homo sapiens 


ezrin-radixin-moesin binding 
phosphoprotein-50 


117 


73 


1859 


D13721 


Gallus gallus 


NF-kB p65 subunit 


223 


56 


1860 


K03204 


Homo sapiens 


salivary proline-rich protein precursor 


111 


32 


1861 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


195 


52 


1862 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


331 


69 


1863 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


80 


35 


1864 


L27428 


Homo sapiens 


reverse transcriptase 


84 


51 


1865 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


106 


46 


1866 


U53585 


Mycobacterium 
avium 


fibronectin attachment protein 


86 


36. 


1867 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


134 


45 


1868 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


63 


1869 


AP000001 


Pyrococcus 
horikoshii 


235aa long hypothetical protein 


108 


40 


1870 


M13100 


Rattus 
norvegicus 


unknown protein 


121 


53 


1871 


X67863 


Mus musculus 


T2 


101 


35 


1872 


S80119 


Rattus sp. 


reverse transcriptase homolog 


151 


43 


1873 


W73633 


Homo sapiens 


Human secreted protein clone. 


140 


44 


1874 


U57053 


Homo sapiens 


myosin-ID 


203 


82 


1875 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


54 


1876 


M21097 


Homo sapiens 


CD19 differentiation antigen 


432 


79 


1877 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


150 


52 


1878 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


108 


36 


1879 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


48 


3 880 


X73113 


Homo sapiens 


fastMyBP-C 


599 


77 


1881 


AX028128 


Homo sapiens 


unnamed protein product 


162 


43 


1882 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


62 



178 



vJSDOCID:<WO 0164835A2 I 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


"I Smiih-' 




NO: 








Watermnn 


Identit 










Score 


y 








7870. 






1883 


Y00664 


Homo sapiens 


open reading frame 1 (AA 1 - 86) 


74 


34 


1884 


U86587 


Mus musculus 


phosphatidylinositol 3-kinase catalytic 


204 


77 








subunit pi 10 delta 






1885 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 


148 


37 








kDa subunit 






1886 


Y13829 


Homo sapiens 


MBNL protein 


117 


63 


1887 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO:. 


119 


80 








6012. 






1888 


R95913 


Homo sapiens 


Neural thread protein. 


91 


64 


1889 


L39059 


Homo sapiens 


transcription factor SL 1 


92 


34 


1890 


M69297 


Homo sapiens 


ORF3 


169 


48 


1891 


R95913 


Homo sapiens 


"Neural thread protein. 


91 


62 


1892 


Z28201 


Saccharomyces 


ORF YKX202W 


95 


51 






cerevisiae 








1893 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


51 








4508. 






1894 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


183 


73 








similar to pogo element 






1895 


Y 10055 


Homo sapiens 


phosphoinositide 3-kinase 


564 


84 


1896 


AF093775 


Mus musculus 


alpha-actinin 3 


375 


85 


1897 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


82 


66 








7878. 






1898 


M21904 


Homo sapiens 


4F2 heavy chain antigen 


386 


80 


1899 


AB028997 


Homo sapiens 


KIAA 1074 protein 


145 


38 


1900 


U97553 


murid 


unknown 


98 


41 






herpesvirus 4 








1901 


AB011142 


Homo sapiens 


KIAA0570 protein 


209 


95 


1902 


AF 194537 


Homo sapiens 


NAG13 


161 


55 


1903 


U93564 


Homo sapiens 


putative pi 50 


142 


33 


1904 


M29399 


Homo sapiens 


erythrocyte membrane protein band 4.2 


413 


90 


1905 


Y27400 


Homo sapiens 


Human P450 reductase functional 


294 


67 








fragment sequence. 






1906 


Y28503 


Homo sapiens 


HGFH3 Human Growth Factor 


167 


100 






Homologue 3. 






1907 


AF 128625 


Homo sapiens 


CDC42-binding protein kinase beta 


509 


89 


1908 


AB029147 


Cucumis sativus 


expressed in cucumber hypocotyls 


98 


41 


1909 


X13783 


Homo sapiens 


alpha- 1 type 2 collagen (714 AA) 


87 


43 


1910 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 


330 


48 








protein 1 






1911 


AF 129075 


Homo sapiens 


T-COMPLEX PROTEIN I, THETA 


511 


90 








SUBUNIT (TCP- 1 -THETA) 






1912 


AFO 11450 


Mus musculus 


type XV collagen 


87 


28 


1913 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


124 


75 








encoded by gene 17. 






1914 


AF171230 


Vigna 


phosphatidic acid phosphatase beta 


112 


53 






unguiculata 








1915 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


47 


1916 


M94131 


Homo sapiens 


mucin 


97 


37 


1917 


X13885 


Nicotiana 


extensin (AA 1-620) 


120 


34 






tabacum 








1918 


AF 186605 


Homo sapiens 


MLL2 protein 


115 


29 


1919 


M12130 


Mus musculus 


RNA polymerase II 


498 


83 


1920 


AL049794 


Homo sapiens 


dJ777L9.1 (novel protein similar to 


514 


90 








mouse kinesin-like proteins KIF1A and 












KIF1B) 






1921 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


233 


64 


1922 


Y11922 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 


162 


77 








No: 522. 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


1923 


ACO02481 


Homo sapiens 


" ' similar to nitrogen permease regulator 
similar to P39923 (PID:g730170), 
match to AA233630 (NID:gl 856833) 
and AA399402 (NID:g2053 147) 


- 


79 


1924 


D82060 


Homo sapiens 


membrane protein with histidine rich 
charge clusters 


115 


40 


1925 


U49974 


Homo sapiens 








1926 


S80119 


Rattus sp. 


reverse transcriptase homolog 


104 


40 


1927 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


68 


1928 


S79639 


Homo sa iens 

s piens 


EXT 1 — putati ve tumour 
exostoses candidate gene 


— — 

430 


■ 

88 


1929 


X58063 


Brugia pahangi 


major protein component of the 
micofilarial sheath 


104 


43 


1930 


AF1 19855 


Homo sapiens 


PRO 1847 


T32 


~7a 


1931 


AJ005577 


Homo sapiens 


6-phosphofructo-2-lcinase 


326 


— 


1932 


AF020261 


Santalum album 


proline rich protein 


93 


33 


1933 


AJ272204 


Homo sapiens 




~7T^ 


""^5 


1934 


AL451017 


Neurospora 


related to I Jl 91V1AT I MI IP! FAR 
RIBONUCLEOPROTEIN C 






1935 


A61971 


unidentified 


MCSP 


328 


79 


1936 


U93564 


Homo sapiens 








1937 


U97553 


murid 


unknown 


113 


38 


1938 


AF1 94537 


Homo sapiens 


NAG 13 


90 


37 


1939 


AF226044 


Homo sapiens 




403 


85. 


1940 




Human 
herpesvirus 6 




303 


50 


1941 


AJ007628 


■ 


ens. Cuannei i 


1 12 


38 


1942 


AF 104328 


Arabidopsis 
thai i an a 


cell wall plasma membrane linker 

protein homolog 


Tr 


35 


1943 


AF 130089 


Homo sapiens 


PRO2550 


93 


63 


1944 


G03133 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7214. 


102 


34 


1945 


M19419 


Mus musculus 




— — 


— — 


1946 


U.573I6 


Homo sapiens 


histone acetyltransferase 


— — 


-~ 


1947 


AJL1 63302 


Homo sapiens 


human type XV1I1 collagen 


~79" 


— | 


1948 


AB020746 


Arabidopsis 
thaliana 


protein kinase-liice protein 


ui 


— 


1949 


AJ011738 


Homo sapiens 


Inilb 


~209 


— 


1950 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


155 


48 


1951 


AF1 19569 


Homo sapiens 


patched 2 




-J? 


1952 


AB002107 


Homo sapiens 


hPer 


"775 




1953 


X98834 




zinc finger protein HsaI2 


— — 

430 


— 

70 


1954 


Y19641 


Homo sapient 


A"-" 3 Jy irom WU77//i4j. 




64 


1955 


U93569 


Homo sapiens 


putative pi 50 


~TT7 




1956 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8 1 72. 


85 


53 


1957 


AB013729 


Mus musculus 


semaphorin Y 


106 


38 


1958 


AC005360 






338 


68 


1959 


AJ223075 


Homo sapiens 


TRIP protein 


598 


95 


1960 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


147 


62 


1961 


AF076776 


Drosophila 
melanogaster 


hehcase DOMINO A 


157 


45 


1962 


AF265555 


Homo sapiens 


ubiquitin-conjugatingBIR-domain 
enzyme APOLLON 


587 


77 


1963 


AF229642 


Mus musculus 


DXImx46e protein 


127 


91 



180 



ISDOCID: <WO 01S4B35A2 I > 



WO 01/64835 



PCT/USO 1/0492 7 



NO: 


Accession No. 




Description 


Waterniiin 


Identit 

y 


1964 


U72520 


Mus musculus 


mena protein 


89 


31 


1965 


W40309 


Homo sapiens 


Human ITAK protein. 


179 


32 


1966 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


1967 


Y92515 


Homo sapiens 


Human OXRE-12. 


224 


53 


1968 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


187 


49 


1969 


Y41245 


Homo sapiens 


Human Y218 protein. 


220 


78 


1970 


AB052738 


Sus scrofa 


Smad3 


366 


85 


1971 


AB007644 


Arabidopsis 
th a liana 


contains similarity to phytocyanin/early 
nodulin-like protein~gene_id:K19P17.3 


106 


32 


1972 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


122 


57 


1973 


AL3 57472 


Homo sapiens 


VPS33B 


112 


31 


1974 


AF151902 


Homo sapiens 


CGI- 144 protein 


112 


95 


1975 


AL 13 7260 


Homo sapiens 


hypothetical protein 


148 


92 


1976 


U89505 


Homo sapiens 


HIark 


408 


89 


1977 


U67328 


Mus muscutus 


NIPI-like protein 


168 


73 


1978 


AK026435 


Homo sapiens 


unnamed protein product 


601 


94 


1979 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


507 


96 


1980 


U63630 


Homo sapiens 


MCM4 


570 


90 


1981 


AF 118090 


Homo sapiens 


PRO2044 


154 


84 


1982 


AF016370 


Homo sapiens 


U4/U6 small nuclear ribonucleoprotein 
hPrp3 


422 


63 


1983 


AB011154 


Homo sapiens 


KJAA05 82 protein 


420 


80 


1984 


ABO 11422 


Homo sapiens 


Trad 


201 


67 


1985 


AL390156 


Homo sapiens 


hypothetical protein 


230 


97 


1986 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


163 


44 


1987 


Ml 2523 


Homo sapiens 


alloalbumin Venezia 


411 


91 


1988 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


145 


63 


1989 


Y36156 


Homo sapiens 


Human secreted protein #28. 


107 


80 


1990 


AF161356 


Homo sapiens 


HSPC093 


156 


57 


1991 


G03443 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7524. 


132 


72 


1992 


AF1 19851 


Homo sapiens 


PRO 1722 


126 


52 


1993 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


228 


60 


1994 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


40 


1995 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


118 


58 


1996 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


57 


1997 


X75068 


Bos taurus 


plasmalemmal porin 


85 


85 


1998 


Y64869 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1030. 


92 


80 


1999 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


275 


79 


2000 


AF1 18078 


Homo sapiens 


PRO 1848 


111 


57 


2001 


S80119 


Rattus sp. 


reverse transcriptase homolog 


120 


61 


2002 


AB011110 


Homo sapiens 


KIAA0538 protein 


146 


73 


2003 


M15530 


Homo sapiens 


B-cell growth factor 


94 


64 


2004 


AF225918 


Mus musculus 


intestinal cell kinase 


216 


77 


2005 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


85 


2006 


D84391 


Mus musculus 


reverse transcriptase 


135 


38 


2007 


AF090930 


Homo sapiens 


PRO0478 


101 


86 



181 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


2008 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


66 


2009 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


165 


49 


2010 


AF 113685 


Homo sapiens 


PRO0974 


124 


63 


2011 


AF130079 


Homo sapiens 


PR02852 


141 




2012 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


51 


2013 


AF090944 


Homo sapiens 


PRO0663 


67 


--4- — 


2014 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA3 5 63 8. 




58 


2015 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 






2016 


G02485 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6566. 




78 


2017 


L26953 


Homo sapiens 






74 


2018 


G03646 


Homo sapiens 


H^mTn secreted protein SEQ ID NO — 

7727. 


~6T~ 


57 


2019 


G03646 


Homo sapiens 


7727. 


64 


42 


2020 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


334 


63 


2021 


L26953 


Homo sapiens 


chromosomal protein 


112 


64 


2022 


W48351 


Homo sapiens 


BCRB2. 




69 


2023 


U93569 


Homo sapiens 


putative pi 50 






2024 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


80 


72 


; 2025 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


71 


2026 


Y76198 


Homo sapiens 


gene 75. 




66 


2027 


AJ242540 


Volvox carteri f. 
nagariensis 


HRGP 




73 


2028 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


80 


2029 


AF 19453 7 


Homo sapiens 


NAG13 




57 


2030 


L26251 


Trypanosoma 


CR5 


110 


41 


2031 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 


basic DNA-bindmg protein BDBP 


149 


55 


2032 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


115 


80 


2033 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


157 


82 


2034 


G02493 


Homo sapiens 


Human secreted protein SEQ ID NO' 
6574. 




75 


2035 


G03646 


Homo sapiens 


7727. 




67 


2036 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO* 
4409. 


137 


71 


2037 


AF 13 0089 


Homo sapiens 


PRO2550 


146 


55 


2038 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
5327. 


81 


73 


2039 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


66 


2040 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


68 



182 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Score™*" 


% 

Identit 


2041 


B01372 


Homo sapiens 


Neuron-associated protein. 


95 


79 


2042 


V01555 


Human 
herpesvirus 4 


BYRF1, encodes EBNA-2 (Dambaugh 
et al, 1 984 ; Dillner et al, 1 984) 


108 


77 


2043 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


125 


65 


2044 


AF1 18086 


Homo sapiens 


PRO 1992 


103 


64 


2045 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


64 


2046 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


108 


51 


2047 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


133 


68 


2048 


AL390114 


Leishniania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


155 


64 


2049 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


125 


53 


2050 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


112 


56 


2051 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


70 


62 


2052 


X61045 


Hydra sp. 


mini-coDsgcn 


99 


46 


2053 


X83413 


Human 
herpesvirus 6 


U88 


148 


73 


2054 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


75 


63 


2055 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


75 


2056 


XI 6524 


Dictyostelium 
discoideum 


coding region (AA 1 -437) 


104 


51 


2057 


R95913 


Homo sapiens 


Neural thread protein. 


83 


77 


2058 


AF1 18080 


Homo sapiens 


PRO1880 


126 


61 


2059 


AJ276003 


Homo sapiens 


GAR1 protein 


136 


59 


2060 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


120 


70 


2061 


X01918 


Drosophila 
melanogaster 


salivary gland glue protein 


179 


40 


2062 


U00029 


Saccharomyces 
cerevisiae 


Yhr217cp 


110 


46 


2063 


AF130051 


Homo sapiens 


PRO0898 


143 


80 


2064 


AF006061 


Homo sapiens 


placental growth hormone isoform 
hGH-V3 


121 


85 


2065 


G 02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


129 


66 


2066 


U82303 


Homo sapiens 


unknown 


92 


68 


2067 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


368 


79 


2068 


L27428 


Homo sapiens 


reverse transcriptase 


186 


5S 


2069 


AF130051 


Homo sapiens 


PRO0898 


164 


73 


2070 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


271 


73 


2071 


AF 130051 


Homo sapiens 


PRO0898 


110 


56 


2072 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


67 


2073 


AF157706 


Human 

herpesvirus 6B 


B4 


104 


49 


2074 


AL049608 


Arabidopsis 
thaliana 


extens in-like protein 


120 


41 


2075 


AP002460 


Arabidopsis 
thaliana 


gene_id:FlD9.26~unknown protein 


150 


27 


2076 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


85 



183 



NSDOCID: <WO_ 



_016483SA2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2077 


Ml 1897 


— Mus musculus — 


r =-u — = = — 

proline-nch salivary protein 


125 


42 


2078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz cione riM;>j w la. 


66 


75 


2079 


AF041330 


— Bodo saltans 


NADH dehydrogenase subunit 5 




47 


2080 


AJ006470 


Homo sapiens 


cartilage-associated protein (GASP) 


139 


84 


2081 


AK024509 




unnamed protein product 


132 


83 


2082 


U93564 


Homo sapiens 


putative pi 50 


137 


67 


2083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


71 


2084 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


81 


2085 


AL 1603 71 


Eeish mania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


1 14 


76 


2086 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


255 


63 


2087 


AF194537 




NAG 13 . ___ 


122 


78 


2088 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


74 


2089 


AF 130079 


Homo sapiens 


PR02852 


149 


77 


2090 


U93574 


Homo sapiens 


p40 


260 


96 


2091 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


99 


76 


2092 


AF194537 


Homo sapiens 


NAG 13 


125 


51 


2093 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


319 


72 


2094 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


159 


56 


2095 


^g34l^ 


Homo sapiens 


putative pi 50 


195 


57 


2096 




Human 
herpesvirus 6 




132 


57 


2097 


AFO 10400 


komo salens 


transaldolase-related protein 


463 


89 


2098 


U93563 




putative pi 50 


128 


35 


2099 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


160 


43 


2100 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


133 


43 


2101 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


71 


2102 


K02401 


Homo sapiens 


chorionic somatomammotropin 


628 


90 


2103 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

QPA TT» XT/~\. 1 £*3 

oJH^ iJL) nc:1oj. 


141 


80 


2104 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


309 


63 


2105 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


192 


37 


2106 


Y59772 


Homo sapiens 


Human normal ovarian tissue derived 
protein 49. 


261 


89 


2107 


AF202051 


Homo sapiens 


INJV1Zj-1-Io 


680 


100 


2108 


R13556 




Protein encoded downstream of hhc M 
oncoprotein. 


127 


54 


2109 


L22029 


Hom^sa^iens 


hydroxyproline-rich glycoprotein 


121 


36 


2110 


D26135 




diacylglycerol kinase gamma 


172 


100 


2111 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


159 


56 


2112 


X65488 


Homo sapiens 


hnRNP U protein 


117 


70 


2113 


AL359782 


Trypan osom a 
brucei 


protein rhala. 






2114 


AF130051 


Homo sapiens 


PRO0898 


93 


62 


2115 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


260 


53 


2116 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


70 



184 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


2117 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


304 


71 


2118 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


70 


66 


2119 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2120 


S79410 


ivius musculus 


nuclear localization signal binding 
protein 


94 


~94 


2121 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


304 




2122 


Ml 1901 


norvegicus 




83 


32 


2123 


AK025047 


Homo sapiens 


unnamed protein product 


1 16 


48 


2124 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


95 


72 


2125 


AF138883 


Bos taurus 


fragment CBIO 


103 


40 


2126 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 86 


100 


2127 


G031 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


117 


52 


2128 


AF 194537 


Homo sapiens 


NAG 13 


126 


66 


2129 


ABO 15802 


Acetobscter 
xylinus 


similar to c&llulosc complementing 
protein of A. xylinum ATCC23869 


] 09 


80 


2130 


AF187147 






-j^ 


~~zf — 


213 1 


W54966 


Homo sa iens 

omo sapiens 


Synthetic human type III collagen 

SYN-C3. 






2132 


L36341 


Aspergillus 


regulatory protein 


130 


49 


2133 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


1 19 


58 


2134 


AF] 16689 


Homo sapiens 


PR02168 


113 


81 


2135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


191 


38 


2136 


M81321 


fascicularis 




125 




2137 


A18812 






106 


34 


2138 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


104 


41 


2139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


92 


2140 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


196 


90 


2141 


Y57285 


Homo sapiens 


Human GPCR protein (HGPRP) 
sequence (clone ID 2214673). 


507 


78 


2142 


AJ242540 


nagariensis 


HRGP yP P 


201 


47 


2143 


AF090944 


Homo sapiens 


PRO0663 


164 


53 








fragment, SEQ ID NO:490. 






2145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


198 


70 


2146 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF 


1 19 


— 


2147 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


194 


45 


2148 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


91 


60 


2149 


AF090942 


Homo sapiens 


PRO0657 


129 


66 


2150 


AF1 30089 


Homo sapiens 


PRO2550 


372 


82 


2151 


AC009991 


Arabidopsis 


unknown protein 


81 


59 



185 



MSDOCID: <WO_ 



_0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description ; 


Smith- 
Waterman 


Identit 

y 






thaliana 








2152 


AF090942 


Homo sapiens 


PRO0657 


93 


53 


2153 


R13319 


Homo sapiens 




215 


89 


2154 


AC008075 


Arabidopsis 
thaliana 


F24J5.4 


139 


36 


2155 


AL3901 14 


Leishniania 
major 


extremely cysteine/valine rich protein 


148 


50 


2156 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


112 


58 


2157 


AK024455 


- ^ omo sapiens 


ri^juuu'f/ protein 


152 


66 


2158 


G00376 


omo sapiens 


Human secreted protein, SEQ ID NO: 


85 


80 


2159 


G03258 


Homo sapiens 


"~H~~ d : 

7339. 




75 


2160 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


145 


45 


2161 


AF1 19900 


Homo sapiens 


PR02822 


J 38 


71 


2162 


AJ223953 


Homo sapiens 


hPTTG 


306 


62 


2163 


AK023542 


Homo sapiens 


unnamed protein product 


76 


52 


2164 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


84 


2165 


U41038 


Caenorhabditis 


Similar to cadherin-type repeat 


137 


65 " 


2166 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


348 


88 


2167 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


108 


84 


2168 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


1 14 


48 


2169 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


125 


67 


2170 
2171 


AF 130089 
Y 19609 


Homo sapiens 
Homo sapiens 


PRO2550 




65 


2172 


AF010144 




SEQ ID NO 327 from W09922243. 
neuronal thread protein AD7c-NTP 


103 
104 


65 
39 


2173 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


91 


50 


2174 


D38435 


Homo sapiens 


homologue of yeast PMS1 


314 


96 


2175 


AF119851 


Homo sapiens 


PRO 1722 


230 


69 


2176 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


63 


2177 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
luiAia (.Aru/sooj ) 


137 


44 


2178 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 

fCiP-PQO^l T-IFl\ylAXT 1 t~\ 

-^az4_HUMAiN, P2j246) and D. 
melanogaster No-on-transient A protein 


139 


42 


2179 


AL049608 


Arabidopsis 
thaliana 




286 


57 


2180 


L17318 


norvegicus 


proline-rich proteoglycan 


148 


40 


2181 


AK024455 




rLjuuu^i / protein 


97 


63 


2182 


R95913 


Homo sapiens 


Neural thread protein. 


100 


69 


2183 


AF266479 


Homo sapiens 


rectachrome 1 


148 




rrrs4 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
4658. 


64 


70 


2185 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
6953. 


107 


40 


2186 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


110 


84 



186 



0 1 64835A2 J_3 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Itlentit 

y 




AF137273 


Gallus gallus 


alpha 1 (V) collagen 


1 03 




2188 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


115 


36 


— 


D90064 


Homo sapiens 


NCA-W272 


■ 


-i— 




r\ 1 J jUUo? 


Homo sapiens 








2191 


X65165 


Volvox carteri 


extensin 


113 


62 


2192 


U49973 


Homo sapiens 


ORFl^ MER37j putative transposase 


154 


59 


2193 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


148 


59 


2194 


U52077 


Homo sapiens 


mariner transposase 


257 


53 






Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 






2196 


Dj81 12 


Homo sapiens 


NADH dehydrogenase subunit 5 




77 


2197 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


121 


65 


2198 


.. .. . . 

Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


64 


2199 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


67 


22qq 


f\JUV fVQJ, 


Homosa iens 


TRX5 protein 


~264 : 


— 


2201 


AF130051 


Homo sapiens 


PRO0898 


71 


61 







Homo sapiens 


FLJ00047 protein 





~fi ~ 






Homo sapiens 


line-1 reverse transcriptase 






2204 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


82 


2205 


AF090895 


Homo sapiens 


PRO0117 


163 


69 


2206 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 






2207 


R95913 


Homo sapiens 


Neural thread protein. 


103 


86 


2208 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 




78 


2209 




Homo sapiens 


Human breast cancer related protein 
BCRB2. 




65 


2210 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


45 


2211 


AF09093 1 


Homo sapiens 


PRO0483 


63 


90 


2212 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 

„i„ n p i-iPN/f r*rv30 
clone rir JVLD^o^d. 


100 


74 


— tTj 

2-13 





— — : 

Homo sapiens 


Human secreted protein encoded by 
°ene 44 clone HTDAD22 


— 

98 


— 

41 


2214 


M76671 


Lycopersicon 


extensin (class II) 


137 


35 


"2215 


XQ3717 


Homo"a iens 

Homo salens 


— ot unidentified read in frame 


98 


54 





R95913 


H°m° salens 


Neural thread^rotein ~ 


— 






~ 


AF1 1 8086 




PRO 1992 








AF081484 


Homo salens 


alpha tubulin isofoniil — 


343 


95 


"1*219 




Homo sapiens 


.Human secreted protein, oea^ IIJ NO: 
4488. 










Homo sapiens 


alpha- tubulin 






2221 


AF071172 


Homo sapiens 


HERC2 


187 


84 






Homo sapiens 


glia-derived nexin precursor 






2223 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


588 


85 






Homo sapiens 


unnamed protein product 






2225 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


61 


2226 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


126 


41 


2227 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


63 



187 



0164835A2_I_> 



WO 01/64835 



PCT/USOi/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description , : : • ,. . v . 


Waterman 


% 

Identit 

y 


2228 


AF1 19855 


Homo sapiens 


PRO ! 847 


108 


74 


2229 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


122 


55 


2230 


AF090944 




rKUUOOj 


146 


61 


2231 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


152 


74 


2232 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


59 


2233 


G03790 




Human secreted protein, SEQ ID NO: 
7871. 


129 


58 


2234 


G03790 




Human secreted protein, SEQ ID NO: 
7871. 


143 


60 


2235 


AK024455 




rLjuuu4/ protein 


133 


71 


2236 


Y87103 


Homo sapiens 
— 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 


59 


2237 


U87607 


a s 
norvegicus 


putative RNA binding protein 1 


111 


38 


2238 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


63 


2239 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


65 


2240 


L27428 


Homo sapiens 


reverse transcriptase 


136 


40 


224] 




Homo sapiens 


Human histiocyte-secreted factor HSF. 


122 


82 


2242 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


95 


2243 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


48 


2244 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


75 


2245 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


346 


87 


2246 


U82303 


Homo sapiens 


unknown 


155 


79 


2247 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


45 


2248 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ Wl 8. 


105 


47 


2249 


Y86248 


H 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


63 


2250 


AF010144 




neuronal thread protein AD7c-NTP 


124 


62 


2251 


X81206 


Drosophila hydei 


histone H3.3 


101 


71 


2252 
2253 


AF010144 


Danio rerio 
Homo sapiens 


proteasome subunit beta 7 


92 


52 


2254 


AF084225 


Homo sapiens 


neuronal thread protein AD7c-NTP 
cytochrome P450 2E1 


172 
114 


60 
46 


2255 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


136 


64 


2256 


AL 132841 


Caenorhabditis 

elegans 




147 


90 


2257 


U49973 




OKi-1; MER37; putative transposase 
similar to pogo element 


297 


59 


2258 


U93563 




putative pi 50 


218 


66 


2259 


L22650 


Homo sapiens 


early lymphoid activation protein 


82 


55 




API O/I^^T 


Homo sapiens 


NAG 13 


117 


56 


2261 


U433 60 


Peromyscus 
maniculatus 


reverse transcriptase 


130 


82 


2262 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
4447. 


86 


66 


2263 


AL021918 


Homo sapiens 


b34I8. 1 (Kruppel related Zinc Finger 
protein 184) 


372 


61 


2264 


AL390U4 


Leishmania 
major 


extremely cysteine/valine rich protein 


122 


71 



188 



WO -01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smifh- 

Waierman 

Score 


Identit 

y 


2265 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


123 


79 


2266 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


204 


95 





L29029 


Chlamydomonas 
reinhardtii 


domain, aa266 .. 468; amino acid 
feature: globular protein domain, aa 32 
.. 265 


141 


58 


2268 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


499 


84 


2269 


A11693 


Homo sapiens 


start codon not included 


594 


87 


2270 


AB014554 


Homo sapiens 


KIAA0654 protein 


141 


63 


2271 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


112 


60 


2272 


U49973 


Homo sapiens 


ORF1; MER37; putative rransposase 
similar to pogo element 


118 


54 


2273 


Y20852 


Homo sapiens 


Human neurofilament-H mutant protein 
fragment 1 1. 


125 


35 


2274 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


92 


66 


2275 


AL3 59782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. 


198 


64 


2276 


U15647 


Mus musculus 


reverse transcriptase 


100 


72 




AJ004810 




cytochrome P450 monooxygenase 


129 


43 


2278 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


111 


67 


7279 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


112 


68 


2280 


L27428 


Homo sapiens 


reverse transcriptase 


150 


60 


2281 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


142 


40 


2282 


U49973 


Homo sapiens 


ORF1 ; MER37; putative rransposase 
similar to pogo element 


321 


54 


2283 


D00570 


Mus musculus 


open reading frame (251 AA) 


169 


56 


2284 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


86 


54 


2285 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


144 


50 


2286 


AF1 19901 


Homo sapiens 


PR02831 


116 


82 


2287 


Q03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


126 


73 


2288 


AF1 30089 


Homo sapiens 


PRO2550 


102 


75 


2289 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


70 


2290 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


64 


2291 


L26953 


Homo sapiens 


chromosomal protein 


137 


53 


2292 


AF161356 


Homo sapiens 


HSPC093 


100 


57 


2293 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


138 


68 


2294 


X03145 


Homo sapiens 


pot. ORF I 


120 


43 


2295 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


83 


52 


2296 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


66 


2297 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


123 


73 


2298 


Y91577 


Homo sapiens 


Tiuman secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


349 


73 


2299 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


109 


61 



189 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 

NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 


" %" 

IJentit 

y 


2300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


100 


81 


2301 


AF251290 


Plasmodium 
falciparum 


glutamic acid-rich protein 


"TT2 




2302 


AF 155232 


Pisum sativum 


extensin 







2303 


AF 130089. 


Homo sapiens 


PRO2550 





50 


2304 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


84 


51 


2305 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


86 


30 


2306 


AP00203 1 


Arabidopsis 
thaiiana 


gene id*K3D^0 3~ 




44 


2307 


AF 15732 1 




ju is_i_/d protein 


309 


64 


2308 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO^ 
6613. 




57 


2309 


M76546 


Helianthus 
annuus 








2310 


Y27607 


Homo sapiens 


Human secreted protein encoded by 


207 


100 


2311 


AF1 30089 


Homo sapiens 


PRO2550 


106 


66 


2312 


G00437 




Human secreted protein, SEQ ID NO. - 
4518. 


1 17 


60 


2313 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


1 14 


80 


2314 


AB032910 


muelleri 


dopamine receptor D4 


108 


40 


2315 


Y27868 


Homo sapiens 


~u — " a : 

Human secreted protein encoded by 

gene No. 107. 


159 


59 


2316 


AK024455 




Pr mnn/17 r^m-t&ir. 
r j-.juuuf / protein 


134 


58 


2317 


AF238235 


Entamoeba 
histolytica 


diaphanous protein 


103 


51 


23 3 8 


U 15647 


Mus musculus 


reverse transcriptase 




37 


2319 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


117 


51 


2320 


Y86248 




Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


78 


2321 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NOr 
7871. 


97 


48 


2322 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


39 


2323 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein. 
RHAla(AF078683) 




75 


2324 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NOi 
6720. 


118 


79 


2325 


U44838 


Glycine max 


extensin 


126 


36 


2326 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 


59 


2327 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment SEQIDNO'490 


408 


79 


2328 


AF194537 


Homo sapiens 




181 


66 


2329 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


405 


75 


2330 


X53581 


Rattus 


ORF4 


116 


45 


) 2331 


L26953 


Homo sapiens 


chromosomal protein 


108 


67 


2332 


L27428 


Homo sapiens 


reverse transcriptase 


117 


76 


.2333 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


84 


50 


2334 


AF191687 


Homo sapiens 


alanine-glyoxylate aminotransferase 
homolog 


100 


37 


2335 


AF020144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


84 


68 
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SEQ1D 

NO: 


Accession No. 


Species 


Description 


Smith- 
VVntcrman 


% 

Idcntit 

y 


2336 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


106 


53 


2337 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


168 


— 


2338 


R95913 






164 




2339 


AF090942 


Homo sapiens 


PRO0657 


122 


81 


2340 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID T^Oi 
8159. 


107 






a pnonod? 

rVT uyuy'fz 


Homo sapiens 








2342 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


83 


"2343" 


a on 1 1A ^/l 


— — 

norvegicus 


NaPi-2 beta 


-— 




2344 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


127 


73 


"2345 




vol vox carteri r. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


146 


— 






Homo sapiens 


Np70 protein carboxy terminal region. 






2347 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


55 


"2348 


Y3620-* 


Homo sapiens 


Human secreted protein #75. 


"138 


— 


2349 


L26953 


Homo sapiens 


chromosomal protein 


131 


57 


2350 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


65 


"2351 


S587?2 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


"744 


~58 




a "p i -inn^ i 

r\r IJUUD 1 


Homo sapiens 








2353 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


76 


2354 


L27428 


Homo sapiens 


reverse transcriptase 


141 


71 






Homo sapiens 


chromosomal protein 






2356 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


75 


2357 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


62 


2358 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


115 


50 


2359 


M76546 


Helianthus 


hydroxyproline-rich protein 


103 


43 


2360 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


82 


80 


2361 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


95 


72 


2362 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


41 


2363 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


80 


-— — 


AT /If lrtl7 


— 

Neurospora 
crassa 


related to Ul oMALL NUCLtAK 

KJeSUIn UC,LJbvJr^,Kw 1 Hi IN 


— — 

152 


— 

45 




365 
■■ 





Homo sapiens 


Human secreted protein encoded by 
gene jLj. cione rxivi^j w i o. 


— 


— 

55 


366 


G03790 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


— — 


— 

77 


2367 


AF093097 


Homo sapiens 


putative RNA-binding protein Q99 


248 


97 


2368 


U52077 


Homo sapiens 


mariner transposase 


227 


74 


2369 


X07882 


Homo sapiens 


Po protein 


102 


38 


2370 


U44838 


Glycine max 


extensin 


102 


32 


2371 


ABO 12223 


Canis familiaris 


ORF2 


158 


60 


2372 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


104 


42 
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SEQ ID 

NO: 


Accession No. 




Description 


Waterman 

-_§£2^ 


Identit 

y 






clcgans 


binding protein K10 (NID:g8148) 








I~lf\A AO 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


107 


40 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


106 


67 




a pi i «riBi=; 


Homo sapiens 


PROl 992 


169 


65 


2yi£ 


WioJJ J 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


55 


2377 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


59 


~2378 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


141 


70 




R.959] 3 


Homo sapiens 


Neural thread protein. 


120 


50 


2380 


U93572 


Homo sapiens 


putative pi 50 


124 


53 




G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


195 


66 




U52077 


Homo sapiens 


mariner transposase 


282 


67 




G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


113 


43 


2384 


AJ242540 


Voivox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


112 


37 


2385 



G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


75 




F 

AF230O89 


Homo sapiens 


PRO2550 


160 


62 


38 


W a 8627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


166 


61 




AF130089 


Homo sapiens 


PRO2550 


103 


71 


2389 


AB027890 


Schizosaccharom 
yces pombe 


Hypothetical protein 


116 


100 


2390 


U93570 


Homo sapiens 


putative pi 50 


153 


54 




Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


110 


71 


2392 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


109 


67 


2393 
■ 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


94 


34 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


62 


2395 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


57 


56 


2396 


U42471 


Mus musculus 


Wiscott-AJdrich Syndrome protein 
homolog 


107 


43 


2397 


AF21065I 


Homo sapiens 


NAG 18 


162 


54 




MI 1901 


norvegicus 


proline-rich salivary protein 


116 


34 




S80864 


Homo sapiens 


cytochrome c-like polypeptide 


115 


68 


2400 
- 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


141 


54 




M13 100 


Rattus 
norvegicus 


unknown protein 


148 


46 




G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


133 


70 


2403 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


2404 


AF1 13685 


Homo sapiens 


PRO0974 


112 


56 


2405 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


98 


67 


2406 


R41001 


Homo sapiens 


Human myotonic dystrophy gene 
protein. 


207 


68 


2407 


AL390114 | Leishmania 


probable (hhv-6) u'l 102, variant a 


127 


63 
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SDOCID: <WO 016483SA2J_> 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Idcntit 

y 






major 


DNA, complete virion genome 






2408 


G03101 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7182. 


378 


95 


2409 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


250 


86 


2410 


U72520 


Mus musculus 


mena protein 


1 15 


40 


2411 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 
RHAla(AF078683) 


132 


44 


2412 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


109 


85 


2413 


AF1 19901 


Homo sapiens 


PR02831 


1 13 


66 


2414 


R95913 


Homo sapiens 


Neural thread protein. 


93 


57 


2415 


B01372 


Homo sapiens 


Neuron-associated protein. 


104 


83 


2416 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


50 


2417 


J02459 


bacteriophage 
lambda 


K (tail component; 1 99) 


720 


92 


2418 


J04694 


Mus musculus 


alpha- 1 type IV collagen 


103 


43 


2419 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


145 


61 


2420 


W26496 


Homo sapiens 


CD2 associated intracellular protein 
CAIPLS02-21. 


115 


80 


2421 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


145 


38 


2422 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


158 


73 


2423 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


156 


70 


2424 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


122 


61 


2425 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


95 


59 


2426 


U93565 


Homo sapiens 


putative pi 50 


147 


75 


2427 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


246 


53 


2428 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


138 


61 


2429 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


112 


75 


2430 


D13892 


Homo sapiens 


carboxyl methyltransferase 


181 


68 


2431 


X97675 


Homo sapiens 


plakophilin 2b 


131 


65 


2432 


G02896 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6977. 


108 


70 


2433 


X03145 


Homo sapiens 


pot. ORF III 


180 


82 


2434 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


77 


86 


2435 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


421 


67 


2436 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


38 


2437 


AF210651 


Homo sapiens 


NAG 18 


128 


72 


2438 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


232 


46 


2439 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


122 


53 


2440 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


136 


80 


2441 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


54 


2442 


AF090895 


Homo sapiens 


PRO0117 


115 


61 


2443 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


125 


70 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Watcrman 


% 

Identit 

y 


2444 


G00594 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4675. 




80 


2445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


81 


37 


2446 


AF090895 


Homo sapiens 


PRO0 1 1 7 


86 


64 


2447 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


103 


86 


2448 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


61 


2449 


L21936 


Homo sapiens 


succinate dehydrogenase flavoprotein 
subunit 






2450 


AF090942 


Homo sapiens 


PRO0657 


J j4 . .... 


.2} ..... 


2451 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 






2452 


K.03 1 79 


Homo sapiens 


pro-alpha- 1 type-I collagen 






2453 


AF1 18082 


Homo sapiens 


PRO 1902 


143 


46 


2454 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 14 


68 


2455 


AF130089 


Homo sapiens 


PRO2550 


125 


89 


2456 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


134 


71 


2457 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


93 


75 


2458 


Y09945 


Rattus 
norvegicus 


putative integral membrane transport 
protein 


166 


46 


2459 


AF 130052 


Homo sapiens 


PRO0956 


75 


48 


2460 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


76 


2461 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


69 


2462 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 3 02, variant a 
DNA, complete virion genome 


157 


48 


2463 


X83413 


Human 
herpesvirus 6 


U88 


236 


50 


2464 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


181 


53 


2465 


X83413 


Human 
herpesvirus 6 


U88 


218 


50 


2466 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


57 


2467 


X83413 


Human 
herpesvirus 6 


U88 


196 


53 


2468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


61 


2469 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


101 


46 


2470 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


221 


56 


2471 


G 00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


136 


63 


2472 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


122 


94 


2473 


AF161361 


Homo sapiens 


HSPC098 


120 


60 


2474 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


142 


40 


2475 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


130 


58 


2476 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


78 


48 


2477 


AL 160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


86 


46 



194 



_0164835A2_I_> 



WO 01/64835 PCT/USO 1/04927 





Accession No. 


Species 


Description 


Smith- 


% 


NO: 




















Score 


y 






major 


DNA, complete virion genome 






2478 


G01478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


66 








5559. 






2479 


AL033545 


Arabidopsis 


extensin-like protein 


114 


40 






thaliana 








2480 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


142 


59 






brucei 


complete virion genome. 






2481 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


146 


62 






major 


DNA, complete virion genome 






2482 


AL 132902 


Caenorhabditis 


Y71A12B.4 


160 


37 






elegans 








2483 


AF 154502 


Homo sapiens 


quiescent cell proline dipeptidase 


439 


88 


2484 


AF0 10326 


Drosophila 


short form of CHIP 


56 


42 






melanogaster 








2485 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


119 


50 








dh40 3. 






2486 


W31186 


Homo sapiens 


Human pi 60 polypeptide 160.2. 


115 


60 


2487 


W48351 


Homo sapiens 


Human breast cancer related protein 


103 


74 








BCRB2. 






2488 


L26953 


Homo sapiens 


chromosomal protein 


129 


68 


2489 


Y21418 


Homo sapiens 


Human high mobility group protein 


96 


46 








HMGI-C mutant fragment 2. 






2490 


Y91577 


Homo sapiens 


Human secreted protein sequence 


366 


94 








encoded by gene 2 SEQ ID NO.250. 






2491 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


76 








4508. 






2492 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


144 


47 


2493 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


117 


81 








encoded by gene 1 7. 






2494 


X92485 


Plasmodium 


pval 


116 


64 






vivax 








2495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


64 








7795. 






2497 


AF161483 


Homo sapiens 


HSPC134 


211 


78 


2498 


X05006 


Homo sapiens 


S-protein 


298 


92 


2499 


AB032911 


Hylobates agilis 


dopamine receptor D4 


89 


42 


2500 


M98502 


Mus musculus 


pMLZ-4 


325 


90 


2501 


Y94920 


Homo sapiens 


Human secreted protein clone 


569 


85 








pm4 12 12 protein sequence SEQ ID 












NO:46. 






2502 


AB049054 


Homo sapiens 


brain link protein- 1 


224 


90 


2503 


AF064604 


Homo sapiens 


KE03 protein 


224 


37 


2504 


Z69727 


Schizosaccharom 


putative dna-directed rna polymerase iii 


384 


61 






yces pombe 


130 kd polypeptide (ec 2.7.7.6) 






2505 


M11901 


Rattus 


proline-rich salivary protein 


147 


43 






norvegicus 








2506 


AJ0 10604 


Mus musculus 


L-Sox5 protein 


366 


87 


2507 


X83413 


Human 


U88 


203 


46 






herpesvirus 6 








2508 


AJ277425 


Globodera 
pallida 


putative cuticular collagen 


125 


40 


2509 


AB017919 


Homo sapiens 


peptidylarginine deiminase type V 


148 


81 


2510 


AF001947 


Homo sapiens 


U4/U6-associated RNA splicing factor 


583 


97 


2511 


AJ238520 


Homo sapiens 


putative transcription factor-like 


722 


100 








nuclear regulator 






2512 


AL390736 


Homo sapiens 


bA209J 1 9. 1 . 1 (G Wl 1 2 protein) 


557 


84 


2513 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


1113 


100 


2514 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


174 


56 








clone HPMBQ32. 







195 
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SEQ1D 


Accession No. 


Species 


Description . 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2515 


U60803 




clathrin heavy chain 2 




92 


2516 


AJ388557 


Canis familiaris 




~~Tfk 




2517 


AB027251 


Homo sapiens 


zinc finger protein (ZFD25) 


~6^1 




2518 


AK023160 


Homo sapiens 


unnamed protein product 


368 


54 


2519 


D87326 


IVlus musculus 


GSG2 


--yll 


73 


2520 • 


G03371 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7452. 




42 


2521 ■ 


X676&8 


Homo sapiens 


translcetolase 


"T20 




2522 


AF074086 


Homo sapiens 


protease 


390 


86 


2523 


AF220509 




transcription associated factor 
TAFII31L 


801 


99 


2524 


AL078463 




aJjODJiy.J (KJAAU456) 


374 


92 


2525 


AF038995 


IVlus musculus 


putative RNA helicase RCK 


160 


93 


2526 


M6061S 


Homo sapiens 


nuclear autoantigen 


116 


75 


2527 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


57 


2528 


X68790 


Homo sapiens 


Bactericidal /Permeability Increasing 
Protein 


136 


90 


2529 


A01592 


—j : 

.Homo sapiens 


haemoglobin A beta chain 


508 


94 


2530 


U89277 


Homo sapiens 


polyhomeotic 1 homolog 


404 


79 


253] 


X90845 


Rattus 
norvegicus 


alphall spectrin 


507 


87 


2532 


AL137081 


thaliana 


pnenylalanine-tRNA synthetase-like 
protein 


1 78 




2533 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


363 


100 


2534 


W54966 


riomo sapiens 


Synthetic human type III collagen 

SYN C3. ^ 


1 13 


42 


2535 


X61047 


„ . 


mini-collagen 


102 


38 


2536 


AB006330 


Mus^usculus 




559 


94 


2537 


AC006283 


Arabidopsis 


En/Spm-Jike transposon protein 


149 


33 


2538 


AF 196779 


Homosa iens 

Homo salens 


■ 

JIvl 1 1 protein 


139 


57 


2539 


AK000741 


Homo salens 


unnamed protein product 


233 


47 


2540 


Y] 8046 




FGFRI oncogene partner (FOP) 


315 


84 


2541 


U54996 


Homo sapiens 


HZW10 


371 


81 


2542 


G0263 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


45 


2543 


M95610 


Homo sapiens 


alpha-2 IX collagen 


93 


34 


2544 


AF07 1 1 73 






349 


82 


2545 


AB000516 


Homo sa ' 


JL"oir p i ou 


600 


93 


2546 


AL355178 


Homo sapiens 


HT047I R 1 Annuel r"*T TO o«H C.^Ui 

ujy<+ 1 i .o (jiovei v-utJ ana s>usni 
(SCR repeat) domain protein) 


395 


90 


2547 


U93574 




putative pi 50 


235 


76 


2548 


X80035 


On^tolagus" S 

cuniculus 


cysteine rich hair keratin associated 


99 


31 


2549 


U58088 


Homo sapiens 


Hs-CUL-2 


592 


86 


2550 


L26953 




chromosomal protein 


131 


63 


2551 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


99 


61 


2552 




Homo sapiens 


alpha SNAP 


159 


54 


2553 


rvr ^.HoOj 1 


Homo sapiens 


RNA-binding protein BRUNOL4 


569 


93 


2554 


AJ006519 


Rattus 
norvegicus 


ASIC-beta 


164 


73 


2555 


U93570 


Homo sapiens 


putative pi 50 


169 


52 


2556 


AF 104328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


105 


29 


2557 


AB028975 


Homo sapiens 


KJAA1052 protein 


610 


83 


2558 


AJ245621 


Homo sapiens 


CTL2 protein 


286 


47 ! 
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SEQ1D 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

ldentit 

y 


^ttt 


AF090944 




PRO0663 


174 






Y 17832 


Human 5313 ' 6 " 5 

endogenous 
retrovirus K 


pol protein 


41 1 


— 


2561 


AF 109907 " 


Homo sapiens 


S164 


279 


44 


2562 


A09561 


synthetic 
construct 


human serum albumin 


680 


91 


2563 


U48359 


Gallus gallus 


kinesin light chain 


575 


81 


2564 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


435 


88 


2565 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


623 


89 


2566 


AF154916 


Chlamydomonas 
reinhardtii 


variable flagellar number protein 


117 


36 


2567 


AF181657 


Orosophila 
melanogaster 


BcDNA.LD34475 


261 


42 




M12530 






693 


87 


2569 


U96915 


Homo sapiens 


sin3 associated polypeptide pi 8 


651 


93 






Homo sapiens 


2. 






— 


ALU / 5_>yj 


Homo sapiens 


HTOIfiRI 1 /VTA A H^ftn^ 


— 








Phaseolus 
coccineus 


seed-micropylar-endothelium -specific 
protein 




~3~9" 


2573 


G031 14 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7195. 


110 


49 


2574 


AB024601 


Pseudomonas 


uridylyl transferase 


120 


37 


2 


AB050893 


A^iadara rfodifera 


cytochrome c oxidase subunit 1 


1 1 1 


79 


■2575 


G03172 


Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


72 


2577 


MI 7697 


Homo sapiens 


giutamate dehydrogenase 


369 


61 


2578 


AF045640 


Caenorhabditis 


C11D2.4 gene product 


264 


43 


2579 


ij 7 13g2 


Hom'osa iens 


OB binding protein 1 


273 


86 


2580 


AF1 16661 


Homo sapiens 


PRO 143 8 


1 14 


61 




AK000496 


Homo salens 




153 


55 


2582 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV 1 
GENE. 


190 


38 


2S8 ^ 


Y44g51 


Homo sa iens 


Human CD39-L66 protein. 


205 


97 


2584 


AB029151 


Homo sapiens 


D29 


253 


73 




AF039023 




Ran-GTP binding protein; RanBP6 


765 


94 


2586 


AF054180 


Homo sapiens 


hematopoietic cell derived zinc finger 


116 


34 


2587 


AF283645 


Homo sapiens 


folate transporter/carrier 


580 


82 




AF090930 




PRO0478 


132 


67 


2589 


AC002339 


Arabidopsis 


putative ABC transporter 


238 


34 


2590 


M34668 


Homo sapiens 


protein tyrosine phosphatase (EC 
3.1.3.48) 


627 


86 


— — 





— : 

omo sapiens 


Human secreted protein SEQ ID NO' 
6613. 


"HI 

■ 


~63 


~2592 


W80406 


Homo sapiens 


dh40 3. 


142 


63 


2593 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


146 


54 


2594 


AF047437 


Homo sapiens 


sperm acrosomal protein 


526 


74 


2595 


U28789 


Mus musculus 


PACT 


528 


71 


2596 


AB002366 


Homo sapiens 


KIAA0368 


615 


83 


2597 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 
(transcription factor-like protein 


704 


93 
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SEQ ID 

NO: 


Accession No. 


Spec-es 


Description 


Smith- 
Waterman 

--Score 


% 

Identit 

y 








MRGX)) 






2598 


G01279 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5360. 


385 


— 


2599 


AF090895 


Homo sapiens 


PRO0117 


142 


— 


2600 


D29763 


Mus musculus 


seizure-related gene product 6 
precursor 


119 


76 


2601 


U20158 


Homo sapiens 


SLP-76 


108 


— 


2602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


242 





2603 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


233 


73 


2604 


L24804 


Homo sapiens 


p23 


259 




2605 


Y12102 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
NO: 415. 


132 


92 


2606 


U79284 


Homo sapiens 


SEC14L 







2607 


AF202635 


Homo sapiens 


PP1200 


105 




2608 


AF277374 


Homo sapiens 


enhancer of polycomb 


254 


— 


2609 


X03484 


Homo sapiens 


raf protein (aa 1-648) 


600 


82 


2610 


U15637 


Homo sapiens 


CD40 binding protien 


465 




2611 


U4026S 


Trypanosoma 
cruzi 


ATPase subunit 6 


98 


31 


2612 


AF090942 


Homo sapiens 


PRO0657 


125 


48 


2613 


V00662 


Homo sapiens 


cytochrome oxidase I 


605 


82 


2614 


U46920 


Homo sapiens 


metaxin 


748 


94 


2615 


Y19757 


Homo sapiens 


SEQ ID NO 475 from W09922243. 






2616 


AK025047 


Homo sapiens 


unnamed protein product 


173 


58 


2617 


D13866 


Homo sapiens 








2618 


U20536 


Homo sapiens 


cysteine protease Mch2 isoform alpha — 








2619 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


608 


86 


2620 


AF190168 


Homo sapiens 


afoumin bUm '" preCUrS ° r 






2621 


A06977 


Homo sapiens 




~isf 


-g! 


2622 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 
clone BPA 13. 


339 


81 


2623 


X5669S 


Xenopus laevis 


42Sp48 | 


117 


47 


2624 


W48351 


Homo sapiens 


BCRB2. P 


131 


57 


2625 


X07881 


Homo sapiens 


proiine-rich protein Gl 


—rj2 


— ~ 


2626 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


~89 


"38" 


2627 


A00279 


synthetic 
construct 


Human serum albumin 


564 


95 


2628 


AK021613 


Homo sapiens 


unnamed protein product 


214 


70 


2629 


M23613 


Homo sapiens 


nucleophosmin 


486 


82 


2630 


AF243424 


Homo sapiens 


SG2NA beta isoform 






2631 


G00506 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4587. 


— ~ 


~ 


2632 


A06977 


Homo sapiens 


albumin 


457 


74 


2633 


A06977 


Homo sapiens 


albumin 






2634 


M81088 


norvegicus 


EF-1 -alpha 


Tol 





2635 


AL359587 


Homo sapiens 


hypothetical protein 







2636 


A03758 


Homo sapiens 






"oT 


2637 


G02872 


Homo sapiens 


Human secreted protein SEQ ID NO" — 
6953. 




— 

39 


2638 


U93563 


Homo sapiens 


putative pi 50 


186 


38 


2639 


U68729 


Meloidogyne 
incognita 


cuticle preprocollagen 


113 


34 


2640 


AB033D56 


Homo sapiens 


KIAA 1 230 protein 


269 


94 


2641 


AF116712 


Homo sapiens 


PR02738 


115 


61 


2642 


W48353 


Homo sapiens 


Human breast cancer related protein 


124 


72 
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Accession No. 


Species 


Description 






NO: 








Waterman 


Idcntit 










Score 


y 








BCFLT2. 






2643 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


54 








6536. 






2644 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


64 








7519. 






2645 


AK025116 


Homo sapiens 


unnamed protein product 


104 


64 


2646 


X92485 


Plasmodium 


pval 


100 


47 






vivax 








2647 


AF130089 


Homo sapiens 


PRO2550 


122 


51 


2648 


AF194537 


Homo sapiens 


NAG 13 


279 


83 


2649 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


90 


66 








encoded by gene 17. 






2650 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


63 








7519. 






2651 


AF116661 


Homo sapiens 


PR01438 


112 


46 


2652 


U63542 


Homo sapiens 


FAP protein 


128 


79 


2653 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


133 


46 






brucei 


complete virion genome. 






2654 


Y85062 


Homo sapiens 


Interleukin I converting enzyme 


139 


72 








homologue (ICEL) protein sequence. 






2655 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


72 








6613. 






2656 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 


145 


51 








(hhcm) gene preoduct 






2657 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


64 








4718. 






2658 


B25722 


Homo sapiens 


Human secreted protein sequence 


91 


88 








encoded by gene 9 SEQ ID NO: 111. 






2659 


X92485 


Plasmodium 


pval 


141 


60 






vivax 


0 






2660 


Y76198 


Homo sapiens 


Human secreted protein encoded by 


68 


68 








gene 75. 






2661 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


96 


59 








gene 12. 






2662 


AF09093 1 


Homo sapiens 


PRO0483 


114 


74 


2663 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


57 








7888. 






2664 


AF090930 


Homo sapiens 


PRO0478 


170 


78 


2665 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


460 


77 


2666 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


368 


67 . 


2667 


AF090931 


Homo sapiens 


PRO0483 


90 


88 


2668 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


66 


2669 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


73 








4478. 






2670 


AF194537 


Homo sapiens 


NAG 13 


170 


42 


2671 


X86003 


Rattus 


neuron-derived orphan receptor 


104 


50 






norvegicus 








2672 


U63542 


Homo sapiens 


FAP protein 


139 


75 


2673 


AF090930 


Homo sapiens 


PRO0478 


130 


69 


2674 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


161 


75 








7645. 






2675 


AK024455 


Homo sapiens 


FLJ00047 protein 


105 


53 


2676 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


67 








7878. 






2677 


AK000496 


Homo sapiens 


unnamed protein product 


110 


67 


2678 


AF1 16715 


Homo sapiens 


PR02829 


151 


78 


2679 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


72 








4718. 






2680 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


81 
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SEQID 

NO: 


Accession No. 




Description 


Waterman 


% 

Idcntit 

y 








6613. 






2681 , 


Y76198 


Homo sapiens 


gene 75. 


I j1 


75 


2682 


U76604 


Homo sapiens 


1 80 kDa bullous pemphigoid antigen 
2/type XVII collagen 


96 


37 


2683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


85 


55 


2684 


G00407 


Homo sapiens 


4488. 


127 


80 


2685 


AF090895 


Homo sapiens 


PRO0 117 


-— 

81 


51 


2686 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


119 


52 


2687 


M64793 


Rattus 
norvegicus 


salivary proline-nch protein 


122 


41 


2688 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


145 


71 


2689 


AF130089 


Homo sapiens 


PRO2550 


149 


78 


2690 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


!02 


47 


2691 


G03043 




Human secreted protein, SEQ ID NO: 
7124. 


1 16 


74 


2692 


G03064 




Human secreted protein, SEQ ID NO: 
7145. 


130 


60 


2693 


AF041330 




NADH dehydrogenase subunit 5 


106 


26 


2694 


AE001 158 


Borrelia 
burgdorferi 


conserved hypothetical integral 
membrane protein 


81 


37 


2695 


G03786 


omo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


162 


76 


2696 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


100 


2697 


AF 194537 






126 


90 


2698 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


69 


2699 


U93563 


Homo sapiens 




138 


37 


2700 


AF090928 




PROf>47f> 
rt\XJ\J^t\J 


177 


57 


2701 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 


2702 


X92485 


Plasmodium 


pval 


116 


50 


2703 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


57 


2704 


D38114 


Gorilla gorilla 




1j7 


81 


2705 


AF090S95 


Homo sapiens 


PRO0 117 


105 


51 


2706 


AF090944 


Homo sapiens 


PRO0663 


121 


45 


2707 


G03263 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7344. 




57 


2708 


G03172 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7253. 


157 


85 


2709 


D38122 


Homo sapiens 




~2lr 


_ 


2710 


AF130079 


Homo sapiens 


PR02852 


125 


48 


2711 


M22334 


Homo sapiens 


unknown proteui _ 


131 


50 


2712 


G00416 




Human secreted protein, SEQ ID NO: 
4497. 


123 


58 


2713 


S80119 




reverse transcriptase homolog ■ 


230 


57 


2714 


AF090931 


Homo sapiens 


PRO0483 


98 


85 


2715 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


104 


79 


2716 


L00016 


Homo sapiens 


urf5 


295 


93 


2717 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


135 


82 


2718 


M81321 


Macaca 
fascicularis 


proline-rich protein 


117 


38 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


SmiCh- 
Wnterman 

Score 


Identit 


2719 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


124 


57 


2720 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 Finger 
protein rhala. 


147 


56 


2721 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


132 


67 


2722 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFTU96. 


129 


56 


2723 


D3S112 


Homo sapiens 


cytochrome c oxidase subunit 1 


392 


68 


2724 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


182 


79 


2725 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


92 


2726 


W97293 


Homo sapiens 


An annexin binding protein (NABP-1). 


193 


40 


2727 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


547 


85 


2728 


V00662 


Homo sapiens 


cytochrome oxidase I 


500 


92 


2729 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


110 


56 


2730 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


75 


2731 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 Finger 
protein rhala. 


113 


57 


2732 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


51 


2733 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


101 


44 


2734 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


288 


81 


2735 


Z74036 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=58.9,E- 
value=3.5e-I4,N=3 


3 17 


37 


2736 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


154 


62 


2737 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 j 


60 


2738 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


110 


63 


2739 


AP002460 


Arabidopsis 
thaliana 


gene_id:FlD9.26~unknown protein 


363 


90 


2740 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


358 


69 


2741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


143 


64 


2742 


L2742S 


Homo sapiens 


reverse transcriptase 


1 19 


45 


2743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


76 


63 


2744 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


155 


79 


2745 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


114 


51 


2746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


75 


2747 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


162 


79 


2748 


W4S351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


2749 


AK024455 


Homo sapiens 


FLJ00047 protein 


165 


69 


2750 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


167 


72 


2751 


M22332 


Homo sapiens 


unknown protein 


208 


46 j 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

I dentil 

y 


2752 


R13556 


Homo sapiens 


--p — — - — — 

rrotem encoded downstream or fine M 

oncoprotein. 


125 


56 


2753 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


150 


65 


2754 


AF220264 


Homo sapiens 


MOST-1 


-— 




2755 


AK024455 


Homo sapiens 


FLJ00047 protein 


— ]4g 


--— 


2756 


AFI3005I 


Homo sapiens 


PRO0898 


162 


73 


2757 


D38112 


Homo sapiens 




~~^r~ 


86 


2758 


G03714 


Homo sapiens 


Human secr^ted^ro^ei^^EQ 1 IE) NO- — 
7795. 


124 


58 


2759 


AJ243666 


Homo sapiens 


NICE-5 protein 


118 


84 


2760 


D3S112 


Homo sapiens 






90 


2761 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6563. 


110 


39 


2762 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


257 


88 


2763 


Y73345 




HI KM clone 438283 protein sequence. 


624 


67 


2764 


R13556 


■ Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


156 


45 


2765 


G02832 




Human secreted protein, SEQ ID NO: 
6913. 


132 


58 


2766 


X92485 


Plasmodium 
vivax 




149 


53 


2767 


AF] 13685 






131 


63 


2768 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


56 


2769 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

- 


154 


71 


2770 


X97707 


-p — 

abel 1° PygmaeuS 


stopcodon created by 
posttranscriptiona) polyadenylation 


185 


87 


2771 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


110 


59 


2772 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO" 
4718. 


145 


60 


2773 


AF090944 


Homo sapiens 


PRO0663 






2774 


L26953 


Homo sapiens 


chromosomal protein 


112 


58 


2775 


AF090930 


Homo sapiens 


PRO 04 7 8 


127 


59 


2776 


AK024455 


Homo sapiens 


FLJ00047 protein 


154 


56 


2777 


R59843 




ApoE4Lx2 protease. 


103 


42 


2778 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRJB2. 


101 


55 


2779 


Y0267 1 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJAV18 


131 


64 


2780 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253 


89 


56 


2782 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


194 


97 


2783 


Y02749 


Homo sapiens 


gene 100 clone HNFIU96 


124 


68 


2784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


49 


2785 


U43360 


Peromvscus 
maniculatus 


reverse transcriptase 


175 


50 


2786 


W58700 


Homo sapiens 


Human ST-1 partial sequence. 


219 


100 


2787 


R96800 




Human histiocyte-secreted factor HSF. 


137 


90 


2788 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


76 


2789 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


44 


2790 


AF130089 


Homo sapiens 


PRO2550 


121 


39 


2791 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


63 


2792 


G03052 


Homo sapiens | Human secreted protein, SEQ ID NO' 


124 


50 



202 



SDOCID; <WO_ 



.01S4835A2J_5 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










SCOrC 


y 








~7T33 ~ — — 









IV1 J U J^IO 


— — : 

Homo sapiens 


* — u ^ i 

cytochrome oxidase I 


Tsl 


— 


2794 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


"Zfil 

— — — 


— 

-2z 


2795 


AF090931 


Homo sapiens 


PRO0483 


—L- 




2796 


G0^789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




~ 






xfJrlu a K vi 









D 8112 


Homo sapiens 


InADH dehydrogenase subunit 2 


~7T4 

-— 


— 


2798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




— 














5 

_799 


— 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


— — 


— 








— j : a u To 






— — — 

2800 


VAl A f\r\ 
Y01400 


— — : 

Homo sapiens 


Secreted protein encoded by gene 1J> 


~Tvf 


— 








Clone rii\ ri r KJtCr? . 




— 




2801 




X92485 


— — — 

Plasmodium 


pval 


97 








y ivax : 


_ _ _ — _ 


— — 


— — - 


7^ 

2802 


— - — 

D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 




|jj — 


2803 


Ar I3ULO 1 


Homo sapiens 






— 


— 


2804 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 










7437 '• . 





— 


2805 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


143 


69 








gene 100 clone HNFIU96. 






2806 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 








7496. 






2807 


K0240 1 


Homo sapiens 


chorionic somatomammotropin 


543 


88 


2808 


YI8522 


Corvus 


cytochrome oxidase subunit I 


382 


60 






frugilegus 









2809 


G01 194 


Homo sapiens 


_ : 

Human secreted protein, SEQ ID NO: 




— 










— — 




2810 


B03148 


. 

Homo sapiens 


Human neuronal differentiation factor- 




— 








1 ^JNLIr- 1 ). 


— 


— 




2811 


/vruu /oiO 


— — ; 

Homo sapiens 


bax epsilon 






- 812 ... 




Homo sapiens 


neuronal thread protein AD7c NTP 


Ipo 


67 


2813 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


— ^- j 


-|i 


2814 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


~TT5 ' 










: — 








2815 


— — — — 

G03258 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


"TFF5 


— 














a 

2816 


D3gl 1? 


Homo sapiens 


cytochrome c oxidase subunit 3 


— = 


— 


2817 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


TZ* 











7 *l^; — - — 




-— 




2818 


X55733 


Homo sapiens 


initation factor 4B 


~3Tr 

— — 


-— 


2819 


^A-J OA 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 












7 ** 82 , ^ j- r-x 








2820 




- 12 


Homo sapiens 


cytoc romecoxi ase su uni 


~^47 


68 


2821 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


~7Ta 


-|| 


2822 


R13556 


Homo sapiens 


Protein encoded downstream of hhc NI 


~TT8 










oncoprotein. 






— — 

zt nA 


Aci in/wo 


— : 

Homo sapiens 


PR02852 


~143 


~54 


2824 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


~uui 











: — — 




— 


. 

2825 


Y76198 


— ; 

Homo sapiens 


Human secreted protein encoded by 


161 


67 














7g26 


AF1 19851 


Homo sapiens 


PRO 1722 


104 


70 


2827 


D49489 


Homo sapiens 


human P5 


523 


91 


2828 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


206 


81 


2829 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


138 


42 








oncoprotein. 






2830 


AF090931 


Homo sapiens 


PRO0483 


97 


80 


2831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 



203 



SNSDOCID: <WO C 



_016483SA2_I_> 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ1D 


Accession No. 


Species 


Description 


Smith- 
Wntcrman 


__ % 

Idcntil 

y 








7871 






2832 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


112 


35 


2833 


X92485 


vivax° diUm 


pval 


103 


43 


2834 


R32010 


~H mo '' 


— — — . 

Rpl 5- TIA- 1. 


152 


54 


2835 


G00357 


"Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


153 


59 


2836 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRJB2 


148 


59 


2837 


AF09093 1 


- Homo sa ien 


PR 00/181 


149 


81 


2838 


AF130089 


" "Homo sapient 


ri\L/Z -J 


123 


49 


2839 


AF090942 


Homo sapiens 


PRO0657 


135 


54 j 


2840 


G03787 




Human secreted protein, SEQ ID NO - 
7868. 


124 


54 


2841 


AF090895 




ponni n 


146 


55 


2842 


AF 132200 


Homo sapiens 


PRO 1751 


98 


74 


2843 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


194 


75 


2844 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


61 


59 


2845 


AL3901 14 


Leishmania 
"? a -° r : 


extremely cysteine/valine rich protein 


171 


37 


2846 


G00328 


Homo sapiens 


. 

Human secreted protein, SEQ ID NO: 

4409 - 


155 


69 


2847 


Y0267 1 


— : 

Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


58 


2848 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


65 


2849 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


441 


77 


2850 




Homo sapiens 


cytochrome c oxidase subunit 1 


490 


88 


2851 


K02401 


Homo sapiens 


chorionic somatomammotropin 


488 


90 


2852 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


49 


2853 


U83280 


Leishmania 
Homo^sa ie 7 


39 kDa antigen 


131 


47 


2854 


AF090944 




rlvUIJOOj 


173 


72 


2855 


V00662 


Homo sapiens 


ATPase 6 


392 


71 


2856 


R13556 




Protein encoded downstream of hhc M 
oncoprotein. 


147 


50 


2857 


Y85062 


Homo sapiens 


Interleukin I converting enzyme 
homologue (ICEL) protein sequence. 


113 


76 


2858 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


178 


84 


2859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


192 


92 


2860 


Z81068 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00102 (Protein-tyrosine 
phosphatase), Score=232.1, E- 
va!ue=2.6e-66, N=l~cDNA EST 
yk299h6.3 comes from this 
gene-cDNA EST yk420b4.3 comes 
from this gene-cDNA EST yk439g6.3 
comes from this gene-cDNA EST 
yk299h6.5 comes from this 
gene-cDNA EST yk420b4.5 comes 
from this gene-cDNA EST yk439g6.5 
comes from this gene 


104 


36 


2861 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


444 


85 


2862 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


168 


61 


2863 


U93564 


Homo sapiens 


Dutativepl50 


132 


34 



204 



SDOCID: <WO_ 



_0164835A2J_> 



WO 01/64835 PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 


2864 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


1 52 


50 








dh40 3. 






2865 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


76 








7878. 






2866 


G03I72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


46 








7253. 






2867 


AF090930 


Homo sapiens 


PRO0478 


161 


61 


2868 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


1 17 


48 








gene 12. 






2869 


U15647 


Mus musculus 


reverse transcriptase 


197 


40 


2870 


U09500 


Homo sapiens 


cytochrome b 


394 


68 


287] 


U09500 


Homo salens 




614 


92 


2872 


D381 12 


Homo sa^len"! 


NADHdeh dro enase subunit 4 







2873 


D3 8 1 1 2 


omo sapiens 


NADH deh dr enase ^bu^t 4 











Homo sapiens 


nauh dehydrogenase subunit 4 


~?Tfi 


— 


2875 


Y30822 


Homo sapiens 


"en 1 e ia i2 SeCreteCl Pr ° tC * n encoded ^ om 


— 


~ 








° " - . ' j- r-p 






~2876 


p38 ^ ]2 


— rj : 

Homo sapiens 


NADH dehydrogenase subunit 5 


~7oa. 


~~q2 


2877 


G02538 


Homo sapiens 


Human secreted protein SEQ ID NO' 


"760 


-~ 








6619. 






2878 


U32 1 74 


Dictyostelium 


non-receptor tyrosine kinase 


"720 


— 






discoideum 








2879 


D381 12 


Homo sapiens 


ATPase subunit 6 


141 


54 


2880 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




84 








7124. 






2881 


y 144 g 2 


~H ie 

omo sapiens 


Fragment of human secreted protein 


— 

■ 


~7Q 














2882 


V00662 


Homo sa iens 

omo sapiens 


ATPase 6 


"Po 


~~R7 


2883 


AL3901 14 


eis ania 


probable proteophosphoglycan 


"TT7 


— 






"^° r : 








"2884 


A PI 1 OR^I 


Homo sapiens 


PRO 1 722 


— 


— 

~zi 


~9S85 


X924S5 


Plasmodium 


pval 


—j— 
















2886 


V00662 


Homosa iens 

Ho" 1 " salens 


ATPase 6 


~F>7 


— 


2887 


D381 12 


omo sapiens 


NADH deh dro enase ub — it~4 

"TJ ey ogenase su ujti 


"rih 





2888 


G03789 


Homo sapiens 


Human secreted protein, ocQ ID NO: . 


— ^ — — 


— ^ 








7870. 






9g89 


X92485 


~Pi cTum 

asmo lum 


pval 


~Tcn 


-— 














2890 


Ml 0546 


Homo sa iens 


cytochrome oxidase 1 


163 


~~89~ 


2891 


AF090942 


Homo sa iens 


PRO0657 




T7 


2892 


G00416 


Homo salens 

omo sapiens 


Human secreted protein SEQ ID NO* 


~TJ6 


-— 








4497. 






2893 


G03790 


Homosa iens 

om p 


Human secreted protein, SEQ ID NO: 


139 


~66 








7871. 






2894 


AK024455 


Homo sa iens 

omo sapiens 


FLJ00047 protein 


— r^q 


— 


2895 


AL359782 




possible (hhv-6) ul 102, variant a dna, 


"TTi 


— 






brucei 








2896 


G02532 


Homosa iens 

P 


Human secreted protein SEQ ID NO- — 


182 


— 








6613. 






2897 


G037 14 


Homo sa iens 

om s p ns 


Human secreted protein SEQ ID NO* 


"143" 


— 








7795. 






2898 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


248 


53 


2899 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


366 


68 


2900 


X07882 


Homo sapiens 


Po protein 


146 


41 


2901 


V00662 


Homo sapiens 


cytochrome B 


517 


91 


2902 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


378 


67 


2903 


AF09093 1 


Homo sapiens 


PRO0483 


117 


67 


2904 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


99 1 


54 



205 



NSOOCID: <WO 016183SA2_I_> 



PCT/US01/04927 ' 



SEQ ID 
NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2905 


AF 130079 


Homo sapiens 


PR02852 


111 


55 


2906 


D38112 


Homo sapiens 




--i^Z . 


79 


2907 


AF202635 


Homo sapiens 


PP1200 


140 


66 


2908 


AF041330 


Bodo saltans 




6 


4 1 


2909 


U63542 


Homo sapiens 


FAP protein 




75 


2910 


D3S114 


Gorilla gorilla 


cytochrome c oxidase subunit 1 (COI) 




72 


2911 


AP00203] 


Arabidopsis 
thaliana 


gene id:K3D20.3~ 


-— -| 

118 


35 


2912 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


- — — 

. 4:>5 


80 


2913 


D38116 


Pan paniscus 






83 


2914 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


480 


80 


2915 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


488 


86 


2916 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


80 


2917 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


172 


80 


2918 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 




75 


2919 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


458 


79 


2920 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


65 


2921 


V00662 


Homo sapiens 


LJIvr u^Aun aenyarogenase subunit) 


324 


87 


2922 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


530 


84 


2923 


AF157321 


Homo sapiens 


30 kJDa protein 


370 


68 


2924 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


174 


71 


2925 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


503 


88 1 


2926 


U47624 


Xenopus laevis 


alpha(E)-catenin 


243 


94 


2927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


47 


2928 


D38112 




cytochrome c oxidase subunit 3 


476 


78 


2929 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


465 


81 


2930 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


58 


2931 


D164S0 


Homo sapiens 


enoyl-CoA hydratase/3-hydroxyacyl- 
CoA dehydrogenase alpha-subunit of 
trifunctional protein 


345 


67 


2932 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


212 


95 


2933 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 

/V 1 a (AfU /804J ) 


123 


69 


2934 


X83427 


Ornithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 



460 


82 


2935 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


527 


83 


2936 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


92 


2937 


Y86248 




Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


183 


77 


2938 


X15081 


Crithidia 
fasciculata 


MURF2 protein (AA 1-348) 


105 


42 


2939 


AF194537 


Homo sapiens 


NAG 13 


151 


52 


2940 


AJ271872 


Nicotiana 
sylvestris 


extensin 


165 


37 


2941 
2942 


AP000616 
D38112 


Oryza sativa 
Homo sapiens 


— Tj ■ 

Similar to KJNG-H2 finger protein 
RHAla(AF078683) 


134 


91 


2943 


D381 12 




cytochrome c oxidase subunit 3 
cytochrome c oxidase subunit 3 


387 
509 


69 
79 


2944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 




2945 


J01415 


Homo sapiens 


MTND4 


429 


76 


2946 


ACO 10793 


Arabidopsis 
thaliana 


F20B17.16 


99' 


41 


2947 


AF116712 


Homo sapiens 


PR02738 


132 


60 


2948 


X89658 


Homo sapiens 


CAP- 18 protein 


136 


49 



206 



3DOCID: <WO. 0164835A2_I_ 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Watermnn 


% 

ldentit 

y 




V0066 7 






449 


86 


2950 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4 


438 


82 






Homo sapiens 




363 


76 


2952 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


129 


92 


Q« 

295 





Crithidia 

oncopelti 


NADH dehydrogenase subunit 5 


102 


32 


2954 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


110 


68 


2955 


Y76287 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 20. 


345 


78 





. ^ 


Homo sa iens 


cytochrome c oxidase subunit 3 


340 


71 


2gyj 


D ogj p 


Homo sapiens 


cytochrome c oxidase subunit 1 


456 


83 


2958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


102 


76 


~2959 


G0 o 5''2 


~Tj : 

Homo sapiens 


Human secreted protein SEQ ID NO: 
6613. 


96 


84 


2960 




Homo sapiens 




400 


70 


2961 


X69978 


Homo sapiens 


XP-G factor 


539 


88 


2962 


AF203687 


Homo sapiens 


prolactin regulatory element-binding 


453 




2963 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


561 


83 


2964 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






2965 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 






2966 


U52077 


Homo sapiens 


mariner transposas© 




83 


2967 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone rio/\\— jo 1 . 


130 


53 


2968 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


103 


42 


2969 


X92485 


Plasmodium 


pval 


144 


42 


2970 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


63 


2971 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
1?^ ' j : — r, 


129 


63 


2972 


Y17221 


— : 

Homo sapiens 


Human secreted protein Qclone ik3 1 /- 
3)- 


~Q1 


56 


2973 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


101 


72 


2974 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


68 




2975 




— : 

Homo sapiens 


PRO2550 


167 


46 




vuuooz 






539 


79 







Oryza sativa" 5 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


121 


44 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


126 


72 


2979 




Homo sapiens 


prohn^dci^protein 


122 


30 










1 19 


39 


2981 


Z38128 


Mus musculus 


histone HI 


117 


36 




AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophiia DNA- 
binding protein K10 (NID:g8148) 


172 


51 


2983 


AC002291 


Arabidopsis 
thaliana 


extensin 


110 


35 


2984 


M10546 


Homo sapiens 


cytochrome oxidase 1 


295 


92 


2985 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


171 


65 



207 



3NSDOCID: <WO 0164835A2J_: 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Spe cie S 

- — : 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


2986 


U 18985 


Homo sapiens 




triad in , 


416 


83 


2987 


AF1 55232 


Pisum sativum 


ex tens in 


159 


41 


2988 


X03 145 


~ Horn" S jj P ? enS 


pot. ORF V 


133 


44 


2989 


U 12690 




cytochrome oxidase subunit II 


565 


84 


2990 


G03789 


" Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


127 


53 


2991 


D381 12 




cytochrome c oxidase subunit 1 


538 


83 


2992 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


471 


75 


2993 




Pan troglodytes 


cytochrome c oxidase subunit 1 


511 


72 


2994 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


301 


79 




- yg^74 


Homo sapiens 


cytochrome c oxidase subunit 1 


526 


78 


2996 




Mesocricetus 
auratus 


cytochrome c oxidase chain I 


634 


82 


2997 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68,- 
SEQ ID NO: 163. 


158 


76 


2998 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


103 


50 


2999 


V00662 


Homo sapiens 


cytochrome oxidase I 


535 


78 


3000 


L38908 


Nicotiana 
tabacum 


extensin 


146 


38 


3001 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


494 


87 


3002 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


66 


3003 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


117 


46 




ATI ^Q7C> 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


93 


47 






Arabidopsis 
thaliana 


extensin 


118 


37 




Apl 1 CC/1Q 

/\r l l JJH7 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


150 


44 


3007 


G03556 


Homo sapiens 

— — . 


Human secreted protein, SEQ ID NO: 
7637. 


114 


73 


"3008" 




Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


111 


39 


3009 




Homo sapiens 
— __ 


Human secreted protein, SEQ ID NO: 
7124. 


109 


84 


~3010 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


212 


77 






Primus dulcis 


extensin 


129 


42 


3012 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


76 


3013 


Z38128 


Mus musculus 


histone HI 


104 


32 




X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


102 


34 


~3~o7I — 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


80 


3016 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


99 


34 


3017 


X5g685 


1 = 

Lycopersicon 

esculentum 


extensin (class I) 


142 


32 






Panulirus argus 


cytochrome c oxidase subunit I 


125 


71 


3019 


G03597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7678. 


175 


68 


~3020 


U93564 


Homo sapiens 


putative pi 50 


97 


57 


3021 


D38116 


Pan paniscus 




555 


82 


3022 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


147 


76 


3023 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


632 


82 


3024 


AF155232 


Pisum sativum 


extensin 


181 


42 


3025 


V00662 


Homo sapiens 


cytochrome oxidase I 


567 1 85 



208 



WO 01/64835 



PCT/U SO 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


„ . 


rfp7 


_ . 

Homo sapiens 


Human secreted protein SEQ ID NO- — 
7868. 


130 


~S8 


~3027 


WOO 83 8 


Homosa iens 

01710 V 


product 


141 


67 


~3028 


AF1 30089 


Homo sa iens 


PRO2550 


149 


55 


"3029 


AF090944 


Homo sapiens 


PRO0663 


168 


76 


3030 


D3S112 


Homo sapiens 


» ADH dehydrogenase subunit 2 


122 


92 





I2 






507 








Homo sapiens 


NAJ^H dehydrogenase subunit2 




~w — 


3033 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


99 


75 


30j4 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 




""If 






Homo sapiens 


putative pi 50 


"T7l 




3036 


X92485 


Plasmodium 


pval 


122 


41 


3037 


AF202635 


Homo sapiens 


PP1200 


128 


67 


3038 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz clone riivio J wis. 


151 


83 




. 


— : 

Homo sapiens 


cytochrome c oxidase subunit 1 


""415 


""74 


3040 


D86853 


Catharanthus 


extensin 


198 
— — 


40 


3041 


ivi /Oj40 


Helianthus 
annuus 


nydroxyproline-nch protein 




— 


3042 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


377 


95 


3043 


S80119 


Rattus sp. 


reverse transcriptase homolog 


106 


50 


3044 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


137 


55 


3045 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


58 


3046 


U93574 


Homo sapiens 


putative pi 50 


177 


49 


3047 


AF063866 


Melanoplus 
sanguinipes 
entomopox virus 


ORF MSV233 hypothetical protein 


98 


37 


3048 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


568 


87 


3049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


63 


3050 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


80 


3051 


AK027208 


Homo sapiens 


unnamed protein product 


145 


72 


3052 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






3053 


AF1 18082 


Homo sapiens 


PRO 1 902 






3054 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 



118 


41 


3055 


X92485 




Plasmodium 
vivax 


pva 


~P2" 


"J 5 


3056 


G 00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 






3057 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 






3058 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


_^|2 





3059 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 

— — 


57 

— 


3060 


., - 
G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 






3061 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


80 


3062 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


529 


80 


3063 


AF130051 


Homo sapiens 


PRO0898 


159 


71 


3064 


AF195418 


Mus musculus 


ODZ3 


386 


94 


3065 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


109 


51 



209 



WO 01/64835 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Watermnn 
__Score 


% 

Identit 

y 








clone HCACJ81. 






3066 


D38112 


Homo sapiens 


NADH dehydrogenase subunit2 


338 


~~80 


3067 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


120 


50 


3068 


U 15647 


Mus musculus 


reverse transcriptase 


"T34 ' 


- — 


3069 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


397 





3070 


AF 130089 


Homo sapiens 


PRO2550 


149 


55 


3071 


U93565 


Homo sapiens 


putative p 1 50 




37 


3072 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


112 


45 


3073 


AF090930 


Homo sapiens 


PRO0478 





78 


3074 


AF130089 


Homo sapiens 


PRO2550 





58 


3075 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO - 387 


267 


65 


3076 


U83280 


Leishmania 
donovani 


39 kDa antigen 


— 


51 


3077 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


~626 


g 


3078 


AF1 16712 


Homo sapiens 


PR02738 


114 


59 


3079 


U42471 


Mus musculus 


homolog 






3080 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


-4^ 


. 80 


3081 


D38112 


Homo sapiens 








3082 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


~| 




3083 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


150 


41 


3084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO - 163 


191 


97 


-3085 


U 12690 


Homo sapiens 


cytochrome oxidase subunit 11 


537 


78 


3086 


D381 12 


^"scrofa' 61 " 


ATPase subunit 6 


300 


58 


3087 


X86031 




homologue of proline/arginirie-rich 
antibacterial protein 


104 


37 


3088 


K02247 


Rattus 


proline-nch protein 


108 


41 


3089 


W48351 


Homo sapiens 


Human breast csmcQr related protein 
BCRB2. 


— — 


48 


3090 


V00662 


Homo sapiens 


cytochrome oxidase I 


-— 




84 


3091 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


"221 




3092 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 






3093 


D38116 


Pan paruscus 


cytochrome c oxidase subunit 1 


593 


84 


3094 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 






3095 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


369 


98 


3096 


L26953 


Homo sapiens 






57 


3097 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. 


124 


57 


3098 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


91 


3099 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


96 


3100 


AC006233 


Arabidopsis 
thaliana 


hypothetical protein 


65 


44 


3101 


AF289098 


Cladrastis 
kentukea 


ENOD2 


107 


35 


3102 


AF155232 


Pisum sativum 


extensin 


142 


43 


3103 


AF130089 


Homo sapiens 


PRO2550 


122 


81 


3104 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


80 


3105 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


104 


59 



210 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Watti man 


7; ■ 

Identit 

y 


3106 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 12 


76 


3107 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


72 


3108 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


334 


85 


3109 


U43627 


Arabidopsis 
thai iana 


extensin 


1 12 


30 


3110 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


118 


100 


3111 


AF130089 


Homo sapiens 


PRO2550 


137 


37 


31 12 


Y86248 




Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


93 


-lyi^ — 








128 


32 




Y86248 


Homo~sap1e^ 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


63 


3115 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


92 


69 




X92485 


Plasmodium 

vivax 




148 


45 




^23 


Homo sapiens 


si^^a^to^myo^a^ antigen 




80 


"frri — 




Caenorhabditis 
elegans 


(SP:CS24_HUMAN, P23246) and D. 
(PIR:JH0162) 


125 


41 


3119 


U43627 


Arabidopsis 


extensin 


129 


32 


3 120 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


171 


82 


^121 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


90 


59 


3122 


AF130051 


Homo sapiens 


PRO0898 


134 


82 


j 123 


M81321 


fescicuJaris 


proline-rich protein 


158 


48 


3124 


U93563 


Homo sapiens 


putative pi 50 


126 


32 








NADH dehydrogenase subunit 1 (ND1) 


337 


92 


3126 


AF130089 


Homo sapiens 


PRO2550 


155 


77 






Homo sapiens 


Human secreted, protein HCHPF68 
SEQ ID NO: 163. 


162 


76 




AF0Q0901 




PRO0195 


109 


53 


3129 


AF003736 


Murine leukemia 


reverse transcriptase 


164 


40 


3130 


U25281 


Rattus 


SH3 domain binding protein 


88 


39 


3131 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


71 


3132 


D86853 


Catharanthus 
roseus 


extensin 


1 10 


33 


3133 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-I). 


506 


76 


~3T^ — 


V00662 




' Fragr^enrof h^m^i secreted protein 


583 


88 




Y36366 


Homo sapiens 

omo sapiens 


encoded by gene 3. 


103 


68 


3136 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


282 


60 


3137 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


119 


54 


3138 


AF1 18082 


Homo sapiens 


PRO 1 902 


118 


51 


3139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


270 


87 


3140 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


103 


48 
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SEQ ID 

NO: 






Description 


Waterman 
Score 


Identit 

y 


3141 


D3 8 1 1 2 


Homo sapiens 


NADH dehydrogenase subunit 5 


353 


74 


3142 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


126 


77 


3143 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQ ID NO: 163. 


115 


72 


3144 


AF090930 


Homo sapiens 


PRO0478 


138 


73 


3145 


AF 155232 


Pisum sativum 


extensin 


110 


34 


3146 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


107 


37 


3147 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-b.2 finger 
protein rhala. . 


103 


63 


3148 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


505 


84 


3149 


M77194 


Rat leukemia 
virus 


polymerase 


167 


35 


3150 


AP002543 


Arabidopsis 
thaliana 


gb|AAD23015.1~gene_id:F15M7.16~si 
milar to unknown protein 


105 


31 


315] 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 


59 


3152 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


89 


39 


3153 


J04794 


Homo sapiens 


aldehyde reductase (EC 1 . 1 . 1 .2) 


137 


58 


3154 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


115 


70 


3155 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


160 


53 


3156 


X92485 


Plasmodium 

vivax 


pval 


104 


50 


3157 


AF 194537 


Homo sapiens 


NAG 13 


207 


60 


3158 


AF063693 


Mus musculus 


type XIII collagen 


104 


37 


3159 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


91 


3160 


AF020191 


Mus musculus 


proline-rich protein 13 


107 


51 


3161 


AF194537 


Homo sapiens 


NAG 13 


170 


76 


3162 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


103 


41 


3163 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. . 


161 


58 




K03205 


Homo sapiens 


salivary proline-rich protein precursor 


139 


46 


3165. 


U93570 


Homo sapiens 


putative pi 50 


151 


68 


3166 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


144 


71 


3167 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


147 


66 


3168 


L27428 


Homo sapiens 


reverse transcriptase 


94 


51 


3169 


X97675 


Homo sapiens 


plakophilin 2b 


103 


76 


3170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


54 


3171 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


61 


j 172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


146 


38 


3173 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


467 


81 


3174 


X92485 


Plasmodium 
vivax 


pval 


88 


69 


3175 


L26953 


Homo sapiens 


chromosomal protein 


124 


78 


3176 


AF042169 


Homo sapiens 


putative ATP-dependent mitochondrial 
RNA helicase 


223 


95 


3177 


AF130079 


Homo sapiens 


PR02852 


96 


44 




AF2732 1 7 


Mus musculus 


cell proliferation related protein CAP 


97 


36 


3179 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


90 


69 


3180 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


146 


78 


3181 


AF 118082 


Homo sapiens 


PRO 1902 


150 


58 


3182 


U60315 


Molluscum 
contagiosum 


MC107L 


101 


44 



21-2 



■ISDOCID: <WO 016483SA2J_> 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 






virus subtype 1 










U49973 


Homo sapiens 


ORFlj M F.R3 7 1 putative transposase 
similar to pogo element 






J^T13 — 




Homo sapiens 


putative p 1 50 


265 






Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 




-~ 


3186 


U87607 


Rattus 


putative RNA binding protein 1 


167 


o9 


3187 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


160 


63 


3188 


S80119 


Rattus sp. 


reverse transcriptase homolog 


152 


49 






norvegicus 












Homo sapiens 


Neuron-associated protein. 






3191 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


83 


3192 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. rj : rj j : 


135 


80 


~3193 


a 17AAAOOQ 


— . 

Caenorhabditis 

elegans 


weak similarity to collagens; glycme- 
and proline-rich 


~98 


~34 


3194 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 






3195 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen 






3196 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 






3197 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^ 4 92. , — : — — — 


110 
— — — 


58 
— 


3198 


X99452 


Lycopersicon 
escu lentum 


extensin-like protein Dif54 






3199 


D86853 


Catharanthus 


extens'in 


119 


36 


3200 


AF025424 


Rattus 


RNA polymerase I 127 kDa subunit 


215 


74 


3201 


U93563 


Homo sapiens 


putative p i 50 


359 


77 






Homo sapiens 








3203 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


84 
— 






Homo sapiens 


Human secreted protein SEQ ID NO' 
4770. 


T>8 








Homo sapiens 


Human secreted protein SEQ ID NO* 
4718. 










Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 






_3207 


^£Z£Zj 


Homo sapiens 


plakophiiin 2b 


~oT^ 


~H — 


3-08 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 






3209 


AB026512 


Ecnomiosa sp. 


cytochrome c oxidase subunit I 


254 


62 








omo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


59 


■ , 


D3 8112 


"h : 


ATPase subunit 6 


447 


78 





V UUDOZ 


Homo sapiens 


ATPase 6 


482 




3213 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


66 


3214 


D38112 


Homo sapiens 


ATPase subunit 6 


400 


72 


3215 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


88 


3216 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


472 


82 


3217 


AF090930 


Homo sapiens 


PRO0478 


96 


64 


3218 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


46 



213 



NSDOCID: <WO 016483SA2_I_> 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Identit 

y 








7253. 






3219 


AF006061 


Homo sapiens 


hGH-V3 ° 


290 


68 


3220 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


1 1 5 


63 


322] 


D381 12 










3222 


G00354 


Homo sapiens 


Human secreted protein. SEQ ID NO: 


111 


59 


3223 


p3S1 12 


Homo sa iens 


cytochrome c oxidase subunit 3 


~497 


— 


3224 


L26953 


Homo sapiens 


chromosomal protein 


95 


59 


o225 




Leishmania 


39 kDa antigen 


1 17 


90 


3226 


AF239615 


Manihot 

Homo^a iens 


CRANTZ hydroxyproline-rich 
glycoprotein 


124 


43 


3227 


D381 12 






~44? 


— 


3228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 


67 


3229 


X92591 


Mus musculus 


transcription factor 


109 


43 


3230 


G 02480 


Homo sapiens 


Human secreted protein, SEQ JD NO: 


81 


54 


3231 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 1 


476 


72 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


54 




U9641 6 


Dennyus 
distinctus 
timjonesi 


cytochrome b 




40 


[ 

3234 


i 


Homo sapiens 


Human secreted protein HCHPF68, 

qca yr-v \TA. 1 £1 
I U NU.IOJ. 


161 


86 


"32^5 


Ivl /OJHO 


r 

Helianthus 

annuus 


hydroxyproline-rich protein 






3236 


G03 172 




7253. 






3237 


D381 12 










3238 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


576 


89 






Homo sapiens 








3240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


79 


3241 


W48351 


Homo sapiens 


Human breast cancer related protein 


145 


65 


3242 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 


62 


3243 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


69 


3244 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


39 


3245 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
f}^' n : — — 


138 


54 


"3246 


G0221 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


"Tsl 


_ 






Caenorhabditis 
elegans 


Similar to cuticular collagen 






3248 


LJS7607 


Rattus 


putative RNA binding protein 1 . 


148 


45 


3249 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3250 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


166 


61 


3251 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


109 


40 


3252 


AF130089 


Homo sapiens 


PRO2550 


.151 


73 
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WO 01/64835 PCT/US01/04927 



NO: 


Accession No. 


Spec.cs 


Description 


Smith- 
Waterman 


Identit 


3253 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


58 


3254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO.163. 


162 


80 


3255 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


108 


57 


3256 


J0I415 


Homo sapiens 


cytochrome oxidase subunit 3 


156 


88 


3257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


93 


3258 


Y36203 


Homo sapiens 


Human secreted protein #75. 


111 


77 


3259 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


57 


3260 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


345 


67 


3261 


Y86248 ( 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


97 


56 


3262 


L38908 


Nicotiana 
tabacum 


extensin 


155 


40 


3263 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


95 


70 


3264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


87 


3265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


69 


3266 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


158 


59 


3267 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


105 


56 


3268 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


72 


3269 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


200 


95 


3270 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


138 


73 


3271 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


151 


90 


3272 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


193 


95 


3273 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


111 


52 


3274 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


87 


3275 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


86 


3276 


S62928 


Homo sapiens 


PRB 1 M protein precursor 


104 


34 


3277 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


154 


55 


3278 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


3279 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


212 


91 


3280 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


118 


50 


3281 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


130 


46 


3282 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


3283 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


455 


81 


3284 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


502 


83 


3285 


AF 1 18086 


Homo sapiens 


PRO 1992 


99 


88 


3286 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


489 


83 
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"JSDOCID: <WO 01 64835A2J_> 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Idcntit 

y 


3287 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


485 


88 


3288 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 




AL3597S2 


Trypanosoma 
brucei 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. 


158 


60 




X92485 


Plasmodium 
vivax 


pval 


99 


58 


3291 


AFl 16712 


Homo sapiens 


PR02738 


102 


51 


3292 


.API 19851 


Homo sapiens 


PRO 1722 


99 


86 


3293 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


3294 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


66 


3295 


M64793 


Ratrus 
norvegicus 


salivary proline-rich protein 


107 


41 


3296 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


53 


3297 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


156 


60 


3298 


AC007654 


Arabidopsis 
thaliana 


T19E23.7 


97 


49 


3299 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3 17- 
3). 


96 


44 


3300 


AJ243905 


Caenorhabditis 
elegans 


SF1 protein 


104 


42 


3301 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


96 


41 


3302 


X89453 


Rattus 
norvegicus 


DRPLA 


83 


65 


3303 


U 10099 


Homo sapiens 


POM-2P3 


227 


52 


3304 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


553 


89 


3305 


L27428 


Homo sapiens 


reverse transcriptase 


144 


43 


3306 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


388 


71 


3307 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


439 


78 


3308 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


439 


82 


3309 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


492 


88 


3310 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


81 


65 


3311 


X53581 


Ratrus 
norvegicus 


ORF4 


140 


48 


JJ>12 


V00662 


Homo sapiens 


cytochrome B 


487 


88 


3313 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 1 


451 


80 


3314 


V00662 


Homo sapiens 


cytochrome oxidase I 


524 


88 


3315 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


518 


87 


3316 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


115 


50 


3317 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


356 


69 


3318 


U 12693 


Homo sapiens 


cytochrome oxidase subunit II 


477 


78 


3319 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N-terminal domain), Score=27.5, E- 
value=0.0001,N=l 


75 


40 


3320 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


126 


75 


3321 


AF090930 


Homo sapiens 


PRO0478 


166 


65 


3322 


X92485 


Plasmodium 
vivax 


pval 


104 


44 


3323 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


513 


80 


3324 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


483 


79 
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NSDOCID: <WO 0164835A2_I_5 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smifh- 
Waterman 


% 

Idcntit 

y 




— ■ 




Homo sapiens 




~wr 









Unknown 


Contains weak similarity to an 
unknown protein T"^3E18 5 


— — — 


66 


3327 


V00662 


Homo sa iens 




439 


^72 


3328 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


316 


~75 


3329 


AF09J711 


Homo sapiens 


splice variant AKAP350 


106 


52 




U93565 






125 




3331 


AF1 19851 


Homo sapiens 


PRO 1 722 


88 


72 






Homo sapiens 


cytochrome c oxidase subunit 3 






3333 


V00662 


Homo sapiens 


cytochrome oxidase I 


544 


90 






Homo sapiens 


cytochrome c oxidase subunit 1 






3335 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


168 


66 


3336 


AF146688 


Takifugu 
rubripes 


serine-threonine kinase 9 


101 


43 


3337 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


101 


50 


3338 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


1 10 


50 


3339 


V00662 


Homo sapiens 


cytochrome oxidase I 


458 


80 


3340 


K02401_ 


Homo sapiens 


chorionic somatomammotropin 




97 


3341 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493 • . 


1 12 


52 


"3349 


Xfisifis 


. : 

Vol vox carten 


extensin 


~T46 


4] 


3343 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


119 


51 


3344 ._j 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


559 


S4 




U 12690 


Homo sapiens 


cytochrome oxidase subunit II 






3346 


Ml 1897 


Mus musculus 


proline-rich salivary protein 


96 


41 






Mus musculus 


WW domain binding protein 7; WBP7 






3348 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


40 


3j49 


K03205_ 


Homo sapiens 


salivary proline-rich protein precursor 


1 ,J 


38 






Homo sapiens 


cytochrome c oxidase subunit 1 






3351 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 

1 /XTTM? 1 \ 

1 (NDr-1 ). 


664 


88 


3352 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124 - , 


1 17 


85 


3353 




Brassica napus 


extensin 


— ?v» 


31 







Homo sapiens 


cytochrome oxidase I 


~7§f 








Macaca 
fascicularis 


proline-rich protein 




— j£ 




MlUJHO 


Homo sapiens 


cytochrome oxidase I 






3357 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


513 


80 


-ilil 


D3 nlj 2 


Homo sapiens 


cytochrome c oxidase subunit 1 





_Z£ — 




V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


_£z£ 




3360 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 




r 6 _ 




G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 






3362 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


125 


62 




Aruiouyy 


Mus musculus 


endonuclease/reverse transcriptase 






3364 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


82 


3365 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


102 


40 


3366 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


76 


3367 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


145 


84 
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INSDOCID: <WO 0164835A2J„> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


, Description. ■ i,t'.,i . 


Smith- 
Waterman 


" % 1 

Identit 

y 


"3§H — 




Homo sapiens 


cytochrome c oxidase subunit 1 


587 


87 


— 


' D3S 1 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


549 


82 






Arabidopsis 
tbaliana 


extensin-like protein 


130 


39 


3371 


J04543 




synexin 




40 


3372 


Z29573 


Didelphis 


cytochrome c oxidase subunit 3 


154 


71 


3373 


U93568 


Homo sapiens 


putative pi 50 


176 


59 


3374 




Bos taurus 




146 


67 


3375 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 






Homo sapiens 


cytochrome c oxidase subunit 1 


509 


78 


3377 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


51 


3378 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain 1 


557 


82 


"3379 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


35 


j380 


X92485 


Plasmodium 


pval 


124 


38- 


3381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


78 


3382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


108 


67 


3383 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF0786S3) 


115 


86 


3384 


X98296 


Homo sapiens 


ubiquitin hydrolase 


238 


83 






Mesocricetus 
auratus 


cytochrome c oxidase chain I 


494 


75 






Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


155 


66 


j j 87 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


1 16 


65 


3388 


W73604 


Homo sapiens 


Human secreted protein clone aw92 1 . 


327 


62 


3389 


G03807 


Homo sapiens 

_ ; 


Human secreted protein, SEQ ID NO: 
J 888 " : 


141 


76 


3390 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


73 


61 


3391 


U35730 


Mus muscuius 


jerky 


152 


30 






Mus muscuius 


Wiskott-Aldrich Syndrome Protein 


100 


41 


3393 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


483 


77 






Pan troglodytes 


cytochrome c oxidase subunit 1 


539 


SI 


3395 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


608 


87 


3396 



X71602 


Nicotiana 
tabacum 


extensin 


104 


35 




L2695 ^ 


Homo sapiens 


chromosomal protein 


100 


60 


339g 




Cyanocitta 
cristata 


cytochrome oxidase I 


488 


76 






norvegicus 


ORF4 


140 


42 


"4lHT 


12 


Homo sapiens 


cytochrome c oxidase subunit 1 


597 


85 




D381 13 


Pan troglodytes 


cytochrome c oxidase subunit 1 


656 


89 


"5402^ 




Homo sapiens 


chromosomal protein 


117 


63 


3403 


AF052298 


silvestris 


Y box protein 




30 


3404 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


133 


53 


3405 


AF030144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


65 


3406 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


82 


3407 


U01849 


Trypanosoma 


ORF2 


94 


30 



218 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

ldentit 






brucei 








3408 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 15 


65 


3409 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


189 


86 


3410 


U93570 


Homo sapiens 


putative pi 50 


233 


52 


3411 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


104 


44 


3412 


AF090944 


Homo sapiens 


PRO0663 


97 


56 


3413 


X61048 


Hydra sp. 


mini-collagen 


128 


44 


3414 


AF1 1 8086 




PRO 1992 


128 


81 


3415 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


61 


3416 


AF1 16695 


Homo sapiens 


PR02221 


169 


69 


3417 


U31778 


Human 
type 20 


putative 


107 


50 




X92485 


vivax 




97 












195 




3420 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No 1 07 


144 


69 


3421 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


71 


3422 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


111 


63 


3423 


X15917 


Paramecium 
aurelia 


ORF4 protein (AA 1-156) 


93 


41 


~^P4 


qq25\4 


Homo sapiens 


— : — — 

Human secreted protein, SEQ ID NO: 
6595. 


~99 


~69 


j>425 




Homo sapiens 


Human secreted protein SEQ ID NO* 
4414. 




60 


3426 


Y14486 


Homo sa iens 

omo pi n 


hydroxymethyltransf erase 


196 


61 


3427 


U93569 


Homo sapiens 


putative pi 50 


110 


44 


3428 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


120 


67 


3429 


ABO 12223 


Canis familiaris 


ORF2 


169 


44 


3430 


X97675 


Homo sapiens 


plakophilin 2b 


118 


60 


3431 


AF220264 


Homo sapiens 


MOST-1 


153 


72 


3432 


X64 1 73 


Zea 

diploperennis 


hydroxyproline-rich glycoprotein 


104 


41 


3433 


AF1 13685 


Homo sapiens 


PRO0974 


104 


72 


3434 


AB002306 


Homo sapiens 


KIAA0308 


282 


83 


3435 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


83 


3436 


AB012223 


Canis familiaris 


ORF2 


108 


43 


3437 


AK024455 




FLJ00047 protein 


100 


65 


3438 


U52077 


Homo sapiens 


mariner transposase 


381 


64 


3439 








425 


84 


3440 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


63 


3441 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


90 


78 


3442 


Y36156 


Homo sapiens 


Human secreted protein #28. 


126 


71 


3443 


AF191032 


Myxine 
glutinosa 


RJNG3 


84 


63 


3444 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


141 


57 


3445 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


88 


65 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Waterman 


% 

Identit 

y 


3446 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32 


157 


67 


3447 


U 12690 


Homo sapiens 


cytochrome oxidase subunit 11 


9 


86 


3448 


AF1 12481 


Homo sapiens 


RAD54B protein 


- 


87 


3449 


G00497 


Homo sapiens 


Human secreted protein. SEQ ID NO: 


98 


55 


3450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
olil^ ID iNC. ioj. 


133 


90 


3451 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


418 


82 


3452 




Homo sapiens 


PR02852 


1 10 


80 


3453 


Y27868 


Homo sapiens 
— . 


Human secreted protein encoded by 
gene No. 107. 


143 


73 


"3454 




Homo sapiens 


bromodomain adjacent to zinc finger 
domain 2A 


447 


85 


3455 


G02480 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
ff 61 ' — 


134 


62 


3456 


G03787 


~U : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


79 


68 


3457 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


545 


84 


3458 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


83 


3459 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


277 


80 


3460 


AK000867 


Homo sapiens 


unnamed protein product 


474 


98 


3461 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


123 


52 


3462 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


89 


30 


3463 


U93568 


Homo sapiens 


putative pi 50 


100 


46 


3464 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


156 


43 


3465 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


340 


88 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 19 


100 




BO 1372 


Homo sapiens 


Neuron-associated protein. 


142 


75 


3468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


103 


40 


3469 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


80 


3470 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl. 


207 


89 


3471 


AF1 16661 


Homosa ie 

omo sapiens 




1 19 


47 


3472 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


129 


59 


3473 


AF130089 


Homo sapiens 


PRO2550 


108 


86 


3474 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 13 


80 


3475 


V00662 


Homo sapiens 


ui\j -+ (Lvf\uri aenyarogenase suounitj 


590 


79 


3476 


AF 118086 


Homo sapiens 


PRO 1992 


126 


69 


3477 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


147 


47 


3478 




Homo sapiens 


MTND4 


482 


78 


3479 


X03404 


Bos taurus 


alpha subunit (aa 1-394) 


583 


89 






Homo sapiens 


cytochrome c oxidase subunit 1 


514 


84 


3481 


G03628 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7709. 






3482 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


121 


61 


3483 


X83427 


Omithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


508 


78 


3484 


Y07754 


Homo sapiens 


Human secreted protein fragment 


549 


93 



220 



ISDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idcntit 










_Score 


y 


■ 






j i r~ n 

encoded from gene 1 1 . 






* P 


y - 


— : 

Homo sapiens 


-P°r..| 


"T^r 




— 

69 


— 


X97567 


Homo sapiens 




~§5T — ' — 


"1^ 


-4 . R Q 


K02401 


Homo sapiens 


— : : : 

chorionic somatomammotropin 




—£ 





A ~VC\ 1 O 1 A A 


Homo sapiens 


neuronal thread protein AD7c-NTP 


— — 

"Til 


71 




S6294 1 


Homo sapiens 


Ps 2~basic proline-rich protein(PRB 1 L 












precursor protein— basic proline-rich 












P recursorHC P termmal)' ^ ^ 












precu rM -ermin * — — 






3490 


S74728 


Homosa iens 




549 


84 


3491 


L13329 


~H 

omo sapiens 


■ , — — = — ° lf ° — e — 

i urona e- -su a ase 


"H?! — 










Homo sapiens 


beta tubulin 






j49j 


A T?f\Q 1 AQA 

/vrUol4o4 


Homo sapiens 


alpha-tubulin isoform 1 


578 


~ 

87 


3494 


U09823 


Oryctolagus 


elongation factor 1 alpha 


631 


89 






cuniculus 








3495 


AF081484 


Homo sapiens 


alpba-tubulin isoform 1 


616 


90 


3496 


M12140 


Homo sapiens 


envelope protein 


430 


53 


3497 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


70 








7795. 






3498 


W48352 


Homo sapiens 


Human breast cancer related protein 


104 


51 




. 




BCF ^T?- ... _. .. 






3499 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


134 


54 










oncoprotein. 






_f-£2 


— ££j£o 


Homo sapiens 


Human secreted protein $28. 


-— 

_il£ 


— 


3501 


AF1 13685 


Homo sapiens 


PRO0974 


77 


_ 

81 


3502 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


62 


















. 

Homo sapiens 




.. 

164 


81 




3504 


AF09093 1 


Homo sapiens 


PRO0483 


103 


70 


3505 


U93570 


Homo sapiens 


p40 


258 


47 


3506 


ABO 16601 


Drosophila 


cytochrome c oxidase subunit I 


108 


82 






alpina 








3507 


AB016601 


Drosophila 


cytochrome c oxidase subunit I 


108 


82 






alpina 








3508 


AF194537 


Homo sapiens 


NAG 13 


449 


78 


3509 


WS8627 


Homo sapiens 


Secreted protein encoded by gene 94 


153 


60 








clone HPMBQ32. 






3510 


Z70684 


Caenorhabditis 


F28D1.8 


102 


37 






elegans 








351 1 


AC0081 13 


Arabidopsis 


F12A21.10 


90 


48 






thaliana 








3512 


M64791 


Rattus 


salivary proline-rich protein 


104 




— — — 




norvegicus 




— — 




— — i 




Homo sapiens 




~T^5 


— =5 

52 


3514 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 


1>2 


67 




... 




protein 1 






3515 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


123 


48 








encoded by gene 90. 










Homo sapiens 


rJMwIUO J / 





_zZ 


■- 

3517 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


286 


56 


3518 


AF255661 


Crypthccodinium 


Dinap 1 -interacting protein 5; Dip5 


108 


36 






cohnii 












Homo sapiens 


Human secreted protein HCHPF68, 


209 


93 








SEQ ID NO: 163. 






3520 


Ml 1901 


Rattus 


proline-rich salivary protein 


102 


38 






norvegicus 








3521 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


64 






6561. 






3522 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


105 


42 



221 



-ISDOCID: <WO 016483SA2J_> 



WO '01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description,- .•}.'-,< v 


Smith- 
Waterman 

Score 


"~% 

Identit 

y 








complete virion genome. 






3523 


R95913 


" "Homo sapiens 


Neural thread protein. 


1 18 


72 


3524 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 

aty ID IND.lDJ. 


115 


59 


3525 


L27428 


Homo '' 


reverse transcriptase 


162 


50 


3526 


Y02749 


- 'Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


1 1 8 


58 


3527 


AF090930 


Homo sapiens 




146 


66 


3528 


G00528 


.Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


50 


3529 


AF194537 


Homo sapiens 


NAG 13 


119 


88 


3530 


U93564 




putative pi 50 


234 


84 


3531 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


208 


63 


3532 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


275 


92 


3533 


AF09093 1 


Homo sapiens 


PRO0483 


126 


66 


3534 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


1 17 


85 


3535 


G03076 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7157. 


173 


77 


3536 




Drosophila 
melanogaster 


ena polypeptide 


1 17 


45 




nrzuyuo i 


Eubranchipus sp. 


cytochrome c oxidase I 


127 


80 


3538 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


174 


85 




AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


118 


45 


3540 


R13556 


Homo sapiens 
. 


Protein encoded downstream of hhc M 
oncoprotein. 


88 


50 


3541 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


128 


71 


3542 




Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 1 1A 


1 1 1 


77 


3543 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


69 


3544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


470 


80 


3545 


G03172 


Homo sapiens 
— 


Human secreted protein, SEQ ID NO: 
12^^. 


120 


54 


3546 


AL35978'' 


1 rypanosoma 


probable similar to ring-h2 finger 
protein rha 1 a. 


106 


42 


3547 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


584 


83 


3548 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
i rKTrMr i \ 
1 ^INL/r- J ). 


635 


95 


3549 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


661 


91 


3550 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


607 


86 


3551 


AK024455 




FLJ00047 protein 


88 


51 


3552 


W88957 


Homo sapiens 


Polypeptide fragment encoded by gene 
. 


500 


86 


3553 


D381 13 


Pan troglodytes 


NADH dehydrogenase subunit 4 


228 


95 


3554 


AF 130092 


Homo sapiens 


PRO2620 


165 


93 


3555 


AL121845 


Homo sapiens 


dJ583P 15.5.1 (novel protein (isoform 
1)) 


659 


91 


3556 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


370 


72 


3557 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


599 


88 


3558 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


578 


84 


3559 


AK000385 


Homo sapiens 


unnamed protein product 


138 


47 


3560 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


112 


61 


3561 


X97675 


Homo sapiens 


plakophilin 2b 


123 


65 


3562 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


62 



222 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 




Description 


Smirh- 


% 
y 








6595. 






3563 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


90 


3564 


X86029 


Vigna 
unguiculata 


extensin-like protein 


141 


36 


3565 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77 


60 


3566 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


1 10 


94 


3567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


203 


93 


3568 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


72 


3569 


S62928 


Homo sapiens 


PRB1M protein precursor 


102 


44 


3570 


D86853 


Catharanthus 


extensm 


123 


40 


3571 


□93574 


Homo sapiens 


putative pi 50 


279 


56 


3572 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


94 


82 


3573 


U45964 


Herpesvirus 


LMP1 


127 


35 


3574 


Y08319 


Homo sapiens 


kinesin-2 


254 


55 


3575 


X97675 


Homo sapiens 


plakophilin 2b 


134 


54 


3576 


AF003151 


Caenorhabditis 


contains similarity to an RNA 


137 


43 


3577 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


126 


66 


3578 


US0447 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
ykl87f6 5' coded for by C. elegans 
cDNA ykl87f6.3; coded for by C. 
elegans cDNA ykl46fl 1 .5 


117 


50 




3579 





Caenorhabditis — 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


126 


31 


3580 


AJ249395 


Globodera 
pallida 


NADH-ubiquinone oxidoreductase 
subunit 4 


85 


33 


3581 


U091 16 


Homo sapiens 


ORFI, encodes a 40 kDa product 


139 


55 


3582 


X97675 


Homo sapiens 


plakophilin 2b 


125 


73 


3583 


L26953 


Homo sapiens 


chromosomal protein 


101 


55 


35S4 


SS0343 


Homo sapiens 


arginyl-tRNA synthetase, ArgRS 


110 


67 


3585 


X97675 


Homo sapiens 


plakophilin 2b 


116 


84 


3586 


X92485 


Plasmodium 


pval 


124 


50 


3587 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


56 


3588 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7253. 


136 


56 


3589 


U63542 


Homo sapiens 


FAP protein 


128 


75 


3590 


D38112- 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


88 


3591 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 1 


612 


87 


3592 


K02401 


Homo sapiens 


chorionic somatomammotropin 


576 


86 


3593 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


108 


40 


3594 


V00662 


Homo sapiens 


cytochrome oxidase I 


459 


76 


3595 


AC008262 


Arabidopsis 
thaliana 


F4N2.10 


136 


31 


3596 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


52 



223 



SNSDOCID: <WO 016483SA2_I_> 



WO 01/64835- 



PCT/US01/04927 



SEQ ID 


Accession No. ' 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


3597 


AF1 16645 


Homo sapiens 


PRO 1708 


623 


91 


3598 


M23028 


Human 
herpesvirus 4 


nuclear antigen precursor 


121 


35 


3599 


W88816 


Homo sapiens 


Polypeptide fragment encoded by gene 
58. 


100 




3600 


X92485 


Plasmodium 
vivax 


pval 


156 


36 


3601 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


78 


3602 


AK000496 


Homo sapiens 


unnamed protein product 


150 


68 


3603 


AF083929 


Mus musculus 


ES18 


98 




3604 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


537 


83 


3605 


U 12690 


Homo sapiens 








3606 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


158 


57 


3607 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 





— 


3608 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


122 


~65 


3609 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


88 


3610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


224 


80 


3611 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


481 


70 


3612 


V00662 


Homo sapiens 


UPvF 2 (NADH dehydrogenase subunit) 


166 


92 


3613 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


83 


3614 


Y86248 


Homo sapiens 


SEQ ID NO: 1 63 


167 


91 


3615 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


582 


77 


3616 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24 HUMAN P23246) and D 

(PIR:JH0162) °° tranS)ent A protem 


101 


40 


3617 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 






3618 


AB037275 


Cynomolgus 
Epstein-Barr 
Virus TsB-B6 


EBNA-l 


119 


42 


3619 


AF061944 


Homo sapiens 


kinase deficient protein KDP 






3620 


Y34068 


Homo sapiens 


HistoneHl isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


100 


33 


3621 


AL049608 


Arabidopsis 
thaliana . 


extensin-like protein 






3622 


AP000616 


Oryza sativa 


similar to PJNG-H2 Finger protein 
RHAla(AF0786S3) 


127 


69 


3623 


S71569 


patriciarum, 
Peptide, 860 aa 


YvlanaQp R WT R /CT *2 O 1 C\ 
yvyiuiidbt- lj, .A. 1 |.c^ j.Z.l.oj 


108 


43 


3624 


U23172 


Gaenorhabditis 
elegans 


—. — T7— — : r-\ — r r. 7 

similar to myoDmst cell suriace antigen 
(SP:CS24_HUMAN, P23246) and D. 
(PIR:JH0162) Protein 


"761 


— 


3625 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 






3626 


D38112 


Homo sapiens 




~5"S~2~ 


-|| 


3627 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


637 


90 


3628 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


169 


39 


3629 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


3630 


U93564 


Homo sapiens 


putative pi 50 


330 


90 


3631 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


42 



224 



JSDOC1D: <WO 0164835A2J_> 



WO 01/64835 PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


lilentit 














3632 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID WO: 


100 


80 








6613. 






3633 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


65 








6441. 






3634 


U74071 


Phascolosoma 


cytochrome c oxidase sub-unit 1 ' 


248 


72 






sp. 'California' 








3635 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


87 








4442. 






3636 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


128 


65 








oncoprotein. 






3637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


124 


63 








gene 22 clone HMSJW IS. 






3638 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


159 


51 








derived from DNA35638. 






3639 


X65582 


Mus musculus 


alpha-2 collagen 


102 


53 


3640 


AF090942 


Homo sapiens 


PRO0657 


97 


76 


3641 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


146 


69 








clone HPMBQ32. 






3642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 18 


55 








6619. 






3643 


X14576 


Murine leukemia 


gag p 1 5 protein 


134 


44 






virus 








3644 


AF130051 


Homo sapiens 


PRO0898 


177 


48 


3646 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


59 








7145. 






3647 


AK000385 


Homo sapiens 


unnamed protein product 


105 


40 


3648 


V00662 


Homo sapiens 


cytochrome oxidase I 


533 


83 


3649 


AF090944 


Homo sapiens 


PRO0663 


129 


73 


3650 


AF090944 




PRO0663 


129 


73 


3651 


W4835 1 


Homo sapiens 


Human breast cancer related protein 


88 


60 








BCRB2. 






3652 


AF090930 


Homo sapiens 


PRO0478 


92 


57 


3653 


G03 1 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


73 








7253. 






3654 


ABO 18440 


Echinococcus 


NADH dehydrogenase subunit 2 


100 


32 






multilocularis 








3655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


68 








5327. 






3656 


AF161356 


Homo sapiens 


HSPC093 


106 


48 


3657 


AF090930 


Homo sapiens 


PRO0478 


158 


85 


3658 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


82 








7124. 






3659 


U93574 


Homo sapiens 


putative pi 50 


117 


44 


3660 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


149 


51 








gene 22 clone HMSJW18. 






3661 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


67 


70 








gene 22 clone HMSJW1 8. 






3662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


99 


62 








SEQ ID NO: 163. 






„ 7 f~ 




Neurospora 


related to Ul SMALL NUCLEAR 


~T54 


--4-5- 






crassa 


RIBONUCLEOPROTEIN C 






3664 


Y27571 


Homo sapiens 


Human secreted protein encoded by 


136 


77 








gene No. 5. 






3665 


W48351 


Homo sapiens 


Human breast cancer related protein 


128 


59 






BCRB2. 






3666 


U47855 


Araneus 


fibroin-3 


109 


39 






diadematus 








3667 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


604 


88 


3668 


K02401 


Homo sapiens 


chorionic somatomammotropin 


636 


93 



225 



WO 01/64835 



PCT/U SO 1/04927 



SEQID 

NO: 


Accession No. 




Description 


Smith- 










Waterman 


Idcntit 


3669 


AF083929 


Mus musculus 


ES18 


133 


46 


3670 


J01415 




lvi I IN L/h 


620 


87 


3671 


AK025047 


Homo sapiens 


unnamed protein product 


160 


60 


3672 


G03793 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7874. 


106 


31 


3673 


AB01 9038 


Homo sapiens 


beta- 1 ,4 mannosyltransferase 


766 


94 


3674 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/forkhead 
homolog 1 1 A 


1 13 


69 


3675 


U80447 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
ykl S7f6.5; coded for by C. elegans 
cDNA ykl87f6.3; coded for by C. 
elegans cDNA ykl46fl 1 .5 


113 


39 


3676 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


197 


68 


3677 




Homo sapiens 


mariner transposase 


500 


74 


3678 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


174 


97 


3679 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


86 


3680 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


102 


83 - 


3681 


AF005370 


Alcelaphine 
herpesvirus 1 


putative immediate early protein 


153 


42 


~36lP 


QQ^ 502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 


111 


62 






Arabidopsis 
thaliana 


gb|AAD23015.1~gene_id:F15M7.16~si 
milar to unknown protein 


131 


AO 


3684 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871 -. 


146 


64 


3685 


U93572 


-— : 

Homo sapiens 


putative pi 50 


158 


43 


3686 




Homo sapiens 


Histone Hi isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


98 


43 


3687 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


106 


54 


mi — 




Homo sapiens 


chromosomal protein 


128 


76 




D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


87 


3690 


R96800 j 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


109 


67 


3691 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


122 


75 


3692 


„ 0 


—— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
^?^ J " ■ — - — 


165 


85 


3693 


M20789 


Homosa iens 


alpha- 1 type I collagen 


141 


43 


3694 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


50 


3695 


AF2 10651 






124 


67 


"3696 


AF1 30079 


Homo sapiens 


PR02852 


94 


53 


3697 


Y 1 9 1 92 


Talpa europaea 


cytochrome oxidase subunit I 


618 


85 


369S 


D38132 


Homo sapiens 


NADH dehydrogenase subunit 2 


414 


86 


3699 




Homo sapiens 


cytochrome c oxidase subunit 1 


487 


76 


3700 


502455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


42 


3701 


AF2 17374 


Acanthaster 


cytochrome oxidase subunit I 


563 


85 


3702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA ' 
binding protein K10 (NID:g8148) 




48 


3703 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


522 


79 


3704 


D67066 


Bos taurus 


N-WASP 


219 


42 


3705 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


51 


3706 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


44 ! 



226 



SDOCID: <WO 0164835A2_I_» 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

ldentit 

y 








7879. 






3707 


Z70684 


Caenorhabditis 


F^8D1 8 


105 




3708 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


74 


3709 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


558 


88 


3710 


AF2 17374 


Acanthaster. 


cytochrome oxidase subunit 1 


512 


85 


37J1 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 2 


186 


80 


3712 


At ly /oD4 


Mel ithreptus 
lunatus 


cytochrome oxidase I 


189 


83 


3713 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

oua 1 a ( APfl7XfiRT^ 
£vo/\ 1 a ^Aru / oOO J ) 


129 


95 


3714 


AB029027 


Homo sapiens 


KIA A 11 04 protein 


141 


81 


3715 


L26251 


Trypanosoma 
brucei 








3716 


R959 1 3 


Homo sapiens 


Neural thread protein. 


117 


~ 


3717 


G003 97 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 






3718 


Y361 12 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 






3719 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 


132 


— 




M64923 


Bos taurus 


C10 protein 


~T94 




3721 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

nana OT ^Inno UN/fQTWl ft 

gene zz. clone ruviij wis. 


115 

— — — 


75 
-— 


3722 


G 03 043 


Homo sapiens 
— ■ 


Human secreted protein, SEQ ID NO: 
7 . 124 ' : — ? 


18 




3723 


AF302773 


Homo sapiens 


ninein-L.m isoiorm 


—rj= 


~57 




A I'M/I /ICC 


Homo sapiens 


FLJ00047 protein 


134 


59 


3725 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


161 


80 


3726 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


121 


52 


3727 


AF220264 


Homo sapiens 








3728 


L27428 


Homo sapiens 


reverse transcriptase 


316 


62 


3729 


U83303 


Homo sapiens 


line-1 reverse transcriptase 






3730 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


"143 




3731 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


118 


56 


3732 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 






3733 


AF090930 


Homo sapiens 


dp nn^vc 






3734 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


140 


72 


3735 


U93569 


Homo sapiens 


putative pi 50 


270 


59 


3736 


U93568 


Homo sapiens 


putative pi 50 






3737 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 






3738 


X92485 


Plasmodium 
vivax 


pval 


101 

— — 


44 

— 


3739 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 






3740 


AF090944 


Homo sapiens 


PRO0663 


124 


56 


3741 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


94 


47 


3742 


AF1 18086 


Homo sapiens 


PRO 1992 


124 


61 


3743 


AF083929 


Mus musculus 


ES18 


108 


35 


3744 


L34685 


Arabidopsis 


cell wall protein 


120 


34 . 



227 



MSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


■ -Description ' »■ •* 


Smilh- 

Waterman 

Score 


% 

Identit 

y 






thaliana 








3745 


AF090931 


Homo sapiens 


PRO0483 


117 


54 


3746 


AF090930 




r ISXJSJH / O 


136 


75 


3747 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


59 


3748 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


44 


3749 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
j\xi/\ i a ^Aru/oooJ ) 


9S 


42 


3750 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMS J W 1 8 . 


165 


73 


375.1 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHA 1 a (AF078683) 


1 13 


87 


3752 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


3753 


U93574 


Homo sapiens 




157 


60 


3754 


X56015 


Crithidia 
oncopelti 


NADH dehydrogenase subunit 5 


103 


30 


3755 


M80613 


Homo sapiens 


putative 


114 


26 


3756 


G00333 




Human secreted protein, SEQ ID NO: 
4414. 


119 


54 


3757 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 2 


210 


89 


3758 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


58 


3759 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


110 


46 


3760 


M13100 


Rattus 
norvegicus 


unknown protein 


119 


46 


3761 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NlD:g8148) 


135 


52 


3762 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
ivriy\ i a (Aru/oOoj ) 


132 


84 


3763 


Ml 1900 Musmusculus 


15-kDa proline-rich salivary protein 


125 


40 


3764 


U63542 


Homo sapiens 


FAP protein 


131 


72 


3765 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


76 


3766 


X92485 


Plasmodium 
vivax 


pval 


119 


54 


3767 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-^ 


81 


56 


3768 


G0341I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


65 


3769 


AB005216 


Homo sapiens 


Nek, Ash and phospholipase C gamma- 
binding protein NAP4 


487 


90 


3770 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


102 


37 


3771 


W48351 


Homo sapiens 


— 

Human breast cancer related protein 
BCRB2. 


91 


55 


3772 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO - 
6566. 


128 


76 


3773 


AK024455 


Homo sapiens 


rjujuuu^f/ protein 


138 


66 


3774 


AF090895 


Homo sapiens 


PRO0117 


152 


60 


3775 


AF 130051 


Homo sapiens 


PRO0898 


122 




3776 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


100 


63 


3777 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


187 


75 


3778 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
4523. 


130 


56 



228 



SDOCID: <WO 0164835A2_I_» 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 


% 
y 


3779 


X55685 


Lycopersicon 
esculentum 


extertsin (class I) 


132 


38 


3780 


L27428 


Homo sapiens 


reverse transcriptase 


116 


53 


3781 


Y71066 


Homo sapiens 


Human membrane transport protein, 
MTRP-11. 


188 


86 


3782 


AL3901 14 


Leishmania 


extremely cysteine/valine rich protein 


122 


67 


3783 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


462 


75 


3784 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


127 


67 


3785 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


129 


36 


3786 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


152 


78 


3787 


S80119 


Rattus sp. 


reverse transcriptase homolog 


143 


40 


37S8 


AF090942 


Homo sapiens 


PRO0657 


134 


67 


3789 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


63 


3790 


AF1 16661 


Homo sapiens 


PRO 143 8 


128 


45 


3791 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


130 


38 


3792 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


127 


74 


3793 


W501 92 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


101 


48 


3794 


AF000298 


Caenorhabditis 
eiegans 


weak similarity to collagens; glycine- 
and proline-rich 


175 


46 


3795 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


117 


40 




Y86248 




Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


185 


88 








cytochrome c oxidase subunit 3 


610 


84 


~3~7§I — 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


156 


56 




AL3 59782 


brucei 


probable similar to ring-h2 finger 
protein rhala. 


71 


52 


3800 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


127 


77 




AF130079 




PR02852 


164 


54 


3802 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhal a. 


118 


45 


3803 


U42580 


Paramecium 
Chiorella virus 1 


Pro-rich, PAPK (20X); similar to 
Arabidopsis anter-specific Pro-rich 
protein, corresponds to Swiss-Prot 
Accession Number P40602 


101 


36 


3804 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


173 


87 


3805 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


126 


72 


3806 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


225 


85 


3807 


U88587 


Nicotiana alata 


120 kDa style glycoprotein 


118 


38 









cytochrome c oxidase subunit 1 


129 


82 




G03043 


Homo sapiens 

omo 


Human secreted protein, SEQ ID NO: 
7124. 


122 


67 


3810 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


213 


97 


3811 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


106 


63 


3812 


AF169974 


Homo sapiens 


serine racemase 


153 


56 


3813 


L27428 


Homo sapiens 


reverse transcriptase 


175 


43 


3814 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


109 


43 



229 



INSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 

sr an 


- % 
Idontit 

y 


3815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO' 
4484. 






3816 


G00352 


Homo sapiens 


Human secreted protein SEQ ID NO" 
4433. 






3817 


AF1 19901 


Homo sapiens 


PR0283 I 






3818 


W4835I 


Homo sapiens 


BCRB2. 


~~9T~ 





3819 


AL1 60371 


Leishmanial 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 




66 


3820 


X92485 


Plasmodium 
vivax 






46 


3821 


AK024455 




rLJuuu4/ protein 




59 


3822 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene Mo. 101. 


178 


S7 


3823 


YS6472 


Homo sapiens 


Human gene 52-encoded protein 
iragmeni, iu inlj..5o/. 


99 


50 . 


3824 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
irom piasmiu puLo i 1 Uj /. 


86 


94 


3825 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4 7*.*f' — - — ; 


138 


53 


3826 


AP000616 


Oryza sativa 


similar to R1NG-H2 ringer protein 
RHAla (AF0786S3) 


129 


88 


3827 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


77 


61 


3828 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


51 


3829 


K03208 


Homo sapiens 


salivary proline-rich protein precursor 


161 


41 


3830 


U44838 




extensin 


108 


47 


3831 


G03438 


Hom'osa'piens 


Human secreted protein, SEQ ID NOi 
7519. 




71 


3832 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


84 


3833 


AL163912 


thaliana 


glycine-rich protein atGRP-7 


1 17 


37 


3834 


AF 130089 


Homo sapiens 


PRO2550 


... 
138 


45 


3835 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


92 


76 


3836 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


70 


3837 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


98 


37 


3838 


X68249 


Xenopus laevis 




"Ho 




3839 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 




92 


3840 


R63235 


Homo sapiens 


CNS neural thread protein HB4 


~| 




3841 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


64 


3842 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


116 


62 


3843 


AF164614 


Homo sapiens 


envelope protein 


508 


77 


3844 


AF 130051 


Homo sapiens 


PRO0898 


IjO 


73 


3845 


G03438 


riomo sapiens 


Human secreted protein, SEQ ID NO' 
7519. 


144 


74 


3846 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


1 19 


68 


3847 


M13100 


Rattus 
norvegicus 


unknown protein 


129 


46 


3848 


X97675 


Homo sapiens | 


plakophilin 2b 


145 


67 


3849 


Y14482 


Homo sapiens 1 


Fragment of human secreted protein 
encoded by gene 17. 


141 


65 


3850 


K03206 


Homo sapiens [ 


salivary proline-rich protein precursor 


114 


42 



230 



ISDOCID: <WO 0164835A2J_> 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3851 


U93574 


Homo sapiens 


putative pi 50 


165 


82 


3852 


AFO90931 


Homo sapiens 


PRO0483 


97 


78 


3853 


U49973 




ORF1; MER37; putative transposase 
similar to pogo element 


279 


80 


3854 


U93570 


Homo sapiens 


putative p 1 50 


135 


38 


3855 


M13100 


Rattus 
norvegicus 


unknown protein 


100 


57 


3856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID MO: 
7253. 


166 


63 


3857 


Y48576 


Homo sapiens 


. Human breast tumour-associated 
protein 37. 


117 


49 


3858 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


147 


45 


3859 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


150 


71 


"3860 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


165 


69 


3861. 




Homo sapiens 


Human secreted protein SEQ ID NO* 


Ij\ 


8j 


3862 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1S. 


171 


81 


3S63 


AK00024 1 


Homo sapiens 


unnamed protein product 


100 


48 


3864 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


121 


80 


3865 


AK024455 


Homo sapiens 


FLJ00047 protein 


124 


69 








Human secreted protein, SEQ ID NO: 
6913. 


87 


64 


3867 


AF1 16712 




PR02738 


92 


48 


3868 


R13556 


Homo sapiens — _ 


oncoprotein. 


165 


62 


3869 


G004 1 6 




Human secreted protein, SEQ ID NO: 
4497. 


128 


62 


3870 


Y27868 


Homo sapiens 


gene No. 107. 


94 


75 


3871 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


72 


3872 


AJ242540 




HRGP 


147 


44 


3873 


GO3790 


Homo^'ap'iens 


Human secreted protein, SEQ ID NO: 
7871. 


113 


51 


3874 


AF090930 




PRO0478 


120 


63 


3875 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


3876 


D88674 


Homo sapiens 


antizyme inhibitor 


147 


75 


3377 


A F2 63 744 




erbb2-interacting protein ERBIN 


212 


37 


3878 


Y361 12 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


140 


82 


3879 


G00354 




Human secreted protein, SEQ ID NO: 
4435. 


82 


35 








12. 


144 


60 


3881 


AFO 10144 




neuronal thread protein A_D7c-NTP 


122 


60 


3882 


G00577 


Homo ^lens 

omo sapiens 


Human secreted protein, SEQ ID NO' 
4658. 


134 


75 


3883 


M81321 


Macaca 
fascicularis 


proline-rich protein 


101 


41 


3884 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


109 


56 


3885 


AF013214 


Bos taurus 


acidic ribosomal phosphoprotein PO 


177 


94 


3886 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


77 
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MSDCCID: <WO 0164835A2J_> 
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NO 3 ' 0 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 








4484. 






3887 


Y59860 


Homo sapiens 


Human normal uterus tissue derived 
protein 23 


137 


72 


3888 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17 


147 


72 


3889 


M64792 


Rattus , 
norvegicus 


salivary proline-rich protein 


104 


40 


3890 


AF 156228 


Drosophila 
melanogaster 


salivary gland secretion protein 


~T04 


- — 


3891 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ8 1 . 


90 




3892 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


~FS 





3893 


D38112 


Homo sapiens 


NADH dehydrogen3se subunit 5 






3894 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


34 


3895 


X02873 


Daucus carota 


put. precursor 


104 


48 


3896 


AF090942 


Homo sapiens 


PRO0657 




50 


3897 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


77 


3898 


X92485 


Plasmodium 


pval 


111 


62 


3899 


G02490 


Homo a ien 

omo sapiens 


~LJ J : ^ 

Human secreted protein, SEQ ID NO: 
6571. 


107 


70 


3900 


AL 160493 


Leishmania 


probable (hhv-6) u 1 1 02, variant a 


94 


93 


3901 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


83 


3902 


AF090942 


Homo sapiens 


PRO0657 


134 


_ 


3903 


AF220264 


Homo sapiens 


MOST-1 


74 


59 


3904 


G04072 


Homo sapiens 


Human secreted protein SEQ ID NO' 
8153. 




53 


3905 


G03043 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7124. 






3906 


AK025047 


Homo sapiens 


unnamed protein product 






3907 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


135 


86 


3908 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO* 
4718. 


— =; 


72 


3909 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


104 


62 


3910 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


114 


70 


3911 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


61 


3912 


U93567 


Homo sapiens 


putative pi 50 




63 


3913 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


96 


40 


3914 


X94976 


Brassica napus 


cell wall-plasma membrane linker 
protein 


104 


36 


3915 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


125 


..... 
73 


3916 


X92485 


Plasmodium 
vivax 


pval 


94 


64 


3917 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


52 


3918 


AF 118086 


Homo sapiens 


PRO! 992 


130 


87 


3919 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


80 


3920 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Water man 


% 

Identit 








4493. 






3921 


AF090930 


Homo sapiens 


PRO0478 


128 


64 


3922 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


75 


3923 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


65 








Human secreted protein sequence SEQ 
ID NO: 103. 


134 


71 


3925 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


102 


58 


3926 


D63487 


Homo sapiens 


The KIAA01 53 gene product is related 
to a putative C.elegans gene encoded in 
cosmid F42A8. 


141 


100 


3927 


U21 123 


Drosophila 
melanogaster 


ena polypeptide 


117 


37 


3928 


X58438 


Mus musculus 


proline rich protein 


136 


36 


3929 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


71 


3930 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


73 


3931 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


48 


3932 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 10 


65 


3933 


X92485 


vivax 


pval 


116 


36 


3934 


X97675 


Homo sapiens 


plakophilin 2b 


129 


82 


3935 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


58 


3937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


107 


56 


3938 


AF090895 


Homo sapiens 


PRO0U7 


166 


68 


3939 


K02550 


Oncorhynchus 
mykiss 


70-kilodalton heat shock protein 


104 


38 


3940 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


74 


3941 


L26953 


Homo sapiens 


chromosomal protein 


92 


67 


3942 


U63542 


Homo sapiens 


FAP protein 


135 


75 


3943 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


80 


3944 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


141 


67 


3945 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


76 


3946 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


92 


52 


3947 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


103 


56 


3948 


AF252293 


Homo sapiens 


PAR3 


161 


44 


3949 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


47 


3950 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


59 


3951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


174 


87 


3952 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


152 


75 


3953 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


168 


64 


3954 


R95913 


Homo sapiens 


Neural thread protein. 


110 


54 


3955 


AF1 30089 


Homo sapiens 


PRO2550 


122 


63 


3956 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


48 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 


3957 


Y36203 


Homo sapiens 


Human secreted protein #75. 


121 


75 


3958 


Y86248 


Homo sapiens 


Human secreted' protein HCHPF68, 
SEQ ID NO: 163. 


177 


61 


3959 


G00328 


Homo sapiens 


Human secreted protein. SEQ ID NO: 

4409. : 


131 


69 


3960 


U93563 


Homo sapiens 


putative pi 50 


227 


62 


3961 


D86853 


Catharanthus 
roseus 


extensin 


100 


40 


3962 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


101 


41 


3963 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


107 


80 


3964 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


121 


42 


3965 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


66 


3966 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


101 


34 


3967 


AF229126 


Homo sapiens 


acetylcholinesterase collagen-like tail 
subunit isofbrm VII 


108 


36 


3968 


AJ006770 


Cicer arietinum 


extensin 


86 


29 


3969 


API 30089 


Homo sapiens 


PRO2550 


128 


82 


3970 


U93563 


Homo sapiens 


putative pi 50 


99 


48 


3971 


AK000496 


Homo sapiens 


unnamed protein product 


134 


71 


3972 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 




3973 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


137 




3974 


AK024455 


Homo sapiens 


FLJ00047 protein 


132 


~6l 


3975 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


85 


3976 


X01918 


Drosophila 
melanogaster 


salivary gland glue protein 




40 


3977 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


136 


46 


3978 


AF229067 


Homo sapiens 


PAD1-H protein. 


107 


87 


3979 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


189 


48 


3980 


S80119 


Rattus sp. 


reverse transcriptase homolog 


134 


49 


3981 


AF090903 


Homo sapiens 


PRO0195 


1 06 




3982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


111 


-~= 


3983 


X92485 


Plasmodium 
vivax 


pval 


102 


39 


3984 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


127 


64 


3985 


AB012162 


Homo sapiens 


APCL protein 1 


190 


42 


39S6 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


139 


65 


3987 


U93570 


Homo sapiens 


putative pi 50 


170 


71 


3988 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


202 


73 


3989 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


244 


85 


3990 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


420 


92 


3991 


D3S112 


Homo sapiens 


NADH dehydrogenase subunit 4 


272 




3992 


L38908 


Nicotiana 
tabacum 


extensin 


114 


40 
64 


3993 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


3994 


AF090944 


Homo sapiens 


PRO0663 


99 


39 


3995 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO 
7888. 


150 


75 


3996 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


574 


87 


3997 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


488 


76 
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Accession No. 


Species 


Description 






NO: 








Waterman 


Identit 










Score 


y 


3998 


U52077 


Homo sapiens 


mariner transposase 


475 


70 


3999 


U93570 


Homo sapiens 


p40 


111 


34 


4000 


U 15647 


Mus musculus 


reverse transcriptase 


137 


43 


4001 


AF1 16712 


Homo sapiens 


PR02738 


105 


52 


4002 


AF1136S5 


Homo sapiens 


PRO0974 


125 


53 


4003 


W48351 


Homo sapiens 


Human breast cancer related protein 


92 


78 








BCRB2. 






4004 


AF1 16715 


Homo sapiens 


PR02829 


1 15 


73 


4005 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


288 


80 


4006 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


614 


86 


4007 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


45 








7868. 






4008 


U90268 


Homo sapiens 


Kritl 


152 


52 


4009 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


472 


96 


4010 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


304 


100 








1 (NDF-1). 






4011 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


472 


98 








1 (NDF-1). 






4012 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


102 


50 








protein CON-2. 






4013 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


73 








4343. 






4014 


G0380I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


47 








7882. 






4015 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


522 


81 


4016 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


52 








6561. 






4017 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-N 1 F 


106 


57 


4018 


AB003476 


Homo sapiens 


gravin 


549 


91 


4019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


391 


93 


4020 


U44949 


Xenopus laevis 


zona pellucida A glycoprotein homolog 


143 


32 


4021 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


571 


88 


4022 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


554 


87 


4023 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


88 








8159. 






4024 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


131 


38 






major 








4025 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


583 


86 


4026 


AF090895 


Homo sapiens 


PRO0117 


104 


49 


4027 


AF130056 


Homo sapiens 


PRO 1367 


80 


60 


4028 


X97675 


Homo sapiens 


plakophilin 2b 


157 


83 


4029 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 


114 


35 








kDa subunit 






4030 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


no 


74 








4497. 






4031 


D38112 


Homo sapiens 


ATPase subunit 6 


395 


80 


4032 


S70718 


Hemicentrotus 


fibrillar collagen alpha 120 and 140 


104 


33 






pulcherrimus=se 


chains 










a urchins, tests. 












Peptide, 632 aa 








4033 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


355 


66 


4034 


AC004497 


Homo sapiens 


MX2 


464 


58 


4035 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


61 








4414. 






4036 


AF1 17888 


Homo sapiens 


myosin-D(a 


113 


34 


4038 


YS6248 


Homo sapiens 


Human secreted protein HCHPF68, 


100 


48 








SEQ ID NO: 163. 






4039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


153 


73 








SEQ ID NO: 163. 
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SEQ ID 

NO: 


Accession No. 




Description 


Waterman 
Score 


Identit 

y 




D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


338 


86 


4041 


AF1 16661 


Homo sapiens 


PRO 143 8 


136 


53 


4042 



4043 


G03 1 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


84 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


493 


85 


4044 


M905 1 6 


Homo sapiens 


glutamine:fructose-6-phosphate 
amidotransferase 


261 


72 


4045 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


102 


.64 


4046 


AF1 30089 


Homo sapiens 


PRO2550 


143 


78 


4047 


AF090942 


Homo sapiens 


PRO0657 


119 


41 


4048 


R95913 


Homo sapiens 


Neural thread protein. 


138 


47 


4049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


59 


4050 


X77962 


Tetrahymena 
thermophila 


fibrillarin 


105 


49 


4051 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
4419. 


144 


71 


4052 


X02794 


Friend murine 
leukemia virus 


Pr65 


107 


37 


4053 


Y00994 


Homo sapiens 


Human CSR3 protein sequence. 


109 


38 


4054 


AL121585 


Homo sapiens 


DA504H3.1 (SNX5 (sorting nexin 5)) 


299 


74 


4055 


V00662 


Homo sapiens | URF 1 (NADH dehydrogenase subunit) 


314 


80 


4056 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


65 


54 


4057 


Y8624S 


Homo sapiens 
— — _ 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


67 


4058 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


202 


89 


4059 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQIDNO:163. 


187 


100 


4060 


U63542 


Homo sapiens 


FAP protein 


142 


73 


4061 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7S6S. 


158 


50 


4062 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


123 


44 


4063 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


76 


65 


4064 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


165 


46 


4065 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


147 


93 


4066 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


76 


4067 


AB035523 


Galius gallus 


avenalll 


101 


48 


4068 


AC079829 


Arabidopsis 
thaliana 


Pto kinase interactor, putative 


105 


43 


4069 


AF2 1 065 1 


Homo sapiens 


NAG 18 


97 


77 


4070 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


43 


4071 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


15S 


65 


4072 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


43 


4073 


AF038007 


Homo sapiens 


FIC1 


153 


96 






Trypanosoma 
brucei 


ATPase 6 r 


103 


37 


4075 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


119 


46 


4076 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


36 


4077 


X52235 


Homo sapiens 


ORFII 


192 


39 


4078 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 [ 83 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

ldentit 

y 








4409. 










Homo sapiens 


WASP-family protein 






4080 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


105 


42 


4081 


AF130051 


Homo sapiens 


PRO0898 


117 


42 






Homo sapiens 








4083 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


48 


4084 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1 . 


89 


40 


4085 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


80 


0 6 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


144 


74 




AF085691 


Homo sapiens 


multidrug resistance-associated protein 
3A 






4088 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 






4089 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


51 


4090 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


103 


65 


4091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


68 


4092 


X92485 


Plasmodium 


pval 


95 


38 


4093 


U93569 


Homo sapiens 


p40 


205 


40 


4094 


L 16864 


African swine 
fever virus 


cd2 homologue 


98 


45 


~4095 


x? l 413 


Homo sapiens 


~ELE1 


~~675 


— 


4096 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


60 


"4097 


Y144g2 


Homo sapiens 


Fragment of human secreted protein 

encoded by gene 17. 


Ti6 


— 






Homo sapiens 








4099 


AF090942 


Homo sapiens 


PRO0657 


124 


56 




G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 






4101 . 


k 26 ? 53 .. . 


Homo sapiens 


chromosomal protein 


—L 0 - 4 . j 


7^ 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 




4 103 


G 03 790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


58 


4104 


U73199 


Mus musculus 


Rho-guanine nucleotide exchange 
factor 


370 

■ 


56 




AF09093 1 


Homo sapiens 


PRO0483 






4106 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


72 


4107 


AF 11 8082 


Homo sapiens 


PRO 1902 


145 


49 


4108 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


1 16 


50 


4109 


U93564 


Homo sapiens 


p40 


539 


91 


4110 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


54 


4111 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


65 


4112 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


91 


73 


4113 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


120 


36 


4114 


Ml 7522 


Paracoccus 


cytochrome cl precursor (EC 1.10.2.2) 


101 


41 
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SEQ ID 
NO: 


Accession No. 




Description *; f _ 


Smith- 
Waterman 


% 

Identit 






den itrifi cans 








41 15 


G03714 




-— . 

Human secreted protein, SEQ ID NO: 
7795. 


121 


59 


4116 


G025j2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


73 


4117 


GO 1931 


- — ■ — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


80 


85 


41 18 


X04758 


Homo sapiens 


pro- alpha (V)collagen (A A 1099) 


106 


39 


4119 


G0O354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


47 


4120 


AF130089 


Homo sapiens 


PRO2550 


132 


68 


4121 




Homo sapiens 


PRO0657 


136 


48 


4122 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 


66 


4123 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


72 


48 


4124 


X92485 


Plasmodium 

vivax 




104 


54 


4125 


G03714 




Human secreted protein, SEQ ID NO: 
7795. 


131 


59 


4126 


AK024455 


Homo sapiens 


FLJ00047 protein 


86 


61 


4127 


AF1 19855 


Homo sapiens 


PRO 1847 


99 


68 




S801 19 


Rattus sp. 


reverse transcriptase homolog 


129 


39 


4129 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


156 


62 


4130 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


72 


4131 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


4132 


L16461 


Chlam ydomonas 
reinhardtii 


structural wall protein 


87 


37 


~4133 


Y3Q713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


95 


4134 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101 . 


121 


69 


4135 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


135 


70 


4136 




Homo sapiens 


salivary proline-rich protein 1 


134 


42 


4137 


Y15173 


Human 

papillomavirus 


E4 protein 


101 


38 


4138 


AF130089 


Homo sapiens 


PRO2550 


130 


35 


4139 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


38 


4140 


Y36203 


Homo sapiens 


Human secreted protein #75. 


120 


48 


4141 


AF130089 


Homo sapiens 


PRO2550 


139 


36 




S801 19 


Rattus sp. 


reverse transcriptase homolog 


170 


47 


4143 


U54636 


Staphylococcus 


protein A ^ 


135 


35 


4144 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


157 


55 


"4145 


UUUJJ'f 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


112 


54 


4146 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


117 


37 


4147 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) genepreoduct. 


98 


61 


4148 


AF130051 


Homo sapiens 


PRO0898 


124 


88 


4149 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 








4754. 






4150 


Z29481 


Homo sapiens 


3-hydroxyanthranilic acid dioxygenase 


244 


62 


4151 


M81321 


Macaca 
fascicularis 


proline-rich protein 


172 


50 


4152 


G00416 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


129 


66 


4153 


AF1 19851 


Homo sapiens 


PRO 1722 


128 


62 


4154 


AB037826 


Homo sapiens 


KIAA1405 protein 


194 


60 


4155 


M12099 


Mus musculus 


proline-rich protein 


124 


37 


4156 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


46 


4157 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


124 


41 


4158 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


134 


83 


4159 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4160 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


135 


39 


4161 


U35730 


Mus musculus 


jerky 


116 


31 


4162 


Z81525 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=35.5, E-value=4e- 
07, N=2 


130 


50 


4163 


AF1 13685 


Homo sapiens 


PRO0974 


129 


45 


4164 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


168 


71 


4165 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


137 


70 


4166 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


175 


72 


4167 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


86 


45 


4168 


AB000505 


Daucus car ota 


unnamed protein product 


101 


43 


4169 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


113 


30 


4170 


AF1 19900 


Homo sapiens 


PR02822 


154 


59 


4171 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


106 


33 


4172 


AE000034 


Mycoplasma 
pneumoniae 


Afunctional threonine dehydrogenase; 
galactosyltransferase 


100 


28 


4173 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


100 


47 


4174 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


142 


68 


4175 


AK024455 


Homo sapiens 


FLJ00047 protein 


116 


60 


4176 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


44 


4177 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


59 


4178 


AY007557 


Mycobacterium 
avium subsp. 
pararuberculosis 


fibronectin-attachment protein FAP-P 


98 


42 


4179 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


115 


69 


4 ISO 


M12100 


Mus musculus 


proline-rich protein MP-3 


132 


44 


4181 


AF090930 


Homo sapiens 


PRO0478 


138 


63 


4182 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


152 


75 


4183 


YS6248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


84 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 


4184 


K03205 


Homo sapiens 


salivary prolinc-rich protein precursor 


150 


43 


4185 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


128 


5] 


4186 


AF090944 


Homo sapiens 


PRO0663 


124 


49 


4187 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


93 


62 


4188 


D38112 


Homo sapiens 


ATPase subunit 6 


439 


78 


4189 


AF130051 


Homo sapiens 


PRO0898 


89 


57 


4190 


AP090930 


Homo sapiens 


PRO0478 


137 


83 


4191 


K03205 


Homo sapiens 




100 


35 


4192 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


212 


91 


4193 


Y13247 




FB 1 9 protein 






4194 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


268 


93 






Mus musculus 


acinusS' 






4196 


AJ277425 


Globodera 
pallida 


putative cuticular collagen 


156 


43 





Y86248 


Homo sapiens 


Human secreted protein HCHPF68 

SEQ ID NO: 163. 


—— 


— 


4198 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


97 


45 


4] 99 


M64793 


norvegicus 


sa) . ya roline rich rotein 

sa ivary pro ine-nc pro ein 


1 19 


— 


4200 


X62379 


Mus musculus 


form in, isoform IV 


115 


48 


4201 


D381 13 






224 




4202 


M14820 


Trypanosoma 
brucei 


ORF2 bases 1807-2850; first start 
codon at 2032; putative 


103 


29 


4203 


U93572 


Homo sapiens 


P 40 


182 


34 


~4"^3 




Silene latifolia 


glycine-rich protein 










Caenorh abd i tis 
elegans 


contains similarity to an RNA 
recognition motif 


~70~7 


~~4"0~ 


4206 


L07545 


Leishmania 


A 'c' was inserted after nt 369 (=nt 
10459 in genomic secjuence (M 10 126)) 
to correct -1 frameshift probably due to 


101 




4207 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


104 


38 




AF229126 




subunit isoform VII 






^2^o 


AF090895 


Plasmodmin" 5 


PRO0 117 





■41 — ■ 




X92485 


vivax 


pval 






4211 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


173 


73 


4212 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


151 


69 


.4213 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


~122 


L ~4T 


4214 


AF090944 


Homo sapiens 


PRO0663 


~97 


~3l) 


4215 


XI 2544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


111 


60 


4216 


M69065 


numan 

herpesvirus 2 


ORF1 


89 


— 


4217 


AF090895 


Homo sa iens 


PRO01 17 


"T33 


— 


4218 


B01372 


Homo sapiens 


Neuron-associated protein. 


135 


46 


4219 


Y86248 




Human secreted protein HCHPF68 
SEQIDNO:163. ' 






4220 


U93567 


Homo sapiens 


p40 


170 


35 


4221 


D00570 


Mus musculus 


open reading frame (251 AA) 


108 


28 


4222 


X97675 


Homo sapiens 


plakophilin 2b 


122 


47 


4223 


AF270937 


Plutella 
xylostella 


PxORF73 peptide 


99 


.54 
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NO: 






Description 


Waterman 


% 

Identit 






granulovirus 








4224 


AF130089 


Homo sapiens 


PRO2550 


144 


69 


4225 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


125 


82 


4226 


U4247I 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


99 


62 


4227 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


172 


59 


4228 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


161 


74 


4229 


AL049537 


Homo sapiens 


dJ 11 641 10.1 (brefeldin A- inhibited 
guanine nucleotide-exchange protein 2) 


212 


90 


4230 


AF1 18086 


Homo sapiens 


PRO 1992 


150 


69 


4231 


M81321 


Macaca 
fascicularis 


proline-rich protein 


130 


41 


4232 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


138 


68 


4233 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


132 


56 


4234 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


110 


52 


4235 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


124 


32 


4236 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


119 


42 


4237 


X93498 


Homo sapiens 


21-Glutamic Acid-Rich Protein 


368 


56 


4238 


R95913 


Homo sapiens 


Neural thread protein. 


139 


55 


4239 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 


63 


4240 


X97675 


Homo sapiens 


plakophilin 2b 


108 


78 


4241 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


65 


4242 


AK000385 


Homo sapiens 


unnamed protein product 


131 


33 


4243 


AJ252253 


human 
herpesvirus 2 


glycoprotein G-2 


107 


36 


4244 


Y99447 


Homo sapiens 


Human PRO 1556 (UNQ764) amino 
acid sequence SEQ ID NO:372. 


597 


100 


4245 


Y05398 


Homo sapiens 


Human TIE ligand NL8 protein 
sequence. 


424 


84 


4246 


L00016 


Homo sapiens 


urf4 


222 


90 


4247 


AF134579 


Zea mays 


arabinogalactan protein 


134 


39 


4248 


Z34465 


Zea mays 


extensin-like protein 


123 


33 


4249 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


62 


4250 


J02695 


Plasmodium 
yoelii 


circumsporozoite protein 


110 




4251 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 


65 


4252 


U02570 


Homo sapiens 


CDC42 GTPase-activating protein 


566 


93 


4253 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


449 


85 


4254 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


111 


63 


4255 


M61143 


Bovine 
herpesvirus 1 


latency-related open reading frame 2; 
putative 


101 


42 


4256 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


43 


4257 


Y06294 


Homo sapiens 


Human transcription regulator MOP6 
partial sequence. 


410 


79 


4258 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


112 


61 


4259 


D90252 


Human 

papillomavirus 
type 5 b 


E4 protein 


110 


32 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 

Waferman 

Score 


% 

lcJenht 

y 


4260 


X90569 


Homo sapiens 




- 660 


95 


4261 


AF090930 




PRO0478 


- -iii 


71 


4262 


AF 132209 


Homo sapiens 


prepro-major basic protein homolog 




72 


4263 


AF043102 


Pneumocystis 
carinii 


surface glycoprotein A 


121 


29 


4264 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


132 


55 


4265 


AF090895 


Homo sapiens 


PRO0 1 1 7 


- 

159 


62 


4266 


AF090930 


Homo sapiens 


PRO0478 


. 1 . 


50 


4267 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


119 


34 


4268 


P70494 


Homo sapiens 


factor (BCGF). 




106 


- — 

61 


4269 


D23660 


Homo sapiens 


ribosomal protein 


493 


73 


4270 


AF1 30089 


Homo sapiens 


PRO2550 


1 19 


39 


4271 


U93564 


Homo sapiens 


putative pi 50 




37 


4272 


AB033615 


IVIus musculus 






79 


4273 


AF006082 


Homo sapiens 


Arp2 


488 


85 


4274 


Y 12293 








33 


4275 


X54289 


Bos taurus 


cGMP-dcpendent protein kinase 
(isoform I beta) 


561 


81 


4276 


AF1 19855 


Homo sapiens 


PRO 1847 


155 


71 






Homo sapiens 


MTND4 


372 


63 


4278 


AB02 1 078 


.riasmia i_.oiiu- 
P9 


ybbA 


101 


30 


4279 


G00328 




— ; . 

Human secreted protein, SEQ ID NO: 
4409. 


1 10 


73 


4280 


G00328 




Human secreted protein, SEQ IE) NOi 
4409. 


1 10 


73 


4281 


AF1 19851 




PRO 1722 


1 10 


60 


4282 


D381 16 


Pa^pan^scu" 5 


NADH dehydrogenase subunit 1 


346 


92 


4283 


Y1590S 


Homo sapiens 


DIA-12C protein 


109 


52 


4284 


Y30681 




Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1. 


122 


76 


4285 


G04072 




Human secreted protein, SEQ ID NO: 
8153. 


121 


42 


4286 


AK024455 




FLJ00047 protein 


135 


57 


4287 


Y11525 


Homo sapiens 


CCAAT/enhancer binding protein 


100 


32 


4288 


AF033122 


Homo sapiens 


non-p53 regulated PA26-T1 nuclear 
protein 


128 


81 


4289 


W48351 


~Ho a~ien 

omo sapiens 


Human breast cancer related protein 
BCRB2. 


1 1 1 


39 


4290 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


140 


58 


4291 


AJ242540 


nasariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


132 


34 


4292 


X60432 


Zea mays 


prolin rich protein 


118 


42 


4293 


U09367 


Homo sapiens 






60 


4294 


AF038960 


Homo sapiens 


SKD1 homolog 


146 


82 


4295 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


128 


54 


4296 


U66561 


Homo sapiens 


kruppel-related zinc finger protein 






4297 


AF043706 


Caenorhabditis 
elegans 


contains similarity to granulins 


104 


55 


4298 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


4299 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
protein 


520 


87 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 














4300 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


72 








5327. 






4301 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


50 








7870. 






4302 


AL035440 


Arabidopsis 


putative protein 


103 


31 






thaliana 








4303 


L05500 


Homo sapiens 


adenylyl cyclase 


655 


96 


4304 


AF116712 


Homo sapiens 


PR02738 


140 


54 


4305 


D13757 


Homo sapiens 


amidophosphoribosy transferase 


182 


92 


4306 


L20450 


Mus musculus 


DNA-binding protein 


470 


64 


4307 


AF213386 


Mus musculus 


ATP-binding cassette protein 


175 


97 


4308 


X76850 


Mus musculus 


MAP kinase-activated protein kinase 2 


154 


69 


4309 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


123 


55 


4310 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 


110 


32 








kDa sub unit 






4311 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


58 








7868. 






4312 


Z46236 


Ovis aries 


keratinocyte growth factor 


210 


75 


4313 


AB020700 


Homo sapiens 


KIAA0893 protein 


569 


87 


4314 


AF124727 


Homo sapiens 


acinusS 


498 


88 


4315 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


88 


32 








clone HPMBQ32. 






43 16 


AL 160493 


I>ershmania 


probable (hhv-6) u 1 1 02, variant a 


122 


53 








DNA, complete virion genome 






4317 


AF1 19900 


Homo sapiens 


PR02822 


135 


58 


4318 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 


104 


70 














4319 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 17 


51 








7124. 






4320 


AB026054 


Homo sapiens 


brain finger protein 


412 


90 




AF090944 




PRO0663 


133 


71 


4322 


AF 1655 1 3 


Homo sapiens 


vacuolar protein sorting 45 isoform 


712 


97 


4323 


Y994 1 8 


Homo sapiens 


Human PR01317 (UNQ783) amino 




97 








acid sequence SEQ ID NO:277. 






4324 


AF163772 


Leishmania 


7138.7 


142 


37 














4325 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


59 








7888. 






4326 


W40054 


Homo sapiens 


P300/CBP-associated transcriptional 


175 


94 








cofactor P/CAF C-terminus. 






4327 


AF21741 1 


Homo sapiens 


neuroligin 3 isoform HNL3 


549 


84 


4328 


U94836 


Homo sapiens 


ERPROT 213-21 


175 


87 


4329 


Y20763 


Homo sapiens 


Human neurofilament-M mutant 


501 


87 








protein fragment 45. 






4330 


AB037745 


Homo sapiens 


KIAA 1324 protein 


1014 


99 


4331 


M91563 




NMDA receptor subtype 2C 


116 


39 






norvegicus 













Homo sapiens 


Human secreted protein SEQ ID NO" 


286 


76 














- ■■ 




... : 

t\\ r 1 a PS,S 


Mutator like trans osase 

u a or- e ansposase 


130 


70 






a * a " : 










4 


AJrUUUJ f j 


Arabidopsis 


— r— 5 rr- : 

jasmonate inducible protein; 


~ no 


— 






thaliana 


myrosinase binding protein-like 






4335 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 


377 


90 








Accession Number P02768; Method: 












conceptual translation supplied by 












author 






4336 


AJ277426 


Globodera 


putative cuticular collagen 




37 






pallida 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4337 


U 16802 


— Rattus — 

norvegicus 


Ca2+-dependent activator protein; 
calcium-dependent actin-binding 
protein 


560 


79 


4338 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No 119 


118 


56 


4339 


AB024520 


Homo sapiens 


notch4 


- 

93 


33 


4340 


AF128406 


Homo sapiens 


nuclear prelamin A recognition tactor 


241 


100 


4341 


W19771 


Homo sapiens 


Beta-1 integrin modulator Bl 71 . 


169 


100 


4342 


AL096828 


Homo sapiens 


dJ963E22. 1 (Novel protein similar to 
in i-ivcin-z Antigen) 


666 


88 


4343 


D50312 


- Homosa 'iens 

" Homo salens 




542 


80 


4344 


U84487 




CX3C chemokine precursor 


371 


78 


4345 


U93569 


Homo sapiens 


putative pl50 


111 


47 


4346 


AL161755 


coelicolor A3(2) 


putative serine/threonine protein kinase 


102 


30 


• 4347 


AK026162 


Homo salens 


unnamed protein product 


377 


98 


4348 


AF221759 






314 


47 


4349 


AF165926 


Homo sapiens 


NUP1 55 


147 


84 


4351 


U50185 


Rattus 

Rattuf 10 " 3 


PP-1M 


144 


52 


4352 


D50455 


norvegicus 


r— , 

phodpholipase C delta4 


196 


70 


4353 


X54131 


Homo sapiens 


protein-tyrosine phosphatase 


261 


87 


4354 


AF151850 


Homo sapiens 


CG1-92 protein 


255 


92 


4355 


G03996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


95 


4356 


U93564 


Homo sapiens 


putative pi 50 


171 


72 


4357 


AP001 507 


Bacillus 
halodurans 


unknown conserved protein 


159 


34 


4358 


G03787 




.Human secreted protein, SEQ ID NO: 
7868. 


153 


60 


4359 


AF072697 






177 


97 


4360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787 ^" 


129 


69 


4361 


M98776 


Homo sa iens 

~Homo sa^ie" 3 — ~ 


Keratin 1 


449 


77 
91 


4362 


AK024436 




t*.LJU0026 protein 


671 


4363 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


155 


49 


4364 


D13896 


norvegicus 


cytoplasmic dynein heavy chain 


327 


92 


4365 


X79389 




glutathione transferase Tl 


164 


96 


4366 


AB006458 


Mus musculus 


alpha-D-mannosidase 


177 


56 


4367 


X01060 




put. transferrin receptor (aa 1-760) 


160 


96 


4368 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


103 


54 


4369 


AL353715 


Homo sapiens 


bK3184A7.3.1 (helicase-like protein 


485 


100 


4370 


AF001631 


Oryctolagus 
cuniculus 


glucose-regulated protein GRP94 


118 


92 


4371 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


113 


80 


4372 


AT 1917/11 


Schizosaccharom 
yces pom be 


putative negative regulator of vesicle 
formation 


200 


41 


4373 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 
subunit 


279 


70 


4374 


U47741 


Homo sapiens 


CREB-binding protein 


123 


96 


4375 


Y0267I 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


127 


75 


4376 


Y12781 


Homo sapiens 


transducin (beta) like 1 protein 


1056 


90 
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SEQ ID 


Accession No. 




Description 


Smith- 




NO: 








Watermiin 


Idcntit 








■■ 


Score 


y 


4377 


W69424 


Homo sapiens 


Human secreted protein bgl40_l . 


1S .-L . 


97 


4378 


AL096700 


Homo sapiens 


dJ499B10.2 (phosphorylase kinase, 


639 


86 








alpha 2 (liver) (PYK)) 








AF 177390 


Manduca sexta 


antennal specific membrane protein 














—— 


— 




— — 


— — — 

p^J^p 


S : 

Homo sapiens 


plakophihn 2b 


— — 




8 




Homo sapiens 






— — 

100 — 


4382 


AtSUl 54/j 


Arabidopsis 


gene_id:MCM23. l~unknown protein 


~TF 


61 






thaliana 








4383 


AF1 1671 5 


Homo sapiens 


PR02829 


133 


50 


4384 


AL357472 


Homo sapiens 


VPS33B 


676 


99 


4385 


AF09093 1 


Homo sapiens 


PRO0483 


155 


58 


4386 


U92645 


Gecarcinus 


alpha- 1 -tubulin 


51 1 


75 






lateralis 








4387 


Y36095 


Homo sapiens 


Extended human secreted protein 


202 


90 






sequence, SEQ ID NO. 480. 






4388 


M64923 


Bos taurus 


C10 protein 


218 


95 


4389 


X98264 


Homo sapiens 


M-phase phosphoprotein 4 


197 


100 


4390 


AK000264 


Homo sapiens 


unnamed protein product 


239 


73 


4391 


M12937 


Homo sapiens 


ferritin heavy subunit 


662 


84 


4392 


P92219 


Homo sapiens 


CR1 protein. 


1 16 


80 






(human) 








4393 


X16135 


Homo sapiens 


L protein (AA 1-558) 


759 


99 


4394 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


5S 








4478. 






4395 


AC005591 


Homo sapiens 


PkB-like 


170 


97 


4396 


AF161426 


Homo sapiens 


HSPC308 




77 


4397 


Y0267 1 


Homo sapiens 


Human secreted protein encoded by 


146 


61 








nana T) TJTVyf C r\\ J 1 o 

gene zz clone ruvicij w 1 8. 






4398 


AF 104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


679 


88 


4399 


AF257330 


Homo sapiens 


COBW-like protein 


586 


90 


4400 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


65 








7888. 






4401 


AF1 19851 


Homo sapiens 


PRO 1722 


101 


79 


4402 


M55542 


Homo sapiens 


guanylate binding protein isoform I 


230 


76 


4403 


Y07752 


Volvox carteri 


pherophorin-S 


459 


88 


4404 


W48351 


Homo sapiens 


Human breast cancer related protein 


113 


44 






BCRB2. 






4405 


U07786 


Sus scrofa 


beta actin 


— 




4406 


A06977 


Homo sapiens 


albumin 


586 


88 


4407 


L20755 


Cuscuta reflexa 


hybrid proline-rich protein;cytokinin- 


12 


41 








induced;haustoria 






4408 


AB002299 


Homo sapiens 


KIAA0301 


612 


98 ... 




AB001424 


Mus musculus 


KIF17 


_I2z 


_zZ 


4410 


M8S108 


Homo sapiens 


P 62 . .... 


574 


92 


441 1 


AL121673 


Homo sapiens 


bA305P22.2 (novel protein) 




71 


4412 


AF064553 


Mus musculus 


NSD1 protein 


224 


64 


4413 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


107 


36 








similar to pogo element 






4414 


A06977 


Homo sapiens 


albumin 


505 


82 


4415 


A06977 


Homo sapiens 


albumin 


596 


90 


4416 


Y871 16 


Homo sapiens 


Human secreted protein sequence SEQ 


123 


50 








IDNO:155. 






4417 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 


103 


85 






brucei 


complete virion genome. 






4418 


G03053 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


54 








7134. 






4419 


AF049606 


Mus musculus 


transcription factor NF-ATc isoform b 


114 


90 
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SEQ 10 

NO: 


Accession No. 


Species 


Description 


Waterman 

Score 


Identit 

y 


4420 ' 


Zl 1 190 cd] 


Homo sapiens 


-- . 

1 l-DEC-1998 Interleukin-3 coding 
sequence from b2HFLS20W cDNA 


354 


97 


4421 


W64469 


Homo sapiens 


Human secreted protein from clone 
CW795 2 


203 


100 


4422 


Ml 2523 


Homo sapiens 


alloalbumin Venezia 


350 


94 


4423 


AF1 30077 


Homo sapiens 




561 


89 


4424 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


161 


64 


4425 




Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


393 


91 


4426 




synthetic 
construct 


human serum albumin 


651 


96 


4427 
— — 


AL133215 




Homo sapiens 



bA108L7.5 (novel protein similar to 
Plasmodium POM1 and C. elegans 
F46G11.1 (Tr:Q20485)) 


392 


100 




Wj j663 


Homo sapiens 


Human puromycin-sensitive 
aminopeptidase (PSA)-68. 


172 


96 


4429 




— : 

Homo sapiens 


DD2/biJe acid-binding 
protein/AKRl C2/3alpha- 
hydroxysteroid dehydrogenase type 3 


184 


81 


4430 


W636S3 


Homo sapiens 


Human secreted protein 3. 


114 


42 


4431 


AY008763 


Homo sapiens 


sentrin/SUMO-specific protease 


447 


96 


4432 


U52965 






176 


94 


4433 


AF 180470 


Mus musculus 


Kiaa0575 


423 


77 


4434 


X 1 7206 


Homo sapiens 


put. LLRep3 protein (AA 1-221) 


581 


99 


4435 


Y70929 


Homo sapiens 


Human zilla4 splice variant protein. 


621 


100 


4436 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


71 


4437 


A06977 


Homo sapiens 


albumin 


610 


97 


4438 


A00279 


synthetic 


Human serum albumin 


621 


94 


4439 


L29028 


Chlamydomonas 
eugametos 


amino acid feature: N-glycosylation 
sites, aa41 .. 43, 46 .. 48, 51 .. 53, 72 .. 
74, 107 .. 109, 128 .. 130, 132 .. 134, 
158 .. 160, 163 .. 165; amino acid 
feature: Rod protein domain, aa 169 .. 
340; amino acid feature: globular 
protein domain, aa 32 .. 168 


105 


36 


4440 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


65 




U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


133 


69 


4442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


60 


4443 




Homo sapiens 


PRO0974 


101 


54 


4444 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPR N-terminal sequence. 


181 


42 


4445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
: 


132 


76 


4446 


G004 12 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493 • ■■ 


141 


54 


4447 


L02867 


~G : 

Homo sapiens 


paraneoplastic antigen 


136 


78 


4448 


D38435 


Homo sapiens 


homologue of yeast PMS 1 






4449 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


65 


4450 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


68 


4451 


AF220264 


Homo sapiens 


MOST-1 


101 


40 


4452 


AF 13 0079 


Homo sapiens 


PR02852 


97 


59 


4453 


AF116715 


Homo sapiens 


PR02829 ~" 


139 


69 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Walerman 


Identit 

y 


4454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


53 


4455 


AF130087 


Homo sapiens 


PR02411 


108 


73 




nrZ / 707 J 


Homo sapiens 


dead box protein 1 5 






4457 


L0S258 


Strongylocentrot 
us purpuratus 


kinesin light chain isoform 4 


176 


94 


4458 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


78 





G03 1 72 


— : 

Homo sapiens 


Human cwriat^rl nrntpin CPO TP) 7\!fV 

numdn secreted protein, ocy lu r*\j. 
7253. 


T46 


— 


4461 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


156 


52 


4462 


UU2DU / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


72 


~4463 


AF1 1 9900 


Homo sapiens 


PR02822 


~1~44 


— 


4464 


M15530 


Homo sapiens 


B-cell growth factor 


92 


76 


4465 


Y 1783 j 


Human 
endogenous 


env protein 


107 


62 


4466 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


78 




AF090S95 


Homo sapiens 


PRO0 1 1 7 






4468 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


64 


4469 


X97675 


Homo sapiens 


plakophilin 2b 


119 


78 


4470 


AF1 1 8081 


Homo sapiens 


PRO 1 900 


1 19 


74 


4471 


G0j564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


60 


4472 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


63 


4473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

&£LV^ ID ISKJ. iOj, 


205 


73 


4474 


AFI30114 


Homo sapiens 


PR02459 


121 


67 




AF178534 


Homo sapiens 








4476 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


122 


92 


4477 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 

stlnnA LI/" 1 A C TO 1 

clone hi^aljo i . 


99 


66 


4478 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


62 


4479 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


55 


~4480 




r 

Acanth aster 

planci 


cytochrome oxidase subunit I 


~~Fo 


"Too — 


4481 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


57 


4482 


Y20983 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP wild type fragment 9. 


110 


63 


4483 


AF221552 


Oryza sativa 


proline-rich protein RiP-15 


119 


33 


4484 


L25941 


Homo sapiens 


integral nuclear envelope inner 
membrane protein 


110 


84 


4485 


AF 130089 


Homo sapiens 


PRO2550 


161 


81 


4486 


U39742 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
yk25e5.3; coded for by C. elegans 
cDNA yk25e5.5; similar to repeat 
guanylate kinase domain of D. 
melanogaster Iethal(l) discs large- 1 
tumor suppressor protein 
(SP:DLG I_DROME, P31007) and R. 
norvegicus postsynaptic density protein 


98 


72 
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SEQID 

NO: 


Accession No. 


Species 


Description.. , x >' 


Smith- 
VVatermnn 


% 

idcntit 

y 








95 (PSD-95) (SP:PSD9 RAT, P3 10 16) 






4487 


Y86248 




Human secreted protein HCHPF68, 
SEQID NO: 163. 


89 


81 


4488 


X97675 


Homo sapiens 


plakophilin 2b 


127 


88 


4489 


G03714 


Homo sapiens 
- — : 


Human secreted protein, SEQ ID NO: 
7795 - . 


105 


71 


44 0 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


64 


4491 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


77 


4492 


U83280 


donovanT' 3 


39 kDa antigen 


107 


76 


4493 


AF023142 


Homo sa iens 

Homo ^ P - CnS 


pre-mRNA splicing SR protein rA4 


141 


40 


4494 


K02576 




salivary proline-rich protein 1 


1 19 


40 


4495 


Y86248 


"Tfomo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


98 


45 


4496 


AF130089 








75 


4497 


AF090930 


Homo sapiens 


PRO0478 


160 


76 


4498 


AF1 19900 






158 


55 


4499 


AF 1 16661 


Homo sapiens 


PRO 143 8 


126 


44 


4500 


AF1 16661 


Homo sapiens 


PRO 143 8 


118 


42 


4501 


AL 160371 


Leishmania 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


113 


48 


4502 


M81321 


Macaca 
fascicularis 


proline-rich protein 


154 


44 


4503 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


96 1 


450 




Saccharomyces 
cerevisiae 


Mdjlp heat shock protein 


93 


42 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


76 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


207 


93 






Homo sapiens 


P 4Q 


520 


86 


~430~8" 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


135 


55 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


77 


4510 


G 0063 7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


146 


66 


451 1 




Homo sapiens 


reverse transcriptase 


143 


87 


4512 



Y86248 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


80 






— — ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


75 


4514 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


60 


~45l3 


t\r uuuo i o 


Oryza sativa 


similar to RING-H2 finger protein 

XV-Tl/V 1 a (Ax U / oOco ) 


120 


73 


4516 


AF130051 


Homo sapiens 


PRO0898 


141 


86 




U12690 


Homo sapiens 


cytochrome oxidase subunit II 


185 


94 


4518 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


130 


100 


4519 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 


4520 


AF2I0651 


Homo sapiens 


NAG 18 


161 


88 


4521 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


140 


68 


4522 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


131 


76 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


4523 


U72520 


Mus musculus 


mena protein 


102 


42 


4524 


JO 1 4 1 5 


Homo sapiens 


cytochrome oxidase subunit 3 


129 


75 


4525 


AL1 60493 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


108 


84 


4526 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


76 


4527 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


89 


4528 


AF090931 


Homo sapiens 


PRO0483 


91 


88 


4529 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 1 1 


51 


4530 


G02515 




Human secreted protein, SEQ ID NO: 
6596. 


130 


92 


4531 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


123 


64 


4532 


L26953 


Homo sapiens 


chromosomal protein 


102 


79 


4533 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


88 


54 


4534 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


122 


40 


4535 


AF090944 


Homo sapiens 


PRO0663 


. 113 


53 


4536 


U15647 


Mus musculus 


expressed in select embryonal 
carcinoma ceils and testicular ceils; 
similar to Swiss-Prot Accession 
Number PI 1260 


119 


35 


4537 


L13858 


Homo sapiens 


guanine nucleotide exchange factor 


150 


81 


4538 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


68 


4539 


AF090930 


Homo sapiens 


PRO0478 


104 


53 


4540 


AF095770 


Homo sapiens 


PTH-rcsponsive osteosarcoma Dl 
protein 


1 12 


64 


4541 


K02401 


Homo sapiens 


chorionic somatomammotropin 


672 


96 


4542 


W88627 




clone HPMBQ32. 


171 


67 


4543 


J04208 


Homo sapiens 


inosine-5'-monophosphate 
dehydrogenase (EC 1.1.1 .205) 


455 


92 


4544 


K0240 1 


Homo sapiens 


chorionic somatomammotropin 


397 


88 


4545 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


155 


66 


4546 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


161 


68 


4547 


R24056 


Homo sapiens 


hGH variant #8 - I72Arg 174Thr 
176Tyr 178His. 


1 17 


62 


4548 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


1 1 1 


68 


4549 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


65 


4550 


AF1 19855 


Homo sapiens 


PRO 1847 


152 


68 


4551 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


153 


70 


4552 


M59217 


Homo sapiens 


alpha-1 type XIII collagen 


98 


57 










136 


96 


4554 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


72 


4555 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


91 


60 


4556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


70 


4557 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


101 


44 



249 
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SEQID 
NO 


Accession No. 


Species 


Description 


Smilh- 
W a term a n 


% 

Idenrit 

y 








isoform 






4558 


AF130089 


Homo sapiens 


PRO2550 


150 


85 


4559 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


161 


56 


4560 


W4835 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


81 


64 


4561 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


1 1 5 


55 


4562 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


106 


82 


4563 


AF090931 


Homo sapiens 


PRO0483 






4564 


AJ242540 


Volvox carteri f. 
HottuT" 5 ^ 


hydroxyproline-rich glycoprotein DZ- 


358 


77 


4565 


AF1 30089 






— — 

175 


85 


4566 


Z93244 


Homo sapiens 


DIS.1 1 Orj.Z ^rU 1A1 I vt KXlOvj/Y-r 

(CDC42 GTPAse Activating Protein) 
LIKE protein) 






4567 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


142 


56 


4568 


AF130089 


Homo sapiens 


PRO2550 


125 


75 


4569 


AF130089 




PRO2550 






4570 


AF225961 


Rattus 


RhoGEF glutamate transport modulator 
GTRAP48 


149 


78 


4571 


W48351 


Homo sapiens 


Human breast cancer related protein 


143 


64 


4572 


G02640 


Homo sapiens 


Human secreted protein SEQ ID NO" 
6721. 


"T57 


— 


4573 


G03786 




7867. 






4574 


X92485 


Plasmodium 
vivax 








4575 


U93567 










4576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
cione XiiNJrtruzy. 


90 


57 


4577 


AC007508 


Arabidopsis 
Homo'sa iens 


F1K23.4 


143 


42 


4578 


G03807 


sapiens 


—j : 

Human secreted protein, SEQ ID NO: 
7888. 


. 

145 


53 


4579 


G03793 


Homo sapiens 


7874. 


—yfl 


_ 


4580 


G00338 


Homo sapiens 


Human secreted protein SEQ ID NO* 
4419. 


142 


76 


4581 


AF1 30079 


Homo sapiens 


PR02852 


117 


70 


4582 


AF090930 


Homo sapiens 


PRO0478 


109 


.57 


4583 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


134 


79 


4584 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


62 


4585 


AF1 18090 


Homo sapiens 


PRO2044 


175 


97 


4586 


AL080243 


Homo sapiens 


El A binding protein p300; match: 
proteins: Sw:Q09472 Sw:Q92793 
ow.r'fj'tol wp.^n.UUj/1 
vvp. L^JCZ I I 1 / LT.KJU IJoo Wp.L^.cl'ocOO 

Wp:CE00570 Wp:CE08453 Tr:O44076 


144 


93 


4587 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


140 


62 


4588 


AJ131245 


Homo sapiens 


Sec24B protein 


131 


81 


4589 


W02105 


Homo sapiens 


Human L-asparaginase. 


559 


100 


4590 


L26953 


Homo sapiens 


chromosomal protein 


146 


71 


459] 


Ml 6976 


Glycine max 


N-75 | 125 


36 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smirh- 
Wntermnn 


% 

Identit 


4592 


AF1 19851 


Homo sapiens 


PRO 1722 


118 


61 


4593 


AF090931 


Homo sapiens 


PRO04S3 


99 


85 


4594 


M20789 


Homo sapiens 


alpha- 1 type I collagen 


103 


41 


4595 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


97 


72 


4596 


AF130089 


Homo sapiens 


PRO2550 


124 


71 


4597 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


143 


65 


4598 


AF11S082 


Homo sapiens 


PRO 1902 


122 


64 


4599 


AK023034 


Homo sapiens 


unnamed protein producl 


156 


100 


4600 


AF1 18082 


Homo sapiens 


PRO 1902 


148 


62 


4601 


AF1 19855 


Homo sapiens 


PRO 1847 


130 


50 


4602 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


99 


77 


4603 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


162 


65 


4604 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


135 


54 


4605 


AL 160371 


Leishmania 


probable (hhv-6) u 11 02, variant a 
DNA, complete virion genome 


123 


75 


4606 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


127 


74 


4607 


V00672 


Pan troglodytes 


reading frame protein 4 


194 


72 


4608 


X86791 


Sus scrofa 




156 




4609 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


88 


4610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 




4611 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


149 


68 


4612 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


120 


38 


4613 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


53 


4614 


AK024372 


Homo sapiens 


unnamed protein product 


124 


63 


4615 


AC00536O 


Homo sapiens 


FAA 


175 


91 


4616 


D38113 


Pan troglodytes 


ATPase subunit 6 


204 


69 


4617 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


122 


45 


4618 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


118 


81 


4619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


99 


— 


4620 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


50 


4621 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 




4622 


AF 126 163 


Homo sapiens 


HHLA3 protein 


124 


75 


4623 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


109 


69 


4624 


U38184 


Trypanosoma 


ATPase subunit 6 


102 


45 


4625 


U01849 


Topanosoma 

brucei 


ORF1 


~97 


— 


4626 


G03058 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7139. 


95 


57 


4627 


M33228 


Trypanosoma 
brucei 


ATPase 6 


126 


37 


4628 


AF 118086 


Homo sapiens 


PRO 1992 


113 


60 


4629 


AF041330" 


Bodo saltans 


NADH dehydrogenase subunit 5 


149 


41 
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ISDOCID: <WO 01 64835A2_I_> 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 


Idcntit 

y 


4630. 


X12544 


Homo sapiens 
l 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


108 


48 


"46^1 


Y 86^4 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 1 1 


46 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


60 






Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


151 


60 






Homo sapiens 


cytochrome oxidase subunit 3 


138 


100 


4635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


135 


56 


4636 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: ] 63 . 


189 


100 


4637 


X67320 


Rattus 
norvegicus 


Hld-Histone 


122 


40 


4638 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


181 


76 


4639 




Y36366 




Homo sapiens 

: 


Fragment of human secreted protein 
encoded by gene 3. 


96 


73 


640 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871 . 


136 


54 


4641 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA 1 a (AF078683) 


123 


70 


4642 


AJ242540 


Vol vox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


190 


56 


~4643 


ivizy j5U 


Homo sapiens 


zinc finger protein 7 (ZFP7) 


103 


38 


4645 


U47741 


Homo sapiens 


CREB-binding protein 


207 


94 


4646 




Homo sapiens 


dJ967N2 1.5 (novel MCM2/3/5 family 
member) 


213 


80 


4647 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


76 


60 


~4648 




Homo sapiens 


HBV pX associated protein-8; XAP-8 


402 


86 


4649 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613 ; 


143 


86 


4650 


U93574 


Homo sapiens 


putative pi 50 


121 


33 


4651 


A02739 


synthetic 


chloramphenicol acetyltransferase 


622 


99 


4652 


AF190449 


Mus musculus 


putative transcription factor ALF-4 


274 


83 


4653 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


76 


4654 


X]5187 


Homo sapiens 


precursor polypeptide (AA -21 to 782) 


763 


98 


4655 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA 1 a (AF078683) 


128 


91 


4656 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


169 


77 


4657 


BO 13 72 


Homo sapiens 


Neuron-associated protein. 


135 


83 






Homo sapiens 


FLJ00047 protein 


151 


60 


4659 


AC004850 


Homo sapiens 


vacuolar assembly protein VPS41 
homolog (S53) 


187 


92 


~4fiSn — 


Y86248 


— ; 

Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


181 


92 


4661 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


138 


58 


4662 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


154 


78 


4663 


M69180 


Homo sapiens 


norunuscle myosin heavy chain-A 


106 


84 


4664 


AF 130089 


Homo sapiens 


PRO2550 


117 ' 


65 


4665 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


133 


91 
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WO 01/64835 PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 




















Scorc rmfln 


^dentit 


4666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


62 








7795. 






4667 


AP000616 


Oryza sativa 


similar to PJNG-H2 finger protein 


111 


78 








RHAla(AF078683) 






4668 


AF113695 


Homo sapiens 


PRO 1365 


82 


71 


4669 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


1 1 1 


64 


4670 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


134 


49 








RHAla(AF078683) 






4671 


AL109658 


Homo sapiens 


dJ776F14.1 (orthologofmouseP47) 


1 10 


100 


4672 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


62 








7870. 






4673 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


138 


60 








RHAla(AF078683) 






4674 


X51591 


Homo sapiens 


beta-myosin heavy chain (1151 AA) 


442 


97 


4675 


Y27S54 


Homo sapiens 


Human secreted protein encoded by 


148 


76 








gene No. 101. 






4676 


D86971 


Homo sapiens 


no similarities to reported gene 


133 


100 








products 






4677 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


142 


43 


4678 


AF044205 


Gossypium 


proline-rich protein precursor 


117 


39 






hirsutum 








4679 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


243 


98 


4680 


AF1 13685 


Homo sapiens 


PRO0974 


117 


74 


4681 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


193 


90 








SEQ ID NO:163. 






4682 


L26 1 63 


Mus musculus 


histone Hie 


1 1 1 


37 


4683 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


145 


78 








SEQIDNO:163. 






4684 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


1 12 


53 








SEQ ID NO: 163. 






4685 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


110 


60 








clone HNHF029. 






4686 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


72 








4413. 






4687 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


165 


100 


4688 


W50922 


Homo sapiens 


Amino acid sequence of a heterogenous 


139 


100 








ribonucleotide protein. 






4689 


Y00281 


Homo sapiens 


precursor 


789 


100 


4690 


AF2 10651 


Homo sapiens 


NAG 18 


142 


65 


4691 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4692 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4693 


U 12690 


Homo sapiens 


cytochrome oxidase subunit U 


166 


80 


4694 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


4695 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


165 


74 








SEQ ID NO:163. 






4696 


W29474 


Homo sapiens 


Human histone HI isoform H1S-1. 


93 


41 


4697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


56 


4698 


S794I0 


Mus musculus 


nuclear localization signal binding 


138 


62 








protein 






4699 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


54 








4494. 






4700 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


70 








7795. 






4701 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


112 


88 


4702 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


59 








4414. 






4703 


AF241242 


Mus musculus 


T-cell-specific T-box transcription 


294 


75 








factor T-bet 






4704 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


80 
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SEQ ID 

NO: 


Accession No. 


I Species 


Description 


Smith- 

Wnterman 

Score 


% 

Identit 

y- 








SEQ ID NO: 163. 











Homo sapiens 


RNA helicase 


1 02 


70 






Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF0786S3) 


148 


63 




GCP363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


145 


70 






Maackia 
amurensis 


ENOD2f 


113 


42 




AF1 30087 


Homo sapiens 


PR024 1 1 


124 


57 


4710 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


188 


86 


4711 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


86 


4712 



Y86248 




Homo sapiens 



Human secreted protein HCHPF68, 
SEQ ID NO:163. 


143 


78 


4713 




Ochotona 
macrotis 


NADH dehydrogenase subunit 4 


154 


76 


~47\4 


q Q -, 453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


104 


48 


~5ttI — 




Ovis aries 


trichohyalin 


120 


33 




nf i Jul/07 


Homo sapiens 


PRO2550 


152 


75 


4717 


AF119851 


Homo sapiens 


PRO 1722 


143 


60 




K03205 


Homo sapiens 


salivary proline-ricb protein precursor 


145 


45 


4719 


K02401 


Homo sapiens 


chorionic somatomammotropin 


619 


96 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 14 


52 


4721 


J03756 


Homo sapiens 


hGH-V2 


515 


74 


4722 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


194 


93 




G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


128 


56 




D49490 


Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 


213 


55 


4725 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


66 


4726 


K02401 


Homo sapiens 


chorionic somatomammotropin 


657 


98 


4727 


D87459 


Homo sapiens 


Similar to Volbox carteri extensin 
(S22697) 


162 


80 


4728. 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. . 


99 


34 


4729 


AF090944 


Homo sapiens 


PRO0663 


151 


63 




M32305 


Human 

papillomavirus 
type 47 


E1/E4 fusion protein 


106 


40 


4731 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


133 


57 


4732 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


76 


4733 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


114 


43 


4734 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


72 


4735 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


113 


43 


4736 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


99 


59 


4737 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


92 


47 


4738 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


73 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 
Waterman 


y 








RHAla (AF078683) 






4739 


AF044205 


Gossypium 
hirsutum 


proline-rich protein precursor 


105 


42 


4740 


S78854 


Oryctolagus 
cuniculus 


a Iph a-tropom yos in 


159 


87 


4741 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


79 


4742 


BO] 372 


Homo sapiens 


Neuron-associated protein. 


159 


85 


4743 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


122 


100 


4744 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


82 


4745 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


78 


4746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


80 


4747 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


100 


4748 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


117 


53 


4749 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


208 


93 


4750 


AF1 55232 


Pisum sativum 


extensin 


172 


40 


4751 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


117 


76 


4752 


AF 130079 


Homo sapiens 


PR02852 


104 


70 


4753 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


50 


4754 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


167 


50 


4755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


138 


100 


4756 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


163 


66 


4757 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


45 


4758 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


242 


94 


4759 


AF046935 


Homo sapiens 


PCFllp homolog 


244 


94 


4760 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


165 


41 


4761 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1 . 


115 


60 


4762 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


178 


84 


4763 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


84 


4764 


R95913 


Homo sapiens 


Neural thread protein. 


136 


70 


4765 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


139 


69 


4766 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


56 


4767 


U63 159 


Mus musculus 


trcmsaldolase 


218 


100 


4768 


AF230279 


Caenorhabditis 
elegans 


SW13-like protein; PSA-1 


106 


38 


4769 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


116 


56 


4770 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


136 


67 


4771 


X92485 


Plasmodium 


pval 


85 


64 



255 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






""vivax 








4772 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


95 


80 


4773 


AC002398 


Homo sapiens 


F25965 3 


96 


39 


4774 


Y86248 




Human secreted protein HCHPF68 5 
SEQ ID NO: 163. 




75 


4775 


AF120151 


Homo sapiens 


cytokine receptor-like molecule 9 




100 


4776 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4777 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


108 


75 


4778 


G03172 




Human secreted protein, SEQ ID NO." 
7253. 




55 


4779 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


101 


55 


4780 


AB007860 


Homosa iens 






88 


4781 


S58722 


Homo sapient 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 




88 


4782 


G0182S 




Human secreted protein, SEQ ID NO: 
5909. 


98 


84 


4783 


M22334 


Homo salens 


unknown protein 


61 1 


93 


4784 


Y36203 




Human secreted protein #75. 


350 


' 76 


4785 


K02401 


Homo sapiens 


chorionic somatomammotropin 


188 


97 


4786 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


64 


4787 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


102 


67 


4788 


M38451 


Homo sapiens 


growth hormone 


264 


94 


4789 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


77 


4790 


K02401 




chorionic somatomammotropin 


250 


96 


4791 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


88 


4792 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


125 


78 


4793 


AP000616 


— : 

Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


131 


51 


4794 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


124 


57 


4795 


Ml 8095 


vulgaris 


hydroxyproline-rich glycoprotein 


103 


34 


4796 


B03628 




Human phospholipase 2 HPPL2. 


13 1 


69 


4797 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


77 


4798 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


120 


51 


4799 


AL009146 


melanogaster 


alternatively spliced form 


79 


94 


4800 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
riL>f\ class ii 


108 


80 


480] 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


121 


100 


4802 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


94 


90 


4803 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


101 


90 



256 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










"fsir 


y 








— -— : 

Homo sapiens 


— j— p : 

— CUra , pr ° 6m " 




71 


4 ~~Q 


y^inAs 


Hydra sp. 


" 1 '"'"^ Q : jr— r 


126 


68 


4806 




X 12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 


114 


56 








OQ tn 9A7"\ 
jZy lO Zu / ) 






— o~n 

4807 


W 650_i / 


— : 

Homo sapiens 


Secreted protein encoded by gene 94 


T60 


"84 — : — 








Clone rljrlvit5^jZ. 








^ 




, J. 


Homo sapiens 


NADH dehydrogenase subunit 6 


111 


94 


4809 


L26953 


Homo sapiens 


chromosomal protein 






4810 


ArUUUo I o 


Oryza sativa 


similar to RJNG-H2 finger protein 


~7o~8 









KXlA 1 a (t\r V 1 aOOD) 








~ 





Homo sapiens 


chromosomal protein 


121 


62 


4812 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


124 


71 








gene No. 1 19. 






4813 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


46 








„ 868 ' r ; ~T : 


— — 


— 


4814 


W48351 


Homo sapiens 


Human breast cancer related protein 












BCRB2. _ . — . 


— — 


— - 


4815 


AF1 52961 


Homo sapiens 


chromatin-specific transcription 


152 


96 








elongation factor FACT 140 kDa 












subunit 


— — 




4816 


U93569 


Homo sapiens 


putative pi 50 


— 


— 

— 


4817 


AB014575 


Homo sapiens 


KIAA0675 protein 


1_5 


67 


4818 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 14 


90 








6613 - 






4819 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


45 








4435 " ; ; J — — 





— 


4820 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


134 


48 






prucei : 


protein rhala. 


— — - 


— 


4821 


... .. 
W48351 


Homo sapiens 


Human breast cancer related protein 












BCRB2^ — -— — 






1 




Homo sapiens 


uman is locyte-secre e ac or . — 


137 


83 


4823 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


55 












— 


. 48 .-r. : 


ArJI JOiiJ 


Homo sapiens 


PRO0974 


Tfo 


— 


4825 


U63542 


Homo sapiens 


FAP protein 


—4^ 





4826 


X8j41j 


Human 


U88 










herpesvirus 6 




-— - 


— 


4827 




G03797 


Homo sapiens 


— — : 

Human secreted protein, Sc,Q ID NO: 




76 










— 


— 


4828 


U76609 


Homo sapiens 


ribosomal L5 protein 


— 


~H 


4829 


W48351 


Homo sapiens 


Human breast cancer related protein 


110 


57 








BCRB2. 






4830 


AC006161 


Arabidopsis 


putative CENP-B/ARS-binding 


141 


31 






thaliana 


protein-like protein 


— — 


— 


4831 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




71 








7637- 








4832 


U94189 


Rattus 


Duo 


158 


79 






norvegicus 


_ 







4S33 


J0403 1 


Homo sapiens 


MDMCSF (EC 1.5.1.5; EC 3.5.4.9; EC 


193 


97 










—— 


-— 


4834 


Y13397 


Homo sapiens 


Ammo acid sequence or protein 












PR0334 






4835 


GQ3438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


53 








7519. 






4836 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


464 


93 








5327. 






4837 


AF272833 


Homo sapiens 


misato 


413 


87 


4838 


AF130089 


Homo sapiens 


PRO2550 


132 


71 


4839 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


.54 



257 
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SEQ ID 

NO: 


Accession No. 




Description . 


Smith- 
Waterman 

--Score 


Idcntit 

y 








4414. 






4840 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


89 


80 


4841 


W77354 


Homo sapiens 


Human telomere repeat binding factor 
2 truncated protein 


106 


100 


4842 


AF090942 


Homo sapiens 


PRO0657 


160 


75 


4843 


AF090944 


Homo sapiens 


PRO0663 





80 


4844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 67 


81 


4845 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


166 


74 


4846 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


228 


97 


4847 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


198 


95 


4848 


AB023485 


Mus musculus 


transcription factor CAI50b 






4849 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


98 


42 


4850 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


106 


45 


4851 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


179 


92 


4852 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


~T45 


82 


4853 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






4854 


G03062 




Human secreted protein, SEQ ID NO" 
7143. 




55 


4855 


U66464 




hematopoietic progenitor kinase 


1 10 


81 


4856 


ACO 11708 


Arabidopsis 
thaliana 


putative RING zinc finger protein 


121 


43 


4857 


M81321 


Macaca 
fascicularis 


proline rich protein 






4858 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 




40 


4859 


Y86248 


Homo sapiens 


SEQIDNO:163. 




75 


4860 


X65165 


Volvox carteri 








4861 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


106 


79 


4862 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


185 


92 


4863 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


— — 




75 


4864 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA 1 a (AF078683) 


~T*i 


-— 

100 


4865 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


63 


4866 


AB023229 


Homo sapiens 


KIAA 1012 protein 


208 


92 


4S67 


R96S00 


Homo sapiens 






80 


4868 


L26953 


Homo sapiens 


chromosomal protein 


126 


70 


4869 


U63542 


Homo sapiens 






40 


4870 


D86853 


Catharanthus 
roseus 


extensin 


102 


48 


4871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163, ■ 


152 


69 


4872 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


55 


4873 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


156 


84 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 


4874 


Y178j2 




Human 


env protein 


"ttzt 


67 






endogenous 












retrovirus K 








— — — 

4875 





Homo sapiens 


u 7 — : — j-r —■ 

uncnaractenzea bone marrow protein 


~RI 


38 








BM041 






— — 


L^7428 


Homo sapiens 


reverse transcriptase _ 


~7o"5 


91 




4877 


M64792 


Rattus 


salivary pronne-nch protein 


Td6 


46 






norvegicus 




-— 


— 


4878 


AFO 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


■ 


4879 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


105 


77 








gene No. 107. 






4880 


R58816 


Homo sapiens 


Human c-myc far upstream element 


87 


39 








(FUSE) binding protein (FBP)variant 












from PBL clone 31-10. 


- 




4881 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


67 








7339. 






4882 


AF1 75223 


Drosophila 


SANT domain protein SMRTER 


84 


89 






melanogaster 









4883 


AF 174482 


Homo sapiens 


polycomb 3 


— ^= 





4884 


Y54324 


Homo sapiens 


Amino acid sequence of a human 


147 


66 








gastric cancer antigen protein. 







4885 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


71 








700 1 - 






4886 


U55376 


Caenorhabditis 


F 1 6H 1 1 .2 gene product 


95 


100 






elegans 




— — 


— 


4887 


AF261918 


Homo sapiens 


disintegnn metalloproteinase with 












throrobospondin repeats 


■ 


— — 


4888 


J02459 


: — 

bacteriophage 


K (tail component; 199) 


610 


100 






lambda 






— 


4889 


Y95435 


Homo sapiens 


— — — 

Human calcium channel SOC- 


5 

219 


89 












-—- 


4890 


B01372 


_ : 

Homo sapiens 


Neuron-associated protein. 


"Tm 


— 


4891 


X64712 


Gallus gallus 


collagen-alpha- 3 type IX 


IT? 

-— = 


-— 


489 ? 


K-02401 


Homo sapiens 


chorionic somatomammotropin 


— ^- 




4893 


AF01 0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


"T^5 


~55 


4894 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




62 










— 


— 


4895 


L26953 


Homo sapiens 


chromosomal protein 








4896 


AJ277740 


Homo sapiens 


RPB1 lb 1 beta protein 






4897 


B0j>628 


Homo sapiens 


Human phospholipase 2 HPPL2. 




— 


4898 


AL3 59782 


Trypan osoma 


possible (hhv-6) ul 102, variant a dna, 


— ^-j 

1^1 


67 






brucei 


complete virion genome. 






4899 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


64 








7—^ . j r -j : : 





— 


4900 


U90304 


Homo sapiens 


iroquois-class homeodomam protein 


109 


41 










— — 





4901 


D29956 


: 

Homo sapiens 


This gene is similar to tre 




73 








oncogene(X63547). 




-— 


4902 


G00332 


- 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


— rz 

116 


70 












■ 


4903 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 


102 


82 








transferase protein. 








AF1 19900 


Homo sapiens 


PR02822 


~L57 


~57 


4905 


U93565 


Homo sapiens 


putative pi 50 


146 


55 


4906 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


67 








4484. 






4907 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


160 


76 








encoded by gene 17. 






4908 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


136 


55 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Waterman 

Score 


Identit 

y 








SEQ ID NO. 163. 






4909 


AF155232 


Pisuni sativum 


extensin 


98 


43 


4910 


U 15647 


Mus musculus 


reverse transcriptase 


196 


50 


49 1 1 


G 02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


144 


56 


4912 


AF2254I9 


Homo sapiens 


HSCARG 


248 


61 


4913 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


47 


4914 


AL1 17382 


Homo sapiens 


dJ8SJL22.2 (novel protein) 


312 


100 


4915 


AB037814 


Homo sapiens 


KIAA1393 protein 


192 


68 


4916 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


169 


89 


4917 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


90 


4918 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


116 


86 


4919 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


132 


100 


4920 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


183 


65 


4921 


AF090944 


Homo sapiens 


PRO0663 ~ 


141 


57 


4922 


AF090931 


Homo sapiens 


PRO0483 


159 


55 


4923 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


81 


4924 


U09500 


Homo sapiens 


cytochrome b 


276 


94 


4925 


AB002377 


Homo sapiens 


KIAA0379 protein 


304 


98 


4926 


M22334 


Homo sapiens 


unknown protein 


201 


62 


4927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


65 


4928 


AF1 30089 


Homo sapiens 


PRO2550 


116 


82 


4929 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


94 


41 


4930 


W75159 


Homo sapiens 


Human secreted protein encoded by 
gene 45 clone HTTDS54. 


294 


82 


4931 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


96 


74 . 


4932 


G02460 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6541. 


101 


57 


4933 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


142 


70 


4934 


AF130087 


Homo sapiens 


PR02411 


137 


80 


4935 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


56. 


4936 


AB032604 


Mus musculus 


MIWI(piwi) 


313 


50 • 


4937 


X74987 


Homo sapiens 


2'-5' oligoadenyiate binding protein 


144 


96 


4938 


AF054658 


Homo sapiens 


kappa I immunoglobulin light chain 
variable region 


120 


87 


4939 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


134 


67 


4940 


U22230 


Felis catus 


ribosomal protein SI 7 


118 


100 


4941 


AB028956 


Homo sapiens 


KIAA1033 protein 


527 


88 


4942 


G00429 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4510. 


281 


86 


4943 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


71 


4944 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasm id pGCSl 1 037. 


108 


95 


4945 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


170 


76 


4946 


AF090895 


Homo sapiens 


PRO0117 


125 


65 


4947 


D13866 


Homo sapiens 


alpha-catenin 


575 


100 


4948 


L01664 


Homo sapiens 


lysophospholipase 


166 


96 



260 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


4949 


AF1 1 S0S5 


Homo sapiens ■ 


PRO 1 975 




96 


4950 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


81 


4951 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
KrlA 1 a (Ar07oooj) 


1 Ij 


58 


— i^r 


A T3f\ 1 t 1 /lO 
AtJUl 1 J^tZ 


Homo sapiens 


maauj /u protein 


— — 


— ^ 


4953 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


— — 


39 


"4954 


Q02532 


~u '■ 

riomo sapiens 


Human secreted protein SEQ ID NO* 
6613. 


T75 


69 




AK024455 




FLJ00047 protein 




64 


4956 


AF116715 


Homo sapiens 


PR02829 


142 


75 


— 


\ai t son 

1V1J 1 JZU 


Homo sapiens 


ribosomai protein S24 




~6T^ — 






Pan troglodytes 


ATPase subunit 8 


~9ir 




4959 


AF130089 


Homo sapiens 


PRO2550 


156 


75 


4960 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

KrlA la ^ArU IbOoj) 


136 


95 


4961 


S61950 


Oryctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4962 


AF070540 


Homo sapiens 


putative nuclear protein 


513 


93 


4963 


S61950 


Oryctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4964 


AF146191 


Homo sapiens 


FRG1 


343 


92 


4965 


Y 16624 


Homo sapiens 


Human phosphoprotein 32 (pp32) 
protein sequence. 


1 14 


100 


4966 


X67813 


Canis familiaris 


signal recognition particle,72 kDa 
subunit 


127 


100 


4967 


S79410 


Mus rnusculus 


nuclear localization signal binding 


127 


63 


4968 


U83303 


Homo sapiens 


•Iine-1 reverse transcriptase 


117 


69 


4969 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


101 


80 






Homo sapiens 


Human secreted protein, SEQ ID NO; 
7871. 






4971 


D38112 


Homo sapiens 


ATPase subunit 6 


. 1 12 


91 


4972 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


209 


97 


4973 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


80 


4974 


AL050399 


Arabidopsis 
thaliana 


putative proline-rich protein 


75 


91 


4975 


LI 1668 


Bos taurus 


cyclophilin-40 


134 


100 


4976 


M55I69 


Homo sapiens 


tripeptidyl peptidase II 


385 


90 


4977 


AF090944 


Homo sapiens 


PRO0663 




88 


4978 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
466 ?- . ... .. — 


102 


62 


4979 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


70 

— 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4802. 


~704 




4981 




R54656 

_ . 


Homo sapiens 


Prostaglandin 12 production stimulating 
protein B. 


101 

. 


100 


4982 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 




38 






Homo sapiens 




"Ill 




4984 


M15530 


Homo sapiens 


B-cell growth factor 


117 


58 


4985 


D13641 


Homo sapiens 


mitochondrial outer membrane protein 
19 


534 


94 


4986 


L11316 


Mus rnusculus 


ect2 


528 


92 


4987 


X51755 


Homo sapiens 


open reading frame (458 AA) 


108 


90 


4988 


M61877 


Homo sapiens 


erythroid alpha spectrin 


129 


87 



261 



NSDOC1D: <WO 0164835A2_I_> 



W O 01/64835 



PCT/uSOl/04927 





SEQ ID 

NO: 


Accession No. 
-— — 


Spec.es 
— 


Description; 


Smith- 
Waterman 
Score 


Identit 

y 


~49R9 — 




Saccharomyces 
cerevisiae 


carboxypeptidase s 

- 


134 


50 


~4990 — 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


151 


78 


499 J 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


173 


61 


4992 


M34019 


Bos taurus 


beta-adrenergic receptor kinase 


172 


100 


4993 




Homo sapiens 


chromosomal protein 


135 


86 


4994 


X97675 


Homo sapiens 


plakophitm 2b 


124 


63 






Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


153 


77 






Homo sapiens 


FRO2550 


175 


48 


4997 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


95 


68 


4998 


Y86214 


Homo sapiens 


Nuclear transport protein clone hfb34I 
protein sequence. 


312 


98 


~4999 


G0368I 


Homo sapiens 
. 


Human secreted protein, SEQ ID NO: 
7762. 


100 


95 


~5000 




Homo sapiens 


chromatin-specific transcription 
elongation factor FACT 140 kDa 
subunit 


127 


96 


5001 


U70063 


Homo sapiens 


acid ceramidase 


315 


96 




AF041373 


Rattus 
norvegicus 


clathrin assembly protein short form 


163 


63 




AK001 841 


Homo sapiens 


unnamed protein product 


468 


83 


5004 




G00333 

....... 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


143 


61 




9410 


Mus musculus 


nuclear localization signal binding 
protein 


116 


47 


5006 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. , 


107 


55 


5007 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


151 


76 


5008 


Z 15005 


Homo sapiens 


CENP-E 


281 


84 




AF130089 


Homo sapiens 


PRO2550 


122 


75 


5010 


AL049698 


Homo sapiens 


dJ470B24.1.1 (myeloid/lymphoid or 
mixed-lineage leukemia (trithorax 
(Drosophila) homolog); translocated to, 
4 (AF-6) (isoform 1)) 


77 


33 


5011 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


90 


62 




~50l2 


a t rtonon 

/\l_yUZ3 /o\j 


Schizosaccharom 
yces pombe 


DNA binding protein 


145 


36 




5013 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


93 


46 






G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


104 


43 




— 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 j 


55 




5016 


U15647 


Mus musculus 


expressed in select embryonal 
carcinoma cells and testicular cells; 
similar to Swiss-Prot Accession 
Number PI 1260 


119 


35 




5017 


B10545 


Homo sapiens 


protein. 




98 




5018 


AF089745 


Homo sapiens 


FK506-binding protein 


656 


98 




5019 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


634 


97 




5020 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


603 


92 




5021 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


623 


94 




5022 


AF081484 j Homo sapiens 


alpha-tubulin isoform 1 


671 


100 



262 



SDOCJD: <WO_ 



_0164635A2_I_> 



WO 01/64835 



PCT/U SO 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 


5023 


AF 130089 


Homo sapiens 


PRO2550 


109 


71 


— 


AB013452 


Homo S ^P|^^ 


ATPasell 


208 


77 











1 83 


100 


~5026 




Homo sapiens 


Human secreted protein 


1 14 


64 


5027 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


74 


3028 


G00577 


_ , . 

Homo sapiens 


Human secreted protein, SEQ ID NO) 
4658. 


~\49 


~59 


5029 


S7472S 


Homo sapiens 


antiquitin = 26g turgor protein homolog 







5030 


L4 1 498 


Homo sapiens 


longation factor 1 -alpha ] 





"1F5 




5031 


AB040672 


Homo sapiens 


UDP-GalNAc: polypeptide N- 
acety Iga lactosaminy] trans feras e 


45 


70 


5032 


AJ22394S 


Homo sapiens 


RNA helicase 


574 


84 


5033 


AF021935 


norvegicus 


mytonic dystrophy kinase-related 
Cdc42-binding kinase 








AF1 50755 


Mus musculus 


microtubule-actin crosslinking factor 






5035 


AF1 50755 


Mus musculus 


microtubule-actin crosslinking factor 


602 


89 


5036 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


598 




5037 


U09823 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 




96 


5038 


AE00362I 


Drosophila 
melanogaster 


CG 13384 gene product 


279 


45 
— 


5039 


(~ir\r\ -7 -to 
UUUj ly 


Homo sapiens 


Human secreted protein, SEQ ID .NO: 


~To7 




5040 


AF047469 


Homo sapiens 


arsenite translocating ATPase 


650 


94 


5041 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 






5042 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 






5043 


L26953 


Homo sapiens 


chromosomal protein 


124 


67 


5044 


AB005047 


Homo sapiens 


SH3 binding protein 






5045 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


78 


5046 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7519 - : 


164 


83 
— 


5047 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


~121 




5048 


G 03 793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


15j 


64 


5049 


Y01I58 


Homo sapiens 


Secreted protein encoded by gene 1 8 
cione m^Ai_JB i. 


120 


59 




.... 


, : 

Homo sapiens 


nuclear pore complex protein hnupl 53 


TT3 


~88 


5051 


J03176 


Bradyrhizobium 
japonicum 


cytochrome b/cl precursor 


97 


35 


5052 


X85995 


Homo sapiens 


immunoglobulin light chain 


200 


100 


5053 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 






5054 


B0 1372 


Homo sapiens 


Neuron-associated protein. 


_L2£ 




5055 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


209 


~T7 


5056 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
745 ?:. . ..... .. ..... 


110 


51 

-— 


5057 


G03790 


Homo sapiens _ 


Human secreted protein, SEQ ID NO: 
7871. 


9s 




5058 


X92474 


Homo sapiens 


ch-TOG 


147 


100 


5059 


L27428 


Homo sapiens 


reverse transcriptase 


264 


75 


5060 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


119 


44 


5061 


L25616 


Homo sapiens 


CGI protein- 


119 


77 


5062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 



263 



.NSDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/USOi/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description , ? ' 


Smith- 
Waterman 
__S£ore 


% 

tdentit 

y- 








7871. 






5063 


Y 02671 


Hoitio sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


.164 


59 


5064 


AFI 18082 


- — : 

Homo sapiens 


PRO 1 902 


.175 


53 


5065 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


84 


51 


5066 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA 1 a (AF078683) 


108 


100 


5067 


AF155232 


Pisum sativum 


extensin 


137 


45 


5068 


X63005 


Mus niusculus 


proline-rich protein 


98 


35 


5069 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


307 


56 


5070 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


173 


89 


5071 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7S82. 


153 


68 


5072 



Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


73 




V A 1 A A A 
i U14UU 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


109 


73 


5074 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


121 


80 


5075 


G01984 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


113 


72 


5076 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


151 


88 


5077 


AF090944 


Homo sapiens 


PRO0663 


138 


78 




Y0267 1 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


150 


74 






Homo sapiens 


plakophihn 2b 


179 


93 


5080 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


123 


76 


5081 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


176 


59 


5082 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


100 


54 


5083 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


99 


59 


5084 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


101 


57 


5085 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


118 


77 


5086 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


93 


69 


5087 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


88 


80- 


5088 


AF000298 
— — 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


112 


57 


~5089 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


87 


61 




G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


74 


61 


5091 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


83 


7S 


5092 


AF065484 


Homo sapiens 


sorting nexin 1A 


172 


81 


5093 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 


90 


5094 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
6613. 


164 


78 


5095 


U02313 


Mus musculus 


protein kinase . 


147 


84 



264 



SDCCID: <WO C 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Idenrit 


5096 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


129 


67 


5097 


U64849 


Caenorhabditis 
clegans 


Contains similarity to Pfam domain: 
PF00646 (F-box), Score=28.7, E- 
value=4.3e-05, N=I 


197 


39 


5098 


AF006010 


Homo sapiens 


progestin induced protein 






5099 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


119 


55 


5100 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


132 


82 


5101 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


77 


5102 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


99 


70 


5103 


J02459 


bacteriophage 
lambda 


J (taikhost specificity; 1 132) 


485 


96 


5104 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


823 


98 


5105 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


106 


42 


5106 


AF130089 


Homo sapiens 


PRO2550 


121 


35 


5107 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


135 


81 


5108 


P92219 


Homo sapiens 
(human) 


CRI protein. 


123 


88 


5109 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


83 


78 


5110 


U41751 


Mus musculus 


EI24 


560 


83 


5111 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


69 


51 12 


AF130089 


Homo sapiens 


PRO2550 


164 


88 


5113 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


160 


72 


5114 


AF052831 


Trypanosoma 
cruzi 


unknown 


107 


70 


5115 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna; 
complete virion genome. 


128 


68 


5116 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


60 


29 


5117 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


64 


5118 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


60 


51 19 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 






5120 


X65718 


Prunus dulcis 


extensin 






5121 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


132 


100 


5122 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 
/saccharopine dehydrogenase 


154 


96 


5123 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 
/saccharopine dehydrogenase 


158 


80 


5124 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 






5125 


AF130089 


Homo sapiens 


PRO2550 


112 


46 


5126 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


60 


5127 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


99 


68 



265 



WO 01/64835 



PCT/USO 1/0492 7 



SEQID 

NO: 


Accession No. 


Species 


UeSC,iPt?0n ' ' 


Smith- 
Waterman 


Jdentit 

y 


5128 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


57 


512 g 


W88627 


— : 

rlomo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


~T40" 


— 






bacteriophage 
lambda 


H (tail component;853) 








G00328 




Human secreted protein, SEQ ID NOi 
4409. 






5132 


J04088 




HWA tnnoicnmprQCP IT rpf~- QQ 1 ^ 

LJi\s\ lopojsoijicrase ji j.yy. i.j) 






5133 


AL 1 32841 


Caenorhabditis 


Y15E3A.3 


147 


85 


5134 


D90827 


Escherichia coli 


Serine/Threonine protein phosphatase 

Vtlv^ 3 . 1 .J . lO). 


329 


46 


5135 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


62 


5136 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


91 


5137 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


96 


5138 




G03043 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


67 






Homo sapiens 


Human secreted protein, SEQ ID NO: 




31 


"3140 — 


7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




141 


46 


5141 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


166 


67 


5142 


AF003535 


Homo sapiens 


ORF2-like protein 


168 


50 


5143 


AF259792 




SWIlLbeta protein 






5144 


Y36203 


Homo sapiens 


Human secreted protein #75. 


130 


63 




AF130089 


Homo sapiens 


IT ISXJJ. JjU 






5146 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

xV-TLM- 1 ci \ArU / oDoj ) 


128 


52 


5147 


GQ3172 


Homo sapiens — ~ 


Human secreted protein, SEQ ID NO; 
7253. 


T29 


— 


5148 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


101 


67 


5149 


W48351 


HoriK) sa iens 

omo sapiens 


Human breast cancer related rotein 

BCRB2. 


— — 


-— 

56 


5150 


AF130051 


Homo sapiens 


PRO0898 


80 


72 


5151 


R07057 




Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 






5152 


X56832 


Homo salens 


muscle specific enolase 






5153 


X76930 












5154 


U12919 


Mus musculus 


adenylyl cyclase type VII 


156 


75 


5155 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


76 


5156 


G04072 


— : — 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


- 

133 


50 


5157 


Y08387 


Homo sapiens 


mu-adaptin-related protein 2 


252 


86 


5158 


R13556 


Homo sapiens 


Protein encoded downstream ofhhc M 
oncoprotein. 


126 


66 


5159 


A F0 9 0942 


Homo sapiens 


PRO0657 


139 


— 


5160 


AF090942 


Homo sapiens 


PRO0657 


120 


70 


5161 


D67066 


Bos taurus 


N-WASP 


135 


42 


5162 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


115 


73 


5163 


AF044496 


Canis familiaris 


acidic ribosomal phosphoprotein P0 


112 


78 


5164 


W48351 


Homo sapiens 


Human breast cancer related protein 


106 


58 



266 



WO 01/64835 PCT/US01/04927 





Accession No. 


Species 


Description 






NO: 








Waterman 


Identit 












y 






BCRB2. 






5)65 


AJ250095 


Bordetella 


pertactin (P.68) 


119 


64 






bronchiseptica 








5166 





fylus musculus 


otogelin; MLEMP 


609 


90 


5167 


AF1 16238 


Homo sapiens 


pseudouridine synthase 1 


237 


87 


5168 


(juus lb 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


78 








4457. 






5169 


AL050342 


Homo sapiens 


dJ655K7. 1 (novel protein) 


263 


100 


5170 


AK024455 


Homo sapiens 


FLJ00047 protein 


177 


72 


5171 


YS6248 


Homo sapiens 


Human secreted protein HCHPF68, 


94 


100 








SEQ ID NO: 163. 






5172 


L38908 


Nicotiana 


extensin 


101 


47 






tabacum 








5173 


AF055904 


Myxococcus 


unknown 


107 


36 






xanthus 








5174 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


143 


59 








gene 22 clone HMSJW18. 






5175 


Y00337 


Homo sapiens 


Human secreted protein encoded by 


123 


86 








gene 81. 






5176 


AJO 10957 


Hippopotamus 


NADH1 


212 


90 






amphibius 








5177 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


76 








7519. 






5178 


U83280 


Leishmania 


39 kDa antigen 


128 


90 






donovani 








5179 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


183 


45 






elegans 


binding protein Kl 0 (NID:g8 148) 






5180 


L27428 


Homo sapiens 


reverse transcriptase 


106 


46 


5181 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


144 


45 






elegans 


binding protein K10 (NID:g8148) 






5182 


R83 1 1 9 


Homo sapiens 


Human cisplatin resistance protein. 


142 


96 


5183 


R95330 


Homo sapiens 


Tumor necrosis factor receptor 1 death " 


475 


71 






- 


domain ligand (clone 1 TU). 






5184 




Homo sapiens 


PRO0478 


121 


57 


5185 


AF 1936 13 


Homo sapiens 


cell recognition molecule Caspr2 


338 


50 


5186 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


102 


36 








SEQ IDNO:163. 






5187 


. .... 
AL03 1118 


Homo sapiens 


dJ153G14.3 (novel C2H2 type Zinc 


465 


62 








Finger protein) 






51S8 


ACISTI Ad 

t\r l oj l'ft 


Mus musculus 


left-right dynein 


199 


83 


5189 


J02459 


bacteriophage 


J (tail:host specificity;! 132) 


748 


95 






lambda 








5190 


AF193613 


Homo sapiens 


cell recognition molecule Caspr2 


229 


50 


5191 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


65 








6595. 






5192 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


120 


84 








RHAla(AF078683) 






5193 


Y003 1 1 


Homo sapiens 


Human secreted protein encoded by 


112 


49 








gene 54. 






5194 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


56 








4414. 






5195 


AF 104261 


Mus musculus 


Pax transcription activation domain 


487 


72 








interacting protein PTIP 






5196 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 








4414. 






5197 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


80 








7124. 






5198 


X55777 


Homo sapiens 


52kD protein 


106 


30 


5199 


AF092170 


Homo sapiens 


polycystic kidney disease 2 related 


122 


91 



267 



-ISDOCID: <WO 0164835A2_L= 
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SEQID 


Accession No. 


Species 


Description ', ,. 


Smith- 


% 


NO: 








Waterman 


Identir 










Score 


y 








protein 






5200 


AF092 1 70 


Homo sapiens 


polycystic kidney disease 2 related 


138 


96 








protein 






5201 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




_ 
67 








: 






no 


AP000616 


— — : 

Oryza satjva 


similar to RING-H2 finger protein 


— — 

10 


— 








IS-TIA I a {t\r O / 505 5 ) 






5203 


R95913 


Homo sapiens 


Neural thread protein. 


— 

96 


— 

61 


5204 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


55 








6588. 






5205 


AF3 1 9476 


Homo sapiens 


GKAP42 


705 


100 


5206 


G02832 


Homo sapiens 


Human secreted protein; SEQ ID NO: 


78 


62 








6913. 






5207 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 19 


75 








4446. 






5208 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


50 








7871. 






5209 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


39 








4770. 






5210 


AF09093 1 


Homo sapiens 


PRO0483 


94 


90 


521 1 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 14 


74 








4438. 






5212 . 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


70 








5327. 






5213 


U53209 


Homo sapiens 


transformer-2 alpha 


124 


95 


5214 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


58 








6566. 






5215 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


151 


49 








gene 22 clone HMSJ Wl 8. 






5216 


L26953 


Homo sapiens 


chromosomal protein 


95 


52 


5217 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


66 








6613. 






5218 


AF220264 


Homo sapiens 


MOST-1 


151 


65 


5219 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


53 








7214. 






5220 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 








471 8. 






5221 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


66 








7888. 






5222 


AF194537 


Homo sapiens 


NAG 13 


120 


60 


5223 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 15 


60 








4457. 






5224 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


144 


58 








encoded by gene 17. , 






5225 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


164 


70 


5226 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


73 








4413. 






5227 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


69 








7519. 






5228 


M34451 


Caenorhabditis 


collagen (rol-6) 


100 


41 






elegans 








5229 


X16396 


Homo sapiens 


precursor polypeptide (AA -29 to 315) 


306 


88 


5230 


AL3 59782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 




69 






brucei 


complete virion genome. 






5231 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


75 








7879. 






5232 


D86982 


Homo sapiens 


similar to human ankyrin 1(S08275) 


201 


100 


5233 


AF1 16712 


Homo sapiens 


PR02738 


135 


49 


5234 


AL049758 


Homo sapiens 


dJ437M21.1 (novel protein) 


184 


70 



268 



NSDOC1D: <WO 0164835AS I ? 



WO 01/64835 



PCT/USO 1/0492 7 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 


5235 


AB014527 




KIAA0627 protein 


164 


100 


5236 


AF194537 


Homo sapiens 


NAG 13 


183 


64 


5237 


R07057 


Homo sapiens 


(hhem) gene preoduct. 


170 


43 


5238 


AF135439 


Mus musculus 


formin binding protein 1 1 


230 


100 


5239 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


137 


69 


5240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


63 


5241 


AF116661 


Homo sapiens 


PRO 143 8 


152 


58 


5242 


L27428 


Homo sapiens 


reverse transcriptase 


193 


53 


5243 


AF130089 


Homo sapiens 


PRO2550 


141 


60 


5244 


AF1 18086 


Homo sapiens 


PRO 1992 


135 


71 


5245 


AJ228863 




CLIP-1 15 


195 


97 


5246 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


81 


5247 


AC007887 


Arabidopsis 
thahana 


F1504.29 


76 


93 


5248 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


75 


5249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


66 


~5250 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


136 


~70 




Y86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 


1 62 


80 


5252 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


1 12 


54 


5253 


AF090944 


Homo sapiens 


PRO0663 


177 


62 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


132 


57 


5255 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


129 


74 


5256 


D38112 


Homo sapiens 


NADH dehydrogenase subunit2 


157 


91 


5257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


160 


78 


5258 


U16296 


Homo sapiens 


T1AM1 protein 


L 133 


100 




AT 1^0371 


major 


DNA, complete virion genome 






5260 


U49973 


Homo sapiens 


ORF1* MER37' putative transposase 
similar to pogo element 


324 


78 


5261 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


50 


5262 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


217 


100 


5263 


AB047600 


Macaca 
fascicular is 


hypothetical protein 


125 


57 


5264 


U83246 


Homo sapiens 


copine I 


199 


86 


5265 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


150 


72 


5266 


U91329 


Homo sapiens 


kinesin-like motor protein KJF1C 


132 


93 


5267 


J00672 


Oryctolagus 


beta-myosin 


109 


100 


5268 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


74 


43 


5269 


AF1 19851 


Homo sapiens 


PRO 1722 


96 


61 


5270 


AF090930 


Homo sapiens 


PRO0478 


113 


91 


5271 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


62 


5272 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


60 



269 



MSDOCIO: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQID 

NO: 


Accession No. 


Species 


Description • , ( , , . 


Waterman 
Score 


Identit 

y 




Y8647'' 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


116 


78 




AF09093 1 


Homo sapiens 


PRO0483 


144 


67 


5275 


AL 1 60493 


Leishniania 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


117 


68 


5276 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


111 


58 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


67 


5278 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
KrlA J a yf\r \) I oOa.3 ) 


120 


67 


~5279 


"GO'' 7 14 


— — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 




151 


69 


"iifr — 


a irn mi a a 


Homo sapiens 


neuronal thread protein AD7c-NTP 


196 


45 


5281 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795, ; : 


1 15 


73 


~5282 


A jggl ? 


Brassica napus 


extensm 






5283 


Z68747 


Homo sapiens 


imogen 38 


291 


75 




X77816 


norvegicus 


rX- voetal 




37 




AF130089 


Homo sapiens 


PRO2550 


181 


73 


5286 


AF130089 


Homo sapiens 


PRO2550 


147 


67 


5287 


AF090942 


Homo sapiens 


PRO0657 


90 


50 


5288 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


116 


74 


5289 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


64 


5290 


D86972 


Homo sapiens 
-. 


similar to E.coli hypothetical 29.6 KD 
protein(P 1 : YIG WECOLI) 


1 1 1 


100 




W48j5 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


57 




Y86^4S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




65 




vna/v; i 

I UoUO 1 


Homo sapiens 


Human c-myb protein fragment. 


213 


76 




Aruyusyj 


Homo sapiens 


PRO0 1 1 7 


134 


52 


5295 


AF017635 


Homo sapiens 


DCHT 


145 


100 




G0^558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6639. 


136 


60 


5297 


AP0006 1 6 


Oryza sativa 


similar to RING-H2 finger protein 
RHA1 a (AF078683) 


1 18 


67 




AK024455 


Homo sapiens 


FLJ00047 protein 


67 


80 


5299 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


164 


75 


5 00 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


129 


55 






Homo sapiens 


Gamma interferon receptor. 


107 


82 


5302 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


238 


90 




Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


205 


68 


5304 


AF1 30089 


Homo sapiens 


PRO2550 


100 


77 


5305 


R13556 



Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


117 


44 






Plasmodium 
vivax 




128 


61 


5307 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


77 


57 


5308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


140 


42 


5309 


Y91577 


Homo sapiens 


Human secreted protein sequence 


489 


78 



270 



>ISDOCID: -;WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 








encoded by gene 2 SEQ ID NO:250. 






5310 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


5311 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


129 


65 


5312 


AF1 19851 


Homo sapiens 


PRO 1722 


97 


65 


5313 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


160 


90 


5314 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


113 


81 


5315 


AF071310 


Mus musculus 


OPA-containing protein 1 


228 


53 


5316 


AF090942 


Homo sapiens 


PRO0657 


162 


70 


5317 


U83303 


Homo sapiens 


Iine-1 reverse transcriptase 


109 


83 


5318 


AF090894 


Homo sapiens 


PRO0113 


125 


63 


5319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


129 


66 


5320 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


75 . 


5321 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
g ene22cIoneHMSJW18. 


138 


81 


5322 


L27428 


Homo sapiens 


reverse transcriptase 


139 


52 


5323 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


109 


83 


5324 


AF1 18082 


Homo sapiens 


PRO 1 902 


107 


70 


5325 


M8I321 


Macaca 
fascicularis 


proline-rich protein 


114 


37 


5326 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


112 


79 


5327 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


72 


5328 


AF090930 


Homo sapiens 


PRO0478 


152 


73 


5329 


L27428 


Homo sapiens 


reverse transcriptase 


267 


65 


5330 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


82 


5331 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


164 


54 


5332 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


135 


52 


5333 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


124 


76 


5334 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


97 


56 


5335 


AP000060 


Aeropyrum 
pernix 


103aa long hypothetical protein 


71 


50 


5336 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


75 


5337 


Z93784 


Homo sapiens 


dJ398C22.1 (novel protein, orthologof 
mouse brain protein E46) 


280 


68 


5338 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


132 


64 


5339 


AF2 10651 


Homo sapiens 


NAG 18 


160 


56 


5340 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


133 


80 


5341 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


151 


75 


5342 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


50 


5343 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


5344 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 



271 



JSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


, Description ^ - 


Smith- 
Waterman 
- Sc ° re 


Idcntit 

y 








~~7]Q2 ~ 






5345 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


92 


48 


5346 


AL121891 


Homo sapiens 


dJ 1 1 87M 1 7.2 (KIAA0552 protein) 


148 


43 


5347 


G00689 


Homo sapiens 


Human secreted proteirij SEQ ID NOi 
4770. 




33 


5348 


AF090944 




rivuvooj 


205 


72 


5349 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


57 


5350 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rha 1 a. 


101 


37 


5351 


AF265578 


Homo sapiens 


orphan neurotransmitter transporter 


121 


39 


5352 


AP000616 


Oryza sativa 


similar to RJNG-H2. finger protein 
RHAla (AF078683) 


105 


75 


5353 


G02532 


— ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


69 


5354 




Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


97 


60 


5355 


G00590 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4671. 


1 13 


80 


5356 


R96800 




Human histiocyte-secreted factor HSF. 


123 


67 


5357 


W88609 


Homo sapiens 


Secreted protein encoded by gene 76 

virtue IIIf'IYl "7*3 
Clone rils.lA.L. /J. 


94 


79 


535S 


AC004450 


Arabidopsis 


putative extensin 


110 


37 


5359 


AP000616 


OryzTsativa 


: . 

Similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


79 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


199 


74 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


174 


97 






Gallus gallus 




98 


44 


5363 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


93 


5364 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


181 


70 


5365 


G02538 




Human secreted protein, SEQ ID NO: 
6619. 


159 


68 


5366 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


94 


60 


5367 


X52022 


"Horn : 

Homo sapiens 


collagen type VI, alpha 3 chain 


150 


33 


5368 


G00637 


omo sapiens 


Human secreted protein, SEQ ID NO: 


120 


77 


5369 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478." 


145 


75 


5370 


X97675 


Homo sapiens 


plakophilin 2b 


170 


85 


5371 


AF130079 




rK.U/OJi 


103 


95 


5372 


L26953 


Homo sapiens 


chromosomal protein 


117 


54 






Homo sapiens 


NAG 1 8 


137 


67 


5374 


Y306S1 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1. 


106 


76 


5375 


AP000616 


Oryza sativa 


RHAla (AF078683) 


109 


70 


5376 


AL109819 


Arabidopsis 
thaliana 


extensin-Iike protein 


123 


34 


5377 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


79 


5378 


AF036170 


Dictyostelium 


homeobox-containing protein Wariai 


93 | 83 



272 



ISDOCID: <WO 0164835A2J_> 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Identi< 










Score rman 


-1 






discoideum 








~5379 


AF130089 


Homo salens 


PRO2550 


134 


75 


5380 


AF161384 




HSPC266 


663 


96 




AF130051 


Homo salens 


PRO0898 


147 


77 


5382 


AF090944 


Homo salens 


PRO0663 


118 


83 


5383 




Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


129 


55 








7214. 









Y02671 


H mo sa iens 

omo sapiens 


Human secreted protein encoded by 


1 10 


68 








gene 22 clone HMSJW18. 






5385 


Y86248 


Homosa iens 

omo sapiens 


Human secreted protein HCHPF68, 


165 


53 








SEQ ID NO: 163. 






. 




__ . 


Human phospholipase 2 HPPL2. 


145 


96 





U54788 


MuTmviKulus 


Wiskott-Aldrich Syndrome Protein 


108 


51 




5358 


nn 1 cos 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO. 


104 


89 












-— 


5389 


- 

Y86248 


. 

Homo sapiens 


x " j : — 

Human secreted protein HCHPF68, 


~T?s 


0 








sen ir~i xiO' 1 AT. 


— — — 


— 


5390 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




72 








5909 " — — — : - 




— 


5391 


AL359782 


Trypanosoma 


possible (hnv-6) ul 102, variant a dna, 


"TTo 

119 


85 








complete virion genome. 








5392 




G00454 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


n?o 











„ 335 " —5 —. — ccr . 






— — — — 

5393 


G03 1 7^ 


— : 

Homo sapiens 


Human secreted protein, ofcO ID INC): 


-pq 


~78 














— — 


1 


H mosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


159 


90 








5633. 











— : 

omo sapiens 


Human secreted protein, SEQ ID NO: 


1 10 


65 














"5396 


Y14482 


Homo sa iens 

omo sapiens 


Fragment of* human secreted protein 


85 


59 








encoded by gene 17. 






5397 


X86681 


Homo sapiens 


HNP3 6 protein 


248 


71 


llll 


Itt?t1? 


Homo sapiens 




133 


62 






Homo sapiens 


Protein encoded downstream of" hhc Ivl 


132 


54 








oncoprotein 






— — — 


A I n»fl 1 OA 


Homo sapiens 


— 7 . . *T r — r— r 


164 


96 




5401 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 






5402 




Homo sapiens 


riVLJUOO j 


"W" 




— — 


5403 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


"61 

93 


58 








' 77 j r-t 






5404 





— — j 

Mus musculus 


W iscott-Alciricn Syndrome protein 


~T<r 


~^7 

















WUjZ / 5 


— : 

Homo sapiens 


T^mouTnecrosis factor related gene 


137 


80 








product CL6.5.40. 






5406 




G00403 


— — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


~To2 


~fi9 








t^^' -5 : oTJt~> in xta 






5407 


— x — 7i 

G 00403 


— — : 

Homo sapiens 


Human secreted protein, S>fcQ ID NO: 


~UY7 


~fi9 








^_ 484 - ^ —. — ocr . Tr . .„ — 






ns 

5408 


— — 

G00328 


— : 

Homo sapiens 


Human secreted protein, S.C.Q ID NU: 


"T42 


~69 












. 


——5 





— : 

omo sapiens 


Human secreted protein, SEQ ID NO: 


139 


63 














"54l0 


yi2950 


Homo sapiens 


Amino acid sequence of a human 


65 


47 








secreted peptide. 






5411 


API 8 3 626 


Drosophila 


BcDNA.GH03694 


171 


53 






melanogaster 








5412 


U93565 


Homo sapiens 


putative pi 50 


195 


70 


5413 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


64 








7253. 
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SEQ ID 
NO: 


1 Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


5414 


Y86248 




Human secreted protein HCHPF68, 
SEQIDNO:163. 




61 


5415 


X83572 


Homo sapiens 


- ARSP 




1 00 


5416 


D86853 


roseii^ an " S 


extensin 


1 13 


41 


5417 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


~~V9 


-— 


5418 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


175 


77 


5419 


R32010 


Homo sapiens 


Rpl5-TIA-1. 


~T4l) 




5420 


D67066 


Bos taurus 


N-WASP 


153 


41 


5421 


U42391 




myosin-IX.b 


203 


1 00 


5422 


Y86248 


Homo sapiens 


Hurnsn secreted protein HCHPP68, 
SEQ ID NO: 163. 






5423 


AF220264 


Homo sapiens 


MOST-1 


94 


78 


5424 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


63 


5425 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


131 


57 




AF130089 


Homo sapiens 


PRO2550 


120 


58 


5427 


Z70204 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00628 (PHD-finger), Score=63.1, E- 
value=1.9e-15, N=l~cDNA EST 
EMBL:T00556 comes from this 
gene—cDNA EST yk344f7.5 comes 
from this gene 


131 


55 


5428 


G03 1 72 




Human secreted protein, SEQ ID NO: 
7253. 


158 


5 1 


g^3Q ■ 


AFl TftrtSO 
r\r I JUU07 


Homo sapiens 


PR.0^550 


112 


73 






Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


161 


52 




G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


72 


5432 


G03797 




Human secreted protein, SEQ ID NO: 
7878. 


152 


79 


5433 


AF090942 




rKUUOj / 


109 


67 


5434 


AE003588 


Drosophila 
melanogaster 


CG2839 gene product 


141 


50 


5435 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


79 


5436 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


41 


5437 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7 88 _ 8 - . 


143 


63 


5438 


qq2\12 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


1 17 


84 


5439 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 12 


76 


5440 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

P U A la / A TmTQ/^.Q1\ 

rs_ri/\. i a ^r\r u / ouoj ) 


110 


42 


5441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


64 




AT 1 &(\AQ1 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


193 


56 


5443 


X07495 


Homo sapiens 






41 


5444 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


62 


5445 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. . 


130 


41 


5446 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


136 


71 



274 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 














5447 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


123 


62 








clone HNHF029. 






5448 


W50193 


Homo sapiens 

m p 


Amino acid sequence of salivary 


101 


45 








protein CON-2. 






"5449 




unidentified 


non-functional lacZ alpha peptide 


106 


57 






cloning vector 








5450 


X01469 


Plasmodium 


histidine-rich protein 


162 


56 






lophurae 








5451 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


88 








6561. 






5452 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


155 


68 








encoded by gene 17. 






5453 


AF1 19900 


Homo sapiens 


PR02822 


162 


63 


5454 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


120 


51 








clone HNHF029. 






5455 


AC007204 


Homo sapiens 


BC273239 1 


1485 


69 


5456 


L27428 


Homo sapiens 


reverse transcriptase 


122 


41 


5457 


Ml 8094 




hydroxyproline-rich glycoprotein 


141 


39 






vulgaris 








5458 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


118 


68 








RHAla (AF078683) 






5459 


AF090930 


Homo sapiens 


PRO0478 


144 


64 


5460 


AF130089 


Homo sapiens 


PRO2550 


124 


60 


5461 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


69 








4458. 






5462 


X97675 


Homo sapiens 


plakophilin 2b 


103 


72 


5463 


G03 1 72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


69 








7253. 






5464 


AF229067 


Homo sapiens 


PADI-H protein 


146 


65 


5465 


AF1 16715 


Homo sapiens 


PR02829 


83 


71 


5466 


AF090930 


Homo sapiens 


PRO0478 


146 


52 


5467 


U93574 


Homo sapiens 


putative p 1 50 


203 


52 


5468 


W48351 


Homo sapiens 


Human breast cancer related protein 


89 


50 








BCRB2. 






5469 


AF191838 


Homo sapiens 


TANK binding kinase TBK1 


121 


95 


5470 


AF130079 


Homo sapiens 


PR02852 


146 


69 


5471 


AF090944 


Homo sapiens 


PRO0663 


90 


66 


5472 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


63 








7882. 






5473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


127 


60 






SEQ ID NO: 163. 






5474 


AF064553 


Mus musculus 


NSD1 protein 


269 


100 


5475 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 






4463. 






5476 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


86 








7253. 






5477 


Ul 1271 


Homo sapiens 


thromboxane A2 receptor 


118 


67 


5478 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


60 








6613. 






5479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


75 








7253. 






5480 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


1 19 


74 


5481 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 


103 


85 








RHAla (AF078683) 






5482 


AF238235 


Entamoeba 


diaphanous protein 


108 


35 






histolytica 








5483 


AF090930 


Homo sapiens 


PRO0478 


120 


55 


5484 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 




84 








SEQ ID NO: 163. 
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SEQ ID 
NO: 


Accession No. 




Description > " "• ; 


Smith- 
Waterman 


Itlentit 

y 




M61 883 


Sus scrofa 


apomucin 


252 


55 


5486 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


107 


77 


5487 


G'03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


110 


46 


5488 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


88 


62 


5489 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8l 48) 


124 


47 


5490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:l63. 


220 


71 


5491 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:l63. 


195 


65 


5492 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


116 


38 


5493 


AF1 18086 


Homo sapiens 


PRO 1 992 


135 


82 


5494 


AF090930 


Homo sapiens 


PRO0478 


105 


74 


5495 


Y944S I 


Homo sapiens 


Human inflammation associated 
protein #8. 


170 


61 


5496 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7546. 


72 


65 


5497 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


78 


5498 


X56123 


Mus musculus 


talin 


594 


88 


5499 


AL035652 


Homo sapiens 


dJU6.l (topoisomerase (DNA) I) 


105 


86 


5500 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


5501 


Y45389 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 31. 


161 


93 


5502 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


119 


60 


5503 


X03145 


Homo sapiens 


pot. ORF I 


215 


79 


5504 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


180 


61 


5505 


AL390212 


Homo sapiens 


dJ29M10B.1.2 (protein kinase C 
binding protein 1 (isoform 2)) 


1282 


93 


5506 


AF1 18086 


Homo sapiens 


PRO 1992 


146 


61 


5507 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


116 


74 


5508 


U42471 


Mus musculus 


Wiscott-AIdrich Syndrome protein 
homolog 


165 


50 




D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


211 


95 


5510 


Y86248 



Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


114 


69 


~55T1 


AF090944 


Homo sapiens 


PRO0663 


125 


58 


5512 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO:- 
7492. 


125 


53 


5513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


110 


68 


5514 


AF 11 8086 


Homo sapiens 


PRO 1992 


128 


76 


5515 


Z22866 


Mus musculus 


skelemin 


218 


95 


5516 


Z22866 


Mus musculus 


skelemin 


2IS 


95 


5517 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


127 


71 


5518 


AL133380 


Homo sapiens 


dJ862P8.3 (Similar to MAP3K 10 
(mitogen-activated protein kinase 
kinase kinase 10)) 


603 


77 


5519 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


187 


78 


5520 


AF090901 | Homo sapiens | PRO0195 


115 


60 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


5521 


Y86^48 


Homo sapiens 


Human secreted protein HCHPF68, 

CCA m ~M/"V 1 £1 


180 


53 


— — 


W88627 


Homo sapiens 


clone HPMBQ32. 


.155 


— 


5523 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


149 


100 





X97675 


Homo sapiens 




~TP 


~70 


5525 


D37793 


Mus musculus 


synaptotagminIl/IP4BP 


651 


84 




M32319 


Homo sapiens 


HLA protein allele B51 


754 


94 


5527 


AF] 30051 


Homo sapiens 


PRO0898 


93 


64 


5528 


X92485 


Plasmodium 
vivax 


pval 


88 


66 


5529 


AJ13 1 890 


Homo sapiens 


DNA polymerase lambda 


205 


100 


5530 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


132 


68 


5531 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101 . 


87 


59 


5532 


AF09093 1 


Homo sapiens 


PRO0483 


133 


75 


5533 


AF090942 


Homo sapiens 


PRO0657 


122 


66 


5534 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


145 


56 


5535 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


569 


91 


5536 


K02401 


Homo sapiens 


chorionic somatomammotropin 


375 


98 


5537 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


136 


68 


5538 


G02480 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
6561. 


130 


44 


5539 


S62928 


Homo sapiens 


PRB IM protein precursor 


104 


40 


5540 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


127 


70 


5541 


AE000463 


Escherichia coli 
K12 


putative glycosidase 


254 


100 


5542 


J02459 


bacteriophage 
lambda 


H (tail component; 853) 


761 


100 


5543 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


1 19 


58 


5544 


U59185 


Homo sapiens 


putative monocarboxylate transporter 


233 


87 


5545 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


187 


65 


5546 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 
in cosmid C27F2 (U4041 9) 


556 


84 


5547 


X92485 


Plasmodium 


pval 


1 10 


67 


5548 


AB023233 


Homo sapiens 


K1AA101 6 protein 


246 


97 


5549 


U 163 59 


norvegicus 


nitric oxide synthase 


95 


64 


5550 


AC002044 


Homo sapiens 


Alpha-fetoprotein enhancer binding 
protein (3' partial) 


1 1 1 


38 


5551 


AF026802 


Homo sapiens 


alpha-3 type DC collagen 


1 17 


47 


5552 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


85 


88 


5553 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


91 


88 


5554 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


71 


5555 


AF 130089 


Homo sapiens 


PRO2550 


172 


86 


5556 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


72 


5557 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


160 


61 
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SEQ ID 
NO: 


Accession No. 




Description f & ' 


Waterman 


Identit 

y 


5558 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


131 


68 


5559 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


1 12 


67 


5560 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


159 


73 


5561 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


141 


67 


5562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


71 


5563 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


146 


SO 


5564 


AF090942 


Homo sapiens 


PRO0657 


157 


73 


5565 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


169 


43 


5566 


AL035526 


Arabidopsis 
thaliana 


extensin-Iike protein 


103 


40 


5567 


AF130089 


Homo sapiens 


PRO2550 


132 


67 


5568 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


129 


83 


5569 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 
kDa subunit 


153 


42 


5570 


AF068266 


Homo sapiens 


EHT protein 


169 


91 


5571 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


113 


42 


5572 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


5573 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


71 


5574 


L20450 


Mus musculus 


DNA-binding protein 


283 


61 


5575 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


95 


5576 


YOI400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


140 


71 


5577 


Y20648 


Homo sapiens 


Human neurofilament-L wild type 
protein fragment 38. 


66 


37 


5578 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
hornolog 


112 


41 


5579 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


74 


5580 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


5581 


AF1 53606 


Homo sapiens 


angiopoietin-related protein 


737 


95 


5582 


AF090930 


Homo sapiens 


PRO0478 


149 


77 


5583 


X52493 


Glycine max 


DNA-directed RNA polymerase 


140 


46 


5584 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


101 


51 


5585 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


66 


47 


5586 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


130 


72 


5587 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


148 


56 


5588 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


148 


66 


5589 


AC006014 


Homo sapiens 


similar to KIAA0618 and nuclear 
envelope protein POM 121; 
alternatively spliced product; similar to 
PID:g3327050 and P52591 
(PID:gl709213) 


108 


84 


5590 | R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


122 | 84 
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1 SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 








oncoprotein. 






5591 


f\ L>UU 1 OOH 


LtiJoreiia 
vulgaris 


ORF74 


61 


47 


"5599 


AF090930 


Homo sapiens 




"T23 


~~70 


5593 


G02996 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7077. 


120 


57 


5594 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


59 


5595 


L26953 


Homo sapiens 


chromosomal protein 


116 


62 







Homo sapiens 


chromosomal protein 




JlL 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


"7T5 




5598 


Y71213 


Ho osa iens 

omo sapiens 


polypeptide IMX39. 


"TIT 


~44 


5599 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


125 


45 




AF130089 




PRO2550 






5601 


AY014404 


Homo sapiens 


kinesin-like protein RBKIN2 


297 


98 






Homo sapiens 










5603 




Homo sapiens 


rl\UU46j 








X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 






5605 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


455 


75 


"5606 


ivitw 


— — 

norvegicus 


salivary proline-rich protein 




. ... , . 






Nicotiana alata 








5608 


AF090944 


Homo sapiens 


PRO0663 


147 


64 






Homo sapiens 


Human sscrcted protein, SEQ ID NO; 
4633. 








X0 1 065 


Cairina moschata 


histone H5 






5611 


U03413 


Dictyostelium 
discoideum 


calcium binding protein 


104 


40 


5612 
— — 


AL 160493 




Leishmania 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


144 
-— 


37 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

—j : — 




— 




56 


G0121S 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
5299. 


196 


"07 

97 


5615 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
^. 413 ' , . ,. . — — 


122 


68 


"5616 




— =j : 

rlomo sapiens 


Npwj o-bmding protein Npwbr 


~H? 


~~7R 


5617 




Homo salens 


pot. ORF III 


"Toi 


"80 





AF090930 




PRO0478 


~Ts9 


~7S 




D88461 


Rattus rattus" 8 


N-WASP 


~7o7 


~40 


5620 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 


104 


38 


"5621 


y 36156 


H mosa iens 


Human secrete'd'protein #28 


"T34 


~5T) 


5622 


AF090894 


Homo salens 


PRO0 1 1 3 


"T19 


65 


5623 


AF130089 


Homo sapiens 


PRO2550 


130 


69 


"H§3 — 


ftr I jui/oy 


Homo sapiens 


rtSXJ^ DDK} 


~7o^ 


~4~f 




G03789 


Homo sapiens 


HurricLn secreted protein, SEQ ID NO; 
7870. 






5626 


A£ 1300S9 




PRO2550 


152 


80 


5627 


AF151837 


Homo sapiens 


CGI-79 protein 


127 


78 . 


5628 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


122 


84 


5629 


AJ237981 


Vitis vinifera 


putative proline-rich cell wall protein 


93 


50 


5630 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
6582. 


110 


52 



279 
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SEQ1D 
NO: 




Species 


Description '»„. 


Smith- 
Walerman 


Id en tit 

y 






Homo sapiens 


PRO2550 


151 


62 


5632 


U47661 


Lupinus luteus 


proline-rich protein PRP2 precursor 


67 


28 


^| 


I USUO J 


Homo sapiens 


Human c-myb protein fragment. 


93 


61 




AL133215 


Homo sapiens 


bA10SL7.6 (semaphorin 4G (sema 
domain, immunoglobulin domain (Ig), 
transmembrane domain (TM) and short 
cytoplasmic domain)) 


323 


100 


5635 


G0j438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


67 


5636 


AF090942 


Homo sapiens 


PRO0657 


184 


67 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


80 


5638 


G0:>043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


83 




AF1 19900 


Homo sapiens 


PR02822 


143 


62 


5640 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


151 


61 


5641 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


118 


72 




AF090895 


Homo sapiens 


PRO0117 


140 


56 


5643 


Y51611 


Homo sapiens 


Human HSGT1 protein. 


611 


96 


5644 


AF 162692 


Homo sapiens 


putative voltage-gated calcium channel 
gamma-4 subunit 


172 


73 


5645 


Z98883 


Homo sapiens 


c407AJ0.1 (GPI1 (N- 
acetylglucosaminyl transferase 
component)) 


229 


100 


5646 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


80 


5647 


G03 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


78 


5648 


AF217517 


Homo sapiens 


uncharacterized bone marrow protein 
o :V1 l 


97 


49 


5649 


AF2 10651 


Homo sapiens 


NAG 18 


149 


58 






Homo sapiens 


unnamed protein product 




36 


5651 


U02313 


Mus musculus 


protein kinase 


143 


96 




AF090944 


Homo sapiens 


PRO0663 


95 


43 


5653 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


131 


61 


5654 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


100 


58 


5655 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561 . 


157 


70 


5656 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


100 


5657 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


47 


5658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


159 


46 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


60 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


46 




Y27868 


.Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


125 


71 


5662 


D87744 


Mus musculus 


DRPLA protein 


89 


40 


5663 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


65 


5664 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


65 



280 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


5665 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


71 


44 


5666 


AI8812 


Brassica napus 


extensin 


98 


41 


5667 


AF130089 


Homo sapiens 


PRO2550 


12S 


74 


5668 


AF090930 


Homo sapiens 


PRO0478 


136 


54 


5669 


X52634 


Mus musculus 


tlm protein 


169 


59 


5670 


AB001684 


Chlorella 
vulgaris 


ORF49b 


70 


62 


5671 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


153 


66 


5672 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5673 


AF090942 


Homo sapiens 


PRO0657 


150 


60 


5674 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5675 


M22332 


Homo sapiens 


unknown protein 


121 




5676 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


128 


72 


5677 


AL049868 


Homo sapiens 


dJ927M24.2 (KIAA1219) 


403 


95 


5678 


U31785 


Human 

papillomavirus 
type 36 


putative 


103 


44 


5679 


G02409 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
6490. 


101 


73 


5680 


AF068229 


Homo sapiens 


lysyl hydroxylase 3 


148 


92 


5681 


Y00358 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


209 


81 


5682 


AB008515 


Homo sapiens 


RanBPM 


142 


100 


5683 


ABO 10692 


Arabidopsis 
thaliana 


contains similarity to endo-beta-N- 
acety tglucosaminidase — gene id:ICl 812 
3.27 ~ 


148 


45 


5684 


J02459 


bacteriophage 
lambda 


J (tai!:host specificity 1 132) 


746 


100 


5685 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


194 


69 


5686 


AB049054 


Homo sapiens 


brain link protein-1 


160 


100 


5687 


AF229067 


Homo sapiens 


PADI-H protein 


144 


68 


5688 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


120 


70 


5689 


AP00I313 


Arabidopsis 
thaliana 


contains similarity to 
pherophorin—gene id:T5M7.14 


106 


47 


5690 


M76976 


Drosophila 
melanogaster 


dorsal-ventral patterning protein 


167 


37 


5691 


L24559 


Homo sapiens 


DNA polymerase alpha 


175 


3 00 


5692 


AF071777 


Mus musculus 


IRE1 


604 


85 


5693 


AF061741 


Homo sapiens 


retinal short-chain 
dehydrogenase/reductase retSDRl 


704 


99 


5694 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


66 


5695 


AF092092 


Homo sapiens 


AP-3 adaptor complex niu3 A subunit 


653 




5696 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO" 
5066. 


106 


-~ 


5697 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


132 


75 


5698 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


116 


40 


5699 


AF161361 


Homo sapiens 


HSPC098 


128 


56 


5700 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


106 


57 



281 



WO 01/64835 



PCT/US01/04927 



SEQ JD 
NO: 


Accession No. 


Species 


Description '•; ;i * 


Smith- 
Watermnn 


Identit 

y 


5701 


AF1 ] 9901 


Homo sapiens 


PR0283 1 


113 


76 


5702 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


101 


49 


5703 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


105 


78 


5704 


AF130089 


Homo sapiens 


PRO2550 


122 


75 


5705 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


136 


72 




G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


1 13 


75 


5707 


AF229067 


Homo sapiens 


PADI-H protein 


134 


81 


5708 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


129 


65 


5709 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


81 


5710 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


54 


5711 


Y76381 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 75. 


247 


93 


5712 


AF116715 


Homo sapiens 


PR02829 


118 


75 


5713 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


154 


71 


5714 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


141 


79 


5715 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


91 


80 


5716 


AF090944. 


Homo sapiens 


PRO0663 


136 


56 


5717 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


153 


65 


5718 


G03807 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7888. 


109 


70 


5719 


AF130089 


Homo sapiens 


PRO2550 


140 


86 


5720 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


144 


67 


5721 


AF1300S9 


Homo sapiens 


PRO2550 


146 


58 


5722 


AF1 18086 


Homo sapiens 


PRO 1992 


130 


58 


5723 


D21230 


Homo sapiens 


alpha 1 (XV) collagen chain 


390 


78 


5724 


AL049569 


Homo sapiens 


dJ37C10.5 (KIAA0445) 


266 


93 


5725 


X04412 


Homo sapiens 


plasma gelsolin 


761 


95 


5726 


A21853 


synthetic 
construct 


serine protease inhibitor 


206 


100 


5727 


Y16610 


Homo sapiens 


paraplegin 


218 


95 


5728 


AF132021 


Homo sapiens 


myosin X 


613 


87 


5729 


AL1 63491 


Arabidopsis 
thaliana 


putative protein 


236 


46 


5730 


LI 9704 


Homo sapiens 


alternative first exon 


561 


86 


5731 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


240 


84 


5732 


AB007936 


Homo sapiens 


KIAA0467 protein 


189 


77 


5733 


L38622 


Mus musculus 


mSin3B gene product 


280 


94 


5734 


AF051944 


Gallus gallus 


Xin 


460 


68 


5735 


B07857 


Homo sapiens 


Amino acid sequence of Smadl 
interactor protein clone S 1+12-5. 


172 


67 


5736 


Z 14020 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


105 


44 


5737 


LI 7308 


Gossypium 
hirsutum 


proline-rich cell wall protein 


145 


34 


5738 


Y76141 


Homo sapiens 


Human secreted protein encoded by 
gene 18. 


109 


72 


5739 


AF178534 


Homo sapiens 


talin 


583 


82 


5740 


D13435 


Homo sapiens 


PIG-F 


143 


100 


5741 


AC005578 


Homo sapiens 


F20887 1 , partial CDS 


584 


86 



282 



NSDOCID: <WO 016483SA?J_= 



WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 


5742 


AC005578 


Homo sapiens 


F20887 1, partial CDS 


470 


-|| 


5743 


AJ0 10479 


Homo sapiens 








5744 


Ml 9419 


Mus musculus 


proline-rich salivary protein 


107 





5745 


U29377 


Caenorhabditis 
elegans 


lysine-rich 


97 


~2j= 


5746 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


90 


67 


5747 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


165 


83 


5748 


J04990 


Homo sapiens 


cathepsin G 


165 


100 


5749 


Y36203 


Homo sapiens 


Human secreted protein #75. 


122 


77 


5750 


V00488 


Homo sapiens 


alpha globin 


181 


94 


5751 


M 1 8642 


Homo sapiens 


lactoferrin 


122 




5752 


W15286 


Homo sapiens 


Soluble type I insulin-like growth 
fsctor receptor 


481 


87 


5753 


Z98595 


Schizosaccharom 


coronin-like protein 


192 


38 


5754 


AL035683 


Homo sapiens 


dJ1063B2.1 (Beta- 1,4- 
galactosyltransferase) 


239 


100 


5755 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


61 


~5756 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


To! 


_ 






Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 






5758 


W fioOZ / 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


175 


72 


5759 


X61451 


Mus musculus 


F41 


130 


"Too — 


5760 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


91 


61 


5761 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


130 


76 


5762 


AF 13 0079 


Homo sapiens 


PR02852 


126 


61 


5763 


AF143946 






288 


40 


5764 


M80627 


Homo sapiens 


helix-loop-helix protein 


127 


88 




AF038007 


Homo salens 


FIC1 







~~5766 


G01881 


omo sapiens 


Human secreted protein SEQ ID NO' 
5962. 




~ 


5767 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


172 


75 


"5768 


B12884 


Homo sapiens 


Human alpha bradeion protein 

sequence. 


~465 


~70 


5769 


Y66645 


Homo sapiens 


Membrane-bound protein PRO 13 10. 






5770 


AB033168 


Mus musculus 


nuclear protein ZAP 


115 


39 


5771 


X83575 


griseus 


CHOI antigen 






5772 


AF 13 5422 


Rattus SaP ' enS 


k DP o^e^n beta yr ° PhOSPh0rylaSe A 








5773 


L38644 


norvegicus 


op rm 6 3 






5774 


AB020746 


Arabidopsis 
Drosc^liila 


protein kinase-like protein 


107 


33 


5775 


Y 17920 


melanogaster 


CALO protein 


~J0R 


— 


5776 


AK000309 


Homo sapiens 


unnamed protein product 


988 


98 


5777 


AFO 10404 


Homo sapiens 


ALR 


469 


97 


5778 


AB045131 


Homo sapiens 


NADPH-dependent retinol 
dehydrogenase/reductase 


126 


92 


5779 


AF1 97927 


Homo sapiens 


AF5q31 protein 


530 


95 


5780 


X79198 


Homo sapiens 


host cell factor 


139 


74 


5781 


X69838 


Homo sapiens 


G9a 


157 


41 



283 
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SEQ ID 

NO: 


Accession No. 


Species 
- — . 


Description 


Smith- 

VVatcrman 

Score 


% 

Identic 

y 


~57JP 


Y94450 


Homo sapiens 


Human inflammation associated . 
protein #7. 


227 


44 


578j 




Rattus 
norvegicus 


nuclear matrix transcription factor 


1539 . 


98 


~5784 


R.06463 


Homo sapiens 


Derived protein of clone ICA13 
(ATCC 40553). 


638 


91 






Mus inusculus 


ATP-binding cassette transporter 


292 


41 


5786 


D89974 


Homo sapiens 


glycosylphosphatidyl inositol-anchored 
protein GPI-80 


490 


78 


5787 


AF090942 


Homo sapiens 


PRO0657 


132 


63 






Homo sapiens 


23 kD highly basic protein 


286 


100 


5789 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


114 


6 ? 


~5790 


2g^2f] 


Schizosaccharom 
yces pombe 


kinesin-like protein 


346 


60 






Homo sapiens 


disintegrin and metalloproteinase 
domain 19 


144 


95 


5792 


Z19550 


Homo sapiens 


N-acetyllactosaruinide beta-l,6-N- 
acetylglucosaminyltransferase 


547 


76 


■;■ 




a ithoao'} c\ 
A Jr VyVy o U 


— ; 

Homo sapiens 


PRO0478 


103 


80 


• 


Y59879 


Homo sapiens 


Human normal uterus tissue derived 
protein 42. 


407 


97 


5795 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


72 


5796 


AF077208 


Homo sapiens 


HSPC022 


682 


100 


|Z^2 — 


D4213S 


Homo sapiens 


PIG-B 


597 


86 


— 


AF0 101 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


75 






Homo sapiens 


integrin alpha subunit precursor 


160 


80 


~5855 




Homo sapiens 


Similar to a C.elegans protein in 
cosmidC14HI0 


354 


85 




W54'?82 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl . 


195 


97 


5802 


AB030176 


Homo sapiens 


peptidylarginine deiminase type II 


107 


100 


5803 


D38441 


Homo sapiens 


acyiamino acid-releasing enzyme 


156 


71 




Y55 93 9 


Homo sapiens 


Human GEK2 protein. 


157 


96 


5805 
— — — 


X89626 


Bos taurus 


240K protein of rod photoreceptor cng- 
channel 


92 


42 


5 




Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PID:g4650844) 


177 


38 


"5807 


Y86275 


Homo sapiens 


Human secreted protein HSRGW16, 
SEQ ID NO: 190. 


436 


77 


5808 


D50577 


Mesocricerus 
auratus 


carboxy I esterase precursor 


328 


55 


5809 


AF272148 


Homo sapiens 


KJRAB zinc finger protein 


593 


68 


-4§TT 


W57260 


Homo sapiens 


Human semaphorin Y. 


651 


96 






Mus musculus 


Kif21b ; 


491 


82 


R 




Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PID:g4650844) 


219 


38 


5813 


AB0171 14 


Homo sapiens 






57 


5814 


AC007954 


Homo sapiens 


unknown 


678 


100 


-||y^ 


A "PI OAT)/; 
i\r JZ*f /ZD 


Homo sapiens 


acinusL 


790 


93 






Homo sapiens 


HRIHFB2007 


167 


36 


5817 


AB020684 


Homo sapiens 


KIAA0877 protein 






5818 


Z 15025 


Homo sapiens 


Bat2 


172 


49 


5819 


X95761 


Mus musculus 


new:Rhobin 


549 


79 


5820 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


68 


5821 


X55126 


Mus musculus 


Zfp-29 


633 


93 


5822 


Y99366 


Homo sapiens 


Human PR01475 (UNQ746) amino _[ 


452 


98 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 




















acid sequence SEQ ID NO:88. 




-— 


5823 


AB04080 1 


Homo sapiens 


SREB3 


61 1 


95 


5824 


M27878 


Homo sapiens 




462 


--^ 


5825 


Y 13620 


Homo sapiens 


BCL9 





~~o^ 


5826 


AF 1525 10 


omo sapiens 


protocadherin gamma A3 short form 




92 









P rotein 






„■ ■ 


IVIZj^tJ 1 




steroid receptor 


-— - — ■ 


7 

61 







am bisexual is 










-M.JU 1 IJUJ 


Homo sapiens 


guanine nucleotide exchange factor, 


— ^ 


100 










delta subunit 










— : 

omo sapiens 


Human homologue of UNC-53 (Hs- 




710 


96 




- 




UNC-53/2) sequence 






5830 


U03277 


~d~o — iTi 




~258 


— 






melano aster 
g ser 


when^xpresse<^ 












Schizosaccharomyces pombe 






5831 


AF 193342 


Mus musculus 


GCN2alpha 


469 


85 


5832 


AF279265 


Homo sapiens 


putative anion transporter 1 


240 


91 


5833 


AK023998 


Homo sapiens 




687 


97 


5834 


AK023998 


Homo sapiens 


unnamed protein product 


674 


96 


5835 


AF071059 


Mus musculus 


talin KNA b ' nding P rotein 


243 




5836 


AF 178534 


Homo sapiens 




463 


~6ir — 


5837 


J04569 


Homo sapiens 


— lial fibrilla — acidic rotein 


102 


~55 


5838 


AF068625 




' DNA cytosine 5 methyltransferase 3A — 


~4zz 


~XQ 


5839 


AB037739 


Homo a iens 

omo sapiens 


KIAA 1318 protein 




— 




5840 


R86865 


omo sapiens 


Human protocadherin pcj. 


TKn 

—52 


_96 


5841 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


84 
















* ~ 


^ ^ 


Gallus gallus 


cNrl-Al protein (AA 1-d22) 




86 


5843 


U65785 


Homo sapiens 


1 50 kDa oxygen-regulated protein 


200 


95 








ORP150 






5844 


16126 


Homo sapiens 


EAA4 


. 

679 


97 


5845 


AL035071 


Homo sapiens 


dJ1085Fl 7.1.3 (ortholog of mouse 


717 


100 








DNMT3B (DNA Cytosine-5 












Methyltransferase 3B3) (isoform 3)) 


, 




"5846 


yi3620 


Homo sapiens 


BCL9 


_505 


_ 81 


5847 


a rnn/ic i o 
ACUU'+o 1 J. 


Homo sapiens 


similar to human HsGCNl U77700 


316 


100 








(PID:g2282576); similar to yeast 












translation activator GCN1 












fPTTVo4/i91 fiSCt 

VTJLi-'.gHOZ lOO^ 









M77246 




attus 


beta-chain clathnn associated protein 


61 


— 

89 






norvegicus 


complex AP-2 






5849 


AF007833 


Homo sapiens 


hcl^>c related Zin ° fi " Ser pTOtsm 


"569 


— 

79 














5850 


AF056617 


H mo a ens 

omo sapiens 


BWSCR2 associated zinc-finger 


— — 


— 

32 








protein BAZ1 






"5851 




D38548 


— : 

Jriomo sapiens 


The ha0936 gene product is novel. 


— — 

— 5l 


— 

65 




G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


55 










4609. 










— ; 

Homo sapiens 


similar to C.elegans protein encoded in 




188 


76 


— ■- 






cosmiu i zvuj ^uoiZUj. 






-||- 




— ; 

Homo sapiens 


glial factor- 1 


557 


83 






Homo sapiens 


Human secreted protein, SEQ ID NO: 


400 


82 








5561. 






5856 


X06345 


Xenopus laevis 


lamin A (AA 1-265) 


109 


68 


5857 


ABO 17499 


Oryctolagus 


Lipoprotein Receptor Related Protein 5 


196 


40 






cuniculus 








5858 


D43633 


Oryzias latipes 


G protein-coupled seven- 


281 


50 








transmembrane receptor 






5859 


AF067165 


Homo sapiens 


zinc finger protein 3 


669 


87 



285 
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SEQ 10 


Accession No. 




Description ■ 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 


5860 


AL031118 


Homo sapiens 


dJ153G14.3 (novel C2H2 type Zinc 


218 


38 








Finger protein) 






5861 


U13897 


Homo sapiens 


homolog of Drosophila discs large 


130 


35 








protein, isoform 1 






5862 


M 74 002 


Homo sapiens 


arginine-rich nuclear protein 


80 


23 


5863 


X67855 


Oryctolagus 


brain calcium channel BII-1 


147 


66 


.. ... 




cuniculus 








5864 


Lb5376 


Homo sapiens 


repressor transcriptional factor 


475 


67 


5865 


AF251038 


Homo sapiens 


GAP-Iike protein 


325 


58 


5866 


L46815 


Mus musculus 


DNA binding protein Rc 


409 


66 


5867 


S68944 


Rattus sp. 


Na+/Cl(-)-dependent neurotransmitter 


582 


95 








transporter 






5868 


X97675 


Homo sapiens 


plakophilin 2b 


159 


93 


5869 


AF 152478 


Homo sapiens 


protocadherin alpha 1 3 short form 


680 


100 








protein 






5870 


AF1 18082 


Homo sapiens 


PRO 1 902 


130 


48 


5871 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


49 








7492. 






5872 


A61971 


unidentified 


MCSP 


621 


82 


5873 • 


Z54206 


Bovine 


UL51 


104 


33 






herpesvirus 1 








5874 


AF069765 


Homo sapiens 


signal recognition particle 72 


674 


97 


5875 


AF 179428 


Homo sapiens 


DNA polymerase zeta catalytic subunit 


692 


100 








variant 1 






5876 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


119 


48 








gene No. 101. 






5877 


Y9499D 


Homo sapiens 


Human secreted protein vb21 J, SEQ 


446 


98 








IDNO:20. 






5878 


L08811 


Drosophila 


adherin 


264 


42 






melanogaster 








5879 


D86983 


Homo sapiens 


similar to D.rnelanogaster 


429 


100 








peroxidasin(U 11052) 






5880 


AF 152508 


Homo sapiens 


protocadherin gamma A2 short form 


664 


100 








protein 






5881 


AF1 52508 


Homo sapiens 


protocadherin gamma A2 short form 


660 


98 








protein 






5882 


AF190018 


Mus musculus 


K-Cl cotransporter 


199 


88 


5883 


AB023229 


Homo sapiens 


KIAA 101 2 protein 


107 


83 


5884 


U53420 


Rattus 


sodium-calcium exchanger form 3 


595 


98 






norvegicus 








5885 


AF039019 


Homo sapiens 


zinc finger DNA binding protein 89 


608 . 


92 








kDa 






5886 


AF207661 


Homo sapiens 


sodium bicarbonate cotransporter-like 


701 


100 








protein 






5887 


AF 130051 


Homo sapiens 


PRO0898 


115 


74 


5888 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 


720 


100 








protein 






5889 


AF251079 


Homo sapiens 


heptacellular carcinoma novel gene-3 


741 


98 








protein 






5890 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


118 


51 








oncoprotein. 






5891 


AF157326 


Homo sapiens 


TIP 120 protein 


571 


85 


5892 


AF129170 


Homo sapiens 


apolipoprotein E receptor 2 


630 


99 


5893 


X05561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 


197 


45 








9 1 7) (2953 is 2nd base in codon) 






5894 


AJ238374 


Homo sapiens 


putative protein TH1 


641 


96 


5895 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


709 


97 








garruna from A.salina 






5896 


Y11651 


Homo sapiens 


phosphate cyclase 


521 


80 
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SEQ II) 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Idcntit 

y 




Y 18448 










5898 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


63 


5899 


AF 152473 


Homo sapiens 


protocadherin alpha CI short form 
P rotein — n SEO ID NO — 


733 


100 


9 q 


G03790 


„ — . 

omo sapiens 


787^ an SeCrC E e ' n ' 


— — 


-— 


5901 


R06463 


Homo sapiens 


Derived protein of clone ICA13 
(ATCC 40553). 


522 


99 


5902 


X12928 


Triticum 
aestivum 


HMW glutenin subunit 5 (A A 1-848) 


169 


25 


5903 


X61048 


Hydra sp. 


mini-collagen 


88 


33 




AF1 1 8082 


Homo sapiens 


PRO 1 902 







TSfiZ 


A00279 


construct 


Human serum albumin 





84 


5906 


T14722_cdl 


Homo sapiens 


01 -DEC- 1992 Human CD27 antigen 


431 


94 


5907 


Y73475 


Homo sapiens 


Human secreted protein clone ye7_l 
protein sequence SEQ IDNO:172. 


271 


94 


5908 


AF1 75265 


Homo sapiens 


vacuolar sorting protein 35 


645 


92 






Mus musculus 


Pro-Pol-dUTPase polyprotein 






5910 


G 003 60 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


58 


5911 
— — — 


W88762 


Homo sapiens 


Polypeptide fragment encoded by gene 


152 


50 


■ 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


— — 


68 




GCP827 


— ; 

omo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


— — 




71 


5914 


AB046048 


Macaea 

fascicu 1 aris 


unnamed portein product 


143 


~49 _ 


5915 


S 80644 


Sus scrofa 


glutaminase; GA 


338 


85 


5916 


AF2 13393 




ATP-binding cassette protein 






5917 


X52622 


Mus musculus 


R p - . lr 


425 


-~ 


-~~ 


S^MO" 


~Tj = 

Homo sapiens 


— • , 10n un — 




-p- 








m33-A isotorm 





— 


5920 


AF090931 


Homo sapiens 


PRO0483 


94 


89 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 









AJ006770 


Cicer arietinum 






"4^ — 




AJ250193 




musck'protein 637 


~T?n 




~592^ 


AF229067 


Homo^sapiens 


PADI-H protein 


~^78 


"63 


5925 


X92485 


Plasmodium 
^' V3X : 


pval 


99 


50 


■ 





omo sapiens 


— — — — — 

Human secreted protein, SEQ ID NO: 

7795. 


-— 


— 


"5927 


AF130089 


„ — — .- 


PRO2550 


~7T4 


hi — 


5928 


J00126 


Homo sapiens 


MHC factor B 


237 


"Too" — 


5929 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


251 


79 


5930 


AF090895 


Homo sapiens 


PRO0117 


63 


62 


593] 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


147 


37 




U83857 


Homo sapiens 








5933 


AF068294 


Homo sapiens 


HDCMB45P 


144 


50 


5934 


X92485 


Plasmodium 
vivax 


pval 


135 


69 


5935 


L07545 


Leishmania 
tarentolae 


A 'c' was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 firameshift probably due to 


127 


36 
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SEQ ID 

NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idenrtr 

y 








gel compression 






5936 


R95913 


Homo sapiens 


Neural thread protein. 


276 


58 


5937 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


130 


65 


5938 


Y16781 


Homo sapiens 


Human secreted protein (clone 
bh!57_7). 


1258 


100 


5939 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


167 


68 


5940 


U93570 


Homo sapiens 


putative pi 50 


103 


41 


5941 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


189 


74 


5942 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


47 


5943 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


122 


90 


5944 


AF1 30089 


Homo sapiens 


PRO2550 


101 


48 


5945 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


86 


5946 


X92485 


Plasmodium 
vivax 


pval 


104 


70 


5947 


M29622 


Mus musculus 


open reading frame 2 


132 


64 


5948 


AF130089 


Homo sapiens 


PRO2550 


13-1 


60 


5949 


AF1 30089 


Homo sapiens 


PRO2550 


151 


60 


5950 


AF130079 


Homo sapiens 


PR02852 


117 


52 


5951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


185 


62 


5952 


U49973 


Homo sapiens 


ORF2: function unknown 


160 


62 


5953 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


55 


5954 


M22332 


Homo sapiens 


unknown protein 


169 


41 


5955 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ WIS. 


133 


72 


5956 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


100 


36 


5957 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


105 


63 


5958 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


48 


5959 


Z49944 


Canis familiaris 


CDC42 GTP-binding protein 


120 


40 


5960 


AF 118082 


Homo sapiens 


PRO 1902 


158 


61 


5961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


91 


65 


5962 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


121 


62 


5963 


AF118082 


Homo sapiens 


PRO 1902 


113 


60 


5964 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


65 


5965 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


120 


57 


5966 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


306 


77 


5967 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


80 


41 


5968 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


109 


75 


5969 


AF1 19855 


Homo sapiens 


PRO 1847 


162 


72 


5970 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


146 


46 


5971 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


122 


37 


59/2 Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. . 


121 


66 
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•1SDOCID: -=WO_ 



.016483 5A2_I_= 



PCT/USO 1/0492 7 



SEQID 

NO: 


Accession No. 


Species 


Description 


| Smith- 
Waterman 
-? C .°V C 


ldentit 

y 




A 1 7 KCi'Zn 1 

AL.1 OUJ / 1 


._ ... 

Leishmania 
major 


_ . 

probable (hhv-6) u 1 102, variant a 
DNA, complete virion genome 




60 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




68 


5975 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


1 1 1 


47 


5976 


U93572 


Homo sapiens 


p40 


1 12 


33 


5977 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


58 


5978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


70 


5979 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


106 


54 


5980 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


105 


31 


5981 


X66285 


Mus musculus 


HCI ORF 


86 


36 


5982 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


43 


5983 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


131 


60 


5984 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


110 


60 


5985 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


107 


47 


5986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


160 


77 


5987 


AF130079 


Homo sapiens 


PR02852 


272 ■ 


77 


5988 


U93570 


Homo sapiens 


putative pi 50 


87 


46 


5989 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


202 


89 


5990 


AF130089 


Homo sapiens 


PRO2550 


238 


74 


5991 


AF1180S2 


Homo sapiens 


PRO 1902 


119 


41 


5992 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


60 


5993 


L27428 


Homo sapiens 


reverse transcriptase 


110 


60 


5994 


S794 1 0 


Mus musculus 


nuclear localization signal binding 
protein 


98 


45 


5995 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


90 


5996 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-lectin antibody 


78 


68 


5997 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


97 


63 


5998 


AP002460 


Arabidopsis 
thaliana 


gene_id:FlD9,26~unknown protein 


100 


38 


5999 


U93569 


Homo sapiens 


p40 


103 


52 


6000 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


100 


63 


6001 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


321 


51 


6002 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


64 


6003 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


40 


6004 


AF 130089 


Homo sapiens 


PRO2550 


122 


63 


6005 


R95913 


Homo sapiens 


Neural thread protein. 


235 


56 


6006 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


173 


58 


6007 


X92485 


Plasmodium 
vivax 


pval 


96 


53 



289 



NSDOCID: <WO 0164835A2_I_» 
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SEQID 

NO: 


Accession No. 




• Description' '• - 


Waterman 


% 

Identit 

y 


600S 


C03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


236 


62 


6009 


Y17832 


Human 
endogenous 
retrovirus K 


env protein 


103 


66 


60] 0 


AF1 19900 


Homo sapiens 


PR02822 


91 


50 


601] 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


155 


55 


6012 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


201 


91 


6013 


Y36156 


Homo sapiens 


Human secreted protein #28. 


253 


70 


6014 


AF1 30079 


Homo sapiens 


PR02852 


119 


36 


6015 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


63 


6016 


AF 130089 


Homo sapiens 


PRO2550 


1 10 


40 


6017 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


57 


6018 


AF1 16661 


Homo sapiens 


PRO 1438 


147 


69 


6019 


AF130089 


Homo sapiens 


PRO2550 


169 


71 


6020 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


208 


60 


6021 


AF 119851 


Homo sapiens 


PRO 1722 


147 


65 


6022 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


139 


47 


6023 


AF130089 


Homo sapiens 


PRO2550 


142 


66 


6024 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


91 


76 


6025 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


48 


6026 


D86853 


Catharanthus 
roseus 


extensin 


97 


46 


6027 


AF1 30089 


Homo sapiens 


PRO2550 


95 


47 


6028 


X92485 


Plasmodium 
vivax 


pval . 


102 


61 


6029. 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


149 


53 


6030 


G00643 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4724. 


96 


76 


6031 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


144 


44 


6032 


ABO 11483 


Arabidopsis 
thaliana 


contains similarity to root cap 
prote in~gene_id : MUF9 . 1 6 


117 


37 


6033 


X92485 


Plasmodium 
vivax 


pval 


85 


40 


6034 


AF1 18082 


Homo sapiens 


PRO! 902 


112 


58 


6035 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


121 


59 


6036 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


65 


6037. 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


107 


71 


6038 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


140 


65 




X92485 


Plasmodium 
vivax 


pval 


84 


78 


6040 


AF130051 


Homo sapiens 


PRO0898 


107 


69 


6041 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


113 


58 


6042 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. . 


209 


75 



290 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


■ 


~ 


Homo sapiens 


~F fh ted — oP 

encoded by gene 27 £ ^ ^ 


^° rC 


56 


"6044 




H m sa ien 

omo sapiens 


Human secreted protein SEQ ID NO* 
7870. 


~126 


— 


6045 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


65 


6046 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


62 


6047 


AF130079 


Homo sapiens 


PR02852 


205 


83 


6048 


R95913 


Homo sapiens 


Neural thread protein. 


148 


45 


6049 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


234 


86 






Rattus° e * 0dyteS — 


NADH dehydrogenase subunit 5 


403 


61 


— 




norvegicus 


proline-ncn salivary protein 








apt inn»Q 


Homo sapiens 


pp ms'ift 






6053 


AF1 19851 


Homo sapiens 


PRO 1722 


109 


68 


6054 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


66 


6055 


AF161356 


Homo sapiens 


HSPC093 


83 


58 






Plasmodium 
vivax 


pval 






6057 


G04000 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


520 


100 


6Q58 


AF010144 


Homo sa iens 


neuronal thread protein AD7c NTP 


T4S 


— 


6059 


W03626 


Homo sapiens 


Human thyrotropin GPRN-terminal 
sequence. . — 


163 


53 




6060 


G 02 8 72 


— : 

Homo sapiens 


Human secreted protein, oEQ JD NO: 
6953. 


"Ton 


-— 


6061 


U93563 


Homo sapiens 


putative pi 50 


389 


37 


~§o"6i — 




Homo sapiens 


r JtvVJU 1 1 f 


66 









Plasmodium 
vivax 








— 


/\r i juuo7 


Homo sapiens 




Tl| 









Homo sapiens 


unknown protein 






6066 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


101 


52 


6067 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


41 


6068 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


75 


6069 


M22332 


Homo sapiens 


unknown protein 


153 


80 




Ar'UUUOl O 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 










Homo sapiens 


Human secreted, protein, SEQ ID NO; 
4484. 






-|5ZB 


r\r 1JUU fry 


Homo sapiens 


PR02852 










Homo sapiens 


Human secreted protein SEQ ID NO* 
4478. 


132 


~53 






Homo sapiens 


Humon secreted protein, SEQ ID NOi 
7795. 






6075 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


68 
— — 


~6076 


/vr 1 JO / 1 — 


— : 

Homo sapiens 


PR02738 


"TP 




6077 


W48351 


Homo sapiens 


Human breast cancer related protein 


102 


69 


6078 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


60 


6079 


AF 118082 


Homo sapiens 


PRO 1902 


196 


61 


6080 


AF1 18082 


Homo sapiens 


PRO 1902 


165 


59 


6081 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


153 


66 



291 



MSDOCID: <WO 0164835A2_L5 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 
- 


Accession No. 




Description, 


Waterman 
Score 


Idcntit 

y 


6082 


AFO 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


188 


71 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


109 


60 




V00672 


Pan troglodytes 


reading frame protein 4 


185 


85 


6085 


AF130051 


Homo sapiens 


PRO0S98 


307 


67 


6086 


AF130079 


Homo sapiens 


PR02852 


96 


72 


6087 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


171 


82 


608S 


AF090S95 


Homo sapiens 


PRO01 17 


110 


67 


6089 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


101 


46 


6090 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


114 


36 


6091 


Y02925 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


111 


46 


6092 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


137 


54 


6093 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


176 


59 


6094 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


199 


50 


6095 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


216 


44 


6096 


X71442 


Rattus 
norvegicus 


ORF 1 ; putative 


109 


53 


6097 


AF1 19855 


Homo sapiens 


PRO 1847 


88 


71 


6098 


L27428 


Homo sapiens 


reverse transcriptase 


104 


45 


6099 


AF090895 


Homo sapiens 


PRO0I17 


119 


61 


6100 


U52077 


Homo sapiens 


mariner transposase 


295 


67 


6101 


G03714 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7795. 


156 


73 


6102 


M22332 


Homo sapiens 


unknown protein 


132 


44 


6103 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


140 


56 


6104 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


64 


47 


6105 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


41 


6106 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 


69 


6107 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


75 




6108 


AF068294 


Homo sapiens 


HDCMB45P 


141 


43 


6109 


AF1 18086 


Homo sapiens 


PRO 1992 


160 


77 


6110 


M63819 


Plasmodium 
falciparum 


malaria antigen 


62 


46 


6111 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


130 


65 


6112 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


282 


70 


6113 


AF119851 


Homo sapiens 


PRO 1722 


156 


63 


61 14 


AF229067 


Homo sapiens 


PADI-H protein 


134 


48 


6115 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


44 






Homo sapiens 


cytochrome c oxidase subunit 1 


201 


80 


6117 


AF285567 


Microcebus 
griseorufus 


cytochrome b 


185 


58 


6118 


AL138656 


Arabidopsis 
thaliana 


extensin precursor-like protein 


141 


28 


6119 


X92485 


Plasmodium 
vivax 


pval 


165 


61 



292 



SDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


6120 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


84 


_ 6121 


lvl J 3 1 UU 


Raftus " 

norvegicus 


unknown protein 


"176 


"49" 


6122 


G03 1 :>3 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177 


50 


6123 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


71 


6124 


AF1300S9 


Homo sapiens 


PRO2550 


109 


85 


6125 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


129 


70 


6126 


AP144054 


Homo sapiens 


apoptosis related protein APR-4 


149 


54 






Homo sapiens 


Lj,. m n n cf»nr*»t*»H nrntpin C pf> I Pi MO* 

iiurrian bccrcicu pruicin, ocy ixj inv^f. 
7795. 




70 




T T/l 

VjHyy id 


Homo sapiens 


OR.F1 ] MER37j putative transposase 
similar to pogo element 










Homo sapiens 


putative pi 50 






6130 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


129 


57 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


"127 


~56 


6132 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


195 


67 


6133 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


92 


42 


6134 


AK000496 


Homo sapiens 


unnamed protein product 


158 


59 


6135 


X92485 


Plasmodium 


pval 


146 


55 


6136 


Y36203 


Homo sapiens 


Human secreted protein #75. 


133 


57 


6137 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


68 


6138 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


106 


36 


~* 




Homo sapiens 


PROftRQR 
rS\\J\Joyo 


77 


65 


6140 


G0j79j 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


222 


73 


"6141 


ATI 1 QBnI 
AT 1 J 70J 1 


Homo sapiens 


PROl 722 


107 


63 


6142 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


117 


57 


— — 

■ -i— 


ftgggjg 


"H : 

omo sapiens 




1 14 


70 








HDCMB45P 


123 


56 


~~6\45 




Homo sapiens 


PRO 1 902 


1 13 


32 


6146 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


42 






Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


938 


100 






Homo sapiens 


putative pi 50 






6149 


AF090894 


Homo sapiens 


PRO0113 


145 


55 




AF 130079 


Homo sapiens 








6151 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


48 


6152 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


89 


60 
— 


6 5 


Y01398 


Homo sapiens 


Secreted protein encoded by gene 16 

clone HNGFR75. 


90 




6154 


X92485 


Plasmodium 
vivax 


pval 


110 


42 


6155 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


118 


48 


6156 


G01495 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


89 



293 



WO 01/64835 



PCT/US01/04927 



SEQrD 
NO: 


Accession No. 




Description 


Waterman 
Score 


lilentit 

y 


- 






5576. 










Plasmodium 
vivax 


pval 


157 


52 




G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


108 


58 


6159 


U36794 


Coturnix 
japonica 


Cytochrome oxidase subunit I 


70 


73 


6160 


AF 1 18082 


Homo sapiens 


PRO 2 902 


1 19 


66 


6161 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


154 


64 


6162 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


107 


37 


6163 


M22332 


Homo sapiens 


unknown protein 


125 


51 


6164 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


163 


59 


6165 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


235 


64 




G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


108 


36 




AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


59 


6168 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


562 


85 


6169 


D3S113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


296 


51 


6170 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
6613. 


95 


85 


6171 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


329 


73 


6172 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
6953. 


136 


53 


6173 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 


104 


36 


6174 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


75 


6175 


U93569 


Homo sapiens 


P 40 


192 


40 


6176 


X70343 


Nicotiana 
sylvestris 


extensin 


94 


35 


6177 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


140 


81 


6178 


AF1 19855 


Homo sapiens 


PRO 1847 


114 


43 


6179 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


98 


68 


6180 


X03717 


Homo sapiens 


pot. unidentified reading frame 


112 


42 


6181 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


204 


76 


6182 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


110 


73 


6183 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 3 


240 


88 


6184 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


102 


47 


6185 


X8603 1 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


99 


49 


6186 


Y36156 


Homo sapiens 


Human secreted protein #28. 


196 


75 


6187 


AF 169387 


Mus musculus 


alpha 3 collagen IV 


111 


38 


6188 


AF194537 


Homo sapiens 


NAG 13 


138 


37 




X91836 


Vigna 
unguiculata 


extensin class 1 protein 


167 


33 


6190 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


175 


63 


6191 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


123 


46 


6192 


X92485 


Plasmodium 


pval 


115 


54 



294 



'SDOCID: <WO _0164B3SA2J_> 



WO 01/64835 PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 

Score 


ldentit 

y 






vivax 








6193 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


345 


65 


6194 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


82 


54 


6195 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


214 


56 


6196 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


87 


79 


6197 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


118 


41 


6198 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


215 


73 


6199 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


100 


46 


6200 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


143 


73 


6201 


V00711 


Mus musculus 


cytochrome oxidase I 


435 


92 


6202 


U93572 


Homo sapiens 


putative pi 50 


99 


63 


6203 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


109 


45 


6204 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


163 


39 


6205 


M15530 


Homo sapiens 


B-cell growth factor 


1 13 


60 


6206 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


128 


50 


6207 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


181 


72 


6208 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


75 


6209 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


120 


62 


6210 


Y36203 


Homo sapiens 


Human secreted protein #75. 


169 


81 


6211 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


52 


6212 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


123 


48 


6213 


D86854 


Catharanthus 


extensin 


120 


44 


6214 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


117 


100 


6215 


W33698 


Homo sapiens 


AL-2-long (AL-21) protein. 


262 


100 


6216 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


102 


62 


6217 


U40342 


Mus musculus 


ninem 


77 


72 


6218 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


174 


49 


6219 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


234 


54 


6220 


X92485 


Plasmodium 


pval 


107 


46 


6221 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


104 


40 


6222 


X86029 


unguiculata 


extensin-like protein 


140 


35 


6223 


G03790 


Homo sapiens 


Human secreted protein SEQ IO WO" 
7871. 




59 


6224 


AF068294 


Homo sapiens 


HDCMB45P 


164 


60 


6225 


Y50797 


Homo sapiens 


Human calmodulin/NEF protein 
binding inhibitor 2. 


135 


50 


6226 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


139 


41 


6227 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 | 58 



295 



JSDOCID: <WO 016483SA2_I_:> 



WO 01/64835 



PCT/US01/04927 







Species 


Description-. 






NO: 








Waterman 


Identit 










Score 


y 








4718. 






6228 


AF130089 


Homo sapiens 


PRO2550 


126 


43 


6229 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 


190 


88 








SEQID NO: 163. 






6230 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 


124 


77 








RHAla(AF0786S3) 






6231 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


174 


79 








SEQID NO: 163. 






6232 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


168 


78 








SEQID NO: 163. 






6233 


U49973 


Homo sapiens 


ORF1; MER37; putative rransposase 


164 


67 








similar to pogo element 






6234 


Y91577 


Homo sapiens 


Human secreted protein sequence 


288 


94 








encoded by gene 2 SEQ ID NO.250. 






6235 


AF1 13685 


Homo sapiens 


PRO0974 


114 


46 


6236 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


40 








7214. 






6237 


AL096844 


Streptomyces 


putative integral membrane protein 


88 


39 






coelicolor A3 (2) 








6238 


AF1 30079 


Homo sapiens 


PR02852 


167 


50 


6239 


AF1 30089 


Homo sapiens 


PRO2550 


365 


85 


6240 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


212 


69 


6241 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


102 


54 








gene No. 107. 






6242 


AF1 09907 


Homo sapiens 


S164 


146 


90 


6243 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


132 


52 








gene 22 clone HMSJW 18. 






6244 


AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


52 


6245 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


32 








7871. 






6246 


AF130089 


Homo sapiens 


PRO2550 


154 


77 


6247 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


40 


6248 


Y36156 


Homo sapiens 


Human secreted protein #28. 


105 


43 


6249 


AF1 16661 


Homo sapiens 


PRO 143 8 


131 


60 


6250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


68 








7795. 






6251 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


73 








4462. 






6252 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


106 


42 








IDNO:1051. 






6253 • 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


247 


68 


6254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


117 


75 








SEQ ID NO: 163. 






6255 


V00662 


Homo sapiens 


cytochrome oxidase I 


355 


78 


6256 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


131 


65 








encoded by gene 121. 






6257 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


176 


86 


6258 


AF061340 


Artibeus 


cytochrome c oxidase subunit III 


308 


63 






jamaicensis 








6259 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


114 


43 


6260 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


179 


78 








6908. 






6261 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


192 


75 








similar to pogo element 






6262 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


71 








7795. 






6263 


U091 16 


Homo sapiens 


ORFI, encodes a 40 kDa product 


] 14 




6264 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 










7344. 







296 



016483SA2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


6265 


U49974 


Homo sapiens 


mariner transposase 


206 


71 


6266 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


71 


72 


6267 


AF097473 


Mus muscuJus 


ORF1 


86 


40 


6268 


X70j>43 


Nicotiana 
sylvestris 


extensin 


95 


33 


6269 



W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


84 


"T^TT 


L27428 


Homo sapiens 


reverse transcriptase 


132 


48 


-Hrii 


a T t i sns? 
nr 1 I 8Uoi 


Homo sapiens 




153 




6272 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


222 


91 


6273 




Homo sapiens 


pot. (JtsT V 


89 


35 






Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


90 


71 




U93572 


Homo sapiens 


p40 


83 


43 


6276 


AF058700 


Bos taurus 


ubiquitin-S27a fusion protein 


148 


70 


6277 


AF0 16099 


Mus musculus 


endonuc lease/reverse transcriptase 


96 


65 


6278 


AF 130079 


Homo sapiens 


PR02852 


179 


86 


6279 


AF161356 


Homo sapiens 


HSPC093 


108 


57 


6280 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


109 


51 


6281 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


130 


58 


6282 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7492. 


81 


69 


6283 


AF068294 


Homo sapiens 


HDCMB45P 


98 


37 


6284 


AF1 16715 


Homo sapiens 


PR02829 


112 


68 


6285 


X92485 


Plasmodium 


pval 


92 


64 


6286 


U93570 


Homo sapiens 


p40 


90 


38 


6287 


Y36203 


Homo sapiens 


Human secreted protein #75. 


225 


62 


6288 


Z 1 1 793 


Homo sapiens 


selenoprotein P 


220 


65 


6289 


X92485 


Plasmodium 
vivax 


pval 


125 


70 


6290 


X52634 


Mus musculus 


tlm protein 


114 


45 


6291 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


94 


36 




AF1 16661 


Homo sapiens 


PROl 438 




46 


6293 


L27428 


Homo sapiens 


reverse transcriptase 


81 


44 


6294 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


51 


6295 


AF130089 


Homo sapiens 


PRO2550 


137 


35 


6296 


L27428 


Homo sapiens 


reverse transcriptase 


96 


53 


6297 


B0 1372 


Homo sapiens 


Neuron-associated protein. 


134 


69 


6298 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 10 


58 


6299 


AF130089 


Homo sapiens 


PRO2550 


110 


61 


6300 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


63 


6301 


AF220264 


Homo sapiens 


MOST-1 


172 


90 


6302 


AF130079 


Homo sapiens 


PR02852 


311 


71 


6303 


AF068294 


Homo sapiens 


HDCMB45P 


136 


53 


6304 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


110 


62 


6305 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


113 


45 


6306 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


166 


58 
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SEQ ID 

NO: 


Accession No. 




Description 


Waterman 
Score 


Identit 

y 


6307 


S85655 


Homo sapiens 


prohibitin 


664 


91 


6j08 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


64 


6309 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


73 


6310 


AF1 19851 


Homo sapiens 


PRO 1722 


160 


67 


6311 


L27428 


Homo sapiens 


reverse transcriptase 


91 


50 


6312 


AF130089 


Homo sapiens 


PRO2550 


140 


55 


6313 


AF 130089 


Homo sapiens 


PRO2550 


211 


68 


6314 


X15917 


Paramecium 
aurelia 


ORF4 protein (A A 1-156) 


97 


38 


6315 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


94 


51 


6316 


Y36203 


Homo sapiens 


Human secreted protein #75. 


124 


65 


6317 


L27428 


Homo sapiens 


reverse transcriptase 


117 


56 


6318 


Ml 1900 


Mus muscuius 


15-kDa proline-rich salivary protein 


105 


41 


6319 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


69 


6320 


U92673 


Eurythenes sp. 
'AUS-3F' 


cytochrome oxidase [ 


59 


37 


6321 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


283 


72 


6322 


AK000496 


Homo sapiens 


unnamed protein product 


206 


70 


6323 


AB047934 


Macaca 
fascicularis 


hypothetical protein 


173 


68 


6324 


AB023485 


Mus muscuius 


transcription factor CA150b 


131 


43 


6325 


AF020262 


Bos taurus 


general protein transport factor pl6 


76 


67 


6326 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


98 


57 


6327 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


115 


60 


6328 


L RS6800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


100 


57 


6329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


67 


6330 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


82 


63 


6331 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


138 


57 


6332 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


112 


37 


6333 


M69297 


Homo sapiens 


ORF 3 


70 


43 


6334 


U 1 5647 


Mus muscuius 


reverse transcriptase 


146 


53 


6335 


AF090894 


Homo sapiens 


PRO0113 


107 


48 


6336 


AF] 30089 


Homo sapiens 


PR.O2550 


359 


75 


6337 


AJ271872 


Nicotiana 
sylvestris 


extensin 


134 


32 


6338 


A J 132694 


Homo sapiens 


small GTPase raclb 


99 


100 


6339 


AF119851 


Homo sapiens 


PRO 1722 


123 


60 


6340 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


215 


56 


6341 


U83774 


Pernis apivorus 


cytochrome oxidase I 


193 


80 


6342 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


139 


62 


6343 


X924S5 


Plasmodium 
vivax 


pval 


104 


81 


6344 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


110 


64 


6345 


U93572 


Homo sapiens 


p40 


79 


48 


6346 


X92485 


Plasmodium 


pval 


156 


55 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


Idcntit 

y 






vivax 












Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






6349 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 




64 


6350 


AF130089 


Homo sapiens 


PRO2550 


1 87 


69 


6351 


U56732 


Rattus 
norvegicus 


KRAB/zinc finger suppressor protein 1 


161 


64 


6352 


U70932 


Peromyscus 
leucopus 


reverse transcriptase 


126 


61 


6353 


X55687 


Lycopersicon 
esculentum 


extensin (class II) 


89 


44 


6354 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


107 


57 


6355 


G00357 


Homo sapiens 


Human secreted protein.. SEQ ID NO: 
4438. 


88 


76 


6356 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


61 


6357 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


165 


51 


6358 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


214 


58 


6359 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


88 


6360 


X07882 


Homo sapiens 


Po protein 


151 


47 


6361 


Zl 1592 


Phytophthora 
parasitica 


cytochrome c oxidase subunit 3 


78 


76 


_ , ._ 
6362 


... . 
AF090944 


Homo sapiens 




PRO066j 


- 

1 j>0 


46 


6363 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


39 


6364 


AF229067 


Homo sapiens 


PADI-H protein 


155 


75 


6365 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


100 


55 


6366 


AK024501 


Homo sapiens 


FLJ001 10 protein 


282 


100 


6367 


AF1 18082 


Homo sapiens 


PRO 1902 


157 


57 


6368 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


151 


63 


6369 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


161 


96 


6370 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


63 


6371 


AF 13 0079 


Homo sapiens 


PR02852 


297 


77 


6372 


X92485 


Plasmodium 
vivax 


pval 


129 


42 


6373 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


122 


63 


6374 


AF1 19851 


Homo sapiens 


PRO 1 722 


251 


65 


6375 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


139 


73 


6376 


X70343 


Nicotiana 
sylvestris 


extensin 


104 


35 


6377 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


52 


6378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


69 


6379 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


109 


51 


6380 


Z38128 


Mus musculus 


histone HI 


113 


36 


6381 


U93573 


Homo sapiens 


p40 


212 


43 
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SEQ ID 

NO. 


Accession No. 


Species 


Description 


Smith- 

Wntermnn 

Score 


% 

Identit 

y 


6382 


K.02576 


Homo sapiens 


salivary proline-rich protein 1 


• 109 


40 


63 S3 


AL096744 


Homo sapiens 


hypothetical protein 


164 


49 


6384 


AF220264 


Homo sapiens 


MOST- 1 


119 


56 


6385 


X92485 


Plasmodium 
vivax. 


pval 


108 


65 


6386 


X53581 


Rattus 
norvegicus 


ORF4 


148 


36 


6387 


G 02639 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6720. 


116 


52 


6388 


M13100 


Rattus 
norvegicus 


unknown protein 


119 


48 


6389 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


80 


33 


6390 


X53581 


Rattus 
norvegicus 


ORF4 


139 1 


55 


6391 


AK025947 


Homo sapiens 


unnamed protein product 


166 


52 


6392 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


244 


75 


6393 


M22332 


Homo sapiens 


unknown protein 


166 


39 


6394 


AF229067 


Homo sapiens 


PADI-H protein 


128 


59 


6395 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


80 


70 


6396 


G04063 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
8144. 


95 


44 


6397 


M26361 


Mus musculus 


LrNE/Ig H-chain fusion protein 


97 


61 


6398 


AF 118082 


Homo sapiens 


PRO 1902 


93 


48 


6399 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 18. 


116 


89 


6400 


U93570 


Homo sapiens 


putative pi 50 


88 


43 


6401 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


147 


61 


6402 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


77 


56 


6403 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


156 


47 


6404 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


155 


56 


6405 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:I051. 


88 


39 


6406 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


84 


68 


6407 


G00560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4641. 


244 


90 


6408 


L27428 


Homo sapiens 


reverse transcriptase 


108 


42 


6409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


136 


58 


6410 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


44 


6411 


U70935 


Peromyscus 
manicularus 


reverse transcriptase 


154 


40 


6412 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


6413 


AL031186 


Homo sapiens 


WC984G 1 . 1 (supported by FGENES) 


249 


78 


6414 


AF130089 


Homo sapiens 


PRO2550 


100 


55 


6415 


AB047963 


Macaca 
fascicularis 


hypothetical protein 


165 


57 


6416 


M22332 


Homo sapiens 


unknown protein 


111 


42 


6417 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


50 


6418 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


60 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Waterman 
- Sc ™ e 


% 

Idcntit 

y 


6419 


D381 12 


""Homo sapiens 


" NADH dehydrogenase subunit 1 






6420 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


"T41 


~~fl 


6421 


U49973 


Homo sapiens 


ORF2* function unknown 






6422 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


135 


96 


6423 


AF072508 


Homo sapiens 


envelope protein 


128 


49 


6424 


AK000385 










6425 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
sinn ar to pogo e ement 


148 


54 


6426 


X92485 


P I asm odium 


P 


"Tol 


— 


6427 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


183 


80 


6428 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


97 


46 


6429 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


51 






Homo sapiens 


Human secreted protein #75. 


139 


58 


6431 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


47 


6432 


X92485 


Plasmodium 


pval 


83 


72 


6433 


Y35984 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 233, 


121 


59 


6434 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


43 


6435 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


51 


6436 


D89501 


Homo sapiens 


similar to salivary proline-rich protein 


88 


38 


6437 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene ji cjone n u rvHJvo-j. 


113 


58 


6438 


X99467 


Medica°o 

truncatula 




~TTo ~ 




6439 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ IDNO"491 


129 


58 


6440 


X92485 


Plasmodium 


pval 


114 


46 


6441 


AK024455 


Homosa iens 

omo sapiens 


FLJ00047 protein 


— 


-— 

-£1 


6442 


G03790 


omo sapiens 


7871. ^ 


— - 

85 




6443 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


56 


6444 


Y0267 1 




gene 22 clone HMSJW18. 






6445 


M24732 










6446 


AF119851 


Homo sapiens 


PRO 1722 


163 


69 




AF" 1 26 1 63 




rtrlLAj proicin 






6448 


GOO 5 77 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


58 


r 


rtXZUU 16/ 


cercopitnicine 


EBNA2-like protein 


125 


38 


6450 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


107 


52 


6451 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


114 


54 


6452 


AF090942 


Homo sapiens 


PRO0657 


129 


65 


6453 


M19503 


Homo sapiens 


ORF1; putative 


86 


39 


6455 


X92485 


Plasmodium 
vivax 


pval 


99 


40 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


6456 


X86030 


Vigna 


extensin-like protein 


115 


36 


6457 


YO8061 


" Hmnc^sapiens 


Human c-myb protein fragment. 


1 13 


61 


6458 


AF055985 


Onchocerca 


pyrrolidone-rich antigen 


90 


3 7 


6459 


AF119851 


Homo sapiens 


PRO! 722 


243 


66 


6460 


J05497 


Rattus 


snRNP-associated polypeptide N 


101 


31 


6461 


AF130089 


Homo sapiens 


PRO2550 


183 


76 






Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1 . 


93 


53 


6463 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


122 


72 


6464 




Homo sapiens 


Human secreted protein #28. 


85 


52 


6465 


AL 160493 


Leishniania 


probable (hhv-6)ul 102, variant a 
DNA, complete virion genome 


120 


40 


6466 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


36 


6467 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
J 874, : 


96 


58 


646g 


G02827 


Homo sapiens 
— — 


Human secreted protein, SEQ ID NO: 
6908 •_ 


121 


67 


6469 


X92485 


Plasmodium 
vivax 


pval 


89 


72 


6470 


G0j786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


56 


6471 


AF130089 


Homo sapiens 


PRO2550 


155 


83 


6472 




Homo sapiens 


putative pi 50 


131 


42 


6473 


AF130079 


Homo sapiens 




1 1 1 


54 


6474 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


51 


6475 


AF220264 


Homo sapiens 


MOST-1 


311 


58 


6476 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


45 


6477 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


106 


35 


6478 


X92485 


Plasmodium 


pval 


220 


55 


6480 


G03714 


Homo sapiens 


— : 

Human secreted protein, SEQ ID NO; 
7795. 


1 12 


61 


6481 


U49973 


Homo sapiens 


ORF2: function unknown 


96 


42 


6482 


U 15647 


Mus musculus 


reverse transcriptase 


113 


61 


6483 


X92485 


Plasmodium 


pval 


181 


49 


6484 


L27428 


Homo sapiens 


reverse transcriptase 


135 


50 


6485 


Y7j966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #1 53. 


144 


52 


6486 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


63 


6487 


U93569 


Homo sapiens 


p40 


176 


53 




Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO.1051. 


72 


42 






Homo sapiens 


chromosomal protein 


93 


33 


6490 


G04072 


Homo sapiens 


8153. 






6491 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


70 


6492 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 
similar to pogo element 


251 


75 


6493 


U 15647 


Mus musculus 


reverse transcriptase 


100 


40 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

ldentit 

y 


6494 


AK027208 


Homo sapiens 




unnamed protein product 




68 


6495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


52 


6496 


AF130089 


Homo sapiens 


PRO2550 


230 


69 


6497 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


237 


62 


6498 


AF090894 


Homo sapiens 


PRO01 13 


140 


68 


6499 


Y03000 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 12 1 . 


138 


84 


6500 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


123 


69 


6501 


M69297 


Homo sapiens 


ORF 3 


87 


58 


6502 


M22332 


Homo sapiens 


unknown protein 


81 


44 


6503 


X92485 


Plasmodium 
vivax 


pval 


111 


52 


6504 


AF090895 


Homo sapiens 


PRO0U7 


131 


60 


6505 


Y73932 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #119. 


97 


69 


6506 


U93568 


Homo sapiens 


putative pi 50 


122 


32 


6507 


X86029 


Vigna 
unguiculata 


extensin-like protein 


177 


32 


6508 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


282 


60 


6509 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRJB2. 


142 


61 


6510 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


93 


35 


6511 


U52077 


Homo sapiens 


mariner transposase 


372 


73 


6512 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


81 


50 


6513 


AF090928 


Homo sapiens 


PRO0470 


97 


39 


6514 


AF134827 


Monodelphis 
domestica 


small nuclear ribonucleoprotein B' 


108 


47 


6515 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


82 


44 


6516 


AF068754 


Homo sapiens 


heat shock factor binding protein 1 
HSBPI 


118 


69 


6517 


AF130089 


Homo sapiens 


PRO2550 


207 


69 


6518 


M22332 


Homo sapiens 


unknown protein 


154 


55 


6519 


G 02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


95 


39 


6520 


G031 19 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7200. 


65 


37 


6521 


AK023563 


Homo sapiens 


unnamed protein product 


268 


58 


6522 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


181 


61 


6523 


AF2 10651 


Homo sapiens 


NAG 18 


85 


85 


6524 


AF130079 


Homo sapiens 


PR02852 


127 


51 


6525 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


51 


6526 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


85 


62 


6527 


U4247 1 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


1 15 


48 


6528 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


86 


60 


6529 


R59842 


Homo sapiens 


ApoE4Ll protease. 


92 


58 


6530 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


77 


6531 


W48351 


Homo sapiens 


Human breast cancer related protein 


88 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description' 1 *<'*H t 


Waterman 


ldentit 

y 








BCRB2. 






— ^ 


Ar i J VSo I 


Homo sapiens 


PRO 1722 


121 


61 




AF063S66 


Melanopkis 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 


85 


35 


6534 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


59 


6535 


R95913 


Homo sapiens 


Neural thread protein. 


132 


67 







Homo sapiens 


PR02852 


1 13 


58 


65j7 

._ 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


159 


60 




G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


279 


68 


6539 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


1 12 


55 


6540 


AF130089 


Homo sapiens 


PRO2550 


124 


56 


6541 


U93574 


Homo sapiens 


putative pi 50 


159 


39 


6542 


AF1 18082 


Homo sapiens 


PRO 1902 


114 


45 


6543 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


78 


50 


6544 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


118 


44 


6545 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


62 


6546 


AF1 16661 


Homo sapiens 


PR01438 


162 


59 


6547 


S62928 


Homo sapiens 


PRB1M protein precursor 


116 


48 


6548 


AB030192 


Mus musculus 


homologueof S70447 FIFo-ATP 
synthase complex Fo membrane 
domain f subunit 


174 


63 


6549 


AK025116 


Homo sapiens 


unnamed protein product 


136 


56 


6550 


X92485 


Plasmodium 
vivax 


pval 


105 


56 


6551 


AF130079 


Homo sapiens 


PR02852 


172 


91 


6552 


U70935 


Peromyscus 
maniculatus ■ 


reverse transcriptase 


84 


42 


6553 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


55 


6554 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


39 


6555 


AF130089 


Homo sapiens 


PRO2550 


205 


64 




AF 130079 


Homo sapiens 


PR02852 


165 


48 


6557 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 


160 


74 


6558 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


133 


49 


6559 


X92485 


Plasmodium 
vivax 


pval 


121 


78 


6560 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


76 


45 


6561 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


234 


85 


6562 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


61 


6563 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


134 


84 


6564 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


66 


47 


6565 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


180 


49 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


6566 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


96 


57 


6567 


X92485 


Plasmodium 
vivax 


pval 


96 


58 


6568 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


103 


73 


6569 


AF1 19851 


Homo sapiens 


PRO 1722 


81 


60 


6570 


AF289098 


Cladrastis 
kentukea 


ENOD2 


1 10 


36 


6571 


AF1 18086 


Homo sapiens 


PRO 1992 


115 


65 


6572 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


57 


65 7 j 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


41 


6574 


M22332 


Homo sapiens 


unknown protein 


95 


34 


6575 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


64 


6576 


AK021848 


Homo sapiens 


unnamed protein product 


131 


41 


6577 


AF161356 


Homo sapiens 


HSPC093 


155 


57 


6578 


AF 116909 


Homo sapiens 


unknown 


75 


76 


6579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


95 


53 


6580 


AF130089 


Homo sapiens 


PRO2550 


276 


74 


6581 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


33 


6582 


U49973 


Homo sapiens 


ORF2: function unknown 


94 


62 


6583 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


90 


65 


6584 


AJ249395 


Globodera 
pallida 


NADH-ubiquinone oxidoreductase 
subunit 4 


99 


41 


6585 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


75 


62 


6586 


U49973 


Homo sapiens 


ORF2: function unknown 


134 


58 


6587 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


112 


47 


6588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


217 


53 


6589 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


147 


41 


6590 


U43627 


Arabidopsis 
thaliana 


extensin 


103 


31 


6591 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


71 


34 


6592 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


177 


66 


6593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


100 


72 


6594 


M69297 


Homo sapiens 


ORF 3 


137 


43 


6595 


X68101 


Rattus 
norvegicus 


fg 


372 


89 


6596 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


81 


62 


6597 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


99 


43 


6598 


Z997I4 


Homo sapiens 


bK1048E9.5 (novel protein) 


1642 


98 


6599 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 






6600 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


132 


65 


6601 


AK000385 


Homo sapiens 


unnamed protein product 


187 


74 


6602 


AF130051 


Homo sapiens 


PRO0898 


137 


54 


6603 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


88 


48 
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1 SEQID 
NO: 


Accession No. 


Species 


•Description ' -t 


Smith- 
Score" 1 "" 1 


% 

fcientu 








terminal, clone XEH.Sc} 






6604 


X61047 


Hydra sp. 


mini-collagen 


116 


45 


6605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


109 


71 


6606 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


212 


66 


6607 


AF232673 


Homo sapiens 


B17 long form 


1272 


99 


6608 


AF 118082 


Homo sapiens 


PRO 1902 


77 


52 


6609 


R13556 


Homo sapiens 


Protein encoded downstream of hhc__M 
oncoprotein. 


132 


74 


6610 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


70 


61 


6611 


Y36203 


Homo sapiens 


Human secreted protein #75. 


140 


62 


6612 


X92485 


Plasmodium 
vivax 


pval 


157 


72 


6613 


U52077 


Homo sapiens 


mariner transposase 


260 


67 


6614 


AF161392 


Homo sapiens 


HSPC274 


138 


45 


6615 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


65 


6616 


AF 119851 


Homo sapiens 


PRO 1722 


147 


49 


6617 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


101 


63 


6618 


AL390U4 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


39 


6619 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 
IDNO:1051. 


105 


58 


6620 


U15647 


Mus musculus 


reverse transcriptase 


106 


38 


6621 


AF130089 


Homo sapiens 


PRO2550 


171 


52 


6622 


X92485 


Plasmodium 
vivax 


pval 


139 


44 


6623 


AF090895 


Homo sapiens 


PRO0117 


161 


58 


6624 


M63730 


Homo sapiens 


BPAG2 1 


107 


32 


6625 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


183 


61 


6626 


AF078860 


Homo sapiens 


PTD007 


69 


73 


6627 


AL080253 


Arabidopsis 
thaliana 


putative snRNP protein 


101 


37 


6628 


AK000385 


Homo sapiens 


unnamed protein product 


140 


57 


6629 


W60652 


Homo sapiens 


Human DVic-1 (alternative transcript). 


218 


51 


6630 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


152 


66 


6631 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


118 


65 


6632 


X92485 


Plasmodium 
vivax 


pval 


119 


88 


6633 


X03145 


Homo sapiens 


pot. ORF V 


98 


39 


6634 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


165 


41 


6635 


X59448 


Drosophila 
melanogaster 


dynamin 


111 


39 


6636 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


162 


50 


6637 


AF 130089 


Homo sapiens 


PRO2550 


14] 


75 


6638 


AF1 18082 


Homo sapiens 


PRO 1902 


108 


63 


6639 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


144 


60 


6640 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


47 


6641 


AF 118082 


Homo sapiens 


PRO 1902 


121 


53 


6642 


AF1 18082 


Homo sapiens 


PRO 1902 


195 


68 


6643 


R95913 


Homo sapiens 


Neural thread protein. 


212 


65 
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SEQID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


% 

Identic 

y 


6644 


AF090928 


Homo sapiens 


rKUUI l\J 








Ar 1 JUL) /y 


Homo sapiens 




~rn 

— 


~75 

-|| 


6646 


AF003535 


Homo sapiens 


ORF2-like protein 






6647 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


61 


6648 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


A 


42 


6649 


M15530 


Homo sapiens 


B-cell growth factor 






6650 


AF090942 


Homo sapiens 


PRO0657 


119 


53 


6651 


AF2 18082 


Homo sapiens 


PRO 1 902 


130 


52 


6652 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


87 


6653 


AF130089 


Homo sapiens 


PRO2550 


143 


63 


6654 


AF119851 


Homo sapiens 


PRO 1722 


285 


70 


6655 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


103 


75 


6656 


AF1 19855 


Homo sapiens 


PRO 1847 


61 


60 


6657 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


107 


36 


6658 


G00577 • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 


61 


6659 


AF1 19851 


Homo sapiens 


PRO 1722 


177 


54 


6660 


X92485 


Plasmodium 
vivax 


pval 


104 


55 


6661 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


116 


57 


6662 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


68 


6663 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


115 


35 


6664 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


126 


35 


6665 


AF130051 


Homo sapiens 


PRO0898 


129 


77 


6666 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


108 


42 


6667 


G02851 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6932. 


71 


41 . 


6668 


AF 13 0079 


Homo sapiens 


PR02852 


1 10 


59 


6669 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


66 


6670 


G00527 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4608. 


98 


73 


6671 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


51 


6672 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


113 


36 


6673 


V01201 


Simian sarcoma 
virus 


coding sequence of pol 


143 


47 


6674 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


159 


78 


6675 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4729. 


64 


56 


6676 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


96 


48 


6677 


AF 130089 


Homo sapiens 


PRO2550 


100 


75 


6678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 




68 


6679 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


105 


71 


6680 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


218 


53 


668] 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


53 
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SEQ ID 

NO: 


Accession No. 


Species 


Description ■""<■' 


Smith- 
Waterman 
JScore 


% 

Identit 

y 








4433. 






6682 


AF161356 


Homo sapiens 


HSPC093 


163 


47 


6683 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


78 


72 


6684 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


87 


42 


6685 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


43 


6686 


U93564 


Homo sapiens 


p40 


96 


41 


6687 


AF1 19855 


Homo sapiens 


PRO 1847 


137 


67 


6688 


U52077 


Homo sapiens 


manner transposase 


268 


81 


6689 


X92485 


Plasmodium 
vivax 


pval 


97 


58 


6690 


W48351 


Homo sapiens 


Human breast cancer related protein • 
BCRB2. 


133 


56 


6691 


M63274 


Plasmodium 
falciparum 


malaria antigen 


61 


46 


6692 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


80 


6693 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


3 15 


40 


6694 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


131 


50 


6695 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


174 


66 


6696 


AF068294 


Homo sapiens 


HDCMB45P 


154 


59 


6697 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


112 


41 


6698 


M83293 


Simian 

immunodeficienc 
y virus 


vpx protein 


85 


41 


6699 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


169 


88 


6700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


121 


58 


6701 


AF1 18080 


Homo sapiens 


PRO 1880 


127 


59 


6702 


X92485 


Plasmodium 
vivax 


pval 


106 


58 


6703 


AF161356 


Homo sapiens 


HSPC093 


155 


53 


6704 


X92485 


Plasmodium 
vivax 


pval 


364 


54 


6705 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


172 


59 


6706 


AK025326 


Homo sapiens 


unnamed protein product 


146 


50 


6707 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


112 


43 


6708 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


119 


71 


6709 


AF176531 


Mus musculus 


F-box protein FBX16 


324 


60 


6710 


X92485 


Plasmodium 
vivax 


pval 


96 


64 


6711 


AF130089 


Homo sapiens 


PRO2550 


203 


67 


6712 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


150 


59 


6713 


AF090944 


Homo sapiens 


PRO0663 


134 


43 


6714 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


110 


47 


6715 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


133 


60 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


6716 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


103 


65 


6717 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


115 


46 


6718 


GO 1222 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5303. 


54 


100 


6719 


Yj>620.> 


Homo sapiens 


Human secreted protein #75. 


-122. 




6720 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


— 

64 


6721 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


130 


61 


6722 


Y36203 


Homo sapiens 


Human secreted protein #75. 


167 


72 


6723 


AF0]0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 16 


46 


6724 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


133 


42 


6725 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


88 


76 


6726 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


218 


61 


6727 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


399 


81 


6728 


U01877 


Homo sapiens 


p3 00 protein 


1 19 


27 


6729 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


51 


6730 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


58 


6731 


X66285 


Mus musculus 


HC1 ORF 


79 


38 


6732 


K02576 


Homo sapiens 


salivary proltne-rich protein 1 


1 14 


38 


6733 


AF057352 


Homo sapiens 


hepatocellular carcinoma autoantigen 


148 


81 j 


6734 


AF090895 


Homo sapiens 


PRO0J 17 


152 


57 


6735 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


61 


6736 


X92485 


Plasmodium 
vivax 


pval 


92 


89 


6737 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


1 12 


49 


6738 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


278 


72 


6739 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


191 


47 


6740 


AF1 18082 


Homo sapiens 


PRO 1902 


83 


49 


6741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


83 


43 


6742 


Y36708 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 62. 


71 


64 


6743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


177 


70 


6744 


Y91475 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 25 SEQ ID NO: 148. 


159 


50 


6745 


X92485 


Plasmodium 
vivax 


pval 


112 


39 


6746 


Z82268 


Caenorhabditis 
elegans 


predicted using Genefmder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), Score=73.8, E-value=1.2e-18, 
N=2; PF01484 (Nematode cuticle 
collagen N-terminal domain), 
Score=37.5, E-value=le-07, 
N=l~cDNA EST yk92fl.5 comes from 
this gene-cDNA EST yk55f6.5 comes 
from this gene-cDNA EST yk65b7.3 
comes from this gene-cDNA EST 


108 


36 
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SEQ ID 

ISO: 


Accession No. 


Species 


jDescription y„ 


Smith- 
Wflferman 


% 

Idcntit 

y 








yk58f3.5 comes from this gene-cDNA 
EST yk72h5.5 comes from this 
gene-cDNA EST yk55c4.5 comes from 
this gene-cDNA ESTyk58hl.5 comes 
from this gene-cDNA ESTyk65al2.5 
comes from this gene-cDNA EST 
yk94f4.5 comes from this gene 






6747 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


113 


38 


6748 


R95913 


Homo sapiens 


Neural thread protein. 


108 


56 


6749 


U57747 


Mus musculus 


platelet-activating factor 
acetylhydrolase isoform lb beta subunit 


209 


69 


6750 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


159 


65 


6751 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


96 


48 


6752 


AF090895 


Homo sapiens 


PRO0117 


115 


56 


6753 


AF2 17973 


Homo sapiens 


unknown 


105 


66 


6754 


AF1 19851 


Homo sapiens 


PRO 1722 


156 


55 


6755 


AF1 19855 


Homo sapiens 


PR01847 


189 


66 


6756 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


55 


6757 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


123 


57 


6758 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


156 


70 


6759 


AF130051 


Homo sapiens 


PRO0898 


98 


68 


6760 


M36914 


Zea mays 


cell wall protein (put.); putative 


75 


28 


6761 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


98 


85 


6762 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


88 


62 


6763 


Ml 9756 


Homo sapiens 


MHC HLA-Bw47 precursor 


580 


92 


6764 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u, 1 1 02, variant a dna, 
complete virion genome. 


115 


52 


6765 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


321 


51 


6766 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


124 


59 


6767 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


303 


64 


6768 


AF061340 


Artibeus 
jamaicensis 


cytochrome c oxidase subunit II 


213 


67 


6769 


T71265_cdl 


Homo sapiens 


19-JAN-1996 Neuronal apoptosis 
inhibitor protein (NAIP) cDNA. 


188 


59 


6770 


B03390 


Homo sapiens 


Neuron-associated protein. 


92 


55 


6771 


AFO 17777 


Drosophila 
melanogaster 


la costa 


114 


50 


6772 


Z34955 


Sus scrofa 


protein phosphatase 2A 65 kDa 
regulatory subunit, alpha isoform 


331 


93 


6773 


X61296 


Rattus 
norvegicus 


open reading frame 2 


101 


37 


6774 


AL1 17345 


Homo sapiens 


dJ416F21 .2 (novel protein) 


639 


91 


6775 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


110 


61 


6776 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 


238 


76 


6777 


AB029065 


Mus musculus 


Ser/Thr kinase KKlAMRE-beta 


165 


66 


6778 


M37400 


Homo sapiens 


aspartate aminotransferase 


208 


68 


6779 


AF130089 


Homo sapiens 


PRO2550 


127 


52 


6780 


AF130079 


Homo sapiens 


PR02852 


332 


81 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


6781 


AL049748 


Homo sapiens 


dJ4JP2.2 (RNA binding motif protein 9 
(isoform 1)) 


264 




6782 


LI 1007 


norvegicus 


cyclin-dependent kinase 4 


239 


75 


6783 


AF3 19938 


Thermus 
thermophilus 


competence factor ComEA 


~T\5 


- — 


67S4 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


44 


6785 


L16864 


fever vinis 








6786 


Y36203 


Homo sapiens 


Human secreted protein $75. 


99 




6787 


AF130087 


Homo sapiens 


PR0241 1 


172 


65 


6788 


AF208846 




BM-004 






6789 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 


83 


6790 


AP000059 


Aeropyrum 


1 20aa long hypothetical protein 


81 


43 


6791 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


53 


6792 


Y12961 


Homo sapiens 


secreted peptide. 


~T75 


— 


6793 


Y73442 


Homo sapiens 


Human secreted protein clone ya66_l 
protein sequence SEQ ID NO: 106. 


565 


93 


6794 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPIVTRQ32 


l->6 


50 


6795 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


137 


93 




r\r UU7OO0 


multiple 
sclerosis 


polyprotein 




60 


6797 


R59842 


Homo sapiens 


ApoE4L l protease 


121 


— 


6798 


U88167 


Caenorhabditis 


contains similarity to C2 domains 


171 


73 


6799 


AF293024 


Homo sapiens 


steroid receptor RNA activator isoform 
1 


767 


97 


6800 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


65 


6801 


BO 1203 










6802 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 102, variant a dna, 


125 


47 


6803 


AF078036 


Homo sapiens 


RelA associated inhibitor; NF-kappaB 


1275 


95 


6804 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


131 


53 


6805 


Y57946 


Homo sapiens 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


6806 


G03714 


Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


— 


— 

81 


6807 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


129 


52 


6808 


AB046048 


Macaca 


unnamed portein product 


273 


63 


6809 


K02401 


Homo sa iens 


chorionic somatomammotropin 


~446 


— 


6810 


AF 130089 


Homo sapiens 


PRO2550 


150 


56 


6811 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


123 




6812 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


93 


38 


6813 


AF090930 


Homo sapiens 


PRO0478 


103 


65 


6814 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


54 
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SEQ ID 

NO: 


Accession No. 




Description^' 1 '' ;< \ 


I Waterman 
Score 


Identit 

y 


6815 


B24423 


Homo sapiens 


Human PRO 1244 protein sequence 
SEQ ID NO. 193. 


484 


95 


6816 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


105 


43 


6817 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


96 


56 


6818 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


102 


48 


6819 


U82303 


Homo sapiens 


unknown 


105 


57 


6820 


G0409I 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
8172. 


113 


45 


6821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


39 


6822 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


146 


86 


6823 


U52077 


Homo sapiens 


mariner transposase 


192 


78 


6824 


AC004237 


Homo sapiens 


KIF3 


129 


86 


6825 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


138 


66 


6826 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


112 


64 


6827 


G03246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


108 


68 


6828 


U49974 


Homo sapiens 


mariner transposase 


103 


86 


6829 


AF119851 


Homo sapiens 


PRO 1722 


115 


53 


6830 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


62 


6831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


94 


73 


6832 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


131 


53 


6833 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


50 


6834 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


168 


61 


6835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


60 


6836 


X92485 


Plasmodium 
vivax 


pval 


99 


58 


6837 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


147 


42 


6838 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


134 • 


52 


6839 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


60 


6840 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


126 


73 


6841 


AF1 30089 


Homo sapiens 


PRO2550 


167 


63 


6842 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


140 


56 


6843 


AF090894 


Homo sapiens 


PRO0113 


148 


62 


6844 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


51 


6845 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


75 


61 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


65 


44 


6847 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


6848 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


132 


68 


6849 


AF1 16715 


Homo sapiens 


PR02829 


123 


67 


6850 


Y01437 


Homo sapiens 


Secreted protein encoded by gene 55 


142 


96 



312 



0164835A2 I > 



WO, 01/64835 



PCT/US01/04927 



SEQ ID 


Accession INo. 




Description 


Smith- 


% 


NO: 








Waterman 


Itlentit 










Score 


y 








clone HLMBP18. 






6851 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 10 


76 








7879. 






6852 


AF068294 


Homo sapiens 


HDCMB45P 


161 


56 


6853 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


57 








6613. 






6854 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


72 








7874. 






6855 


AF13005I 


Homo sapiens 


PRO0898 


117 


61 


6856 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


72 








7879. 






6857 


X556S5 


Lycopersicon 


extensin (class I) 


190 


45 






esculentum 








6858 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 


164 


69 






brucei 


complete virion genome. 






6859 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


90 


52 


6860 


AF1 19851 


Homo sapiens 


PRO! 722 


331 


73 


6861 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


67 








4499. 






6862 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


53 








7870. 






6863 


X92485 


Plasmodium 


pval 


105 


63 






vivax 








6864 


AF068294 


Homo sapiens 


HDCMB45P 


206 


61 


6865 


AF068294 


Homo sapiens 


HDCMB45P 


128 


46 


6866 


L00016 


Homo sapiens 


urf4 


157 


58 


6867 


X92485 


Plasmodium 


pval 


95 


89 






vivax 








6868 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


148 


77 








SEQ ID NO: 163. 






6869 


AF130089 


Homo sapiens 


PRO2550 


162 


67 


6870 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


257 


65 








encoded by gene 17. 






6871 


X71972 


Mus musculus 


ribosomal protein S24 


211 


85 


6872 


M280I6 


Homo sapiens 


cytochrome b 


72 


93 


6873 


M24069 


Homo sapiens 


DNA-binding protein A 


1662 


79 


6874 


AF130051 


Homo sapiens 


PRO0898 


163 


76 


6875 


AF006061 


Homo sapiens 


placental growth hormone isoform 


263 


90 








hGH-V3 






6876 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


80 








5066. 






6877- 


AF130079 


Homo sapiens 


PR02852 


130 


76 


6878 


S80905 


Homo sapiens 


Conl 


148 


38 


6879 


X55683 


Lycopersicon 


extensin (class I) 


70 


35 






esculentum 








6880 


U16359 


Rattus 


nitric oxide synthase 


106 


94 






norvegicus 








6881 


AK025047 


Homo sapiens 


unnamed protein product 


167 


50 


6882 


X92485 


Plasmodium 


pval 


110 


63 






vivax 








6883 


W48351 


Homo sapiens 


Human breast cancer related protein 


185 


46 








BCRB2. 






6884 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


64 


6885 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


47 








6574. 






6886 


AFO 14008 


Bos taurus 


myocardial vascular inhibition factor 


158 


96 


6887 


AF265575 


Homo sapiens 


. ubiquitous TPR-motif protein Y 


183 


83 








isoform 






6888 


Y45382 


Homo sapiens 


Human secreted protein fragment 


199 


49 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


" % 

y 








encoded from gene 28. 






6889 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


140 


67 


6890 


AF130051 


Homo sapiens 


PRO0898 


135 


65 


6891 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


147 


65 


6892 


X72963 


Nicotiana 
tabacum 


pAPS product 


80 


44 


6893 


AF073957 


Homo sapiens 


CXC chemokine BRAK 


399. 


82 


6894 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


160 


80 


6895 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO.I63. 


150 


70 


6896 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


178 


85 


6897 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


142 


62 


6898 


AF1 18082 


Homo sapiens 


PRO 1902 


144 


53 


6899 


AF 130051 


Homo sapiens 


PRO0898 


187 


72 


6900 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


85 


6901 


AF130089 


Homo sapiens 


PRO2550 


100 


63 


6902 


Z29446 


Hepatitis C virus 


core region 


60 


36 


6903 


AF 107406 


Homo sapiens 


GW128 


71 


63 


6904 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPR N-terminal sequence. 


173 


64 


6905 


AF130089 


Homo sapiens 


PRO2550 


88 


65 


6906 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


78 


84 


6907 


U93569 


Homo sapiens 


p40 


145 


48 


6908 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


186 


50 


6909 


AF1300S9 


Homo sapiens 


PRO2550 


244 


77 


6910 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


119 


38 


6911 


AF130089 


Homo sapiens 


PRO2550 


139 


47 


6912 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 


60 


6913 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


131 


70 


6914 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


106 


74 


6915 


AF161361 


Homo sapiens 


HSPC098 


79 


66 


6916 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


207 


60 


6917 


AF130051 


Homo sapiens 


PRO0898 


258 


61 


6918 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


108 


65 


6919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


58 


6920 


G02635 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6716. 


168 


80 


6921 


AF1 16661 


Homo sapiens 


PR01438 


71 


48 


6922 


AF090940 


Homo sapiens 


PRO0644 


72 


85 


6923 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


186 


70 


6924 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


55 


6925 


AF229067 


Homo sapiens 


PADI-H protein 


122. 


72 
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SEQ1D 


Accession No. 




Description 


Smith- 


% 


NO: 








Wsuerman 


Identit 










Score 


y 


6926 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


105 


37 








protein CON-1. 






6927 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID MO: 


173 


60 








7881. 






6928 


X56603 


Mus musculus 


mouse 57-KD Calcium-binding protein 


103 


52 








(MCaBP) 






6929 


AFl 19851 


Homo sapiens 


PRO 1722 


148 


76 


6930 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


61 








7868. 






6931 


AF077208 


Homo sapiens 


HSPC022 


155 


72 


6932 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


111 


62 


6933 


M22332 


Homo sapiens 


unknown protein 


154 


77 


6934 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


146 


60 








gene 22 clone HMSJW18. 






6935 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


63 








6613. 






6936 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 








4457. 






6937 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


77 








6613. 






6938 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


68 








7795. 






6939 


AFl 18082 


Homo sapiens 


PRO 1902 


120 


54 


6940 


AF090942 


Homo sapiens 


PRO0657 


149 


72 


6941 


M22332 


Homo sapiens 


unknown protein 


96 


51 


6942 


AF130087 


Homo sapiens 


PR02411 


132 


58 


6943 


Ml 5530 


Homo sapiens 


B-cell growth factor 


97 


52 


6944 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


52 








4433. 






6945 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


73 


66 








7871. 






6946 


Z10561_cdl 


Homo sapiens 


12-MAR-1998 DNA encoding G 


330 


96 








protein-coupled receptor protein 












designated SREB2. 






6947 


W53869 


Homo sapiens 


Gravin polypeptide fragment, residues 


170 


89 








1526-1780. 






6948 


W48351 


Homo sapiens 


Human breast cancer related protein 


164 


86 








BCRB2. 






6949 


AFl 18086 


Homo sapiens 


PRO 1992 


117 


86 


6950 


AF090928 


Homo sapiens 


PRO0470 


1 15 


58 


6951 


AFl 30089 


Homo sapiens 


PRO2550 


124 


69 


6952 


Y38397 


Homo sapiens 


Human secreted protein encoded by 


94 


64 








gene No. 12. 






6953 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


55 








6613. 






6954 


X92485 


Plasmodium 


pval 


108 


64 






vivax 








6955 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


117 


85 








terminal, clone XEH.8c} 






6956 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


53 








6574. 






6957 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


50 








6983. 






6958 


AF130089 


Homo sapiens 


PRO2550 


127 


67 


6959 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


119 


49 








gene 100 clone HNFIU96. 






6960 


J02459 


bacteriophage 


U (tail component; 131) 


492 


92 






lambda 








6961 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


68 



315 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 








7214. 






6962 


Z66490 


Salmo trutta 


slow myotomal muscle tropomyosin 


182 


72 


6963 


AF 13 0089 


Homo sapiens 


PRO2550 


108 


73 


6964 


W2I733 


Homo sapiens 


NlP-1 encoded by clone 59. 


131 


69 


6965 


AF161356 


Homo sapiens 


HSPC093 


58 


48 


6966 


AB047934 


Macaca 
fascicularis 


hypothetical protein 


146 




6967 


X92485 


Plasmodium 
vivax 








6968 


AF1 18082 


Homo sapiens 


PRO 1 902 






6969 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


176 


63 


6970 


AF1 16712 


Homo sapiens 


PR02738 


101 


56 


6971 


X96667 


Homo sapiens 


sialyltransferase 






6972 


Y86574 


Homo sapiens 


fragment, SEQ IDNO:491. 


150 


82 


6973 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


176 


"58 


6974 


AF 162 149 


Mycoplasma 
bovis 


variable surface lipoprotein 


143 


40 


6975 


W17971 


Homo sapiens 


RAC protein kinase C-terminal binding ■ 
protein C-terminal region. 


135 


81 


6976 


AF1 30089 


Homo sapiens 


PRO2550 


345 


78 


6977 


AF130089 


Homo sapiens 


PRO2550 


195 




6978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 


6979 


M60510 


Mus muscuius 


beta-D-gafactosidase fusion protein 


96 


— 


6980 


AF1 18086 


Homo sapiens 


PRO 1992 


131 


— 


6981 


AF1 30089 


Homo sapiens 


PRO2550 


132 


64 


6982 


L27428 


Homo sapiens 


reverse transcriptase 


128 




6983 


U93566 


Homo sapiens 


p40 


98 


"17 


6984 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


138 


— 


6985 


L27428 


Homo sapiens 


reverse transcriptase 


133 




6986 


AF090895 


Homo sapiens 


PRO01 17 







6987 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


212 


60 


6988 


Y36203 


Homo sapiens 


Human secreted protein #75. 


189 


73 


6989 


AF1 16712 


Homo sapiens 


PR02738 


87 




6990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


63 


6991 


AFO90930 


Homo sapiens 


PRO0478 


154 


52 


6992 


G03786 


Homo sapiens 


7867. 






6993 


G02872 


Homo sapiens 


6953. 






6994 


G02753 


Homo sapiens 


6834. 






6995 


J02974 


castellanii 


myosin IB heavy chain 


120 


43 


6996 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


151 


69 


6997 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


153 


60 


6998 


Y12961 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


114 


66 


6999 


AF1 13685 


Homo sapiens 


PRO0974 


88 


58 


7000 


AF130089 


Homo sapiens 


PRO2550 


146 


58 
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SEQ ID 


Accession No. 


Species 


Description 


Smitli- 


% 


NO: 






Identit 










Score rman 




7001 


U93565 


Homo sapiens 


putative pi 50 


96 


46 


7002 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


146 


57 








gene 100 clone HNFIU96. 






7003 


Y91577 


Homo sapiens 


Human secreted protein sequence 


206 


92 








encoded by gene 2 SEQ ID NO.250. 






7004 


AF1 30089 


Homo sapiens 


PRO2550 


104 


50 


7005 


AL137472 


Homo sapiens 


hypothetical protein 


179 


55 


7006 


Y91577 


Homo sapiens 


Human secreted protein sequence 


379 


98 








encoded by gene 2 SEQ ID NO:250. • 






7007 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


38 








7871. 






7008 


AF1 19851 


Homo sapiens 


PRO 1722 


136 


63 


7009 


AF2 18028 


Homo sapiens 


unknown 


104 


52 


7010 


AF 130079 


Homo sapiens 


PR02852 


129 


60 


7011 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


1 10 


54 








IDNO:1051. 






7012 


X03145 


Homo sapiens 


pot ORF VI 


130 


65 


7013 


AF090944 


Homo sapiens 


PRO0663 


171 


75 


7014 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


122 


65 








gene 22 clone HMSJW1 8. 






7015 


M26460 


Homo sapiens 


retinoblastoma 1 


113 


37 


7016 


X92485 




pval 


1 12 


56 






vivax 








7017 


X92485 


Plasmodium — 


1 


144 


57 






vivax 








7018 


AF130079 


Homo sapiens 


PR02852 


224 


74 


7019 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


129 


43 








gene 51 clone HUKEX85. 






7020 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


58 








7795. 






7021 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


106 


69 








encoded by gene 90. 






7022 


AF090901 


Homo sapiens 


PRO0195 


103 


88 


7023 


AF1 16661 


Homo sapiens 


PRO 143 8 


120 


47 


7024 


AP002460 


Arabidopsis 


gene id:FlD9.26~unknown protein 


325 


60 






thaliana 








7025 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


126 


63 








derived protein #153. 






7026 


AF 11 9900 


Homo sapiens 


PR02822 


111 


70 


7027 


AF130051 


Homo sapiens 


PRO0898 


133 


44 


7028 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


179 


68 








derived protein #153. 






7029 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


72 


60 


7030 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


50 








6613. 






7031 


J03941 


Mus musculus 


ferritin heavy chain 


124 


57 


7032 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


59 








7795. 






7033 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


120 


65 








factor (BCGF). 






7034 


Y36156 


Homo sapiens 


Human secreted protein #28. 


164 


48 


7035 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


116 


48 


7036 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


130 


52 








clone HPMBQ32. 






7037 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


55 








6953. 






7038 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


68 








6619. 






7039 


AF130089 


Homo sapiens 


PRO2550 


121 


65 
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SEQ ID 
NO: 


Accession No. 


Spec,es 


Description 


Smith- 
Waterman 


% 

Identit 


7040 


AF1 19855 


Homo sapiens 


PRO 1847 


158 


65 


7041 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


116 


62 


7042 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


117 


54 


7043 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


80 


7044 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


63 


7045 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


86 


44 


7046 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


80 


72 


7047 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


100 


57 


704S 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


152 


66 


7049 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


94 


72 


7050 


AF1 76323 


Macropus 
eugenii 


small nuclear ribonucleoprotein B' 


103 


41 


7051 


AF255446 


Crypthecodin ium 
cohnii 


Dip 1 -associated protein C 


127 


38 


7052 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


149 


42 


7053 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


69 


7054 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


109 


54 


7055 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


185 


86 


7056 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class II HERVs 


125 


65 


7057 


AF068294 


Homo sapiens 


HDCMB45P 


264 


73 


7058 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


190 


95 


7059 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


140 


62 


7060 


AF090894 


Homo sapiens 


PRO0113 


185 


62 


7061 


AF 107406 


Homo sapiens 


GW128 


66 


80 


7062 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


46 


7063 


AF1 19851 


Homo sapiens 


PRO 1722 


117 


74 


7064 


X92485 


Plasmodium 
vivax 


pval 


123 


66 


7065 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


79 


59 


7066 


L23545 


Homo sapiens 


putative 


135 


44 


7067 


AF1 19851 


Homo sapiens 


PRO 1722 


301 


69 


7068 


AF130079 


Homo sapiens 


PR02852 


118 


71 


7069 


AF119851 


Homo sapiens 


PRO 1722 


150 


64 


7070 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


48 


7071 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


39 


7072 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


145 


62 


7073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


44 
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SE0 ID 

NO: 


Accession No. 




Description 


Waterman 


Identit 

y 








4478. 






7074 


AF 130089 


Homo sapiens 


PRO2550 


146 


60 


7075 


U49974 


Homo sapiens 


mariner transposase 


198 


83 


7076 


AF236085 


Homo sapiens 


CYP4F1 1 


427 


79 


7077 


AF 13 0089 


Homo sapiens 


PRO2550 


159 


52 


7078 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


206 


85 


7079 


D32202 


Homo sapiens 


alpha 1 C adrenergic receptor isoform 2 


152 


71 


7080 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


88 


62 


7081 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


135 


46 


7082 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


142 


75 


7083 


X53581 


Rattus 
norvegicus 


ORF4 


119 


64 


7084 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


111 


62 


7085 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


64 


7086 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


71 


66 


7087 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


54 


7088 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


148 


50 


7089 


U08021 


Homo sapiens 


nicotinamide N-methylrransferase 


116 


100 


7090 


U93569 


Homo sapiens 


p40 


199 


82 


7091 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


56 


7092 


X92485 


Plasmodium 
vivax 


pval 


125 


60 


7093 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


62 


7094 


AF1 18082 


Homo sapiens 


PRO 1902 


191 


50 


7095 


X92485 


Plasmodium 
vivax 


pval 


129 


67 


7096 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


80 


66 


7097 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


162 


67 


7098 


AF116715 


Homo sapiens 


PR02829 


123 


63 


7099 


L27428 


Homo sapiens 


reverse transcriptase 


189 


50 


7100 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


84 


48 


7301 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


65 


7102 


D85030 


Schizosaccharom 
yces pombe 


ribosomal protein S33 homolog 


139 


85 


7103 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7102. 


177 


77 


7104 


U72543 


Sus scrofa 


ubiquitin-]ike/S30 ribosomal fusion 
protein 


129 


100 


7105 


AC007787 


AA 187-502 


NFI-X3=transcription factor 


190 


84 


7106 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


110 


38 


7107 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


157 


56 


7108 


Z69381 


Saccharomyces 
cerevisiae 


N1114 


450 


91 



319 



WO 01/64835 
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Accession No. 


Species 


Description 


Smith- 


% 


NO: 










Id en tit 










Score 




7109 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 


353 


70 








clone HP10122 cDNA#l. 






7110 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


99 


45 








protein CON- 1 . 






7111 


AF068294 


Homo sapiens 


HDCMB45P 


132 


55 


7112 


AB033020 


Homo sapiens 


KIAAII94 protein 


850 


100 


7113 


AB019692 


Homo sapiens 


protein kinase PKNbeta 


792 


87 


7114 


M12140 


Homo sapiens 


envelope protein 


154 


79 


7115 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


88. 


62 








gene No. 10 1. 






7116 


AF124491 


Homo sapiens 


ARF GTPase-activating protein GIT2 


1216 


100 


7117 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


153 


54 








gene 22 clone HMSJW 18. 






7118 


AF130089 


Homo sapiens 


PRO2550 


184 


64 


7119 


AF130079 


Homo sapiens 


PR02852 


144 


68 


7120 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


112 


51 






brucei 


complete virion genome. 






7121 


AF032384 


Xenopus laevis 


metalloprotease-disintegrin 


304 


76 


7122 


KOI 664 


Drosophila 


Bkm-like protein 


99 


71 






melanogaster 








7123 


AJ238706 


Drosophila 


monocarboxylate transporter 1 


161 


38 






melanogaster 


homologue 






7124 


X92485 


Plasmodium 


pval 


127 


54 






vivax 








7125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


65 








7795. 






7126 


AF1 16712 


Homo sapiens 


PR02738 


118 


53 


7127 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


64 








7871. 






7128 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


62 








6566. 






7129 


AF1 18090 


Homo sapiens 


PRO2044 


280 


80 


7130 


AF.I29085 


Homo sapiens 


carboxy terminus of Hsp70-interacting 


593 


91 








protein 






7131 


AF054840 


Homo sapiens 


tetraspan TM4SF; Tspan-3 


513 


83 


7132 


L27428 


Homo sapiens 


reverse transcriptase 


156 


51 


7133 


AK001647 


Homo sapiens 


unnamed protein product 


2334 


99 


7134 


AF 130051 


Homo sapiens 


PRO0898 


142 


84 


7135 


X53581 


Rattus 


ORF4 


213 


46 






norvegicus 








7136 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


86 








7124. 






7137 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


194 


68 








6953. 






7138 


U21634 


Onychomys 


cytochrome c oxidase subunit III 


111 


84 






torridus 








7139 


X60592 


Homo sapiens 


CDw40 


201 


95 


7140 


G0371.4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


63 








7795. 






7141 


M76979 


Homo sapiens 


pigment epithelial-differentiating factor 


458 


93 


7142 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


54 








7795. 






7143 


Y27677 


Homo sapiens 


Human secreted protein encoded by 


323 


90 








gene No. 111. 






7144 


AF116712 


Homo sapiens 


PR02738 


132 


60 


7145 


AB024057 


Homo sapiens 


vascular Rab-GAP/TBC-containing 


121 


45 








protein 






7146 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 








7253. 
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SEQ ID 


Accession No. 


Species 


Description 




% 


NO: 








Waterman 


Identit 










Score 


y 


7147 


AL3 59782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 


141 


76 






brucei 


complete virion genome. 






7148 


R06400 


Homo sapiens 


Lambda gtl0ch2 encoded C-terminal 


337 


85 






alpha amidating enzyme. 






7149 


S87759 


Homo sapiens 


protein phosphatase 2C alpha, 


427 


78 








PP2Calpha 






7150 


MI2140 


Homo sapiens 


envelope protein 


541 


64 


7151 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


106 


85 








RHAla(AF078683) 






7152 


X55683 


Lycopersicon 


extensin (class I) 


70 


38 






esculentum 








7153 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


194 


61 








similar to pogo element 






7154 


AF130089 


Homo sapiens 


PRO2550 


89 


60 


7155 


AF 100742 


Homo sapiens 


M-phase phosphoprotein homolog 


714 


94 


7156 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


78 








6619. 






7157 


AL390114 


Leishmania 


probable (hhv-6) ul 1 02, variant a 


174 


79 






major 


DNA, complete virion genome 






7158 


AL031230 


Homo sapiens 


dJ73M23.3(KIAA0319) 


214 


45 


7159 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


155 


37 


7160 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


129 


46 








encoded by gene 17. 






7161 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


53 








7874. 






7162 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


55 








4435. 






7163 


M13043 


Mus musculus 


alpha- 1 (IV) collagen 


80 


42 


7164 


ABO 11004 


Homo sapiens 


UDP-N-acetylglucosamine 


257 


90 






pyrophosphorylase 






7165 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


111 


47 








gene 22 clone HMSJW18. 






7166 


AF1 19851 


Homo sapiens 


PRO 1722 


279 


61 


7167 


G02538 


Homo sapiens 


Human .secreted protein, SEQ ID NO: 


136 


51 








6619. 






7168 


W73327 


Homo sapiens 


Bipolar affective disorder associated 


287 


63 








gene fshl6 product. 






7169 


Y35946 


Homo sapiens 


Extended human secreted protein 


1231 


99 








sequence, SEQ ID NO. 195. 






7170 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


76 








4446. 






7171 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


184 


60 


7172 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


64 








6619. 






7173 


AF 129507 


Homo sapiens 


transcription factor ICBP90 


245 


55 


7174 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


56 








7871. 






7175 


AF161356 


Homo sapiens 


HSPC093 


107 


55 


7176 


AF2 18028 


Homo sapiens 


unknown 


116 


55 


7177 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


52 








6735. 






7178 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


38 








6953. 






7179 


AF130089 


Homo sapiens 


PRO2550 


124 


47 


7180 


AP0006I6 


Oryza sativa 


similar to RING-H2 finger protein 


109 


66 








RHAla(AF078683) 






7181 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


65 








7795. 






7182 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


62 
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SEQ ID 

NO: 


Accession No. 


Species 


Description V *' 


Smith- 
Score 


% 
y 


71S3 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


121 


75 


7184 


X92485 


Plasmodium 

vivax 


pval 


129 


60 


7185 


AF 130089 


Homo sapiens 


PRO2550 


116 


53 


7186 


AF130089 


Homo sapiens 


PRO2550 


137. 


64 


7187 


X51760 


Homo sapiens 


zinc finger protein (583 AA) 


236 


56 


7188 


X92485 


Plasmodium 
vivax 


pval 


131 


50 


7189 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


152 


57 


7190 


AF068294 


Homo sapiens 


HDCMB45P 


163 


50 


7191 


AF1 19855 


Homo sapiens 


PRO 1847 


55 


50 


7192 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


57. 


7193 


M15530 


Homo sapiens 


B-cell growth factor 


106 


53 


7194 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


121 


41 


7195 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


124 


60 


7196 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


94 


68 


7197 


U60315 


Molluscum 
contagiosum 
virus subtype 1 


MC054L 


116 


36 


7198 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


116 


67 


7199 


Ml 5530 


Homo sapiens 


B-cell growth factor 


146 


59 


7200 


AF090894 


Homo sapiens 


PRO0113 


165 


65 


7201 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


160 


57 


7202 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


53 


7203 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


92 


65 


7204 


AF 126 163 


Homo sapiens 


HHLA3 protein 


13t 


78 


7205 


X92485 


Plasmodium 
vivax 


pval 


116 


42 


7206 


AF090931 


Homo sapiens 


PRO0483 


136 


83 


7207 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


49 


7208 


Y12961 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


158 


55 


7209 


Y36203 


Homo sapiens 


Human secreted protein #75. 


124 


56 


7210 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


90 


34 


7211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


71 


7212 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


60 


7213 


AF130089 


Homo sapiens 


PRO2550 


163 


86 


7214 


AF1 30089 


Homo sapiens 


PRO2550 


111 


52 


7215 


AF 118082 


Homo sapiens 


PRO 1902 


150 


63 


7216 


Y53663 


Homo sapiens 


A human B-cell growth factor related 
protein (BGFRP). 


83 


66 


7217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


68 


7218 


AF220264 


Homo sapiens 


MOST-1 


123 


66 


7219 


AF229067 


Homo sapiens 


PADI-H protein 


121 


52 


7220 


AF1 18086 


Homo sapiens 


PR01992 


130 


61 


7221 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


179 


100 
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SEQ ID 

NO: 




Species 


Descuption 


Waterman 

.JlSH 


1 den tit 

y 








SEQ ID NO: 163. 






7222 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


78 


7223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: J 63. 


200 


95 


7224 


L27428 


Homo sapiens 


reverse transcriptase 


127 


55 


7225 


AB008227 


Adiantum 
capillus-veneris 


Extensin 


121 


43 


7226 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


113 


78 


7227 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


153 


96 


7228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 


74 


7229 


M15530 


Homo sapiens 


B-cell growth factor 


137 


44 


7230 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit HI 


140 


87 


7231 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


46 


7232 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


123 


90 


7233 


XI 5769 


Mus musculus 


UlRNA-associated 70-kDa protein 


105 


29 


7234 


Ml 5530 


Homo sapiens 


B-celJ growth factor 


113 


61 


7235 


AF 194537 


Homo sapiens 


NAG 13 


249 


65 


7236 


R20313 


Homo sapiens 


Gly(5) beta-globin mutant. 


296 


80 


7237 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


50 


7238 


Y36156 


Homo sapiens 


Human secreted protein #28. 


97 


80 


7239 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


395 


81 


7240 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


70 


7241 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF078683) 


123 


90 


7242 


AF090942 


Homo sapiens 


PRO0657 


118 


47 


7243 


AF119851 


Homo sapiens 


PRO 1722 


112 


62 


7244 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


100 


53 


7245 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


144 


66 


7246 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


129 


78 


7248 


X92485 


Plasmodium 
vivax 


pval 


108 


58 


7249 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


74 


7250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


153. 


70 


7251 


U01849 


Trypanosoma 
brucei 


ORF2 


93 


39 


7252 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


113 


54 


7253 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7554. 


110 


83 


7254 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


167 


66 


7255 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


179 


83 


7256 


AF 13 0079 


Homo sapiens 


PR02852 


147 


70 


7257 


AF1J8082 


Homo sapiens 


PRO 1902 


80 


45 


7258 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


55 


7259 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


90 


65 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


7260 


X92485 


Plasmodium 

vivax 


"pval ~ — 


-y^p 




7261 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO*. 
7879. 




76 




AF1 30089 


Homo sapiens 






43 


7263 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene z./. cione mviaj wi6. 


114 


60 


7264 


AF1 16661 


riomo sapiens 


nn/\i .1 1 o 

rtsAJ Ihoo 


T39 


67 


7265- 


AF 104021 


Bovine viral 
Le^hni V ' rUS " 2 — 


polyprotein 


161 


40 


7266 


AL 1 60493 


eis ania 


probable (hhv-6) ul 102, variant a 


. 

16 


43 


7267 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene *r4 cione ri i u/\ljjc.z. 


1 12 


62 


7268 


M15530 


Homo sapiens 


B-cell growth factor 


154 


60 


7269 


G03_>56 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437- 


95 


54 


7270 


AF1 18082 


Homo sapiens 


PRO 1902 


139 


62 


7271 


S52010 




orn j oi tpojK. 


93 


33 


7272 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


50 


7273 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


61 


7274 


AF1 30089 


Homo sapiens 


PRO2550 


124 


43 


7275 


X92485 


Plasmodium 


pval 


147 


44 


7276 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
f. 438 ' : 


81 


59 


72 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


67 


7278 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


67 


7279 


AF 130087 


Homosa iens 


PR024 1 1 


— 




7280 


Y40039 


Homo sapiens 


human secreted protein. 


-po" 





7281 


J03507 










7282 


AF 11 6661 


Homo sapiens 


PR01438 


117 


48 


7284 






Human secreted protein, SEQ ID NO; 
6613. 






7285 


AF1 16715 










7286 


AF130079 


Homo sapiens 


PR02852 


122 


52 






Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


120 


50 




G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


338 


75 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


1 14 


72 


7290 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


146 


59 


7291 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


58 


7292 


G02532 


Homo sapiens 


Human secreted protein SEQ ID NO' 
6613. 


~Wi 


~6~r — 


7293 


X92485 


Plasmodium 


pval 


122 


71 


7294 


AF130089 


Homo sapiens 


PRO2550 


87 


62 


7295 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


140 


50 


7296 


X92485 


Plasmodium. .. 


pval 


152 


75 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Wa term an 


% 
y 






vivax 








7297 


AF090894 


Homo sapiens 


PRO0113 


172 


65 


7298 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


121 


61 


7299 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


114 


66 


7300 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


228 


80 


7301 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


111 


73 


7302 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


119 


46 


7303 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


76 


7304 


U58658 


Homo sapiens 


unknown 


102 


48 


7305 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


118 


53 


7306 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


63 


7307 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


149 


58 


7308 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


52 


7309 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


115 


54 


7310 


M15530 


Homo sapiens 


B-cell growth factor 


131 


54 


7311 


AF 118082 


Homo sapiens 


PRO 1902 


168 


46 


7312 


L78671 


Homo sapiens 


CoxIl/D-loop DNA fusion protein 


412 


92 


7313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


326 


74 


7314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


149 


59 


7315 


AF289099 


Maackia 
amurensis 


ENOD2f 


107 


32 


7316 


U 17009 


Phytophthora 
infestans 


SecY-independent transporter protein 


135 


41 


7317 


AF220264 


Homo sapiens 


MOST-1 


133 


72 


7318 


U49973 


Homo sapiens 


ORF2: function unknown 


111 


53 


7319 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


274 


80 


7320 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


79 


56 • 


7321 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO.T63. 


205 


95 


7322 


AC005175 


Homo sapiens 


TA2R HUMAN, BETA ISOFORM; 
TXA2-R; PROSTANOID TP 
RECEPTOR 


127 


57 


7323 


R48296 


Homo sapiens 


Human PGF-2/NT-3. 


89 


93 


7324 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI 8. 


157 


60 


7325 


AF1 19855 


Homo sapiens 


PRO 1847 


172 


66 


7326 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


50 


7327 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


56 


7328 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


98 


56 


7329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


64 


7330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


111 


72 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
q""" C 


% 

Identit 

y 


-J221 


AFI261 63 


— : 

omo sapiens 


riJrii^A^ procein 


~"T^ ~~ 


47 


7332 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7339. 


122 


61 


7333 


AF 130079 


Homo sapiens 


PR02852 


136 


54 


7334 


AK023563 


*j? mo ^ ap | ens 


unnamed protein product 


~ttI 


73 


7335 


G02532 


omo sapiens 


Human secreted protein, SEQ ID NO. 




61 


7336 


Ml 5530 


Homo sa iens 


~B — 11 r wth f 


— — 


— 


7337 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


To8 


~ 


7338 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


339 


80 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


170 


74 






Homo sapiens 


NADH dehydrogenase subunit 3 


400 


82 


7341 


L00016 


Homo sapiens 


urf4 


295 


72 




D381 12 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


86 


7343 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


286 


82 


7344 
— 


X15917 


Paramecium 
aurelia 


ND2 protein (AA 1-193) (unusual start 
co don) 


132 


34 


73 




Y86248 




Homo sapiens 


Human secreted protein HCHPF68, 
oby ID N«J:loJ. 


136 


82 


7346 




— : 

Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


89 


69 


~7347 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

CCA TT~» \IA' 1 6.1 


13j 


69 


~7348 


p381 12 


___ ; 

Homo sapiens 


NADH dehydrogenase subunit 3 


— k 




71 


7349 


D381 12 


Homo sa^rT 


cytochrome c oxidase subunit 1 





?? 


7350 


V00662 


omo sapiens 


cytochrome oxidase I 





.11 — 


735 1 




omo sapiens 


NADH dehydrogenase subunit 3 


— =2 


8 ? 


-^H? — 


- p. - 

~ 


Homo sapiens 


NADH dehydrogenase subunit 3 




79 




D ^ sn ~ 


Homo sapiens 


NADH dehydrogenase subunit 1 


270 


86 


7354 


j-^sns 




NADH dehydrogenase subunit 3 


-i|Z 


- 90 


7355 




PrrTrxog^od^es — 


cytochrome c oxidase subunit 1 






7356 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


84 


yill — 


P3g| ^2 


Homo sapiens 


NADH dehydrogenase subunit 3 


188 


72 


^ 




Homo sapiens 


NADH dehydrogenase subunit 3 


j35 


72 






Homo sapiens 


cytochrome b 


275 


80 


-— q 




Homo sapiens 


NADH dehydrogenase subunit 3 




73 


~7§6I 


API 1 R07R 


Homo sapiens 


PDA I SMC 


_ 95 .__ .. . 


58 


7362 


AB0171 16 


Homo sapiens 






89 


7363 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene cione n i Ur\ uzz. 


146 


42 


"7364 


D381 12 


— — : 

Homo sapiens 


NADH dehydrogenase subunit 1 




82 


7365 


D3S112 


Homo sapiens 


NADH dehydrogenase subunit 1 


206 


84 


— 


pgg| 


Homo sapiens 


NADH dehydrogenase subunit 3 


408 


82 






Homo sapiens 


NADH dehydrogenase subunit 3 




86 


7368 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


481 


85 




X8603 1 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


150 


53 


7j70 


AB0171 16 


Homo sapiens 


AD 1 


145 


91 


7371 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

LLJ INU.lDJ. 


173 


85 


7372 


AF 118082 


Homo sapiens 


PRO 1902 


122 


54 


7373 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


163 


62 


7374 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


53 


7375 


AF090894 


Homo sapiens 


PRO0113 


128 


64 


7376 


AK000241 


Homo sapiens 


unnamed protein product 


131 


67 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


7377 


AF1 19855 


Homo sapiens 


PRO 184 7 


164 


62 


7379 


AFO 16099 


Mus muscuJus 


endonuciease/reverse transcriptase 


114 


37 


7380 


U57362 


Rattus 
norvegicus 


collagen XII alpha 1 


148 


47 


7381 


D381 12 


Homo sapiens 


ATPase subunit 6 


423 


76 


7382 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


135 


58 


7383 


AB040938 


Homo sapiens 


KJAA1 505 protein 


983 


76 


7384 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


48 


7385 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


122 


75 


7386 


U63295 


Homo sapiens 


seven in absentia homolog 


260 


85 


7387 


AE003499 


Drosophila 
raelanogaster 


CG 12706 gene product 


103 


32 


7388 


X03342 




rpL32 (aa 1-135) 


123 


51 


7389 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


129 


58 


7390 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


100 


~739\ 


AKUZOoUU 


-rj : 

Homo sapiens 




"T37 


— 


7392 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


1 17 


— 


7393 


G04072 


Homosa iens 

onio sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


89 


70 


7394 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


80 


7395 


Ml 5530 


Homo sapiens 


B-ceJl growth factor 


98 


57 


7396 


L10908 


Mus musculus 


Gcapl gene product 


105 


51 


7397 


AF1 19851 


Homo sapiens 


PRO 1722 


127 


55 


7398 


X99256 


Hylobates lar 


cytochrome oxidase subunit III 


319 


73 


7399 


D38I 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


77 


7400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


399 


80 


7401 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


170 


83 


~7^o3 — 




Homo sapiens 







91 




Y86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 






7404 


G02639 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6720. 


115 


40 


"7405 


D83006 


Saccharomyces — 
cerevisiae 


oligosaccharides/ has high homology 
with YJR061w 


" iTI 


~38" 


7406 


AF 144636 


Gallus gallus 


osteoblast regulatory factor 3 A 


108 


44 


7407 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


70 


7408 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


407 


93 


7409 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


140 


87 


7410 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


1 1 1 


59 


7411 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


138 


82 










43 1 




7413 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


127 


56 


7414 


AB036396 


Ran a 

nigromaculata 


cytochrome b 


109 


87 


7415 


AF090930 


Homo sapiens 


PRO0478 


121 


76 


7416 


B03148 . 


Homo sapiens 


Human neuronal differentiation factor- 
1 CNDF-1). 


171 


96 


7417 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


446 


91 



327 
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SEQ1D 

NO: 
— 


Accession No. 




Description 


Smith- 

VV'aterman 

Score 


% 

Idcntit 

y 


418 


ABO 18440 


Echinococcus 
multiiocu Earis 


NADH dehydrogenase subunit 2 


1 12 


33 


7419 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


123 


66 


7420 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


50 


~74T] 


A p i557 ~2 — 


Pisum sativum 


extensin 


128 


37 


7422 


AF161356 


Homo sapiens 


HSPC093 


89 


48 


7423 


D381 15 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


162 


81 


7424 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


171 


79 


7425 




AF275953 



synthetic 
construct 


Cox8ND6gfp fusion protein 


214 


93 


7426 


Dj8 L 12 .. 


Homo sapiens 


cytochrome c oxidase subunit 1 


156 


73 




G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


130 


59 






Homo sapiens 


FLJ00047 protein 


82 


66 


7429 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


212 


82 




S79410 


Mus musculus 


nuclear localization signal binding 
protein 


124 


48 


7431 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


195 


90 


7432 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


217 


84 


7433 


U87145 


Toxoplasma 
gondii 


ORFD 


89 


36 


7434 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


213 


72 


7435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


135 


57 




G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


82 


74j7 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


190 


88 


7438 


D38112 


Homo sapiens 


NADH dehydrogenase subunit I 


14! 


78 


7439 


AF250284 


Amsacta moorei 
entomopoxvirus 


AMV012 


88 


31 


7440 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


149 


85 




U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


52 




AF275807 


Homo sapiens 


PNAS-1 10 


157 


48 


7443 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


126 


56 


7444 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


475 


95 


7445 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


146 


71 


7446 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


130 


60 


7447 


X92485 


Plasmodium 
vivax 


pval 


153 


45 


7448 


AF1 19851 


Homo sapiens 


PRO 1722 


96 


55 


7449 


AF130079 


Homo sapiens 


PR02852 


153 


73 


7450 


AF1 19851 


Homo sapiens 


PRO 1722 


238 


52 






Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


113 


38 


7452 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


147 


57 


7453 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


136 


63 


7454 


R14163 


Homo sapiens 


Cellular DNA-binding protein RSI . 


270 




7455 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


387 


80 | 



328 
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PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Wntelma 


% 


NO: 










Idcntit 










Score 










gamma from A.salina 






7456 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


80 








7253. 






7457 


AF1 19851 


Homo sapiens 


PRO 1722 


102 


55 


7458 


G02756 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


55 








6837. 






7459 


AF1 19851 


Homo sapiens 


PRO 1722 


132 


64 


7460 


L10908 


Mus musculus 


Gcapl gene product 


106 


64 


7461 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


54 








6913. 






7462 


X77816 


Rattus - 


PR-Vbetal 


142 


51 






norveeicus 








7463 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 


96 


35 








reactive fragment (residues 69-226). 






7464 


AB046048 


Macaca 


unnamed portein product 


302 


62 






fascjcularis 








7465 


X92485 


Plasmodium 


pval 


1 15 


47 






vivax 








7466 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


65 








6619. 






7467 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


91 


35 








protein CON-1. 






7468 


AF161361 


Homo sapiens 


HSPC098 


92 


57 


7469 


AF2 16389 


Homo sapiens 


semaphorin Rs 


288 


88 


7470 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


153 


73 


7471 


AF130089 


Homo sapiens 


PRO2550 


108 


48 


7472 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


58 








7870. 






7473 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


115 


71 


7474 


X61296 


Rattus 


open reading frame 2 


134 


49 






norvegicus 








7475 


U22376 


Homo sapiens 


alternatively spliced product using exon 


165 


46 








13A 






7476 


AF1 19851 


Homo sapiens 


PRO 1722 


133 


46 


7477 


AL359782 


Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna, 


127 


69 








complete virion genome. 






7478 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


87 


7479 


S79410 


Mus musculus 


nuclear localization signal binding 


122 


59 








protein 






7480 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


168 


84 


7481 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


181 


80 








7519 






7482 


G022 1 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


56 








6292. 






7483 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


37 








7874. 






7484 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


100 








6836. 






7485 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


72 


7486 


M29294 


Rattus 


small nuclear ribonucleoparticle- 


132 


42 






norvegicus 


associated protein 






7487 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


140 


50 








gene 22 clone HMSJ W 1 8. 






7488 


AF090901 


Homo sapiens 


PRO0195 


101 


62 


7489 


X97249 


Homo sapiens 


leucine-rich primary response protein 


304 


93 


7490 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


447 


84 


7491 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


63 


7492 


U87269 


Homo sapiens 


pl20E4F transcription factor 


244 


66 


7493 


A31036 


Nicotiana alata 


PRP2 


87 


31 



329 



WO 01/64835 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Iclentit 


— — 









Score 


y 


7494 


8 


Homo sapiens 


Human secreted protein HCHPF68, 


206 


97 





■ 




SEQ ID NO: 1 63 . 






7495 


8 


Homo sapiens 


Human secreted protein HCHPF68, 


166 


91 









SEQ ID NO: 1 63 . 








Y86248 


— ; 

Homo sapiens 


Human secreted protein HCHPF68, 


161 


66 





■ — — — 


_ . 


ot-O I-U NU.JOJ. 






7 


AK024455 


Homo sapiens 


FLJ00047 protein 


67 


61 


7498 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


174 


84 








SEQ ID NO: 163. 






7499 


X75438 


Sus scrofa 


homologue of proline/arginine rich 


116 


44 








antibacterial peptides 






7500 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


166 


89 








SEQ ID NO: 163. 






7501 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


137 


61 








gene No. 107. 






7502 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


213 


60 









SEQ ID NO: 1 63 . 






750j 


AB042770 


Pteropus 


cytochrome c oxidase subunit 3 


169 


89 


_ ■ 




dasymallus 










A T7 1 0£ 1 AH 

Ar izolco 


Homo sapiens 


HHLA3 protein 


1 10 


62 


7505 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


299 


90 


7506 


D38J 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


194 


88 


7507 


W48351 


Homo sapiens 


Human breast cancer related protein 


128 


72 








BCRB2. 






7508 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


162 


86 








SEQ ID NO: 163. 






7509 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


116 


65 


7510 


L3890S 


Nicotiana 


extensin 


114 


36 






tabacum 








7512 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


273 


73 








similar to pogo element 






7513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


168 


44 


7514 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


171 


96 








1 (NDF-1). 






7515 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 


110 


46 


— — 







SEQ ID NO: 1 63. 










Pongo pygmaeus 


cytochrome c oxidase subunit 3 


170 


84 


7S17 
7517 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


43 









7344. 






— — — 


H 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


82 


~* 




Gorilla gorilla 


NADH dehydrogenase subunit 3 (ND3) 


141 


84 


— |— 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


410 


85 


7521 


D38112_ 


Homo sapiens 


cytochrome c oxidase subunit 1 


1 12 


71 


7522 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


96 


46 





— — — 


- |j""y ce j — . 


protein rhala. 






7523 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


57 


— — 






7871. 






7524 


— t,- 

AF220264 


Homo sapiens 


MOST-1 


86 


62 


7525 


AF130089 


Homo sapiens 


PRO2550 


109 


35 


7526 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


79 








4462. 








U49973 


Homo sapiens 


ORF2: function unknown 


105 


68 


7528 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


122 


62 








encoded by gene 17. 






7529 


X05831 


Ratals 


fibronectin precursor 


100 


64 






norvegicus 








7530 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


60 








6613. 






7531 


X92485 


Plasmodium 


pval 


120 


59 



330 



WO 01/64835 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










_Score 


y 














"7532 


D3SI 15 


r ongo pygmaeus 


cytochrome c oxidase subunit 3 


383 


— 


7533 


D381 12 


Homo^sapiei^s 


NADH dehydrogenase subunit 3 


321 


67 


7534 


AL359782 






126 


72 






bruc^ n0S ° ma 


complete virion genome. 






■ 

~7^36 


AC] lOR^I 


__ : 

iiOITlO S3pi6DS 


PRO 1 722 


TTo ' 


~67 




D381 12 


Homo sapiens 




53 1 


81 


— — 


G03714 


Homo sapiens 


Wnman cprrptpH nrnfpin RPO TO "MO* 




75 








7795. 






- R 


yo 74 og 


~di TP 


pval 


Til 


~48 














■ 


/\r 1 177UU 


Homo sapiens 




— 


— 


Tn 

7540 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


~ToK 


-— 










-— 









Homo sapiens 


B-cell growth factor 




— 

— | 


7542 


X53581 


Rattus 


ORF4 


— — 

174 








norvegicus 









7543 


Y86248 


Homo sapiens 


„. — =— 

Human secreted protem HCHPF68, 


138 




74 












— 


- ■ 


p ^ 81 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


—7 




■ 


JV11 jjjU 


Homo sapiens 


B-cell growth factor 


— 


— 


7546 


X92485 


Plasmodium 


pval 


"TZo 


— 

















AF1 1 3685 


Homo sapiens 


PRO0974 


~TT6 


~4R 

— 




7548 


U49973 


Homo sapiens 


ORF1, MER37, putative transposase 


-~^z 




< — Q ' 






similar to pogo element 


— 


— 





c^cviift 


Homo sapiens 


CoxII/D-loop DNA fusion protein 






7550 


S79410 


Mus musculus 


nuclear localization signal binding 


~Tn7 

10 


— 

70 








protein 




-— 






Homo sapiens 




— ^ 


— 1 


7552 


R95913 


Homo sapiens 


Neural thread protein. 




56 




U93569 


Homo sapiens 


putative pi 50 


171 

—— 


—Z 


7554 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




75 








44^5. 






7555 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


207 


56 


7556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


59 






7795 - ; : 






7557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




125 


43 








— -j— -j : 


— 




7558 


9 


— - . 

Homo sapiens 


Human gene 42-encoded protein 


95 


— 

71 








fragment SEQ ID NO'360. 






— ?~o 

7559 





-— 

Human 


U88 


— 


— — 






herpesvirus 6 










. 75 . _. 




A18812 


Brassica napus 


extensin 


____ 

-— l 


— 




7561 


D^8112 


Homo sapiens 


ATPase subunit 6 




-5Z 


7562 


AF 130089 


Homo sapiens 


r>r> rtifcn 
VjsXjZ j_>U 


— 7T 

—Li 


* 6 . 


7563 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


106 


61 








isoform 


— 





— FT 

7564 




G00361 


— s 

Homo sapiens 


Human secreted protein, SEQ ID NO: 












^f^' -j 77: -tz 






— — 

7565 





Caenorhabditis 


C elegans collagen dpy-10 gene (Levy, 


"TTvi 


— 








A D Yang J. and Kramer^ J.M. Mol. 












Biol Cell 4 3 '803- 17, 1993)' 






7566 


V00672 


Pan troglodytes 


reading frame protein 4 


294 


73 


7567 


A18812 


Brassica napus 


extensin 


145 


48 


7568 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


241 


69 


7569 


L00016 


Homo sapiens 


urf4 


284 


71 


7570 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


50 


7571 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


65 



331 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4754. 






7572 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


7573 


X92485 


Plasmodium 


pval 


157 


49 


7574 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


151 


54 


7575 


AF1 30089 


Homo sapiens 


PRO2550 


348 


70 




R95913 


Homo sapiens 


Neural thread protein. 


1 13 


45 


7577 


L26953 


Homo sapiens 


chromosomal protein 


148 


38 


7578 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


116 


46 


7579 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


173 


67 


7580 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


149 


57 


7581 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


95 


51 


7582 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


125 


85 


7583 


AC004079 


Homo sapiens 


human HOXA3; 95% similarity to 
e307530 (PID.g 1888441) 


127 


100 


7584 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


171 


71 


7585 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


80 


7586 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


222 


100 


7587 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


194 


71 


7588 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 1 


115 


80 


7589 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


77 


7590 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


112 


60 


7591 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


72 


7592 


A49807 


Neisseria 
meningitidis 


IGA-LPHA3 


93 


33 


7593 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N-terminal domain), Score=27.5, E- 
value=0.0001,N=l 


102 


42 


7594 


U43627 


Arabidopsis 
thaliana 


extensin 


99 


44 


7595 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


74 


7596 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


49 


7597 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


190 


92 


7598 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


96 


7599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


95 


7600 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID-NO:J63. 


192 


85 


7601 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


129 


91 


7602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


333 


71 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Wnterman 


y 


7603 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


190 


86 


7604 


U09500 


Homo sapiens 


cytochrome b 


279 


93 


7605 


Z14016 


Nicotiana 
tabacum 


pistil extensin like protein, partial CDS 


84 


40 


7606 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


108 


71 


7607 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7717. 


55 


52 


7608 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


1 14 


54 


7609 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


97 


62 


7610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


434 


87 


761 1 


Y21418 


Homo sapiens 


Human high mobility group protein 
HMGI-C mutant fragment 2. 


97 


46 


7613 


X92485 


Plasmodium 


pval 


106 


58 


7614 


AC003058 


Arabidopsis 
ttialiana 


unknown protein 


191 


88 


7615 


S7422 1 


Homo sapiens 


UC factoi^cytokine down-regulating 
HLA class II 


153 


65 


7616 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


148 


74 


7617 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


169 


56 


7618 


L35013 


Homo sapiens 


spliceosomal protein 


95 




7619 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


441 


85 


7620 


AF1 16638 


Homo sapiens 


PRO 1546 






7621 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


146 


46 


7622 






Human secreted protein SEQ ID NO' 
7795. 






7623 


G02538 




Human secreted protein SEQ ID NO - 
6619. 






7624 


U4 1 03 1 


elegans 


proline rich 


92 


5 ^ 


7625 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


130 


55 


7626 


LI 9098 


Choristoneura 
fumiferana 


cytochrome oxidase I 


102 


66 


7627 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


99 


76 


7628 


AFI26163 


Homo sapiens 


HHLA3 protein 


117 


67 


7629 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


69 


53 


7630 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


83 


55 


7631 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


87 


64 


7632 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


120 


48 


7633 


R95913 


Homo sapiens 


Neural thread protein. 


126 


61 


7634 


V00662 


Homo sapiens 


cytochrome oxidase I 


396 


85 


7635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


205 


84 


7636 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


464 


85 


7637 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


165 


94 


7638 


AF1 18086 


Homo sapiens 


PRO 1992 


120 


68 


7639 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


84 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


' Smith- 

Waterman 
Score 


% 

Idcntit 

y 


7640 


W88627 


Homosa iens 

omo sapiens 


-j : — 

secreted protein encoded by gene 94 
clone HPMBQ32. 


95 


39 


7641 


G03714 


Homo sapiens 


Human secreted protein SEQ ID NO' 
7795. 


~\A3 


68 


7642 


D381 12 


Homo sapiens 






91 


7643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


416 


80 


7644 


AF004341 




cytochrome c oxidase subunit HI 


133 


84 


7645 


U93570 


Homo sapiens 


putative pi 50 


206 


42 






Homo sapiens 


ATPase subunit 6 


192 


80 


7647 


AF090942 


Homo sapiens 


PRO0657 


128 


71 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


134 


73 


7649 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


115 


52 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


130 


61 




X71442 


Rattus 
norvegicus 


ORF 1 ; putative 


105 


52 




L07545 


Leishmania 
tarentolae 


A 'c' was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 frameshift probably due to 
gel compression 


111 


32 


7653 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


53 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6554. 


291 


49 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


63 






Homo sapiens 


NADH dehydrogenase subunit 3 


470 


88 


7657 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


367 


80 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


95 


7659 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
protein rhala. 


109 


50 


7660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 


77 


7661 


AP002460 


— T j-r-j : 

Arabidopsis 


gene_id.FlD9.26~unknown protein 


121 


40 


7662 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
Kii/\ l a (ArU / aooi) 


137 


96 


7663 


AF2 1 7973 


~Ho : 

Plasmo^um"" 


unlcnown 


124 


51 


7664 


X92485 


vivax 


pval 


118 


59 




API 1 ROR9 


Homo sapiens 


PRO 1 902 


123 


44 


7666 


ABO 14563 


Homo sapiens 


KIAA0663 protein 


267 


90 




(JUUOJ / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


49 


Z||| 


Ar I lyiOD 


Homo sapiens 


PRO 1847 


152 


72 




AXOzDl lo 


Homo sapiens 


unnamed protein product 


127 


65 


7670 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


55 


7671 


R14847 


Homo sapiens 


Protein associated with biochemical 
pathway involving cAMP. 




89 


7672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


68 


1613 


U37426 


Homo sapiens 


kinesin-like spindle protein HKSP 


247 


81 


7674 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


125 


75 


7675 


S67633 J Homo sapiens 


class I major histocompatibility antigen 


103 


59 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 














7676 


AF126163 


Homo sapiens 


HHLA3 protein 


146 


72 


7677 


AC005192 


° m ° g a | ens 

omo sapiens 




143 


87 








protein PC4; 96% identical to PI 9 182 












(PID:gl35861) 






7678 


X07882 


Homo sapiens 


Po protein 


126 


39 


7679 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


148 


60 








encoded by gene 17. 






7680 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


177 


37 






nagariensis 


HRGP 






7681 


AF090930 


Homo salens 


PRO0478 


go 


58 


7682 


X55654 




cytochrome C oxidase 11 subunit 


1 15 


65 


7683 


AB046048 


Macaca 31316 " 5 


unnamed portein product 


145 


67 






lasciculans 













Homo sapiens 


PR02852 


1 17 


60 




7685 


1VI 1 J I U 1 


Rattus 


unlcnown protein 


1 14 








norvegicus 














omo sapiens 


Secreted protein encoded by <>ene 18 


134 


~74 








clone HCACJ81. 






7687 


AF126163 


Homosa iens 


HHLA3 protein 


141 


84 








Homo sapiens 


^F^V^FT~~^ P rote '" 




61 


7689 


M37197 


Homo sapiens 


CCAA1 -box-binding tactor 


"Tfii 




7690 


A I ion 1 1 A 


Leishmania 


probable (hhv-6) ul 102, variant a 


"Q7 


~5l 






: 













Homo sapiens 


MOST- 1 


TT7 

— 


— 




nrUOJOOO 


Melanoplus 


ORF IVlS'v'233 hypothetical protein 




-— 






sanguinipes 












en omopoxvirus — 














Homo sapiens 


— — : 

Human secreted protein, oEQ 11-' NC/r 


~QR 


— 














7694 


A T?1 1 OflQO 

Ar I loUoz 


-— : 

Homo sapiens 


DDA1 OftO 

rKUl yUz. 


— — 


— 

— 


7695 


G00517 


Homo sapiens 


xiuman secretca proiein, jcy iu inw. 


"776 










4598 : r . -. — - — 


—— 


— 


— — 

7696 


— ■ — 

Y15163 


— — 

Mus musculus 


putative splice variant containing 












region C 


— — 




7697 


2 n 

Y64890 


— r : 

Homo sapiens 


Human 5' EST related polypeptide SEQ 




— 

52 








ID NO: 1051. 








698 





— : 

omo sapiens 




~Ts5 


~77 








Hnnp HPAPTR1 




— 




^ 


ivii JJJU 


— : 

omo sapiens 




~\06 






G03790 


Homo sapiens 


Human secreted protein SEQ ID NO" 


~7o8 


~T0 








7871. 






7701 


G02538 


—q : 

omo sapiens 


Human secreted protein, SEQ ID NO: 


166 


62 














77QO 

~ 


AF09093 1 


Homosa iens 


PRO0483 


125 


68 




G03432 


Homo salens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


1 14 


79 








7513. 












H mosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


175 


63 








6613. 








Zl^r 




u 

Bo o sal ans 


NADH dehydrogenase subunit 5 


141 


-^r? 

"4r — 


7706 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 
















7707 


X92485 


Plasmodium 


~PaT 


126 


42 






vivax 








7708 


AK023563 


Homo sapiens 


unnamed protein product 


217 


72 


7709 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


124 


58 








RHAla(AF07S683) 






7710 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


124 


56 








gene No. 107. 






7711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


62 
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Accession No. 


Species 


Description 




% 


NO: 








Waterman 












Score 


. y 








7795. 






7712 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 1 


116 


72 


7713 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


194 


58 


7714 


X92485 


Plasmodium 


pval 


123 


68 






vivax 








7715 


G03797 


Homo sapiens 


Human secreted protein, SF.Q ID NO: 


137 


48 








7878. 






7716 


AF1 19851 


Homo sapiens 


PRO 1722 


168 


60 


7717 


U93570 


Homo sapiens 


putative pi 50 


133 


35 


7718 


AF1 16712 


Homo sapiens 


PR02738 


94 • 


68 


7719 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


107 


55 








terminal, clone XEH.8c} 






7720 


•U49973 


Homo sapiens 


ORF2: function unknown 


200 


85 


7721 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


.234 


73 








similar to pogo element 






7722 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


76 








4478. 






7723 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


140 


65 








6953. 






7724 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


70 


73 








gene 22 clone HMSJW18. 






7725 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


55 








7870. 






7726 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


130 


75 








terminal, clone XEH.8c} 






7727 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


94 


78 








encoded by gene 90. 






7728 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


65 








7881. 






7729 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


111 


63 








gene 22 clone HMSJW 18. 






7730 . 


WS0406 


Homo sapiens 


A secreted protein encoded by clone 


119 


69 








dh40 3. 






7731 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


84 








7075. 






7732 


Y08769 


Rattus 


microvascular endothelial 


94 


38 






norvegicus 


differentiation gene 2 






7733 


U09500 


Homo sapiens 


cytochrome b 


463 


84 


7734 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


334 


92 


7735 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


399 


92 


7736 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


128 


55 








secreted peptide. 






7737 


AC003O58 


Arabidopsis 


unknown protein 


167 


66 






thaliana 








7738 


W48353 


Homo sapiens 


Human breast cancer related protein 


86 


70 








BCFLT2. 






7739 


D86854 


Catharanthus 


extensin 


154 


40 






roseus 








7740. 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


57 








7324. 






7741 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 


99 


49 








antibacterial protein 






7742 . 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


35 








6834. 






7743 


V00662 


Homo sapiens 


cytochrome oxidase I 


568 


89 


7744 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


145 


93 


7745 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


154 


100 


7746 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


61 








RHAla(AF078683) 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


11 Al 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


126 


71 


7748 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


327 


92 


7749 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


150 


71 


7750 


AF 130079 


Homo sapiens 


PR02852 


147 


81 


7751 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


104 


49 


7752 


U43627 


Arabidopsis 
thaliana 


extensin 


1 16 


37 


7753 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


71 


7754 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 1 


551 


83 


7755 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


88 


56 


7756 


AF119851 


Homo sapiens 


PRO 1722 


97 


56 


7757 


U80955 


Caenorhabditis 
elegans 


similar to RE repeat region of 
PIR:S27806 D. melanogaster homeotic 
protein BarH2 protein (dual bar 
protein) 


89 


100 


7758 


AC002045 


Homo sapiens 


Unknown protein product CIT987SK- 
A-589H1_1 splice form 2 


97 


94 


7759 


Y02775 


Homo sapiens 


Human secreted protein encoded by 
gene 1 2 clone HFTCU 19. 


288 


98 


7760 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 


80 


7761 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


199 


76 


7762 


AF090931 


Homo sapiens 


PRO0483 


153 


84 


7763 


X92485 


Plasmodium 
vivax 


pval 


114 


62 


7764 


AF09093 1 


Homo sapiens 


PRO0483 


107 


86 


7765 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


103 


43 


7766 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


168 


73 


7767 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


72 


7768 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


83 


7769 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


68 


72 


7770 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


J56 


62 


7771 


U93569 


Homo sapiens 


p40 


188 


43 


7772 


AL049934 


Homo sapiens 


hypothetical protein 


113 


47 


7773 


AF090931 


Homo sapiens 


PRO0483 


86 


65 


7774 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


137 


92 


7775 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


143 


65 


7776 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


71 


7777 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4504. 


93 


47 


7778 


AF090942 


Homo sapiens 


PRO0657 


148 


74 


7779 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


194 


83 


7780 


U14966 


Homo sapiens 


ribosomal protein L5 


315 


81 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

. y 


7781 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


126 


47 




AF 1 1 9855 


Homo sapiens 


PRO 1 847 


131 


51 


~778l 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


79 


53 






Homo sapiens 


NADH dehydrogenase subunit 3 


405 


82 


7785 


G00331 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4412. 


64 


42 


77 

7786 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


59 


7787 


AF 118082 


Homo sapiens 


PRO 1902 


131 


48 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


64 






Strongylocentrot 
us purpuratus 


histone HI -delta 


1 1 1 


43 






Homo sapiens 


NADH dehydrogenase subunit 1 


139 


85 


7791 


V00662 


Homo sapiens 


cytochrome oxidase I 


522 


80 


7792 


Y36j66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


153 


72 


7793 


G02832 




Homo sapiens 



Human secreted protein, SEQ ID NO: 
6913. 


84 


53 


~7794 


W48j5 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


46 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


76 


7796 


D3 8112 


Homo sapiens 


ATPase subunit 8 


166 


77 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


48 




AF090931 


Homo sapiens 


PRO0483 


132 


75 


7799 


G03714 

. ... .. 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


97 


57 


800 




Homo sapiens 


similar to human 22kDa, SM22 mRNA 
(HUM22SM). 


99 


84 


7801 


X64707 


Homo sapiens 


BBC1 


232 


77 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


104 


33 


7803 


^86^4 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 12 


50 


"7804 


Y36203 


Homo sapiens 


Human secreted protein #75. 


142 


56 


7805 


AFU9851 


Homo sapiens 


PRO 1722 


88 


46 


7806 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


57 


7807 




X68600 


Hordeum 
vulgare 


pZE40 


90 


37 







Homo sapiens 


unnamed protein product 


138 


54 




nf IJUUo? 


Homo sapiens 


PRO2550 


375 


71 


7 g— 




Homo sapiens 


Human secreted protein, SEQ ID NO. 
5330. 


124 


65 






Homo sapiens 


ApoE4L 1 protease. 


156 


58 


7812 


AF0 16446 


Caenorhabditis 
elegans 


similar to C. elegans cuticulin precursor 
CUT-2 (SP:P34682) 


95 


42 


7813 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


154 


65 


7814 


AK025047 




unnamed protein product 




71 


7815 


W34499 


Homo sapiens 


Obesity receptor C protein. 


309 


89 


7816 


AJ222673 


Hepatitis C virus 
type 2c 


core protein 


95 


42 


7817 


ABO 11483 


Arabidopsis 
thaliana 


contains similarity to root cap 
protein~gene_id:MUF9. 1 6 


104 


61 


7818 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


154 


74 
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SEQ1D 

NO: 


Accession No. 


Species 


Description 


Score 


% 

Iclcntit 


7819 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


7820 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


103 


60 


7821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


65 


7822 


AF130089 


Homo sapiens 


PRO2550 


107 


67 


7823 


AF 1 18086 


Homo sapiens 


PRO 1992 


143 


73 


7824 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


132 


63 


7825 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


142 


59 


7826 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


42 


7827 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


128 


65 


7828 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


104 


58 


7829 


AF132961 


Homo sapiens 


CGI-27 protein 


154 


49 


7830 


X70343 


Nicotiana 
sylvestris 


extensin 


105 


30 


7831 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


100 


62 


7832 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


129 


51 


7S33 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


99 


46 


7834 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


74 


65 


7835 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


163 


55 


7836 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


72 


52 


7837 


AF090942 


Homo sapiens 


PRO0657 


126 


59 


7838 


Z33426 


Human 
respiratory 
syncytial virus 


glycoprotein 


110 


34 


7839 


D63480 


Homo sapiens 


The KIAA0146 gene product is novel. 


158 


89 


7840 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


88 


63 


7841 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


156 


60 


7842 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


145 


100 


7843 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


66 


7844 


M28016 


Homo sapiens 


cytochrome b 


214 


78 


7845 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


283 


82 


7846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 


73 


7847 


D381 12 


Homo sapiens 


cytochrome b 


525 


83 


7848 


L27428 


Homo sapiens 


reverse transcriptase 


210 


67 


7849 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


156 


96 


7850 


X92485 


Plasmodium 
vivax 


pval 


121 


54 


7851 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


59 


7852 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


191 


93 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smilh- 
Watermnn 


% 
y 


7853 


W48808 


Homo sapiens 


Homo sapiens clone CG1091 protein. 


72 


69 


7854 


Y 8624 8 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


76 


7855 


U43627 


Arabidopsis 
thaliana 


extensin 


124 


35 


7856 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


95 


7857 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


.95 


57 


7858 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


91 


7859 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


108 


45 


7860 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


159 


54 


7861 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


63 


7862 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


265 


77 


7863 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


145 


56 


7864 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


100 


7865 


D381I2 


Homo sapiens 


cytochrome c oxidase subunit 1 


409 


87 


7866 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


112 


50 


7867 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


169 


74 


7868 


AL031004 


Arabidopsis 
thaliana 


RSZp22 splicing factor 


127 


35 


7869 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 


168 


73 


7870 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 


186 


81 


7871 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


203 


56 


7872 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


130 


71 


7873 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


44. 


7874 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


69 


7875 


AF 119855 


Homo sapiens 


PR01847 


174 


70 


7876 


AF2 10651 


Homo sapiens 


NAG 18 


164 


76 


7877 


Z93244 


Homo sapiens 


bKl 1 6F5.2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


111 


45 


7878 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


227 


79 


7879 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


68 


7880 


Y36156 


Homo sapiens 


Human secreted protein #28. 


85 


68 


7881 


AF1 16661 


Homo sapiens 


PRO 143 8 


138 


61 


7882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


55 


7883 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


111 


63 


7884 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


106 


64 


7885 


AK022821 


Homo sapiens 


unnamed protein product 


175 


72 


7886 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


59 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


% 

Idcntif 

y 


7887 


L27428 


Homo sapiens 


reverse transcriptase 


165 


63 


7888 


G0253S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


139 


61 


7889 


MI 5530 


Homo sapiens 


B-celi growth factor 


132 


74 


7890 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


81 


64 


7891 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


150 


61 


7892 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


154 


73 


7893 


AF 130089 


Homo sapiens 


PRO2550 


224 


61 


7894 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


127 


69 


7895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


62 


7896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


50 


7897 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


108 


68 


7898 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


144 


65 


7899 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


105 


83 


7900 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


104 


57 


7901 


X03717 


Homo sapiens 


pot. unidentified reading frame 


105 


46 


7902 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


67 


7903 


X61046 


Hydra sp. 


mini-collagen 


107 


46 


7904 


AF124726 


Homo sapiens 


acinusL 


111 


44 


7905 


U77049 


Ovis aries 


bactinecin 1 1 


1 16 


38 


7906 


AFI 18086 


Homo sapiens 


PRO 1992 


130 


73 


7907 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


321 


48 


7908 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


124 


83 


7909 


AC004450 


Arabidopsis 
thaliana 


putative extensin 


167 


34 


7910 


AFI 30079 


Homo sapiens 


PR02852 


110 


42 


7911 


AF090942 


Homo sapiens 


PRO0657 


144 


66 


7912 


AF090931 


Homo sapiens 


PRO0483 


96 


80 


7913 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


128 


46 


7914 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


87 


68 


7915 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


105 


75 


7916 


X92485 


Plasmodium 
vivax 


pval 


102 


67 


7917 


AF090942 


Homo sapiens 


PRO0657 


126 


76 


7918 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


108 


47 


7919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


63 


7920 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


61 


7921 


L26251 


Trypanosoma 
brucei 


CR5 


95 


37 


7922 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


176 


73 
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SEQ ID 

NO: 


Accession No. 




Description 


Waterman 


Identit 

y 


7923 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


174 


61 


7924 


L27428 


Homo sapiens 


reverse transcriptase 


121 


33 


7925 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


67 


7926 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


119 


73 


7927 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


65 


7928 


Y02693 . 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


138 


60 


7929 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


166 


61 


7930 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit 111 


144 


87 


793 1 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


176 


53 


7932 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


179 


60 


7933 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


279 


92 


7934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


132 


63 


7935 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


64 


7936 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


159 


55 


7937 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


320 


80 


7938 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


159 


71 


7939 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


127 


65 


7940 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


314 


73 


7941 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


121 


57 


7942 


AB040936 


Homo sapiens 


KIAA1503 protein 


149 


71 


7943 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


107 


83 


7944 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


74 


60 


7945 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


68 


35 


7946 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


126 


50 


7947 


AP000735 


Arabidopsis 
thaliana 


extensin protein-like 


214 


40 


7948 


AF161356 


Homo sapiens 


HSPC093 


134 


75 


7949 


AF090942 


Homo sapiens 


PRO0657 


110 


48 


7950 


AJ249395 


Globodera 
pallida 


cytochrome b 


117 


36 


7951 


AF1 16661 


Homo sapiens 


PR01438 


135 


49 


7952 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


92 


7953 


Ml 5530 


Homo sapiens 


B-cell growth factor 


143 


65 




G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


68 


7955 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


62 


7956 


AF 130089 


Homo sapiens 


PRO2550 


153 


62 


7957 


AL 160371 


Leishmania 
major 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


99 


48 


7958 


AF161356 


Homo sapiens 


HSPC093 


154 


56 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smilh- 
Watcrman 


% 

Idcntit 


7959 


G02832 


Horno sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


131 


57 


7960 


X92485 


vivax 


pval 


107 


53 


7961 


AF210651 


Homo sapiens 


NAG 18 


113 


77 


"fHi — 


AF 13 0051 




PRO0898 


84 






R59842 


Homo sapiens 






~68 


7964 


AF090931 


Homo sapiens 


PRO0483 


129 


56 


7965 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NOT 051. 


151 




~7966 


j^gggjg 


— -. 

Homo sapiens 


Neural thread protein. 


TTo 


— 


7967 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


50 


7968 


AF090894 


Homo sapiens 


PRO0113 


104 


58 


7969 


Y7 [ 040 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP mutant fragment 49. 


285 


100 


797Q 


AK001452 


Homo sapiens 




346 


56 


7971 


V00672 


Pan troglodytes 


reading frame protein 4 


134 


90 






Homo sapiens 


SEQ ID NO: 163. 










Homo sapiens 


Human breast cancer related protein 
BCRB2. 








AF1 16661 


Homo sapiens 








7975 


AF1 19882 


Homo sapiens 


PR02492 


112 


48 


7976 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 






7977 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


124 


89 


7978 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


125 




7979 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 






7980 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 


84 


7981 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


— 




~7982 


^ 778}6 


Rattus 

norvegicus 


PP VV-iota 1 

jr iv- v ueid i 


~Tol) 


"69 


7983 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


73 


7984 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


72 


7985 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


82 


— 




Homo sapiens 










AO /OS i 


Homo sapiens 


proline-rich protein G 1 


"ITi 


~4T 


7988 


G03790 


Homo sapiens 
. 


Human secreted protein, SEQ ID NO: 


123 


53 
— 


.. .. 


AF090894 


Homo sapiens 


rKUU 1 1 j 


~To4 




7990 


Z93244 


Homo sapiens 


bKI 16F5.2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


100 


61 


7991 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 j 


56 




W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 






7993 


AF210651 


Homo sapiens 


NAG 18 


133 


61 


7994 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


92 


42 


7995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


171 


82 


7996 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


54 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

> - 


7997 


AF090928 


Homo sapiens 


PRO0470 


86 


76 


7998 


AJ251579 


thaliana 


cef protein 


1 13 


37 


7999 


G0379O 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
7871. 


119 


60 


8000 


R95913 


Homo sapiens 


Neural thread protein. 


105 


70 


8001 


AF229067 




PADI-H protein 




61 


8002 


WS8609 


Homo sapiens 
— 


Secreted protein encoded by gene 76 • 

.U-o UFIYI -71 

cione rir».ij\L-/j. 


83 


65 


8003 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


85 


61 


8004 


S79410 


Mas musculus 


nuclear localization signal binding 
protein 


123 


43 


8005 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


93 


68 


8006 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


64 


8007 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


38 






Homo sapiens ■ 


Human secreted protein, SEQ ID NO: 
7795. 


150 


72 


8009 


/\JrVJUU010 


Oryza sativa 


similar to RING-H2 finger protein 
K-ri/Y 1 a (^ArU / Soto ) 


123 


71 


8010 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


54 


8011 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


59 


8012 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


66 


8013 


AF161356 


Homo sapiens 


HSPC093 


185 


64 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


107 


54 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. | 


.71 


65 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. • 


139 


66 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


94 


77 








Human breast cancer related protein 
BCRB2. 


108 


46 




G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


61 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


85 


65 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


51 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


202 


77 


8023 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


195 


68 


8024 


G03786 


Homo sapiens 


Human secreted protein, SEQ. ID NO: 
7867. 


158 


53 


8025 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


58 


8026 


X92485 


Plasmodium 
vivax 


pval 


128 


56 


8027 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


135 


61 


8028 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


105 


66 


8029 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. " 


126 


59 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identic 

y 


8030 






7874. 






8031 


G02455 




6536. P 


126 




8032 


G04072 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
8153. 


125 


41 


8033 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


113 


69 


8034 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


102 


66 


8035 


Y86248 


Homo sapiens 


Humsn secreted protein UCHPF6S, 
SEQ ID NO: 163. 


221 


95 




AF041330 










8037 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


55 


8038 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 

.| ni ,B TJ/— A TO 1 

cjone ri v^yAv^J o i . 


108 


64 









Homo sapiens 


neuronal thread protein AD7c-NTP 


— — 

-ii^ 


— 




G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 


~ 

86 


8041 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein $153. 


170 


56 


8042 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


58 


8043 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 




AF220264 










8045 


AK024455 


Homo sapiens 


FJLJ00047 protein 


~T51 


~5l 


8046 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein $153. 


181 


78 


8047 


Y27868 


Homosa iens 

p en 


gene No. 107. 


147 


61 


8048 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


102 


43 


8049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


59 




G02538 




Human secreted protein SEQ ID NO* 
6619. 








D00570 










8052 


AF040964 


Homo sapiens 


unknown protein IT1 


143 


50 


8053 




Homo sapiens 


Human secreted protein 
SEQ ID NO: 163. 


J 68 


8j 


8054 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


108 


70 


8055 


G02538 


m ap 


Human secreted protein, SEQ ID NO: 
6619. 


134 


62 


8056 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 12 


65 


8057 


AF090895 


Homo sapiens 


PRO01 17 


1 10 


77 


8058 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


124 


90 


8059 


AF1 18082 


Homo sapiens 


PRO 1902 


1 17 


50 


8060 


AK022217 


Homo sapiens 


unnamed protein product 


145 


53 


8061 


AF 130089 


Homo sapiens 


PRO2550 


160 


76 


8062 


AF119851 


Homo sapiens 


PRO 1722 


177 


54 


8063 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


73 


8064 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


426 


85 


8065 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


344 


90 


8066 


AF068294 


Homo sapiens 


HDCMB45P 


146 


60 


8067 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


461 


87 


8068 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


504 


90 ) 



345 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 
._. 


Accession No. 

■ 




Description 


Smith- 

Waternian 

Score 


Idcntit 

y 


8069 


X8603 ] 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


97 


40 


8070 


R59842 


Homo sapiens 


ApoE4Ll protease. 


147 


47 




AF1 19851 


Homo sapiens 


PRO 1 722 


3 39 


59 


8072 


X92485 


Plasmodium 


pval 


183 


45 


8073 


AF1 16712 


Homo sapiens 


PR02738 


95 


52 




AF213465 


Homo sapiens 


dual oxidase 


184 


so 


8075 


AF090895 


Homo sapiens 


PRO0117 


153 


57 


8076 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8077 


T85403_cdl 


Homo sapiens 


19-APR-1993 Human cadherin-11 
coding sequence. 


284 


98 


8078 


Y64890 


Homo sapiens 


Human 5" EST related polypeptide SEQ 
1DNO:1051. 


99 


63 


8079 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8080 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


142 


52 


8081 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


37 


8082 


X05285 


Drosophila 
melanogaster 


fibril larin 


100 


34 


8083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


65 


8084 


G02216 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6297. 


380 


82 


8085 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


64 


8086 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8087 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


117 


82 




D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


79 


8089 


U27143 


Homo sapiens 


protein kinase C inhibitor-I 


119 


41 


8090 


AF1 19851 


Homo sapiens 


PRO! 722 


85 


47 


8091 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


112 


73 


8092 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


133 


64 


8093 


AF1 19854 


Homo sapiens 


PRO 1843 


522 


100 


8094 


X92485 


Plasmodium 
vivax 


pval 


115 


82 


8095 


D38112 


Homo sapiens 


NADH dehydrogenase subunit I 


598 


86 


8096 


AF097473 


Mus musculus 


ORF1 


109 


46 


8097 


AF097473 


Mus musculus 


ORF1 


109 


46 


8098 


Y17833 


Human 
endogenous 
retrovirus K 


pol protein 


281 


73 


8099 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


156 


69 




G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


126 


42 


8101 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


8102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


8103 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


71 
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SEQ ID 


Accession No: 


Species 


Description 


Smith- 


% 


NO: 










IdenM 










Score rm!ln 




8104 


AFI30089 


Homo sapiens 


PRO2550 


130 


42 


8105 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


57 








7492. 






8106 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


132 


61 








derived protein #153. 






8107 


AF090942 


Homo sapiens 


PRO0657 


123 


41 


8108 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


134 


38 


8109 


AL160371 


Leishmanial 


probable (hhv-6) ul 102, variant a 


101 


77 








DNA, complete virion genome 






8110 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


340 


90 


8111 


AF130089 




PRO2550 


123 


86 


81 12 


AP000616 


Oryza sati va 


similar to RTNG-H2 finger protein 


122 


90 








RHAla (AF078683) 






81 13 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


182 


47 








6953. 






81 14 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


77 








4435. 






8115 


G00452 


Homo sa iens 

omo sapi ns 


Human secreted protein, SEQ ID NO: 


72 


50 








4533. 






81 16 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


459 


88 


81 17 


L 1 73 1 8 




proline-rich proteoglycan 


1 10 


45 






norvegicus 








g . . g 


S794 . Q 


Mus musculus 


nuclear localization si nal bindin 


125 


62 








protein ' 






g . -o 


G03639 


~H : 

omo sapiens 


Human secreted protein SEQ ID NO" 


~7o6 


~7S 








TJt?^,, j-t— i r — 








-|— 


D3gll2 


— ; 

Homo sapiens 


NADH dehydrogenase subunit 1 


"Tnrl 


— 




GQ7490 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


— - 


"J? 

















D86854 


ih 

a aran us 


— 

ex ensm 


T42 


— 














g]23 


x92485 


Plasmodium 


— val 

pva 


T44 


— 






vivax 








8124 


G03789 


Homo sa iens ' 

omo sapien 


Human secreted protein, SEQ ID NO: 


138 


59 








7870. 






~8125 


G02532 


Homo a iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


163 


~67 








6613. 






8126 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


145 


62 








gene 22 clone HMSJW1 8. 






8127 


X07881 


Homo sapiens 


proline-rich protein Gl 


142 


37 


8128 


X70343 


Nicotiana 


extensin 


102 


29 






sylvestris 








8129 


W4835 1 


Homo sapiens 


Human breast cancer related protein 


113 


48 








BCRB2. 






8130 


AF090895 


Homo sapiens 


PRO01 17 


150 


7] 


8131 


D00570 


N4us musculus 


open reading frame (251 AA) 


130 


41 


8132 


G02532 


Homo sapiens 


Human secreted protein SEQ ID NO* 


106 


61 








6613. 






"8133 


Y 14482 


Homosa ien 

omo sapiens 


encode^Dy genTl 7 Se ° reted P rotein 


"ns 


in — 














8134 


AF130079 


Homo sapiens 


PR02852 


181 


77 


8135 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


58 








7795. 






8136 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


64 








7437. 






8137 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


112 


43 








protein CON-1. 






8138 


AF 119900 


Homo sapiens 


PR02822 


113 


57 


8139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


427 


86 
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SEQID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 






Homo sapiens 


cytochrome c oxidase subunit III 


155 


88 


8141 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


58 


8142 


D70836 


Homo sapiens 


Zinc-finger protein 


156 


47 


8143 


AC00229 1 


Arabidopsis 
thaliana 


extensin 


153 


37 


8144 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


85 


34 


8145 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


100 


76 


8146 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


108 


63 


8147 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


110 


79 


8148 


M22332 


Homo sapiens 


unknown protein 


156 


40 


8149 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


92 


76 


8150 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4497. 


131 


66 


8151 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


441 


76 


8152 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


67 


8153 


AF130089 


Homo sapiens 


PRO2550 


129 


42 


8154 


AF 1 30089 


Homo sapiens 


PRO2550 


173 


50 


8155 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8156 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


82 


8157 


L27428 


Homo sapiens 


reverse transcriptase 


181 


50 


8158 


U11271 


Homo sapiens 


thromboxane A2 receptor 


138 


73 


8159 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


145 


53 


8160 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


139 


58 


8161 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


155 


59 


8162 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


64 


8163 


AF161361 


Homo sapiens 


HSPC098 


89 


66 


8164 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


117 


58 


8165 


AF130089 


Homo sapiens 


PRO2550 


121 


63 


8166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


85 


8167 


X92485 


Plasmodium 
vivax 


pval 


97 


56 


8168 


X92485 


Plasmodium 
vivax 


pval 


115 


57 


8169 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


61 




A rrt I A 1 A A 

Ar010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


60 


8171 


X92485 


Plasmodium 
vivax 


pval 


105 


44 


8172 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


75 


64 


8173 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


65 


8174 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


108 


44 
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SKQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


8175 


Y36203 


Homo sapiens 


Human secreted protein #75. 


150 


51 


8176 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


1 14 


41 


8177 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


68 


8178 


AF090928 


Homo sapiens 


PRO0470 


119 


72 


8179 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
S ene22cloneHMSJW18. 


141 


75 


8180 


AF220264 


Homo sapiens 


MOST-1 


94 


61 


8181 


AF1 19851 


Homo sapiens 


PRO 1722 


143 


73 


8182 


W48351 


Homo sapiens 


BCRB2. ^ 


134 


64 


~8\S3 


"G0""4l"i 


: 

omo sapiens 


Human secreted protein SEQ ID NO* 
7492. 


—p; 


— 


8184 


X92485 


Plasmodium 


pval 


127 


72 


8185 


X92485 


Plasmodium 


pval 


130 


65 


8186 


AF1 19851 


Homo sapiens 


PRO 1722 


285 


64 


81 87 


AF090942 




PRO0657 


137 


38 


8188 


AF090931 


Homo sapiens 


PRO0483 


154 


66 


8189 


Y0269j> 


Homo sapiens 


gene 44 clone HTDAD22. 


148 


43 


8190 


U49973 


Homo sapiens 


similar to pogo element 


268 


79 


8191 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


1 10 


43 


8192 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


129 


78 


8193 


AF1 13685 


Homo sapiens 


PRO0974 


86 


52 


8194 


Y36203 






132 




8195 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


225 


77 





S801 19 





homoto^uTcrf'^ rich 


105 


— 


gl97 


X86031 


Sus scrofa 

us scro a 


ant^acter^rprotein 16 ar8mme ~ nc 


124 


"39" 


8198 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


60 


8199 ' 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


68 


8200 


Y36203 


Homo sapiens 


Human secreted protein #75. 


131 


56 


8201 


Y36203 


Homo sapiens 


Human secreted protein #75. 


136 


44 


8202 


AF010144 


Homo sapiens 


neuronal thread protein AT")7c-NTP 


166 


48 


8203 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


50 


S204 


Y36203 


Homo sapiens 


Human secreted protein #75. 


127 


58 


8205 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


139 


61 


"8206 


Y0267 ^ 


Homo sapiens 


gene 22 clone HMSJW18. 


190 


~92 


8207 


AF09093 1 




PRO0483 


J60 




8208 


AF126163 


Homo sapiens 


HHLA3 protein 


107 


67 


8209 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


93 


41 


8210 


Y36156 


Homo sapiens 


Human secreted protein #28. 


100 


62 


8211 


AF161356 


Homo sapiens 


HSPC093 


73 


60 


8212 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


107 


34 


8213 


AB046048 


Macaca 


unnamed portein product 


146 


61 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idcntit 










--Score 


y 






— : : 

fascicularis 











AT I JUU iy 


Homo sapiens 




71 







Y7j966 


Homo sapiens 


Human prostate tumor EST fragment 


~T59 


~ 














— — — 






— — : 

Homo sapiens 




— 

"Tfil 


-— 


8217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO". 




— — 

60 








TJ^ 5 ' — r : — -j : 






- 





Homo sapiens 


Human breast cancer related protein 


-— 


— - 








BCRB2. — 









ivi J J I 


-= 


proJine-ricn salivary protein 


~uyy 


~Z5 






norvegicus 




-— 









Homo sapiens 


Human secreted protein, SEQ ID NO: 




— - 








Q^^jl 









x - 22 ^ 5 


Homo sapiens 




Ud 







|r ~ — 


Ar 1 jUUiSV 


Homo sapiens 




T59 


— - 




ArUoctzy4 


Homo sapiens 




161 




8224 


M623 87 


Oryctolagus 


ubiquitin conjugating-protein 


219 


90 






cuniculus 








8225 


AF1 13685 


Homo sapiens 


PRO 09 74 


1 j6 


65 


8226 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 3 


430 


85 


8227 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


1 15 


58 








encoded by gene 90. 






8228 


Y36156 


Homo sapiens 


Human secreted protein #28. 


117 


50 


8229 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1j0 


44 


8230 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


50 














8231 


AP000616 


. - 

Oryza sativa 


similar to RJNG-H2 finger protein 


105 


46 








RHAla (AF078683) 






8232 


S6^928 


... . _ 
Homo sapiens 


PRB1M protein precursor 


} lj 






AB047600 


Macaca 


hypothetical protein 


42 


- 






fascicularis 


- 








8 


Homo sapiens 


Human secreted protein. SEQ ID NO: 


104 


48 








6953 - : : 






— 




V 12 . 69 . 5 


. 

Pan paniscus 


cytochrome oxidase subunit II 




259 


84 




K0 1 664 


Drosophila 


Bkm-Iike protein 


116 


61 






melanogaster 










8237 


, 

AF1 16661 


Homo sapiens 


r,^, a 

PRO 1 43 8 


; 

135 




56 


8238 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


107 


53 








factor (BCGF). 










Homo sapiens 




168 


49 ....... 


8240 




Homo sapiens 


Human secreted protein encoded by 


100 


72 








gene cione rxivisj wio. 






— — — 

.. 824 J. . 


Z70^92 


Homo sapiens 


chemokine CC- 1 


— — 


. 

83 


8242 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


1 19 


82 






major 


DNA, complete virion genome 






8243 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


97 


83 








^r?r\ T"T\ A1/"V 1 A.'i 

JC,K^ ID 1NI_A joj. 






8244 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


1 14 


78 


8245 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


65 








4419. 






8246 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 


74 


60 








protein fragment 59. 






8247 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


115 


58 








gene 22 clone HMSJW 18. 






8248 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


58 








6619. 






8249 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


84 








7870. 






8250 


AF 118082 


Homo sapiens 


PRO 1902 


136 


50 
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NO: 


Accession No. 


Species 


Description 


Waterman 


y 


8251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


196 


59 


8252 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


106 


81 


8253 


AF130089 


Homo sapiens 


PRO2550 


143 


53 


8254 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 


8255 


AK022821 


Homo sapiens 


unnamed protein product 


140 


68 


8256 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


53 


8257 


M24509 


Mus musculus 


ferritin heavy chain 


347 


69 


8258 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


57 


8259 


U93570 


Homo sapiens 


p40 


254 


57 


8260 


AF2 18028 


Homo sapiens 


unknown 


129 


71 


8261 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


77 


68 


8262 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


604 


90 


8263 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


60 


8264 


AL1 39298 


Streptomyces 
coelicolor A3(2) 


putative membrane protein 


92 


34 


8265 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


138 


70 


8266 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


100 


64 


8267 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


113 


38 


8268 


X92485 


Plasmodium 
vivax 


pval 


102 


63 


8269 


X61048 


Hydra sp. 


mini-collagen 


110 


46 


8270 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


82 


8271 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


121 


75 


8272 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


48 


8273 


AF090944 


Homo sapiens 


PRO0663 


122 


42 


8274 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


81 


85 


8275 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7871. 


140 


54 


8276 


AF 130089 


Homo sapiens 


PRO2550 


275 


56 


8277 


L27428 


Homo sapiens 


reverse transcriptase 


105 


27 


8278 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


112 


54 


8279 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327.' 


159 


73 


8280 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


109 


71 


8281 


AF I 04921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


188 


42 


8282 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


39 


8283 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


96 


36 


8284 


AF225910 


Mus musculus 


DAZ-associated protein 1 


117 


33 


8285 


AF130089 


Homo sapiens 


PRO2550 


347 


75 


8286 


AB046100 


Macaca 
fascicular is 


unnamed protein product 


140 


45 


8287 


X55684 


Lycopersicon 


extensin (class I) 


67 


45 
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SEQ ID 
MO: 


Accession No. 




Description 


Smith- 

VVaterman 

Score 


% 

Uentit 

y 






esculentum 








— — 

8288 


At /U 


Homo sapiens 


■ j : 

Rad5 lO truncated protein 




-— 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 




-~ 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 






8291 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


50 


8292 


D89999 


Gallus gallus 


muscle derived protein 


1134 


66 




LI 1566 


Homo sapiens 


ribosomal protein LI 8 







8294 


W61 170 


Homo sapiens 


Partial polypeptide sequence from 
CP 140 partial cDNA. 


"tfr 








Homo sapiens 


HPlHs alpha=25 kda chromosomal 
autoantigen 






8296 


AF1 1 9882 


Homo sapiens 


PR02492 






8297 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


54 


829S 


AK025116 


Homo sapiens 


unnamed protein product 


259 


82 


8300 


X92485 


Plasmodium 


pval 


162 


47 


8301 


G00397 

L. . 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
44 „*i; ; 


157 


76 


8302 


AF092176 


— = : 

Homo sapiens 


MHC class II antigen 




280 


T 

_. 84 


8j03 


AF 1 1 8082 


Homo sapiens 


PRO 1902 


174 ._ 




8304 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 




8305 


AK023563 


Homo sapiens 


unnamed protein product 


244 


60 


8j06 


G0j714 


Homo sapiens 
— 


Human secreted protein, SEQ ID NO: 

_ZZ£jL . 


136 


66 


~8307 


y 1564 j 


Mus museums 


reverse transcriptase 


To7 


~38 


8308 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


110 


40 


8309 




Rattus rattus 


Dl"T A ID 
rL I J-\ 1 rvxlZ 




~H — 






Mus musculus 


Ig heavy chain precursor 







8311 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


82 






— : 

Homo sapiens 


Human secreted protein C\v38j 1. 


"^12 


Too — 


8313 


AFl 13685 


Homo sapiens 


PRO0974 


116 


39 


83 14 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 






8315 


Z28407 


Homo sapiens 


ribosomal protein L8 




78 


83 16 


AE003536 


Drosophila 
melanogaster 


CG6451 gene product 


237 


51 


8317 


M19419 


Mus musculus 


proline-rich salivary protein 


115 


39 


8318 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 






8319 


Y29862 


Homo sapiens 


Human secreted protein clone 
du515 21. 


399 


81 


8320 


Z34278 


Homo sapiens 


mucin 


103 


25 


8321 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u3102, variant a dna, 
complete virion genome. 


142 


91 


8322 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


82 


62 


8323 


Z46791 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=67.1, E- 
value=1.2e-16,N=3; PF01484 
(Nematode cuticle collagen N-terminal 
domain), Score=82.0, E-value=3.9e-21, 
N=l 


122 


37 


8324 


AF021232 


Homo sapiens 


TRAIL-R4-A 


113 


100 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 
y 


8325 


AB0460S5 


Macaca 
fascicularis 


unnamed protein product 


292 


75 


8326 


AF 113944 


Muntiacus 
muntjak 


prion protein 


96 


39 


8327 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


125 


63 


8328 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


123 


62 


8329 


AJ005897 


Homo sapiens 


JM5 


275 


73 


S330 


AF 119900 


Homo sapiens 


PR02822 


138 


76 


8331 


AF163573 


Homo sapiens 


CARKL 


359 


84 


8332 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


103 


37 


S333 


AF026246 


Homo sapiens 


HERV-E integrase 


354 


78 


8334 


J03756 


Homo sapiens 


hGH-V2 


362 


82 


8335 


AB001517 


Homo sapiens 


KNP-I alpha protein 


126 


76 


8336 


X97490 


Mus musculus 


PNG protein 


469 


98 


8337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


74 


8338 


V18354_cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


379 


92 


8339 


V18354_cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


402 


93 


8340 


Z11502 


Homo sapiens 


intestine-specific annexin 


385 


89 


8341 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


110 


54 


8342 


AC003113 


Arabidopsis 
thaliana 


F240I.6 


68 


50 


8343 


X12517 


Homo sapiens 


C protein (AA 1-159) 


321 


82 


8344 


D86549 


Homo sapiens 


p97 homologous protein 


279 


96 


8345 


Y73384 


Homo sapiens 


HTRM clone 2284580 protein 
sequence. 


1513 


100 


S346 


AP001745 


Homo sapiens 


human cDNA DKFZp586F0422, 
Accession No. AL050173 


118 


40 


8347 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


121 


49 


8348 


AF091090 


Homo sapiens 


unknown 


125 


44 


8349 


AF 106542 


Homo sapiens 


pregnancy-specific beta-1 glycoprotein 
1 


145 


100 


8350 


AF090942 


Homo sapiens 


PRO0657 


252 


65 


8351 


L23116 


Homo sapiens 


galctocerebrosidase 


185 


85 


8352 


AJ243803 


Streptomyces 
coelicolor A3(2) 


hypothetical protein 


97 


36 


8353 


Y11339 


Homo sapiens 


GalNAc alpha-2,6-sialyltransferase I 


306 


100 


8354 


U09477 


Homo sapiens 


p53-binding protein 


388 


100 


8355 


L27584 


Homo sapiens 


Ca channel B3 subunit 


197 


100 


8356 


AF071569 


Homo sapiens 


multifunctional calcium/calmodulin- 
dependent protein kinase II delta2 
isoform 


574 


100 


8357 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSS I . 


321 


81 


8358 


Y12916 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


759 


100 


8359 


AF038961 


Homo sapiens 


SL 1 5 protein 


166 


100 


8360 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 


328 


90 


8361 


R95326 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain ligand (clone 2DD). 


386 


97 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


8 j 62 


.. ., 

AF026954 


Bos taums 


1 d h d a k — in 

re^utatory subunif precursor ^DPr^ 


269 


88 


— ~ 


X07855 


— — 


G-Protein 


289 


98 


Pll — 


AT 1 1 COA~l 




hypothetical protein 


265 


97 





A Vtr\d£.17A 
ADU^O J /** 


Homo sapiens 


K1AA 1 554 protein 


3619 


99 


8366 


M10050 


Homo sapiens 


fatty acid binding protein 


97 


100 


8367 




Homo sapiens 


vdupT cyspecific beta '£ 1 y c °P rotein e 


1097 


99 




aesuj iyu i 


Homo sapiens 




662 


98 


8369 


U72678 


Mus musculus 


EF-9 


176 


65 


8370 


AL1 17237 


Homo sapiens 


hypothetical protein 


~To~l 


60 


8371 


M10617 


Homo sapiens 


L-FABP — _ 




84 


8372 


AJ249980 


Homo sapiens 


hypothetical protein 


a — 




8373 


AF003136 


Caenorhabditis 
elegans 


contains weak similarity to an AMP- 
binding motif - 


~242 


r ~5~5~ 


8374 


X91638 


Gallus gallus 


BRM protein 




|_90 


8375 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 






8376 


Z36714 


Homo sapiens 


cyclin F 


519 


95 


8377 


X86401 


Homo sapiens 


L-arginine: glycine amidinotransferase 






8378 


M12413 


Homo sapiens 


apolipoprotein B100 


1278 


99 


8379 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 






8380 


AF151860 


Homo sapiens 


CGI- 102 protein 






8381 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


255 


92 

— 


8382 


AF017807 


Homo sapiens 


Arp2/3 complex 16kDasubunit 






8383 


U28727 


Homo sapiens 


pregnancy-associated plasma protein-A 
preproform 


196 


97 


8384 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


128 


85 


8385 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


203 


69 


8386 


AF090931 


Homo sapiens 


PRO0483 


145 


77 


8387 


S63758 


Mus sp. 


metallothionein-I gene transcription 
activator 


324 


98 


8388 


W89030 


Homo sapiens 


Polypeptide fragment encoded by gene 
165. 


174 


66 


8389 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO:. 
4581. 


93 


70 


8390 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






8391 


V29245_cdl 


Homo sapiens 


15-NOV-1996 Nucleotide sequence of 
the human selenium-binding protein. 






8392 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 






8393 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


99 


77 


8394 


AK023563 


Homo sapiens 


unnamed protein product 


285 


75 


8395 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 






8396 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


121 




8397 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 






8398 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


57 


8399 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


63 


8400 


W88607 


Homo sapiens 


Secreted protein encoded by gene 74 
clone HHSEG23. . 


146 


. 55 



354 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 


8401 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


90 


69 


8402 


AF1 19851 


Homo sapiens 


PRO 1722 


133 


65 


8403 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


67 


76 


8404 


L06498 


Homo sapiens 


ribosomal protein S20 


170 


94 


8405 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


58 


8406 


AF 118086 


Homo sapiens 


PRO 1992 


142 


80 


8407 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


153 


81 


8408 


AF090919 


Homo sapiens 


PRO0327 


96 


74 


8409 


G0253S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


62 


8410 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


150 


100 


8411 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


8412 


AF116715 


Homo sapiens 


PR02829 


158 


76 


8413 


AF229067 


Homo sapiens 


PADI-H protein 


137 


56 


8414 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


131 


77 


8415 


AF068294 


Homo sapiens 


HDCMB45P 


132 


54 


8416 


W34499 


Homo sapiens 


Obesity receptor C protein. 


251 


77 


8417 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


272 


57 


8418 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


1 13 


54 


8419 


AK025047 


Homo sapiens 


unnamed protein product 


128 


58 


8420 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


130 


46 


8421 


AL 132841 


Caenorhabditis 
elegans 


Y15E3A.3 


135 


56 


8422 


AF1 30051 


Homo sapiens 


PRO0898 


164 


72 


8423 


G00092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4173. 


161 


96 


8424 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


100 


8425 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


134 


75 


8426 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


91 


48 


8427 


AF 194641 


Homo sapiens 


immunoglobulin lambda light chain 
variable region 


174 


69 


8428 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


146 


68 


8429 


M64322 


Homo sapiens 


protein tyrosine phosphatase 


201 


100 


8430 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


135 


55 


8431 


AF090931 


Homo sapiens 


PRO0483 


158 


87 


8432 


J04495 


Macaca mulatta 


alpha-globin 


150 


100 


8433 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


150 


78 


8434 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


137 


48 


8435 


AJ272324 


Homo sapiens 


adaptor molecule- 1 


333 


100 


8436 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


91 


48 


8437 


Ml 8085 


Homo sapiens 


glycoprotein lib 


154 


TOO 



355 
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NO: 








Waterman 
Score 


Idcntit 

y 


8438 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8439 


G03136 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7217. 


126 


56 


8440 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


105 


72 


8441 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


122 


55 


8442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


139 


69 


8443 


Y36203 


Homo sapiens 


Human secreted protein #75. 


103 


70 


8444 


AB034633 


Mus musculus 


deubiquitinating enzyme 


129 


100 


8445 


U01849 


Trypanosoma 
brucei 


ORF2 


95 


41 


8446 


J05497 


Rattus 
norvegicus 


snRNP-associated polypeptide N 


115 


37 


8447 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


137 


50 


8448 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


100 


8449 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


168 


94 


8450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


63 


8451 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


91 


8452 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


804 


97 


8453 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


109 


100 


8454 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


138 


88 


8455 


AF099505 


Homo sapiens 


colon carcinoma related protein 


100 


63 


8456 


AF217511 


Homo sapiens 


uncharacterized bone marrow protein 
BM034 


130 


96 


8457 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


70 


8458 


LI 7007 


Didelphis 
marsupiaiis 


protamine 1 


101 


80 


8459 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


145 


62 


8460 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


121 


75 


8461 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


110 


81 


8462 


AF016371 


Homo sapiens 


U-snRNP-associated cyclophilin 


108 


70 


8463 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


80 


93 


8464 


AF130089 


Homo sapiens 


PRO2550 


190 


65 


8465 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 


159 


55 


8466 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


183 


70 


8467 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


97 


60 


8468 


AF1 18082 


Homo sapiens 


PRO 1902 


72 


42 


8469 


AF2 17449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


107 


40 


8470 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


75 



356 
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SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Identit 

y 








7339. 






8471 


AF09093 1 


Homo sapiens 


PRO0483 


161 


79 


S472 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


127 


56 


8473 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


137 


50 


8474 


AF0227S9 


Homo sapiens 


ubiquitin hydroly2ing enzyme 1 


140 


100 


8475 


X03 145 




pot. ORF VI 






8476 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




152 


65 


8477 


R59842 


Homo sapiens 


Apoc4L 1 protease. 


"773 


-_ 


8478 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

pT 95 " — — -. 


112 


56 


8479 


K01 664 


Drosophila 

melanogaster 


J3Km-like protein 


~140 


. - 


8480 


G02455 




6536. 






8481 


AF09093 1 




PRO0483 


-p5^ 


-|Z — 


8482 


Y27854 


Homo sapiens 


gene No. 101. 






8483 


M69297 


Homo sapiens 


ORF 3 







8484 


G02532 




6613. 




~~7"2" 


8485 


G03793 




Human secreted protein, SEQ ID NOi 
7874. 






8486 


AF032906 




cathepsin Z precursor, CTSZ 






8487 


X15917 




Paramecium 
" re ia -. 


ND2 protein (AA 1-193) (unusual start 
codon) 


94 


35 


~8488 




Homo sapiens 


putative p 1 50 


109 


35 


~8489 


GO" 807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7888. 




72 


8490 


X92485 


Plasmodium 


pval 


120 


85 


849] 


Y02693 


Homosa iens 

omo sapiens 


Human secreted protein encoded bv 

gene 44 clone HTDAD22. 


— — 

109 


42 


8492 


Ml 5530 


Homo sapiens 


B-cell growth factor 


126 


~64 


8493 


AF090895 


Homo sapiens 


PRO0117 


123 


~60 


8494 


G02403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6484. 


166 


100 


8495 


AF1 16661 


Homo sapiens 


PRO 143 8 


126 


63 


8496 


AF1 30089 


Homo sapiens 


PRO2550 






8497 


AF090895 


Homo sapiens 


PRO0117 


113 


75 


8498 


AF090930 


Homo sapiens 


PRO0478 


~T1§ 




8499 


AF126163 




nnL/Hj protein 




-~ 


8500 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

j p : : 


109 


75 


-g501 


S794 1 0 


Mus musculus 


nuclear localization signal binding 
protein 


98 


48 




ATT sQ7»9 


1 rypanosoroa 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 




63 






Homo sapiens 


PROl 722 




53 


8504 


G03714 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7795. 


-~ 




8505 


AF090931 


Homo sapiens 


PRO0483 


128 


50 


8506 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


126 


56 


8507 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


154 


82 


8508 


X92485 


Plasmodium 


pval | 


83 


70 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

identit 

y 






vivax 








8509 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


144 


50 


8510 


AL 160493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


124 


86 


8511 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


141 


51 


8512 


AF229067 


Homo sapiens 


PADI-H protein 


155 


76 


8513 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


170 


68 


8514 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


66 


8515 


AF090930 


Homo sapiens 


PRO0478 


88 


80 


8516 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 


59 


8517 


K01 664 


Drosophila 
melanogaster 


Bkm-like protein 


102 


92 


8518 


M31145 


Homo sapiens 


insulin-like growth factor binding 
protein precursor 


460 


96 


8519 


M18193 


Homo sapiens 


inter-alpha-trypsin inhibitor heavy 
chain old gene name 'ITT 


118 


41 


8520 


AF 116689 


Homo sapiens 


PR02168 


127 


92 


8521 


AF1 30051 


Homo sapiens 


PRO0898 


133 


85 


8522 


L27428 


Homo sapiens 


reverse transcriptase 


201 


72 


8523 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


108 


74 


8524 


AF161356 


Homo sapiens 


HSPC093 


127 


60 


8525 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


95 


83 


8526 


AF1 18086 


Homo sapiens 


PRO 1992 


95 


64 


8527 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


50 


8528 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


154 


62 


8529 


AF 130089 


Homo sapiens 


PRO2550 


150 


70 


8530 


Y03000 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


111 


55 


8531 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


65 


8532 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7881. 


172 


59 


8533 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


395 


91 


8534 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


45 


8535 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


107 


95 


8536 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


128 


89 


8537 


Z28407 


Homo sapiens 


ribosomal protein L8 


242 


79 


8538 


AF 1 30089 


Homo sapiens 


PRO2550 


139 


71 


8539 


Q55625_cdl 


Homo sapiens 


22-JUN- 1 992 Human beta globin 5'- 
UTR-CDS-3'-UTR. 


274 


76 


8540 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


154 


61 


8541 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


69 


8542 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


64 


8543 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


240 


61 
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SEQ ID 

NO: 


Accession No. 




Description 


Waterman 

_ .Score 


Identit 

y 








6953. 






8544 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


1 17 


85 


8545 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


1 10 


95 


8546 


AF1 19851 


Homo sapiens 


PRO 1722 


146 


80 


8547 


U49973 


Homo sapiens 


ORF2: function unknown 


105 


86 


8548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


71 


8549 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


152 


53 


8550 


G02S32 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


80 


40 


8551 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


140 


74 


8552 


V00672 


Pan troglodytes 


reading frame protein 4 


342 


92 


8553 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


164 


63 


8554 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


148 


71 


8555 


AB046100 


Macaca 
fascicularis 


unnamed protein product 


144 


58 


8556 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


322 


73 


8557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


62 

i 


8558 


U30888 


Homo sapiens 


tRNA-Guanine Transglycosylase 


276 


86 


8559 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


114 


71 


8560 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


108 


72 


8561 


G03136 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7217. 


110 


57 


8562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


140 


83 


8563 


W34499 


Homo sapiens 


Obesity receptor C protein. 


188 


92 


8564 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


56 


8565 


L07946 


Volvox carteri 


histone HI -I 


108 


38 


8566 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


132 


77 


8567 


M13941 


Human 
herpesvirus 4 


nuclear antigen 1 


100 


43 


8568 


Y36I56 


Homo sapiens 


Human secreted protein #28. 


136 


67 


8569 


X70343 


Nicotiana 
sylvestris 


extensin 


92 


34 


8570 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


381 


87 


857J 


V00672 


Pan troglodytes 


reading frame protein 4 


146 


93 


8572 


D38116 


Pan paniscus 


ATPase subunit 6 


116 


96 


8573 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


132 


52 


8574 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


217 


100 


8575 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


271 


84 


8576 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8577 


D38112 


Homo sapiens 


ATPase subunit 6 


180 


93 


8578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit I 


179 


91 


8579 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


88 
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SEQID 


Accession No. 


Specks 


Description 


Smi Ill- 


% 


NO: 








Waterman 


Identit 










SC °™ 


y 


"8580 




M15530 


— : 

riomo sapiens 


B-cell growth lactor 


_J09 


71 




Y 86''48 


Homo sapiens 


Human secreted protein HCHPF68, 


1_7 


77 






__ 


SEQ ID NO: 1 63 . 






8582 


AL359782 




probable similar to ring-h2 finger 


— — — 


47 






bruceT° S ° ma 


protein rhala. — 






8583 


Y 8624 8 


Homo sa ien 

omo sapiens 


Human secreted protein HL-HPFoo, 


— — 


— 

71 






__ . 


CPfi TFl NTfV 1 f.~\ 
ocy 1U V*\J . 






8584 


Y91577 


omo sapiens 


Human secreted protein sequence 


— 


— 








encoded by gene 2 SEQ ID NO:250. 






8585 


Y86248 


Homosa iens 

omo sapiens 


Human secreted protein HCHPF68, 


— — 


— 

80 








SEQ ID NO: 1 63 






8586 


M28016 


Homo sapiens 




"129" 


— 

~§1 


8587 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


153 




8588 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 




— 








SEQ ID NO: 163. 






8589 


Y08769 


Rattus 




To9 


-— 








differentiation e e2 






8590 


L00016 


Homosa iens 


urf4 m ge " e 


—— 


— 


8591 


AF1501O5 


Homo S ^ PieJ1S 

omo sapiens 


W~- — r ft 7~- 


~vri 

— ■ 


— — — 
_ioo — 






Homo sapiens 


NFX1 Z1 " C L "° er ' 6 6m 


123 .. ... .. 


95 


8594 


R445 1 9 


Homo sapiens 


r~r TTT c 

Adult human alpha-globin V107S. 


222 


78 


8595 




Homo sapiens 


Fragment of human secreted protein 


128 


70 









encoded by gene 121. 






— — 


S75895 


— — : 

Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


151 


90 





-A.O/050 


Homo sapiens 


transketo lase 


123 


84 





t nnni a 


Homo sapiens 


" rf4 


130 


85 




Y36366 


Homo sapiens 


Fragment of human secreted protein 


109 


54 








encoded by gene 3. 






8600 


y- , 7 


— : 

omo sapiens 


Human secreted protein HCHPF68, 


— — 

159 




89 









CCA T r*l *M/*\« 1 






"8601 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


54 








6953. 








RO6800 


~Ho : 

■ omo sapiens 


Human histiocyte-secreted factor HSF. 





43 


8603 


AP000616 


ryza sa iva 


similar to RJNG-H2 finger protein 


120 


95 







— 


"PT-T A 1 i f A T7fnff<OT\ 

KxiAia (/vru /aooj ) 









S79980 


Bos taurus 


ribosomal protein L37 


109 


84 




UUjCU / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


48 














8606 


D38L 12 


Hq -. 

Ho " 1 ° sa P* ens 


A i Pase subunit 6 




_r2r. 


100 


8607 


Y86248 


omo sapiens 


Human secreted protein HCHPF68, 


150 


42 








cprv TT"k "Mf"V 1 fi,"X 
OCy IU fNv_/.10j. 






"8608 


G00333 


— : 

.Homo sapiens 


Human secreted protein, SEQ ID NO: 


T7 

127 


58 








ll^rti u = 






8609 


AF229067 


— : 

Jriomo sapiens 


PADI-H protein 


— — 

12 


78 


8610 


AF 130089 


Homo sapiens 




125 


86 


-|>?2 




Homo sapiens 


chorionic somatomammotropin CS-5 


710 


97 




AT 70m 14 


Eeishmania 


extremely cysteine/valine rich protein 


104 


41 






*" a J Qr : 








8613 


Y86248 


Homo sapiens 


Human secreted protein HCHPFoS, 




1 12 


84 








SEQ ID NO: 163. 






~S6\4 


ArUUUOlO 


— . 

Oryza sativa 


similar to RING-H2 finger protein 


116 


70 








RHAIa (AF078683) 






8615 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


123 


91 








RHAla(AF078683) 






8616 


M28016 


Homo sapiens 


cytochrome b 


148 


93 


8617 


AF 118082 


Homo sapiens 


PRO 1902 


94 


66 


8618 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


182 


82 








SEQ ID NO: 163. 






8619 


X02585 


Xenopus laevis 


unidentified open reading frame 1 (166 


170 


72 
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NO: 


Accession No. 


Species 


Description 


Smith- 

Wiiterman 

Score 


% 

Identit 








aa) 






8620 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


125 


57 


8621 


AC004450 


Arabidopsis 
thaliana 


putative extensm 


140 


4S 


8622 


Y86248. 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


79 


8623 


AF130089 


Homo sapiens 


PRO2550 


150 


78 


8624 


L00016 


Homo sapiens 


urf4 


280 


92 


8625 


AF 130051 


Homo sapiens 


PRO0898 


161 


76 


8626 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


123 


49 


8627 


L00016 


Homo sapiens 


urf4 


201 


95 


8628 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4L 


169 


82 


8629 


API 19851 


Homo sapiens 


PRO 1722 


171 


65 


8630 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


120 


92 


8631 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


108 


56 


8632 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8633 


G03043 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7124. 


132 


86 


8634 


AJ224997 


Rattus 
norvegicus 


huntingtin 


73 


59 


8635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


80 


8636 


M19061 


Ateles geoffroyi 


delta-globin 


100 


40 


8637 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


102 


65 


8638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


104 


50 


8639 


AF116715 


Homo sapiens 


PR02829 


158 


76 


8640 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


142 


62 


8641 


Y09010 


Mus musculus 


Ser/Thr kinase 


193 


95 


8642 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


227 


79 


8643 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


1 16 


95 


8644 


AP000616 ' 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


88 


8645 


V46311_cdl 


Homo sapiens 


12-JAN-1998 Human secreted protein 
cDNA from clone D157. 


197 


100 


8646 


Y17379 


Homo sapiens 


NADH:ubiquinone oxidoreductase 51- 
kD subunit 


240 


97 


8647 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


142 


68 


8648 


AJ388520 


Canis familiaris 


Ribosomal protein 


134 


100 


8649 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


136 


56 


8650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


1 16 


80 


8651 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


99 


36 


8652 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


136 


64 


8653 


AF229067 


Homo sapiens 


PADI-H protein 


134 


69 


8654 


AF090942 


Homo sapiens 


PRO0657 


103 


42 


8655 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


84 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

IOcntit 

y 






Homo sapiens 


MRE1 1 homologue hMrel 1 






8657 


AL160371 


Leishmania 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


113 


60 


8658 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


83 





Ar 1 61356 


Homo sapiens 




- -- 

-±zt 






G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


121 


44 


8661 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


107 


37 


8662 


U 19098 


Lycopersicon 
chilense 


proline-rich protein 


72 


48 


8663 


AF 11 8082 


Homo sapiens 


PRO 1902 


95 


75 




AF130089 


Homo sapiens 


PRO2550 




79 


8665 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


115 


40 


8666 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


102 


32 


8667 


AK026841 


Homo sapiens 


unnamed protein product 


276 


100 


8668 


AL 1 09963 


Homo sapiens 


dJ 1 1 88J2 1 . 1 (FSH primary response 
(LRPR1, rat) homolog 1) 


597 


100 


8669 


Y01 1 81 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


123 


58 


8670 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


103 


65 


8671 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


61 


8672 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


82 


8673 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


21 1 


93 


8674 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


61 


8675 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


114 


66 






Lycopersicon 
esculentum 


glycine-rich protein 


85 


38 


8677 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
bbQ ID NO:loi. 


188 


80 


8678 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


183 


75 


8679 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


71 


8680 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


96 


82 


8681 


U80761 j 


Homo sapiens 


CTG26 alternate open reading frame 


136 


88 


8682 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


137 


93 


8683 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


107 


45 


8684 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

bfct^ 1JL> INU. 1 0 J . 


141 


90 


8685 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


127 


55 


8686 


W64486 


Homo sapiens 


Human DR3 protein. 


139 


100 


8687 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


134 


33 


8688 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


119 


35 


8689 


U36448 


Homo sapiens 


Ca2+-dependent activator protein for 


107 


91 



362 
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NO 3 ' 0 


Accession No. 


Species 


Description 


Smith- 


% . 










Waterman 


Identic 




















secretion 




... 


8690 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


43 








6490. 






8691 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


147 


74 








encoded by gene 3. 






8692 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


126 


95 








RHAla(AF078683) 






8693 


AF 130089 


Homo sapiens 


PRO2550 


126 


35 


8694 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


233 


97 


8695 


Ml 1902 


Mus muscuJus 


proline-rich salivary protein 


98 




8696 


W34499 


Homo sapiens 


Obesity receptor C protein. 


263 


83 


8697 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


181 


73 








SEQIDNO:163. 






8698 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


125 


48 








SEQ ID NO: 163. 






8699 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


329 


88 








RHAla (AF078683) 






8700 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


1 1 1 


51 








SEQ ID NO: 163. 






8701 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 


165 


73 








SEQ ID NO: 163. 






8702 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


133 


62 








encoded by gene 3. 






8703 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


184 


92 








SEQ ID NO: 163. 






8704 


D381 12 


Homo sapiens 


ATPase subunit 6 


117 


96 


8705 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


58 








6913. 






8706 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


79 








7795. 






8707 


AF 118086 


Homo sapiens 


PRO 1992 


167 


81 


8708 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


114 


51 








SEQ ID NO: 163. 






8709 


LI 0908 


Mus musculus 


Gcapl gene product 


179 


38 


8710 


S79410 


Mus musculus 


nuclear localization signal binding 


135 


69 








protein 






8711 


AF 13 0089 


Homo sapiens 


PRO2550 


399 


78 


8712 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8713 


AF1 18082 


Homo sapiens 


PRO 1902 


123 


58 


8714 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


81 








6613. 






8715 


V35554_cdl 


Homo sapiens 


21 -NOV- 1996 Human tumour antigen 


258 


83 








protein (clone 3) encoding cDNA. 






8716 


AF130089 


Homo sapiens 


PRO2550 


143 


51 


8717 


AL359782 


Trypanosoma 


probable similar to ring-h.2 finger 


92 


52 






brucei 


protein rhala. 






8718 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


147 


71 








gene No. 107. 






8719 


AF153208 


Homo sapiens 


GC-rich sequence DNA -binding factor 


115 


70 








candidate 






8720 


AF090931 


Homo sapiens 


PRO0483 


145 


"87 


8721 


X92485 


Plasmodium 


pval 


125 


79 






vivax 








8722 


U72543 


Sus scrofa 


ubiquitin-like/S30 ribosomal fusion 


201 


97 








protein 






8723 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


223 


85 








6908. 






8724 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


121 


60 








clone HPMBQ32. 







363 



ISDOCID: <WO_ 



_0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 4 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idenrif 

y 


8725 


AF130051 


Homo sapiens 


PRO0898 


102 


72 




G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 










Homo sapiens 


nun vd.ii an i n-j o i/*+o i/ot jut iho 
18F21L 167S 171D 175T 179T. 






8728 


00407^ 


Homo sapiens 


8153. 


92 


72 


8729 


X53375 


HeJianthus 
annuus 


anther-specific protein SF 1 8 


114 


39 


87^0 


S74221 


Homo sapiens 


HLA class II 


159 


42 


8731 


AC003058 


Arabidopsis 


unknown protein 


166 


89 


8732 


AB027508 


Homo sapiens 


hVPSll 


194 


50 





U2897 1 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) ■ 




■ 


^jt 5 




Homo sapiens 


rectachrome 1 


— — 


_Zz 




AL 132841 


Caenorhabditis 
elegans 




"723 


60 






Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 






8737 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

443 5 : — -— : — : 


79 


56 


~8738 


U 9 ~ 565 


— — 

Homo sapiens 


putative pi 50 


~~ 190 


~T5 


8739 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


117 


57 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 










Homo sapiens 


Human secreted protein encoded by 
gene 75. 






8742 


M58009 










8743 


Ml 7887 


Homo sapiens 


acidic ribosomal phosphoprotein (P2) 


106 


74 




ATT ^Q7C> 
A.L.J Jy IOC 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 








M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


94 


40 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


76 




G0^53^ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 




67 






Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


272 


57 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 




71 • 


8752 


AL451017 


Neurospora 


related to U 1 SMALL NUCLEAR 
KIBUN UCLbOrKO 1 blN L. 


123 


37 


8753 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


55 


8754 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


44 


8755 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


78 


8756 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


284 


87 


8757 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


190 


89 



364 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


y 


8758 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


103 


40 


8759 


G03797 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
7878. 


120 


61 


8760 


AF132984 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP 


196 


94 


8761 


AF130079 


Homo sapiens 


PR02852 


125 


65 


8762 


AF155232 


Pisum sativum 


extensin 


177 


42 


8763 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


95 


8764 


X75438 


Sus scrofa 


homologue of proJine/arginine rich 
antibacterial peptides 


96 


38 


8765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


96 


8766 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


137 


75 


8767 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


207 


95 


8768 


AL 16 1543 


Arabidopsis 
thaliana 


extensin like protein 


100 


38 


8769 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


110 


47 


8770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


178 


78 


8771 


D83407 


Homo sapiens 


a thyroid hormone responsive gene in 
human skin fibroblasts 


133 


100 


8772 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
subunit) 


172 


73 


8773 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


107 


66 


8774 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


100 


8775 


AF090931 


Homo sapiens 


PRO0483 


175 


88 


8776 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


125 


58 


8777 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


126 


73 


8778 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


154 


70 


8779 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


149 


50 


8780 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


170 


91 


8781 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


65 


8782 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


184 


70 


8783 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


169 


97 


8784 


U55376 


Caenorhabditis 
elegans 


F16H1 1.2 gene product 


141 


90 


8785 


U55376 


Caenorhabditis 
elegans 


F16H11.2 gene product 


150 


100 


8786 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


127 


96 


8787 


AF266479 


Homo sapiens 


rectachrome 1 


148 


61 


8788 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A4 1726) 


106 


95 


8789 


AL132841 


Caenorhabditis 
elegans | 


Y15E3A.3 


162 


96 


8790 


AC006693 


Caenorhabditis \ Hypothetical protein W02H5.e 


180 


94 



365 



VSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID ; 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Identit 










Scorc"" 3 " 








elegans 








8791 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


159 


80 








RD protein (nuclear rna-binding 












protein) 






8792 


AF150105 


rioino sapiens 


small zinc finger-like protein 


505 


98 


8793 


AL390935 


Leishmania 


probable pro0195 


85 


31 














8794 


Y 19767 


i sa iens 


SEQ ID NO 485 from W09922243. 


115 


55 





AF1 18082 


H mosa iens 

omo sapiens 


PRO) 902 


123 


75 






omo sapiens 




438 


82 


— — 


AF1 1 8086 


Homo sapiens 


PRO 1 992 


156 


81 





Y53037 


Homo sapiens 


Human secreted protein clone 


196 


100 








dx2 19 13 protein sequence SEQ 10 












NO:8o7 






8799 


X92485 


Plasmodium 


pval 


132 


63 






vivax 








-ggQQ 


G02532 


H m sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO; 


170 


65 








6613. 






8801 


G00397 


Homo 

omo sapiens 


Human secreted protein, SEQ ID NO: 


123 


58 








4478. 






8802 


AF1 16909 


Homosa iens 


unknown 


1 16 


63 


8803 


X92485 


-p| jj£ 

mo iu 


pva 


127 


82 














8804 


G03438 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 


140 


67 














8805 


295 i i 4 


Homo sa iens 

omo sapiens 


bK212A2.2 (apolipoprotein L, 2) 


294 


100 


-||^ 


AF143369 


usmuscu us 


msgl related protein 2 — ■ 


348 


83 


8807 




Homo sapiens 


Human secreted protein, StQ ID NO: 


1 10 


76 

















AP000616 


— -. 

ryza sa lva 


srrnUar to RING H? finger protein 


129 


91 








RHAIa (AF078683) 






8809 


AF230279 


~Cae — rhabditi 

aenor a i is 


SWI3-like protein; PSA-1 


126 


37 














"ssTo — 


D38U2 


Homo sa iens 


NADH dehydrogenase subunit 4L 


168 


82 


88 1 1 


Y86248 


Homo salens 


Human secreted protein HCHPF6S, 


122 


54 








SEQ ID NO: 163. 






8812 


D381 12 


Homo sapiens 


ATPase subunit 6 


207 


100 


88 13 


W99376 


Homo sapiens 


Human fibroblast growth factor 2 24 


93 


41 








kD isoform N-terminus. 






gg 14 


"G03043 


Homo sa iens 

omo s pien 


Human secreted protein, SEQ ID NO: 


157 


57 








7124. 






8815 


X54978 


Bos taurus 


17,000 dalton myosin light chain 


133 


96 


8816 


M58459 


tiomo salens 




260 


100 


|||^ 




»x° m0 ^ apie " S 


NADH dehydrogenase subunit 1 


256 


90 







ivlycopjasma 


VsaDl 


1j2 


31 






Homo'sa iens 








g8 


2 


omo sapiens 


Human secreted rotein SEQ ID NO- — 


140 


69 








7253 r ' 








" 


L00016 


Homosa iens 

omo sapiens 


llrff" : 


205 


90 




X55683 


Lycopersicon 


'■ — 7~ i t\ 

extensin (class 1) 


111 


4 






esculentum 








8822 


AL1 60493 




probable (hhv-6) u 1 1 02, variant a 


94 


43 






major 


DNA, complete virion genome 






8823 


X66285 


Mus musculus 


HC1 ORF 


94 


53 


8824 


AF061340 


Artibeus 


Fl ATPase subunit 6 


97 


90 






jamaicensis 








8825 


Ml 8093 


Phaseolus 


hydroxyproline-rich glycoprotein 


123 


36 






vulgaris 








8826 


AF1 19855 


Homo sapiens 


PRO 1847 


113 


52 



366 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 




Description 






NO: 








Waterman 
Score 


Identic 

y 


8827 


M28016 


Homo sapiens 


cytochrome b 


140 


93 


8828 


YS6248 


Homo sapiens 


Human secreted protein HCHPF68, 


142 


56 








SEQ ID NO: 163. 






8829 


AK024455 


Homo sapiens 


FLJ00047 protein 


145 


59 


8830 


D381 12 


Homo sap/ens 


ATPase subunit6 


194 


95 


8831 


D381 12 


Homo sapiens 


ATPase subunit 6 


184 


90 


8832 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 


134 


60 








RHAla(AF0786S3) 






8833 


AF1 30079 


Homo sapiens 


PR02852 


115 


88 


8834 


AB003784 


Drosophila 


Histone H3 


135 


100 






simulans 








8835 


Y91577 


Homo sapiens 


Human secreted protein sequence 


173 


S2 








encoded by gene 2 SEQ ID NO:250. 






8836 


X70343 


Nicotiana 


extensin 


112 


37 






sylvestris 








8837 


L17318 


Rattus 


proline-rich proteoglycan 


142 


38 






norvegicus 








8838 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


148 


74 








encoded by gene 3. 






8839 


W34625 


Homo sapiens 


Human C3 protein mutant FT-3. 


277 


100 


8840 


AF1 16638 


Homo sapiens 


PR01546 


78 


41 


8841 


W48351 


Homo sapiens 


Human breast cancer related protein 


138 


61 








BCRB2. 






8842 


AF 130087 


Homo sapiens 


PR024 1 1 


131 


67 


8843 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


72 








5327. 






8844 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


60 








4433. 






8845 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


66 








7795. 






8846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


68 








7795. 






8847 


Y0267J 


Homo sapiens 


Human secreted protein encoded by 


146 


70 








gene 22 clone HMSJW1 8. 






8848 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


131 


68 








SEQ ID NO: 163. 






8849 


Y91577 


Homo sapiens 


Human secreted protein sequence 


273 


79 








encoded by gene 2 SEQ ID NO:250. 






8850 


K02I20 


Bovine leukemia 


gag 


201 


54 






virus 








8851 


AF229067 


Homo sapiens 


PADI-H protein 


162 


57 


8852 


AF090931 


Homo sapiens 


PRO0483 


125 


82 


8853 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 


189 


100 








clone BPA 13. 






8854 


U52197 


Mus musculus 


poly(A) polymerase III 


183 


85 


8855 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


193 


72 








6908. 






8856 


G00549 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


66 








4630. 






8857 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


93 


64 








encoded by gene 17. 






8858 


Q55625_cdl 


Homo sapiens 


22-JUN-1992 Human beta globin 5- 


412 


98 








UTR-CDS-3'-UTR. 






8859 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


152 


67 








gene 22 clone HMSJW 18. 






8860 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


72 








6909. 






8861 


AF 13 0079 


Homo sapiens 


PR02852 


160 


64 


8862 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


61 



367 



iSDOCID: <WO 01 64835A2_I_> 



WO 01/64835 



PCT/US01/O4927 



SEQ1D 

NO. '' 


Accession No. 


Species 


Description 


Smith- 
Waterman 

■ Sc - r ' 


% 

Idcntit 

_y 








~4488 






"8863 


~X647d7 


Homo sapiens 


BBC " 


216 


Too — 


8864 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


127 


65 


8865 


G00092 


Homo sa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
4173. 


210 


100 


8866 


AL137351 


Homo sapiens 


hypothetical protein 


327 


100 


8867 


AF090942 


Homo sapiens 


PRO0657 


105 


38 


8868 


AK023050 






137 


60 


8869 ' 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


101 


65 


8870 


AF2 16751 


Homo sapiens 


CDA14 


282 


96 




AL i jjZO^ 


Homo sapiens 


QjJlyUAZ.. I (CL»Cj-!ike protein J 






8872 


V32779_cdl 


Homo sapiens 


1 1 -MAR- 1997 Novel human FO ATP 
synthase subunit encoding DNA 
sequence. 


267 


81 


8873 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


— 


.. . 

58 


8874 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


89 


43 


8875 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


63 


8876 


AK001S73 


Homo sapiens 


unnamed protein product 


373 


100 




X52 138 


Homo sapiens 


L7a protein 






8878 


X52138 


Homo sapiens 


L7a protein 


273 


87 


8879 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


57 


8880 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


128 


57 


88S1 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


71 


8882. 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


67 


8883 


AF130089 


Homo sapiens 


PRO2550 


134 


79 


8884 


U36898 


Rattus 
norvegicus 


pheromone receptor VN6 


14j 


53 


8885 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


143 


64 


8886 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


138 


92 


8887 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


72 


8888 


R4S296 


Homo sapiens 


Human PGF-2/NT-3. 


118 


66 




UIO YjOJ. 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 










Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 






8891 


G03714 


Homo sapiens 


Human secreted protein, SEQ iD NO: 

"^95. — ; — — — 


121 


77 


— 

8892 


. —— -— 

X53375 


— r 

Hehanthus 

annuus 


anther-specific protein SF18 


o 

109 




64 


~8893 


r\r \jyj\JO 1 O 


Oryza sativa 


. ; 

similar to RXNG~xi2 linger protein 
RHAla(AF078683) 


~Ti9 


— 


8894 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


136 


68 


8895 


AF090894 


Homo sapiens 


PRO0113 


114 


60 


8896 


AF161356 


Homo sapiens 


HSPC093 


148 


51 


8897 


AF1 16638 


Homo sapiens 


PRO 1546 


91 


45 


8898 


U22231 


Felis catus 


ribosomal protein S3a 


201 


ioo 



368 



WO 01/64835 PCT/USO 1/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Iclentit 


8899 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID MO: 
6908. 


114 


75 


8900 


AF090894 


Homo sapiens 


PRO0113 


114 


61 


8901 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


63 


8902 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


95 


8903 


R90288 


Homo sapiens 


Modified pigment epithelium-derived 
factor (rPEDF). 


258 


88 


8904 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


86 


8905 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


118 


66 


8906 


X70343 


Nicotiana 
sylvestris 


extensin 


122 


37 


8907 


G02386 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6467. 


162 


70 


8908 


AF 130089 


Homo sapiens 


PRO2550 


125 


85 


8909 


AF220264 


Homo sapiens 


MOST-1 


113 


47 


8910 


Y25769 


Homo sapiens 


Human secreted protein encoded from 
gene 59. 


119 


77 


8911 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


77 


52 


8912 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


162 


83 


8913 


Y27S6S 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


77 


8914 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


64 


8915 


AF090944 


Homo sapiens 


PRO0663 


73 


58 


8916 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


151 


76 


8917 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


84 


47 


8918 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8919 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


254 


82 


8920 


AJ245416 


Homo sapiens 


G7b protein 


360 


86 


8921 


D90228 


Homo sapiens 


mitochondrial acetoacetyl-CoA thiolase 
precursor 


190 


100 


8922 


AF229067 


Homo sapiens 


PADI-H protein 


157 


65 


8923 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


65 


8924 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


62 


68 


S925 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


116 


63 


8926 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


65 


8927 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


59 


8928 


AF287892 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 8 long splice variant 


994 


76 


8929 


AF030131 


Mus musculus 


Plenty of SH3s; POSH 


181 


77 


8930 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


814 


98 


8931 


U13866 


unidentified 
cloning vector 


non-functional lacZ alpha peptide 


112 


59 



369 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Iden tit 


8932 


U67540 


Methanococcus 
jannaschii 


conserved hypothetical protein 


98 


56 


8933 


Z74472 


Caenorhabditis 
elegans 


predicted using Genefinder-contains 
similarity to Pfam domain: PF0I391 
(Collagen triple helix repeat (20 
copies)), Score=84.3, E-value=8.1e-22, 
N=2; PF01484 (Nematode cuticle 
collagen N-terminal domain), 
Score=34.2, E-value=9.9e-07, 
N= 1 -cDNA EST yk 1 OOd 1 0. 3 comes 
from this gene; cDNA EST 
EMBL:D65979 comes from this 
gene-cDNA EST ykl00d4.3 comes 
from this gene; cDNA EST ykl23g7.3 
comes from this gene-cDNA EST 
yk58e6.3 comes from this gene; cDNA 
EST yk67a5.3 comes from this 
gene-cDNA EST EMBL:T01590 
comes from this gene; cDNA EST 
yk60cl 1.3 comes from this 
gene-cDNA EST yk89cl0.3 comes 
from this gene; cDNA EST ykS5g9.3 
comes from this gene-cDNA EST 
yk83e2.5 comes from this gene; cDNA 
EST yk75f2.3 comes from this 
gene-cDNA EST CEESK90F comes 
from this gene; cDNA EST yk79f5.5 
comes from this gene-cDNA EST 
yk64bl2.3 conies from this gene; 
cDNA EST yk64h9.3 comes from this 
gene-cDNA EST yk74dl.3 comes 
from this gene; cDNA EST yk75e4.3 
comes from this gene-cDNA EST 
yk83e2.3. comes from this gene; cDNA 
EST yk91cl2.3 comes from this 
gene-cDNA EST yk98f2.3 comes from 
this gene; cDNA EST yk60h4.5 comes 
from this gene-cDNA EST yk60cll.5 
comes from this gene; cDNA EST 
yk64h9.5 comes from this gene-cDNA 
EST yk64a9.5 comes from this gene; 
cDNA EST yk64b!2.5 comes from this 
gene-cDNA EST yk67a5.5 comes from 
this gene; cDNA EST yk74dl.5 comes 
from this gene-cDNA EST yk75e4.5 
comes from this gene; cDNA EST 
yk75f2.5 comes from this gene-cDNA 
EST yk79e6.5 comes from this gene; 
cDNA EST yk80d2.5 comes from this 
gene-cDNA EST yk85g9.5 comes 
from this gene; cDNA EST yk89cl0.5 
comes from this gene-cDNA EST 
yk89c4.5 comes from this gene; cDNA 
EST yk90el 1.5 comes from this 
gene-cDNA EST yk91cl2.5 comes 
from this gene; cDNA EST yk96e2.5 
comes from this gene-cDNA EST 
yk98c2.5 comes from this gene; cDNA 
EST yk98f2.5 comes from this 
gene-cDNA EST yk98hll .5 comes 


101 


38 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Sinith- 
Watermnn 


% 

Itlentit 

y 








from this gene; cDNA EST ykl07e7.5 
comes from this gene-cDNA EST 
yk313g9.3 comes from this gene; 
cDNA EST yk3 13g9.5 comes from this 
gene-cDNA EST yk312gl.3 comes 
from this gene; cDNA EST yk312g].5 
comes from this gene-cDNA EST 
yk293c4.3 comes from this gene; 
cDNA EST yk293c4.5 comes from this 
gene 






8934 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


63 


8935 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQIDNO:I63. 


125 


48 


8936 


X03741 


Homo sapiens 


myosin heavy chain (635 AA) 


1343 


99 


8937 


M29295 


Rattus 
norvegicus 


small nuclear ribonucJeoparticle- 
associated protein 


106 


52 


8938 


D00097 


Homo sapiens 


serum amyloid P component 


188 


100 


8939 


W34499 


Homo sapiens 


Obesity receptor C protein. 


255 


83 


8940 


L00016 


Homo sapiens 


urf4 


293 


83 


8941 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


94 


8942 


AB007148 


Homo sapiens 


ribosomal protein S3a 


153 


73 


8943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO.T63. 


213 


95 


8944 


M10119 


Homo sapiens 


ferritin light subunit 


129 


100 


8945 


M74718 


Homo sapiens 


SEF2-1A protein 


118 


84 


8946 


AL355178 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 
(SCR repeat) domain protein) 


366 


51 


8947 


Y56033 


Homo sapiens 


PoJyproline-rich domain from dynamin. 


70 


51 


8948 


X06547 


Homo sapiens 


glutathione S-transferase (GST-Pi) (AA 
1-210) 


172 


97 


8949 


AF119851 


Homo sapiens 


PRO 1722 


154 


78 


8950 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


123 


88 


8951 


X54802 


Homo sapiens 


cytochrome-c oxidase subunit IV 


116 


95 


8952 


AJ011580 


bacteriophage 
PS34 


gp23 


338 


98 


8953 


Y59441 


Homo sapiens 


Human deltaS fragment #5. 


271 


96 


8954 


J02459 


bacteriophage 
lambda 


B (capsid component;533) 


807 


99 


8955 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 ' 


172 


94 


S956 


W34499 


Homo sapiens 


Obesity receptor C protein. 


166 


89 


8957 


X92485 


Plasmodium 
vivax 


pval 


135 


71 


8958 


AB010340 


Mus musculus 


mszf81 


122 


60 


8959 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


51 


8960 


X14898 


Cricetulus sp. 


ORF (AA 1-286) (1 is 2nd base in 
codon) 


108 


54 


8961 


AF090931 


Homo sapiens 


PRO0483 


119 


82 


8962 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI S. 


130 


72 


8963 


AF 170724 


Homo sapiens 


cell cycle checkpoint protein CHFR 


1364 


99 


8964 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


43 


8965 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


250 


74 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

ldentit 


8966 


Y87233 


H mosa iens 

omo sapiens 


Human signal peptide containing 

protein HSPP-10 SEQ ID NO. 10. 


445 


"TOO 


8967 


AF 127085 


Mus musculus 


semaphorin cytoplasmic domain- 
associated protein 3B 


1 12 


80 


8968 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO. 163. 


109 


46 


8969 


AF090894 


Homo sapiens 


PRO0113 


153 


61 


8970 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


15S 


49 


8971 


U38544 


Mus musculus 


alpha- 1 type 1 collagen 


102 


52 


8972 


X92485 


v!vax° diUm 


pval 


1 13 


50 


8973 


AF241228 


Xe o uslaevis — 
enopus aevis — 


Friend of GATA 


143 


"Too" — 





AF090930 




PRO0478 




-— 




AF238374 


Homo sapiens 


receptor 3; FGFR3 


"Tf5 


~50 






Homo sapiens 


Human secreted protein 3 SEQ ID NO! 
6613. 






8977 


AB001838 


Homo sapiens 


recoverin 


179 


80 




ACT 1 RfW} 


Homo sapiens 


PRO 1 QH9 









A PI 1 ^7T ^ 
1 lO/lJ 


Homo sapiens 


I y**\J J, OAS 


~Hl2 




8980 


AF090942 


Homo sapiens 


PRO0657 


132 


61 




G03714 


Homo sapiens 


Human secreted protein, SEQ ID NOi 
7795. 










Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 






8983 


A00127 


Homo sapiens 


melanoma associated antigen P97 


366 


94 




G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 








Y59778 


Homo sapiens 


Human normal ovarian tissue derived 
protein 55. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






8987 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


50 


8988 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


51 


~8989 


Y24025 


— : 

Homo sapiens 


Amino acid sequence of" the human 
MMSC I protein. 


-f6~(j 


Too" — 


8990 


G02538 


Homo sapiens 


Human secreted protein SEQ ID NO' 
6619. 


1j0 


67 


~~899~j 


D3gl 12 


Homo sapiens 




165 


~82 


8992 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


111 


62 


8993 


M28016 


Homo sapiens 


cytochrome b 


130 


100 




Y 86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 










Homo sapiens 


NADH dehydrogenase subunit 4L. 






8996 


Y 16430 


Mus musculus 


ribosomal protein L35a 


101 


65 


8997 


Y45318 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 18. 


96 


76 


8998 


AL049659 


Arabidopsis 
thaliana 


putative protein 


126 




8999 


AE003500 


Drosophila 
melanogaster 


CGI 23 79 gene product 


285 


67 


9000 


R37991 


Homo sapiens 


Sequence of a new cytokine which 
inhibites induction by gammainterferon 
of expression of Class II 
histocompatibility antigens. 


124 


48 



372 
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NO: 






Description 


Waterman 
Score 


[dentil 

y 


9001 


AF090930 


Homo sapiens 


PRO0478 


193 


89 


9002 


AP000381 


Arabidopsis 
thaliana 


gb|AAF57656.1~gene_id:K17E12.13~s 
imilar to unknown protein 


193 


29 


9003 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


364 


66 


9004 


D63424 


Homo sapiens 


glycogen synthase kinase 3alpha 


126 


81 


9005 


AP000616 


Oryza saliva 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


1 13 


75 


9006 


AL1 17637 


Homo sapiens 


hypothetical protein 


257 


92 


9007 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


141 


26 


9008 


W34499 


Homo sapiens 


Obesity receptor C protein. 


152 


86 


9009 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


199 


80 


9010 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID MO: 
7645. 


202 


94 


90U 


AF090931 


Homo sapiens 


PRO0483 


135 


68 


9012 


M11717 


Homo sapiens 


heat shock protein 


696 


100 


9013 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


75 


9014 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


85 


9015 


AL 160371 


Leishmania 
major 


probable (hhv-6) u ] 1 02, variant a 
DNA, complete virion genome 


96 


65 


9016 


AF229067 


Homo sapiens 


PADI-H protein 


119 


54 


9017 


AF090944 


Homo sapiens 


PRO0663 


138 


54 


9018 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


50 


9019 


X92485 


Plasmodium 
vivax 


pval 


101 


73 


9020 


G 02 532 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
6613. 


82 


72 


9021 


AF073519 


Homo sapiens' 


small EDRK-rich factor 1, long isofonm 


116 


66 


9022 


AF229067 


Homo sapiens 


PADI-H protein 


127 


45 


9023 


M33112 


Homo sapiens 


amyloid-beta protein 


105 


67 


9024 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


79 


68 


9025 


AF161361 


Homo sapiens 


HSPC098 


130 


55 


9026 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


169 


72 


9027 


AF130051 


Homo sapiens 


PRO0898 


338 


77 


9028 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


49 


9029 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


117 


58 


9030 


AC002310 


Homo sapiens 


Unknown gene product 


155 


85 


9031 


X92485 


Plasmodium 


pval 


112 


42 


9032 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


52 


9033 


G00487 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4568. 


123 


69 


9034 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


138 


61 


9035 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


111 


66 


9036 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


173 


59 


9037 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


100 


54 


9038 


AF220264 


Homo sapiens 


MOST-1 


96 


80 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


9039 


AF130051 


Homo sapiens 


PRO0898 


1 19 


68 


9040 


AF1 16715 


Homo sapiens 


PR02829 ■ 


121 


63 


9041 


L76200 


Homo salens 


Huma^secretecrprotein SEQ ID NO" — 


193 


~fi — 


9042 


G03415 


omo sapiens 


7496. 


125 




9043 


AF152513 


Homo sapiens 


protocadherin gamma A6 short form 
protein 


334 


100 


9044 


AF1 30079 


Homo sapiens 


PR02852 


~T>5 


~33 


9045 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


128 


64 


9046 


AF090942 


Homo sapiens 


PRO0657 


126 


~65 


9047 


AF284223 


Homo sapiens 


terra-like protein 


228 


100 


9048 


AP000616 


Oryza sativa 


RHAla (AF078683) 


128 


68 


~9049 


S75g95 


Homo sapiens 




~\22 


~92 


9050 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


123 


75 


9051 


AL034344 


Homo sapiens 


dJ118B18.1 (forkhead box CI) 


323 


100 


9052 


Y 86248 




Human secreted protein HCHPF68 
SEQ ID NO: 163. 






~9~0"§4" 


A fl 

/\i —±.jy i u 














AF 130087 


Horn ^sapiens 


PR024 1 1 






9055 


W80400 


Homo sapiens 


A secreted protein encoded by clone 


858 


97 


9056 


X70343 


Nicotiana 
sylvestris 


extensin 


113 


33 


9057 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 

ISJrxA. I a ^ArU / oDoj ) 


128 


57 


9058 


R95913 


Homo sapiens 


Neural thread protein. 


142 


38 




AruUUO i O 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 




56 




G02538 




6619. P ° n ' SEQ1DN °' 










Nicotiana 
sylvestris 








9062 




sylvestris 


extensin 


129 




9063 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


~m 


— 


9064 


AF266164 


norvegi cus 


densm- 1 80 variant D 


144 


96 


9065 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


135 


68 


9066 


M28016 


Homo sapiens 


cytochrome b 


178 


87 




M^S016 










9068 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO' 163 


148 


60 


9069 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


111 


100 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 




72 


9071 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


147 


64 


9072 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


109 


69 


9073 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


171 


85 


9074 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


88 


88 


9075 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


137 


62 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 








encoded by gene 17. 






9076 


L38941 


Homo sapiens 


ribosomal protein L34 


139 


88 


9077 


GO 1 984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


164 


86 


9078 


X55684 


Lycopersicon 
esculentum 


extensin (class T) 


72 


35 


9079 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


82 


9080 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


229 


59 


9081 


AFI 30089 


Homo sapiens 


PRO2550 


118 


71 


9082 


X92485 


Plasmodium 
vivax 


pval 


125 


71 


9083 


AF2 16650 


Homo sapiens 


MTAP 


154 


73 


9084 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


78 


9085 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


191 


73 


9086 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


61 


9087 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


130 


75 


9088 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


144 


61 


9089 


AK024455 


Homo sapiens 


FLJ00047 protein 


145 


82 


9090 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


181 


60 


9091 


AF090895 


Homo sapiens 


PRO0I17 


87 


66 


9092 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


211 


86 


9093 


AFI 30079 


Homo sapiens 


PR02852 


157 


74 


9094 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


82 


9095 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


97 


42 


9096 


X6555I 


Homo sapiens 


antigen of the monoclonal antibody Ki- 
67 


106 


65 


9097 


W34499 


Homo sapiens 


Obesity receptor C protein. 


231 


80 


9098 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


127 


63 


9099 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


129 


78 


9100 


AF229067 


Homo sapiens 


PADI-H protein 


162 


59 


9101 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. 


157 


65 


9102 


Y60010 


Homo sapiens 


Human endometrium tumour EST 
encoded protein 70. 


69 


63 


9103 


R95913 


Homo sapiens 


Neural thread protein. 


124 


72 


9104 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


104 


68 


9105 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


156 


88 


9106 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


170 


57 


9107 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 18. J 


163 


89 


9108 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


123 


41 


9109 


L41944 


Homo sapiens 


interferon receptor 


162 


66 



375 



WO 01/64835 PCT/USO 1/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: ' ' 








Waterrann 


Idcntit 

y 


91 ]0 


AK000496 


Homo sapiens 


unnamed protein product 


138 


78 


911 1 


G0^797 


Homo sapiens 




81 


62 








7878. 






91 12 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


52 








6619. 






91 13 


M26361 


Mtis musculus 


smaH EDRK^ch factor l* ton" isoform 


"T05 


-~ 


91 14 


AF073519 


Homo sapiens 




1 19 


56 


9115 


AF1 19900 


Homo sapiens 


PR02822 


140 


"is — 


91 16 


G00397 


Homo sapiens 


Human secreted protein SEQ ID NO' 


158 










4478. 






9117 


L00016 


-j-r = 

omo sapiens 




"Tfi 

—Li- 


— 


91 18 


ALUUOz /O 


Homo sapiens 




f 3 .r 


— 

100 


9119 


W4cO->l 


Homo sapiens 


Human breast cancer related protein 


131 


72 








BCRB2. 






■ ■ 

~ 


API 1 QC;C< 
nrj J 70JJ 


Homo sapiens 


T>1?/~11 QA1 


. 

. \ 5;> ... 


.. 

8] 




AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


84 


017? 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 


15 


78 








in MO - 1 AO 






9123 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


53 









44 ?8. : 








9124 


G00397 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


40 














9125 


AL3901 14 


Leishmania 


probable (hhv-6) ul 102, variant a 


96 


63 









DNA, complete virion genome 









A PI TftORQ 


Homo sapiens 


rKUxjJll 




-4P 




,. 78 ... 




V^4835 1 


Homo sapiens 


Human breast cancer related protein 


98 


56 




-— — — 




BCRB2. . — — -— 






v 




— : 

' Homo sapiens 


Human secreted protein, SEQ ID NO: 


— — 

107 


75 










Jy—— j : — 










Homo sapiens 


Human secreted protein, SEQ ID NO: 


— — 

131 


— — 

49 


— 


— — — — — — — 




^f 36 " : 








AK024455 


-77 ■■ 

Homo sapiens 


FLJ00047 proteui 


130 


78 


9131 


W88435 


Homo sapiens 


Disease associated protein kinase 


336 


100 









DAPK.-4. . 






9132 


U63332 


— : 

Homo sapiens 


super cysteine rich protein; SCRP 


144 


59 




Y94922 


Homo sapiens 


Human secreted protein clone pv6 1 


154 


90 








protein sequence SEQ ID NO:50. 






~9l34 


AF3 1 7425 


Homo sa iens 

omo sapiens 




— — 


— 


"orvs 




Momo sapiens 


— j-j — : 

Human secreted protein, SEQ ID NO: 


— — 1 

174 


— 

54 














Q _ . 

~ 


A PI 1 £71 < 
r\t I I D / 1 D 


Jtiomo sapiens 


PR028' 7 9 • 


—ryp 


— 




UUjoU / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


2 


65 















mi — 


api ^nnRQ 


Homo sapiens 


rlvU/JjU 


— a 

140 




62 


9139 


AF161221 


Homo sapiens 


kallikrein-Iike protein 6 


204 


90 


9140 




Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna, 


110 


48 






brucei 


complete virion genome. 






9141 


U27486 


Pseudorabies 


EP0 


84 


44 






virus 








9142 


Y91577 


Homo sapiens 


Human secreted protein sequence 


150 


96 








encoded by gene 2 SEQ ID NO:250. 






~9l43 


GQ2827 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 












6908. 






9144 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


163 


84 








SEQ ID NO: 163. 






9145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


180 


77 








SEQID NO: 163. 






9146 


U52077 


Homo sapiens 


mariner transposase 


365 


78 


9147 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


143 


67 



376 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


5 « 


NO. 








Walcrman 


Identit 




















clone HPMBQ32. 






914S 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


100 


81 








RHAla(AF078683) 






9149 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


118 


66 








derived from DNA35638. 






9150 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


105 


54 








SEQ ID NO: 163. 






9151 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


101 


70 






brucei 


complete virion genome. 






9152 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


66 








6613. 






9153 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 


160 


72 








SEQ ID NO:163. 






9154 


Y0267I 


Homo sapiens 


Human secreted protein encoded by 


106 


63 








gene 22 clone HMSJ Wl 8. 






9155 


Y02926 


Homo sapiens 


Fragment of human secreted protein 


95 


80 








encoded by gene 101. 






9156 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 








7870. 






9157 


G03924 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


265 


80 








8005. 






9158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


72 








4457. 






9159 


AF130089 


Homo sapiens 


PRO2550 


180 


61 


9160 


AF116715 


Homo sapiens 


PR02829 


184 


82 


9161 


KOI 664 


Drosophila 


Bkm-like protein 


200 


69 






melanogaster 








9162 


D88146 


Homo sapiens 


UDP-galactose transporter 2 


101 


100 


9163 


AF 107406 


Homo sapiens 


GW128 


155 


60 


9164 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


51 








6983. 






9165 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


60 








7795. 






9166 


AF2 10651 


Homo sapiens 


NAG 18 


83 


65 


9167 


V00672 


Pan troglodytes 


reading frame protein 4 


119 


95 


9168 


Y01155 


Homo sapiens 


Secreted protein encoded by gene 4 


225 


74 








clone HrBCW32. 






9169 


AF161356 


Homo sapiens 


HSPC093 


100 


75 


9170 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


66 








7795. 






9171 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


115 


86 








RHAla(AF078683) 






9172 


AF1 19855 


Homo sapiens 


PRO 1847 


178 


59 


9173 


AF130087 


Homo sapiens 


PR02411 


129 


48 


9174 


AF130079 


Homo sapiens 


PR02852 


149 


59 


9175 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


90 








■7214. 






9176 


AF090894 


Homo sapiens 


PRO0113 


118 


65 


9177 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


105 


46 






brucei 


protein rha 1 a. 






9178 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


52 








7867. 






9179 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


70 








7795. 






9180 


AF116715 


Homo sapiens 


PR02829 


127 


64 


9181 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


71 








5327. 






9182 


AF229067 


Homo sapiens 


PADI-H protein 


164 


64 


9183 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


80 



377 



WO 01/64835 



PCT/USOi/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

ldcntit 

y 








6613. 






9184 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


106 


64 


9185 

.. ... 


G02538 
- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


132 


57 




G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


1 13 


65 




AF090931 


Homo sapiens 


PRO0483 


117 


68 


9188 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


136 


67 


9189 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


47 


9190 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


269 


85 


9191 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


150 


40 


9192 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


153 


73 


9193 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


62 


9194 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


31 


9195 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


98 


47 


9196 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


107 


48 


9197 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


111 


68 


9198 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


164 


83 


9199 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


235 


85 


9200 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


1 17 


65 




X67640 


Homo sapiens 


HS24/P52 


222 


75 


9202 


X67640 


Homo sapiens 


HS24/P52 


231 


81 


9203 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


128 


66 


9204 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


107 


74 


9206 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


186 


66 


9207 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


136 


56 


9208 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


160 


67 


9209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


68 


9210 


AF1 16715 


Homo sapiens 


PR02829 


123 


81 


9211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


70 


9212 


AF1 18086 


Homo sapiens 


PRO 1992 


84 


57 


9213 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


115 


62 


9214 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


247 


92 


9215 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


74 


76 


9216 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953 


134 


74 



378 



WO 01/64835 PCT/US01/04927 



SEQID 


Accession No. 




Description 


Smith- 
Waterman 
Score 


% 

Identit 


9217 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


137 


53 


9218 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


130 


44 


9219 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


68 


9220 


L41944 


Homo sapiens 


interferon receptor 


141 


50 


9221 


AF130079 


Homo sapiens 


PR02852 


170 


70 


9222 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


100 


85 


9223 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


56 


9224 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


104 


63 


9225 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


92 


84 


9226 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


157 


65 


9227 


AK027208 


Homo sapiens 


unnamed protein product 


138 


68 


9228 


B25722 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 9 SEQ ID NO: 111. 


125 


87 


9229 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


127 


88 


9230 


AK025047 


Homo sapiens 


unnamed protein product 


179 


64 


9231 


AF090894 


Homo sapiens 


PRO01 13 


195 


57 


9232 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


176 


54 


9233 


AB008227 


Adiantum 
capiilus-veneris 


Extensin 


93 


34 


9234 


AF130089 


Homo sapiens 


PRO2550 


98 


73 


9235 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


135 


52 


9236 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


162 


62 


9237 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


59 


9238 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


52 


9239 


AE003499 


Drosophila 
melanogaster 


CG 12706 gene product 


166 


37 


9240 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


161 


64 


9241 . 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


9242 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


69 


42 


9243 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


140 


80 


9244 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


92 


9245 


M22332 


Homo sapiens 


unknown protein 


117 


40 


9246 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


109 


66 


9247 


AL 1361 73 


Homo sapiens 


dJ914B9.1 (novel protein (HSPC 162) 
similar to Rattus norvegicus 
bithoraxoid-like protein) 


305 


76 


9248 


AF1 16661 


Homo sapiens 


PRO 143 8 


141 


53 


9249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


144 


76 



379 



WO 01/64835 



PCT/US01/04927 



SEQ JD 


Accession No. 




Description 


Smifh- 
Waterman 


% 

Idcntit 

y 


9250 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


245 


89 


925 1 


J02459 


bacteriophage 
lambda 


E (capsid componentj341) 


245 


92 


9252 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


128 


63 


9253 


GO 1 246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


124 


72 


9254 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla (AF078683) 


110 


90 


9255 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No 1 07 


132 


65 


9256 


X52164 


Mus musculus 


Q300 protein (A A 1-77) 


106 


62 


9257 


AF2 17374 


planci haSter 


cytochrome oxidase subunit I 


259 


98 


9258 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


64 


9259 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
471 8. 


107 


54 


9260 


AP000616 


Oryza sativa 


similsv to RJNG-H2 finger protein 
RHAla (AF078683) 


129 


66 


9261 


M28016 


Homo sapiens 


cytochrome b 


148 


93 


9262 


Y86248 




Humsn secreted protein HCHPF68 
SEQ ID NO: 163. 






9263 


R55749 


Homo sapiens 


affmtty IR^^rrn^^rece^rtor 311 


1 16 


8j 


9264 


K02403 


Homo sapiens 


complement component C4A 


9025 


99 


9265 


X92485 


vivax 10 ^ 1 "" 


pval 


161 


55 


9266 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


83 


9267 


G04091 




Human secreted protein, SEQ ID NO: 
8172. 






9268 


AF130089 


Trypanosoma 3 


PRO2550 






9269 


AL359782 


brucei 


protein rhala. 


"57- 





9270 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


96 


81 


9271 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


132 




9272 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






9273 


AF229067 


Homo sapiens 


PADI-H protein 






9274 


Y36203 


Homo sapiens 


Human secreted protein #75. 


154 


77 


9275 


X92485 


vivax 








9276 


G02872 




Human secreted protein SEQ ID NO' 
6953. 








G02987 




Human secreted protein, SEQ ID NO: 
7068. 










Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






9279 


Y20772 


Homo sapiens 


Human neurofilament-M mutant 


1 12 


70 


9280 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


146 


59 


9281 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


103 


63 


9282 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


89 


60 


9283 


X92485 


Plasmodium 


pval 


324 


'79 



380 



WO 01/64835 PCT/US01/04927 



SEQ1D 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Idcntit 






vivax 








9284 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


95 


75 


9285 


AF1 30087 


Homo sapiens 


PR024 1 1 


138 


78 


9286 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


140 


54 


9287 


AF004340 


Homo sapiens 


ATPase 6/8 


98 


100 


9288 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


68 


9289 


J02459 


bacteriophage 
lambda 


J (tail component;223) 


457 


97 


9290 


AC0O3058 . 


Arabidopsis 
tha liana 


unknown protein 


177 


84 






elegans 


protein BarH2 (PIR: A4 1 726) 


107 


87 


9292 


D381I2 


Homo sapiens 


NADH dehydrogenase subunit 4L 


311 


91 




S794 1 0 




protein 


139 


61 




R28916 






117 


33 


9295 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRJB2. 


126 


77 


9296 


AF1 30089 


Homo sapiens 


PRO2550 


137 


52 




G02987 




Human secreted protein, SEQ ID NO: 
7068. 


93 


79 








13A y P P S 


239 


77 


9299 


AL3 59782 


bruceT OSOma 


possihle (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


134 


69 


9300 


AL3901 14 


Leishmania 

major 


extremely cysteine/valine rich protein 


105 


24 


9301 


D90282 


Homo sapiens 


carbamyl phosphate synthetase I (EC 
6.3.4.16) 


276 


96 


9302 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


231 


93 


9303 


Ml 5077 


Photinus pyralis 


Luciferase 


557 


100 




L00016 






303 


93 


9305 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


69 


9306 


L46721 


Homo sapiens 


mucin 


204 


80 


9307 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


110 


84 


9308 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


218 


93 


9309 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
6613. 


78 


66 


9310 


AF 13 0075 


Homo sapiens 


PR02532 


92 


69 


931 1 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


97 


51 


9312 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


203 


79 


9313 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


133 


95 


9314 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


68 


9315 


D38112 


Homo sapiens 


ATPase subunit 6 


223 


81 


9316 


AL1 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


85 


9317 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


117 


88 


9318 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


257 


72 



381 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


9319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


61 


9320 


AK001116 


Homo sapiens 


unnamed protein product 


96 


40 


932] 


G02535 




6616. 






9322 


AF037081 










9323 


G03800 




Hu^an ^ecreled protein 0 SEQ ID NO" — 
7881. 


j~~ 


-1^2 — 


9324 


AF1 53056 


H^mo^sapTens^ 3 


tegumental antigen 


60 




9325 






Human secreted protein, SJEQ ID NO; 
6908. 






9326 






Human secreted protein, SEQ ID NOi 
6836. 






9327 


AF1 1 8086 




PRO 1992 






9328 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


73 


9329 


AF 107406 


Homo sapiens 


GW128 


142 


56 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


75 


93j 1 


X92485 


Plasmodium 


pval 


121 


56 


9332 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


179 


52 


9333 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


67 


9334 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

oJtit,; ID INU.lO-5. 


108 


46 


9335 


M22332 


Homo sapiens 


unknown protein 


158 


60 




G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


46 


93j7 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


160 


76 


9338 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone riiNorLJ— y. 


150 


68 


9339 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


59 


9340 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 







9341 


AK024455 


Homo sapiens 


FLJ00047 protein 






9342 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


191 


76 


9343 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


82 


9344 


AF032457 


Homo sapiens 


BimEL 


123 


100 


9345 


G00j57 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


46 


9346 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


75 


9347 


X53020 


Plasmodium 
falciparum 


erythrocyte membrane-associated 
antigen 


68 


75 


9348 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


120 


70 


9349 


AF130089 


Homo sapiens 


PRO2550 


116 


69 


9350 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


165 


72 


9351 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
447S. 


94 


82 


9352 


AL080242 


Homo sapiens 


bA554C12.1 (RBX1 or ROC 1 (ring- 
box or ring finger protein 1)) 1 


184 


60 



382 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Scor"" 13 " 


Identit 


9353 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


125 


64 


9354 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


96 


40 


9355 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


98 


46 


9356 


AF090931 


Homo sapiens 


PRO0483 


148 


76 


9357 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


56 


9358 


AC005175 


Homo sapiens 


TA2R HUMAN, BETA ISOFORM; 
TXA2-R; PROSTANOID TP 
RECEPTOR 


153 


63 


9359 


AF116715 


Homo sapiens 


PR02829 


125 


66 


9360 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


72 


9361 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


178 


75 


9362 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


90 


57 


9363 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 


86 


76 


9364 


AF229067 


Homo sapiens 


PADI-H protein 


116 


51 


9365 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


248 


64 


9366 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


104 


76 


9367 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


76 


9368 


G 0245 5 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


105 


74 


9369 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


9370 


AF181977 


Hepatitis GB 
virus C 


E2 protein 


95 


38 


9371 


AF090931 


Homo sapiens 


PRO0483 


151 


62 


9372 


AF090894 


Homo sapiens 


PRO0113 


133 


57 


9373 


AK002154 


Homo sapiens 


unnamed protein product 


226 


55 


9374 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1S. 


140 


71 


9375 


U77494 


Homo sapiens 


RANBP8 


166 


100 


9376 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


155 


52 


9377 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


126 


63 


9378 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


7S 


9379 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


68 


9380 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


117 


62 


9381 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


104 


47 


9382 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


57 


9384 


AB047936 


Macaca 
fascicularis 


hypothetical protein 


95 


41 


9385 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


57 


64 


9386 


U04810 


Homo sapiens 


tasrin 


151 


100 


9387 


U79260 


Homo sapiens 


unknown 


97 


79 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 










Waterman 


Identit 














9388 


U18339 


Variola virus 


D4L 


103 


65 


9389 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


75 








7881. 






9390 


AL390935 


Leishmania 


possible dopap 


99 


88 






major 








9391 


AB015633 


Homo sapiens 


type 11 membrane protein 


108 


100 


9392 


AF1 13685 


Homo sapiens 


PRO0974 


98 


62 


9393 


AF064819 


Homo sapiens 


serine protease DESC1 


227 


69 


9394 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


219 


68 








7867. 






9395 


GO 1400 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


100 








5481. 






9396 


AK024455 


Homo sapiens 


FLJ 00047 protein 


153 


68 


9397 


AF033260 


porcine 


reverse transcriptase 


155 


5j 






endogenous type 












C retrovirus 








9398 


X12517 


Homo sapiens 


C protein (AA 1-159) 


147 


55 


9399 


AB029948 


Homo sapiens 


mitochondrial seryl-tRNA synthetase 


1203 


100 


9400 


Y87075 


Homo sapiens 


Human secreted protein sequence SEQ 


234 


93 








IDNO:114. ' 






9401 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


58 








6536. 






9402 


M15530 


Homo sapiens 


B-cell growth factor 


113 


74 


9403 


X56932 


Homo sapiens 


-23 kD highly basic protein 


1 14 


85 


9404 


ACO 18748 


Arabidopsis 


Contains similarity to PI 1 protein from 


75 


65 






thaliana 


Drosophila melanogaster gb|X59691 . 






9405 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


1 14 


~88 








SEQ ID NO: 163. 






9406 


AF220264 


Homo sapiens 


MOST- 1 


124 


70 


9407 


AF1 19851 




PRO 1 722 







9408 


X92485 


Plasmodium 


pval . 


J 

















9409 


X92485 


Plasmodium 


~pval : " 


165 


77 






vivax 


: 






9410 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


55 








4433. 






9411 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


64 








7795. 






9412 


AF 130089 


Homo sapiens 


PRO2550 


147 


55 


9413 


AF1 30089 


Homo sapiens 


PRO2550 


175 


65 


9414 


L14848 


Homo sapiens 


MHC class I-related protein 


225 


97 


9415 


R95913 


Homo sapiens 


Neural thread protein. 


107 


74 


9416 


AB001684 


Chlorella 


ORF49b 


68 


53 






vulgaris 








9417 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 1.1 


50 








6613. 






9418 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


107 


73 








sequence. 






9419 


AL050285 


Homo sapiens 


hypothetical protein 


194 


100 


9420 


AF090944 


Homo sapiens 


PRO0663 


132 




9421 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


61 








6953. 






9422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


70 








6613. 






9423 


M20030 


Homo sapiens 


small proline rich protein 


62 


41 


9424 


AF1 16718 


Homo sapiens 


PRO2900 


396 


100 


9425 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


102 


35 








gene No. 101. 






9426 


X61047 


Hydra sp. 


mini-collagen . . .. .. . 


92 


45 



384 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Sinith- 
YVatcrraan 

Score 


y 


9427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


105 


71 


9428 


LI 0908 


Mus musculus 


Gcapl gene product 


107 


44 


9429 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


79 


87 


9430 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


61 


9431 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJS1. 


111 


80 


9432 


X12580 


Medicago sativa 


put. nodulin (soybean N-75 homolog.) 


87 


33 


9433 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


48 


9434 


AC003028 


Arabidopsis 
thaliana 


30S ribosomal protein S31 


79 


42 


9435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


106 


75 


9436 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


207 


75 


9437 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


163 


85 


9438 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


84 


57 


9439 


AF1 18086 


Homo sapiens 


PRO 1992 


145 


75 


9440 


AF229067 


Homo sapiens 


PADI-H protein 


140 


64 


9441 


M36913 


Zea mays 


cell wall protein (put.); putative 


107 


38 


9442 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


161 


59 


9443 


AF229067 


Homo sapiens 


PADI-H protein 


147 


85 


9444 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


79 


9445 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


155 


57 


9446 


X92485 


Plasmodium 
vivax 


pval 


97 


55 


9447 


AF1 19851 


Homo sapiens 


PRO 1722 


123 


55 


9448 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


88 


9449 


AL391688 


Homo sapiens 


bA524D16A.2.1 (novel protein similar 
to mouse granuphilin-a) 


258 


97 


9450 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


91 


9451 


X924S5 


Plasmodium 
vivax 


pval 


123 


88 


9452 


X83000 


Myocastor 
coypus 


lipase related protein 2 


168 


43 


9453 


M35603 


Mus musculus 


Hox^3.1 protein 


447 


78 


9454 


M35520 


Canis familiaris 


GTP-binding protein (rab5) 


271 


98 


9455 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


362 


86 


9456 


A13595 


synthetic 
construct 


immunosuppresive protein PP1 5 


111 


100 


9457 


X67788 


Rattus 
norvegicus 


ezrin, p81 


185 


97 


9458 


U28068 


Mus musculus 


neurogenic differentiation factor 


132 


89 


9459 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


556 


88 


9460 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


379 


94 


9461 


AF 169689 


Homo sapiens 


protocadherin alpha 10 alternate 
isoform 


208 


95 


9462 


U73193 


Homo sapiens 


inward rectifier potassium channel 


236 


96 



385 
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SEQ ID 

NO: 


J Accession No. 


Species 


Description 


Waterman 
Score 


Idcntit 

y 








Kir 1.2 






9463 


U83278 


Bos taurus 


neural specific protein CRMP-2 


157 


61 


9464 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


74 


57 


9465 


AL031673 


Homo sapiens 


dJ694B14.3 (PUTATIVE novel 
haloacid dehalogenase-like hydrolase 
family protein similar to (archaea) 
bacterial proteins) 


444 


100 


9466 


X67247 


Homo sapiens 


ribosomal protein S8 


519 


90 


9467 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


51 


9468 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


95 


84 


9469 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


145 


71 


9470 


AF130056 


Homo sapiens 


PRO 13 67 


98 


76 


9471 


X71354 


Homo sapiens 


vesicular monoamine transporter 


102 


95 


9472 


U66372 


Bos taurus 


ribosomal protein S29 


120 


95 


9473 


D32002 


Homo sapiens 


nuclear cap binding protein 


583 


96 


9474 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl . 


469 


91 


9475 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 
clone HP10122 cDNA #1. 


562 


100 


9476 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4L 


247 


88 


9477 


AB043820 


Homo sapiens 


platelet glycoprotein VI-2 


95 


47 


9478 


AB020236 


Homo sapiens 


ribosomal protein L27A 


437 


86 


9479 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


52 


9480 


AF205633 


Homo sapiens 


ninjurin2 


364 


100 


9481 


D14886 


Homo sapiens 


TFIIA-37 


196 


100 


9482 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


9483 


AF 178842 


Homo sapiens 


Rhesus blood group-associated 
glycoprotein 


128 


75 


9484 


AF134895 


Homo sapiens 


glyoxylate reductase 


838 


95 


9485 


AC004832 


Homo sapiens 


similar to 45 kDa secretory protein ; 
similar to CAA10644.1 
(PID:g4164418) 


562 


98 


9486 


Z35761 


Homo sapiens 


TEL/ABL 


230 


71 


9487 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


149 


72 


9488 


AL03263 1 


Caenorhabditis 
elegans 


predicted using Genefinder 


159 


31 


9489 


AJ271079 


Oenothera elata 
subsp. hookeri 


Ycf2 protein 


113 


36 


9490 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


135 


40 


9491 


Y53005 


Homo sapiens 


Human secreted protein clone pm749_8 
protein sequence SEQ ID NO: 16. 


124 


61 


9492 


Y21091 


Homo sapiens 


Human p53 cellular tumour antigen 
mutant protein fragment 28. 


85 


41 


9493 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


76 


9494 


AF1 19851 


Homo sapiens 


PRO 1722 


94 


54 


9495 


R04932 


Homo sapiens 


Interferon-gamma receptor segment 
from clone 39 responsiblefor binding 
the target 


142 


100 


9496 


U40952 


Caenorhabditis 
elegans 


C03B1.10 gene product 


103 


75 


9497 


M63838 


Homo sapiens | 


interferon-gamma induced protein 


128 


92 



386 
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SEQ ID 


Accession No. 


Species 


Description 






NO: 








Waterman 


Identit 










Score 


y 


9498 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 


147 


44 






crassa 


RJBONUCLEO PROTEIN C 






9499 


AF108831 


Homo sapiens 


K:C1 cotransporter 3 


244 


77 


9500 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 


154 


68 








(hhem) gene preoduct. 






9501 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


63 








7492. 






9502 


AE003499 


Drosophila 


CGI 2706 gene product 


180 


36 






melanogaster 








9503 


X02344 


Homo sapiens 


beta-tubulin 


363 


94 


9504 


D7S014 


Homo sapiens 


dihydropyrimidinase related protein-3 


189 


84 


9505 


U59446 


Brassica napus 


myrosinase-binding protein related 


1 16 


40 








protein 






9506 


AB010437 


Rattus rattus 


cadherin-8 


239 


50 


9507 


AF229067 


Homo sapiens 


PADI-H protein 


178 


74 i 


9508 


AJ224819 


Homo sapiens 


tumor suppressor 


291 


85 


9509 


AFO 17275 


Mus musculus 


growth factor independence- 1 B 


242 


40 


9510 


AF 177203 


Homo sapiens 


cerebral cell adhesion molecule 


388 


52 


9511 


AY007160 


Homo sapiens 


similar to Homo sapiens CGI-57 


1533 


99 








protein mRNA with GenBank 












Accession Number AF151815.1 






9512 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


104 


66 






major 


DNA, complete virion genome 






9513 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


92 


59 


9514 


X80035 


Oryctolagus 


cysteine rich hair keratin associated 


96 


40 






cuniculus 


protein 






9515 


D86491 


Xenopus laevis 


Nfrl 


1777 


76 


9516 


Y70761 


Homo sapiens 


Human beta-amyloid peptide (BAP) 


168 


80 








binding protein, BBP3. 






9517 


AK024500 


Homo sapiens 


FLJ00109 protein 


1714 


100 


9518 


AF008220 


Bacillus subtilis 


YtaG 


104 


41 


9519 


AF099977 


Mus musculus 


schlafen4 


163 


35 


9520 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


162 


72 








derived from DNA35638. 






9521 


AL035068 


Homo sapiens 


dJ116G19.1 (novel protein) 


102 


76 


9522 


B24527 


Homo sapiens 


Human secreted protein sequence 


729 


90 








encoded by gene 15 SEQ ID NO: 1 53. 






9523 


AF129756 


Homo sapiens 


MSH5 


182 


100 


9524 


AJ238706 


Drosophila 


monocarboxyiate transporter 1 


126 


32 






melanogaster 


homologue 






9525 


Y76198 


Homo sapiens 


Human secreted protein encoded by 


250 


97 








gene 75. 






9526 


D86604 


Mus musculus 


Bach2 


200 


92 


9527 


AB032918 


Hylobates 


dopamine receptor D4 


91 


36 






moloch 








9528 


AB007830 


Homo sapiens 


CSR2 


995 


99 


9529 


AC004472 


Homo sapiens 


PI. 11659 4 


449 


98 


9530 


AF273047 


Homo sapiens 


CTCL tumor antigen se20-7 


104 


29 


9531 


AF 13 0079 


Homo sapiens 


PR02852 


153 


67 


9532 


X07881 


Homo sapiens 


proline-rich protein Gl 


105 


43 


9533 


AK001845 


Homo sapiens 


unnamed protein product 


1368 


63 


9534 


S79410 


Mus musculus 


nuclear localization signal binding 


94 


41 








protein 






9535 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


185 


86 








6613. 






9536 


X99583 


Homo sapiens 


CHL1 protein 


635 


98 


9537 


AL022170 


Homo sapiens 


dJ501N12.1 


111 


37 


9538 


X66179 


Xenopus laevis 


p70 S6 kinase 


120 


75 



387 



MSDCCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Water roan 


% 
y 


9539 


U68380 


Gallus gallus 


csdp 


289 


98 


9540 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


90 


9541 


AB013918 


Homo sapiens 


CAD 


369 


78 


9542 


U29380 


Caenorhabditis 
elegans 


similar to adenylate cyclase 


501 


42 


9543 


AF020312 


Mus musculus 


proline-rich protein 9-1 


95 


51 


9544 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


142 


96 


9545 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


142 


66 


9546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


110 


47 


9547 


AB044546 


Homo sapiens 


mitogen-activated protein kinase kinase 
kinase 


407 


60 


9548 


U52111 


Homo sapiens 


Ca2+/Calmodulin-dependent protein 
kinase I 


660 


100 


9549 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


129 


68 


9550 


Y87852 


Homo sapiens 


Human FGF-5 protein fragment. 


146 


100 


9551 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


50 


9552 


D45131 


Homo sapiens 


basigin 


311 


66 


9553 


A14656 


synthetic 
construct 


protein antigen 


450 


89 


9554 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


133 


52 


9555 


X73458 


Homo sapiens 


protein kinase 


568 


85 


9556 


AJ131890 


Homo sapiens 


DNA polymerase lambda 


1549 


100 


9557 


S56555 


Homo sapiens 


paraoxonase/arylesterase 


224 


85 


9558 


M22332 


Homo sapiens 


unknown protein 


90 


38 


9559 


B24598 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 47 SEQ ID NO:224. 


66 


44 


9560 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


95 


51 


9561 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


77 


76 


9562 


X55687 


Lycopersicon 
esculentum 


extensin (class II) 


69 


60 


9563 


AC002333 


Arabidopsis 
thaliana 


putative SF16 protein {Helianthus 
annuus} 


111 


31 


9564 


AF1 18082 


Homo sapiens 


PRO 1902 


106 


61 


9565 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


85 


52 


9566 


U10323 


Homo sapiens 


NF45 protein 


480 


82 


9567 


AF100757 


Homo sapiens 


COP9 complex subunit 4 


686 


90 


9568 


AL035608 


Homo sapiens 


dJ479J7.2 (transmembrane 4 
superfamily member 6) 


293 


90 


9569 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


118 


63 


9570 


U47856 


Araneus 
diadematus 


fibroin-4 


98 


44 


9571 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


83 


54 


9572 


AJ006239 


Homo sapiens 


dihydropteridine reductase 


461 


84 


9573 


G00821 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4902. 


248 


75 


9574 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


90 


36 


9575 


V00488 


Homo sapiens 


alpha globin 


508 


84 


9576 


AF1 16719 


Homo sapiens 


PR02987 


483 


91 



388 



VISDOCID: <WO 0164835A2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Watcrma n 
Score 


Identit 

y 


9577 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


62 


38 


9578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


474 


92 


9579 


P90387 


Homo sapiens 
(Human) 


N-terminal of human serum albumin 
polypeptide. 


274 


67 


9580 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


271 


88 


9581 


U 15779 


Homo sapiens 


p70 


101 


57 


9582 


M19419 


Mus musculus 


proline-rich salivary protein 


102 


36 


9583 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


114 


40 


9584 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


105 


32 


9585 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


61 


9586 


U93569 


Homo sapiens 


putative pi 50 


232 


51 


9587 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


120 


52 


9588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7S71. 


128 


58 


9589 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


90 


61 


9590 


L27428 


Homo sapiens 


reverse transcriptase 


120 


46 


9591 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


85 


62 


9592 


AJ133489 


Canis familiaris 


albumin 


204 


60 


9593 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


294 


68 


9594 


Ml 5386 


Homo sapiens 


gamma-globin 


316 


77 


9595 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


134 


68 


9596 


AF116719 


Homo sapiens 


PR02987 


309 


89 


9597 


AF1 16719 


Homo sapiens 


PR02987 


485 


94 


9598 


AF068294 


Homo sapiens 


HDCMB45P 


131 


34 


9599 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


70 


9600 


AF 118082 


Homo sapiens 


PRO 1902 


151 


59 


9601 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


186 


68 


9602 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871.. 


176 


65 


9603 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


261 


80 


9604 


AF194537 


Homo sapiens 


NAG13 


154 


61 


9605 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


96 


67 


9606 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


100 


33 


9607 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


134 


48 


9608 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


67 


9609 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


101 


32 


9610 


AF1 18080 


Homo sapiens 


PRO 1880 


237 


100 


9611 


AF1 19851 


Homo sapiens 


PRO 1722 


138 


81 


9612 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


178 


55 


9613 


AF090930 


Homo sapiens 


PRO0478 


88 


71 


9614 


AF068294 


Homo sapiens 


HDCMB45P 


121 


49 


9615 


G01657 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 



389 



0164835A2_L> 



WO 01/64835 PCT/US01/04927 



SEQ1D 


Accession No. 




Description 


Smith- 
Waterman 


y 








5738. 






9616 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


9617 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


92 


44 


9618 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492: 


121 


56 


9619 


AB046061 


Macaca 
fascicularis 


unnamed protein product 


93 


44 


9620 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 155. 


122 


71 


9621 


AF 113685 


Homo sapiens 


PRO0974 


156 


65 


9622 


AF097178 


Equus caballus 


glyceraldehyde-3-phosphate 
dehydrogenase 


119 


75 


9623 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


134 


62 


9624 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


1 12 


J 


9625 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


123 


77 


9626 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


164 


73 


9627 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


236 


71 


9628 


AF 130079 


Homo sapiens 


PR02852 


138 


50 


9629 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO" 
4658. 


130 


60 


9630 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


167 


61 


9631 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


87 


50 


9632 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


128 


41 


9633 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


134 


81 


9634 


AF 11 8082 


Homo sapiens 


PRO 1902 


114 


57 


9635 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


133 


54 


9636 


L07543 


Leishmania 
tarentolae 


MURF4 


85 


42 


9637 


U49973 


Homo sapiens 


ORF2: function unknown 


163 


44 


9638 


AF090895 


Homo sapiens 


PRO0117 


77 


41 


9639 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


108 


71 


9640 


Y144S0 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


90 


46 


9641 


AF 118078 


Homo sapiens 


PRO! 848 


1 14 


66 


9642 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


79 


9643 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


130 




9644 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


141 




9645 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


69 


62 


9646 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


69 


35 


9647 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


478 


90 


9648 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


50 


9649 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


60 



390 



0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No, 


Species 


Description 


Waterman 

Score 


Idcntit 

y 








6619. 






9650 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


173 


38 


9651 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


118 


36 


9652 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


162 


64 


9653 


D00570 


Mus musculus 


open reading frame (196 AA) 


182 


51 


9654 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


138 


54 


9655 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


105 


74 


9656 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


82 


70 


9657 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


79 


39 


9658 


M24732 


Homo sapiens 


lamin-like protein 


76 


31 


9659 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


161 


73 


9660 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 


149 


48 


9661 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


135 ■ 


62 


9662 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


126 


71 


9663 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


141 


63 


9664 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


153 


65 


9665 


R96S00 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


93 


9666 


AF1 13685 


Homo sapiens 


PRO0974 


112 


73 


9667 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


133 


62 


9668 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


102 


47 


9669 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


80 


41 


9670 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


106 


58 


9671 


AF090942 


Homo sapiens 


PRO0657 


118 


66 


9672 


AFI61356 


Homo sapiens 


HSPC093 


94 


63 


9673 


AF090930 


Homo sapiens 


PRO0478 


167 


64 


9674 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


98 


50 


9675 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


87 


70 


9676 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


60 


9677 


X13412 


Rattus rattus 


flk protein 


614 


69 


9678 


AJ006770 


Cicer arietinum 


extensin 


163 


48 


9679 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


120 


67 


9680 


G00494 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4575. 


106 


58 


9681 


U63542 


Homo sapiens 


FAP protein 


120 


68 


9682 


M15530 


Homo sapiens 


B-cell growth factor 


146 


76 


9683 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


180 


62 


9684 


AF321379 


Callicebus 
moloch 


gamma2-gIobin 


108 


u 



391 



WO (11/64835 



PCT/USOJ/04927 



NO: 






Description 


Wnterman 


ldentit 

y 


9685 


L27428 - 


Homo sapiens 


reverse transcriptase 


122 


39 


9686 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


77 


51 


9687 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


58 


9688 


G02363 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
6444. 


123 


58 


9689 


X81713 


Homo sapiens 


smallest subunit of TFIIA 


210 


91 


9690 


AF193330 


Human 
respiratory 
syncytial virus 


G protein 


100 


25 


969) 


AF150100 


Homo sapiens 


small zinc finger-like protein 


205 


83 


9692 


AF068294 


Homo sapiens 


HDCMB45P 


122 


54 


9693 


AF1 89307 


Homo sapiens 


unknown 


96 


50 


9694 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


165 


50 


9695 


AF130089 


Homo sapiens 


PRO2550 


198 


68 


9696 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


170 


80 


9697 


AK024455 


Homo sapiens 


FLJ00047 protein 


137 


63 


9698 


M24732 


Homo sapiens 


lamin-like protein 


92 


42 


9699 


AF116715 


Homo sapiens 


PR02829 


119 


75 


9700 


AF1 30089 


Homo sapiens 


PRO2550 


234 


57 


9701 


V00662 


Homo sapiens 


ATPase 6 


255 


77 


9702 


L 10908 


Mus musculus 


Gcapl gene product 


93 


35 


9703 


AF1 30089 


Homo sapiens 


PRO2550 


208 


78 


9704 


AF194537 


Homo sapiens 


NAG 13 


89 


51 


9705 


U79260 


Homo sapiens 


unknown 


98 


77 


9706 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


7S 


43 


9707 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


70 


92 


9708 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


136 


60 


9709 


AF1 94537 


Homo sapiens 


NAG 13 


99 


48 


9710 


AF 130089 


Homo sapiens 


PRO2550 


87 


58 


9711 


X03145 


Homo sapiens 


pot. ORF III 


135 


41 


9712 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


97 


35 


9713 


AF 116695 


Homo sapiens 


PR02221 


185 


50 


9714 . 


AF130051 


Homo sapiens 


PRO0898 


108 


68 


9715 


AF1 30089 


Homo sapiens 


PRO2550 


143 


50 


9716 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


49 


9717 


L27428 


Homo sapiens 


reverse transcriptase 


133 


50 


9718 . 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


88 


43 


9719 


L27428 


Homo sapiens 


reverse transcriptase 


127 


56 


9720 


M74009 


Theropithecus 
gelada 


cytochrome c oxidase subunit II 


283 


67 


9721 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


292 


84 


9722 


AF130051 


Homo sapiens 


PRO0898 


96 


57 


9723 


U93564 


Homo sapiens 


putative pi 50 


144 


49 


9724 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


373 


76 


9725 


AK000496 


Homo sapiens 


unnamed protein product 


155 


55 


9726 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


94 


62 



392 



-ISOOCID: <WO 0164835A2J_> 



WO 0 1/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 




AF1 30089 


Homo sapiens 


PRO2550 


154 


40 


9728 


API 30089 


Homo sapiens 


PRO2550 


127 


61 


9729 


M 1 94 1 9 


Mus musculus 


proline-rich salivary protein 


98 


39 


9730 


D90053 


Sus scrofa 


destrin 


119 


43 


973 ) 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


138 


50 


9732 


AF090942 


Homo sapiens 


PRO0657 


72 


45 


9733 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID "NO: 
6065. 


110 


44 


9734 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


94 


9735 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


150 


85 


9736 


L27428 


Homo sapiens 


reverse transcriptase 


217 


33 


9737 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


89 


65 


9738 


AF007064 


Saccharomyces 
cerevisiae 


basic protein 


108 


31 


9739 


B01372 


Homo sapiens 


Neuron-associated protein. 


128 


60 


9740 


G03140 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7221. 


109 


50 


9741 


AF 194537 


Homo sapiens 


NAG 13 


123 


54 


9742 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


101 


77 


9743 


X92485 


Plasmodium 
vivax 


pval 


106 


41 


9744 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


76 


9745 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


128 


50 


9746 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


87 


64 


9747 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


163 


69 


9748 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


124 


74 


9749 


Y 14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


97 


66 


9750 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


156 


71 


9751 


G03925 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8006. 


157 


71 


9752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 


51 


9753 


X92485 


Plasmodium 
vivax 


pval 


91 


39 


9754 


Y07786 


Vibrio cholerae 


Rfc-like protein 


92 


32 


9755 


AF068294 


Homo sapiens 


HDCMB45P 


125 


63 


9756 


L27428 


Homo sapiens 


reverse transcriptase 


91 


34 


9757 


AK024455 


Homo sapiens 


FLJ00047 protein 


143 


59 


9758 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


41 


9759 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


96 


67 


9760 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


179 


58 


9761 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


154 


65 


9762 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


78 
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NO: 




Species 


Description 


Smith- 
Waterman 
Score 


Identit 








7888. 






9763 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


119 


60 


9764 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


111 


85 


9765 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


116 


36 


9766 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


165 


74 


9767 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


162 


74 


976S 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


52 


9769 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


92 


50 


9770 


AF 13 0089 


Homo sapiens 


PRO2550 


104 


31 


9771 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


62 


9772 


U52077 


Homo sapiens 


mariner transposase 


194 


56 


9773 


YS7297 


Homo sapiens 


Human signal peptide containing 
protein HSPP-74 SEQ ID NO:74. 


345 


100 


9774 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


73 


36 


9775 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


96 


32 


9776 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


124 


53 


9777 


L27428 


Homo sapiens 


reverse transcriptase 


251 


53 


9778 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


141 


40 


9779 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


111 


58 


9780 


AJ224997 


Rattus 
norvegicus 


huntingtin 


64 


64 


9781 


AF283769 


Homo sapiens 


similar to GenBank Accession Number 
AC021163 


119 


53 


9782 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


81 


50 


9783 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


68 


9784 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


85 


65 


9785 


AF1 44054 


Homo sapiens 


apoptosis related protein APR-4 


83 


53 


9786 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


75 


41 


9787 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


122 


45 


9788 


D90053 


Sus scrofa 


destrin 


146 


43 


9789 


A18812 


Brassica napus 


extensm 


104 


34 


9790 


AF1 30089 


Homo sapiens 


PRO2550 


102 


66 


9791 


AF130089 


Homo sapiens 


PRO2550 


326 


70 


9792 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


495 


79 


9793 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


343 


75 


9794 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


106 


58 


9795 


X9248S 


Plasmodium 
vivax 


pval 


108 


48 


9796 


M15530 


Homo sapiens 


B-cell growth factor 


119 


70 


9797 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7214. 


148 


65 


9798 


X61046 


Hydra sp. 


mini-collagen - 


101 


55 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 


9799 


U93571 


Homo sapiens 


p40 


82 


36 


9800 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


170 


52 


9801 


X92485 


Plasmodium 
vivax 


pval 


113 


45 


9802 


Y 144 79 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


63 


44 


9803 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


583 


89 


9804 


G01736 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5817. 


150 


47 


9805 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


132 


71 


9806 


X92485 


Plasmodium 
vivax 


pval 


106 


53 


9807 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ IDNO:490. 


163 


67 


9808 


AF1 19851 


Homo sapiens 


PRO 1722 


341 


68 


9809 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


77 


57 


9810 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla (AF078683) 


133 


72 


9811 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


65 


9812 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


118 


37 


9813 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


121 


72 


9814 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


72 


92 


9815 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


71 


71 


9816 


AF1 18082 


Homo sapiens 


PRO 1902 


93 


45 


9817 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


207 


59 


9818 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


99 


46 


9819 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


130 


69 


9820 


LI 0908 


Mus musculus 


Gcapl gene product 


99 


37 


9821 


A00469 


Homo sapiens 


growth hormone 


254 


81 


9822 


U79260 


Homo sapiens 


unknown 


85 


39 


9823 


R96S00 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


125 


62 


9824 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


103 


74 


9825 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


126 


51 


9826 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


148 


44 


9827 


Y24789 


Homo sapiens 


Human secreted protein bf377_l. 


265 


100 


9828 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


72 


60 


9829 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


118 


45 


9830 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


104 


51 


9831 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


118 


74 


9832 


AF151866 


Homo sapiens 


CG1-108 protein 


115 


31 


9833 


U93563 


Homo sapiens 


putative pi 50 


127 


61 
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Accession No. 






Wntermflii 
Score 


Identit 

y 


9834 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


97 


30 


9835 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


138 


65 


9836 


B01372 


Homo sapiens 


Neuron-associated protein. 


150 


88 


9837 


G03115 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7196. 


115 


57 


9838 


AF068294 


Homo sapiens 


HDCMB45P 


90 


65 


9839 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


71 


66 


9840 


AF090942 


Homo sapiens 


PRO0657 


115 


76 


9841 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


75 


54 


9842 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


156 


67 


9843 


AF 130089 


Homo sapiens 


PRO2550 


216 


75 


9844 


M15530 


Homo sapiens 


B-cell growth factor 


113 


67 


9845 


U93567 


Homo sapiens 


p40 


248 


75 


9846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


55 


9847 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


64 


9848 


AF 118082 


Homo sapiens 


PRO 1902 


58 


81 


9849 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


129 


65 


9850 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


329 


77 


9851 


AF090944 


Homo sapiens 


PRO0663 


122 


80 


9852 


X71442 


Rattus 
norvegicus 


ORF 1 ; putative 


109 


48 


9853 


AF 130089 


Homo sapiens 


PRO2550 


126 


60 


9854 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


98 


62 


9855 


U93563 


Homo sapiens 


putative pi 50 


136 


48 


9856 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


276 


74 


9857 


AFO 17777 


Drosophila 
melanogaster 


la costa 


99 


43 


9858 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


99 


37 


9859 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


106 


37 


9860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


138 


71 


9861 


AF1 16909 


Homo sapiens 


unknown 


103 


43 


9862 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


113 


38 


9863 


Y21811 


Homo sapiens 


CPF polypeptide 36PRO. 


131 


58 


9864 


U93567 


Homo sapiens 


p40 


252 


70 


9865 


AF090895 


Homo sapiens 


PRO0117 


164 


60 


9866 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


109 


47 


9867 


AF090930 


Homo sapiens 


PRO0478 


106 


84 


9868 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


167 


84 


9869 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


83 


60 


9870 


AF1 19900 


Homo sapiens 


PR02822 


134 


48 


9871 


AF1 16715 


Homo sapiens 


PR02829 


111 


81 


9872 


U52077 


Homo sapiens 


mariner transposase 


332 


76 
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% 


NO: 








Wntermnn 


Identit 










Score 


y 


9873 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 


95 


85 








transferase protein. 






9874 


U93570 


Homo sapiens 


putative pi 50 


243 


44 


9875 


L27428 


Homo sapiens 


reverse transcriptase 


127 


54 


9876 


S80119 


Rattus sp. 


reverse transcriptase homolog 


163 


43 


9877 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 13 


64 








4484. 






9878 


AF 187823 


Zea mays 


transposase DOPA 


132 


38 


9879 


W40353 


Homo sapiens 


Human unspecified protein from 


101 


41 








US5702907. 






9880 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


43 








7496. 






9881 


LT49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


101 


59 








similar to pogo element 






9882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


52 








4718. 






9883 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


57 


65 








5330. 






9884 


X92485 


Plasmodium 


pval 


129 


43 






vivax 








9885 


M64792 


Rattus 


salivary proline-rich protein 


98 


37 






norvegicus 








9886 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


89 


59 








gene 1 00 clone HNFIU96. 






9887 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


54 








8153. 






9888 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


40 








8144. 






9889 


D38II2 


Homo sapiens 


NADH dehydrogenase subunit 2 


346 


68 


9890 


G03I33 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


38 








72 14. 






9891 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


66 








7879. 






9892 


X53581 


Rattus 


ORF7 


81 


56 






norvegicus 








9893 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


53 








7348. 






9894 


AE003859 


Xylella 


hypothetical protein 


99 


27 






fastidiosa 








9895 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


157 


58 


9896 


L26953 


Homo sapiens 


chromosomal protein 


107 


68 


9897 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


95 


51 






major 


DNA, complete virion genome 






9898 


U93569 


Homo sapiens 


putative pi 50 


106 


52 


9899 


U 15647 


Mus musculus 


reverse transcriptase 


118 


47 


9900 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


118 


70 








similar to pogo element 






9901 


U93563 


Homo sapiens 


putative pi 50 


116 


39 


9902 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


81 








4409. 






9903 


X03145 


Homo sapiens 


pot. ORF V 


148 


54 


9904 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


65 








7868. 






9905 


AF066073 


Dictyostelium 


SP85; PsB 


113 


52 






discoideum 








9906 


AF090895 


Homo sapiens 


PRO0117 


125 


58 


9907 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


57 








4478. 






9908 


M24732 


Homo sapiens 


lamin-like protein 


95 


35 
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SEQID 


Accession No. 


Species 


Description 


Smith- 

Wattrman 

Score 


y 


9909 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


96 


55 


9910 


AF1 18080 


Homo sapiens 


PROl 880 


120 


66 


991] 


AF1 16719 


Homo sapiens 


PR02987 


216 


72 


9912 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


150 


75 


9913 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


129 


76 


9914 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


72 


58 


9915 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4314. 


103 


71 


9916 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


1 14 


75 


9917 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


75 


83 


9918 


AF1 16712 


Homo sapiens 


PR02738 


106 


69 


9919 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


90 


77 


9920 


AF068294 


Homo sapiens 


HDCMB45P 


296 


59 


9921 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


58 


54 


9922 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


143 


69 


9923 


U38979 


Homo sapiens 


hPMSR3 


135 


46 


9924 


AB002317 


Homo sapiens 


KIAA0319 


294 


75 


9925 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


1 69 


42 


9926 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


153 


59 


9927 


G00214 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4295. 


261 


97 


9928 


Y 14483 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 8. 


863 


76 


9929 


AF130051 


Homo sapiens 


PRO0898 


128 


60 


9930 


AF130089 


Homo sapiens 


PRO2550 


123 


41 


9931 


AF064597 


Homo sapiens 


LINE-1 like protein 


101 


48 


9932 


AF027144 


Homo sapiens 


zinc finger protein 


76 


42 


9933 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
4770. . 


120 


44 


9934 


U93569 


Homo sapiens 


putative pi 50 


105 


68 


9935 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


133 


51 


9936 


Y73344 


Homo sapiens 


HTRM clone 0258181 protein 
sequence. 


145 


46 


9937 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


123 


77 


9938 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


40 


9939 


D63643 


Homo sapiens 


clathrin coat assembly protein-like 


232 


69 


9940 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


75 


65 


9941 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


176 


71 


9942 


X04588 


Homo sapiens 


cytoskeletal tropomyosin (AA 1-248) 


379 


81 


9943 


G03683 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7764. 


58 


52 


9944 


AF130114 


Homo sapiens 


PR02459 


117 


67 


9945 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


297 


64 



398 
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NO 3 ' 0 


Accession No. 


Species 


Description 


W^ternnn 
Score 


Identic 

y 








similar to pogo element 






9946 


D00570 


Mus musculus 


open reading frame (251 AA) 


214 


63 


9947 


X03717 


Homo sapiens 


pot. unidentified reading frame 


76 


35 


9948 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


127 


64 


9949 


J02621 


Homo sapiens 


high mobility group protein 14 


90 


50 


9950 


AF090931 


Homo sapiens 


PRO0483 


114 


75 


9951 


AF1646I2 


Homo sapiens 


Gag protein 


132 


63 


9952 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


112 


65 


9953 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


118 


59 


9954 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


138 


74 


9955 


G00974 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5055. 


94 


38 


9956 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


124 


61 


9957 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


177 


51 


9958 


X92485 


Plasmodium 
vivax 


pval 


137 


44 


9959 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


53 


9960 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


68 


9961 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


123 


38 


9962 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


56 


9963 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


80 


57 


9964 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


82 


9965 


Y27908 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


92 


53 


9966 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


76 


40 


9967 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


38 


9968 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


101 


67 


9969 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


90 


83 


9970 


Y36156 


Homo sapiens 


Human secreted protein #28. 


170 


68 


9971 


U79260 


Homo sapiens 


unknown 


142 


67 


9972 


L27428 


Homo sapiens 


reverse transcriptase 


10S 


63 


9973 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


119 


60 


9974 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


78 


85 


9975 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


79 


39 


9976 


AF090944 


Homo sapiens 


PRO0663 


109 


73 


9977 


AF116715 


Homo sapiens 


PR02829 


120 


56 


9978 


U93563 


Homo sapiens 


putative pi 50 


132 


57 


9979 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


97 


80 


9980 


U42580 


Paramecium 
bursaria 


A658R 


70 


40 
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SEQ1D 


Accession No. 


Species 


Descriplion 


Smith- 
Waterman 
Score 


% 

Identit 

y 






Chlorella virus 1 








9981 


G03371 


Homo.sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


87 


61 


9982 


G02363 


Homo sapiens 


Human secrcted'protein, SEQ ID NO: 
6444. 


135 


70 


9983 


X61047 


Hydra sp. 


mini-collagen 


90 


36 


9984 


G03564 


Homo sapiens 


7645. 






9985 


Y41552 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 80. 


90 


42 


9986 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


132 


52 


9987 


G03787 


Homo sapiens 


7868. 


— — — 


— 

41 


9988 


AF042832 


Homo sapiens 


forkhead-related transcription factor 
FREAC-9 


89 


36 


9989 


X03717 


Homo sapiens 


pot. unidentified reading frame 


~7ol 


— 


9990 


AL390935 


Leishmania 


possible dopap 


118 


74 


9991 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


109 


58 


9992 


AF1301 14 




PR02459 




68 


9993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


148 


83 


9994 


AF068294 


Homo sapiens 


HDCMB45P 


196 


43 


9995 


Y95028 


Homo sapiens 
_ . 


Human clone vp7_l ORF2, SEQ ID 


117 


54 


9996 


Ml 1759 


escu°enrujr| 0n 


cell wall hydroxyproline-rich 


74 


53 


9997 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


132 


55 


9998 


AF102771 


Homo sapiens 


(RalGDS/AF-6) domain family 1 
protein isofbrm ID 


"148 


Too — 


9999 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


213 


88 


10000 


D381 12 


Homo sapiens 








10001 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


97 


39 


10002 


AK000496 


Homo sapiens 


unnamed protein product 


154 


74 


10003 


S58722 


Homo sapiens 


terminal, clone XEH.8c} 






1000A 


B07702 




Protein encoded by the endogenetic 
fragment of HERV-W. 




56 


10005 


Y36366 




Fragment of human secreted protein 
encoded by gene 3. 


97 


65 


10006 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


72 


10007 


AF021231 


Mus musculus 


acetylcholinesterase-associated 


109 


41 


10008 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

f5 44- — ■ — - — — 


69 


78 


10009 


G03714 


Homosa iens 

omo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


170 


78 


10010 


AF130051 


Homo sapiens 


PRO0898 


134 


67 


10011 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


100 


44 


10012 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


95 


78 


10013 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


90 


52 


10014 


AF 13 0052 


Homo sapiens 


PRO0956 


104 


50 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 


10015 


X92485 


Plasmodium 
vivax 


pval 


119 


46 


10016 


AF 130089 


Homo sapiens 


PRO2550 


171 


56 


10017 


AF1 30089 


Homo sapiens 


PRO2550 


172 


58 


10018 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


38 


10019 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


54 


10020 


G02584 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6665. 


109 


43 


10021 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


70 


51 


10022 


AF090942 


Homo sapiens 


PRO0657 


113 


67 


10023 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


77 


33 


10024 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


95 


35 


10025 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


93 


61 


10026 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


42 


10027 


Y73377 


Homo sapiens 


HTRM clone 1645941 protein 
sequence. 


1180 


84 


10028 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


108 


59 


10029 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


112 


65 


10030 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


163 


82 


10031 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


66 


10032 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


74 


48 


10033 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


104 


85 


10034 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


192 


60 


10035 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


.153 


55 


10036 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


128 


76 


10037 


AF1 18086 


Homo sapiens 


PRO 1992 


148 


68 


10038 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


125 


47 


10039 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


137 


65 


10040 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


139 


45 


10041 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


75 


10042 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


100 




10043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


64 


10044 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


135 


71 


10045 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


148 


46 


10046 


X92485 


Plasmodium 
vivax 


pval 


98 


58 
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WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Speccs 


Description 


Smilh- 
Waterman 


% 

Idenlit 


10047 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


102 


87 


10048 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7437. 


63 


84 


10049 


U09202 


Homo sapiens 


ornithine decarboxylase antizyme 


202 


67 


10050 


X92485 


Plasmodium 
vivax 


pval 


77 


34 


10051 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


87 


40 


10052 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


114 


57 


10053 


AC005396 


Arabidopsis 
thaliana 


putative proline-rich protein 


100 


33 


10054 


G03600 


Homo sapiens 


Fluman secreted protein, SEQ ID NO: 
7681. 


91 


94 


10055 


X73434 


Ovis aries 


KAP5.4 keratin protein 


76 


45 


10056 


G 024 85 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


82 


62 


10057 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


66 


10058 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


55 


10059 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


130 


73 


10060 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


100 


42 


10061 


L27428 


Homo sapiens 


reverse transcriptase 


107 


43 


10062 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


47 


10063 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


93 


46 


10064 


AF220264 


Homo sapiens 


MOST-1 


135 


48 


10065 


AF130114 


Homo sapiens 


PR02459 


87 


68 


10066 


D86853 


Catharanthus 
roseus 


extensin 


115 


35 


10067 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


79 


71 


10068 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


102 


54 


10069 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


93 


77 


10070 


AJ233597 


Mus famulus 


reverse transcriptase 


65 


36 


10071 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


94 


67 


10072 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


114 


51 


10073 


AK024455 


Homo sapiens 


FLJ00047 protein 


146 


66 


10074 


X92485 


Plasmodium 
vivax 




121 


36 


10075 


G 03 790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


64 


10076 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


73 


43 


10077 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


165 


75 


10078 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


106 


72 


10079 


AF130050 


Homo sapiens 


PRO0872 


102 


63 


10080 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


.130 . 


44 
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SEQ1D 
NO: 


Accession No. 




Description 




y 








8153. 






10081 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


285 


78 


10082 


AF068294 


Homo sapiens 


HDCMB45P 


144 


68 


10083 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


98 


48 


10084 


X92485 


Plasmodium 
vivax 


pval 


108 


53 


10085 


AF 11 8082 


Homo sapiens 


PRO 1902 


117 


57 


10086 


AF090942 


Homo sapiens 


PRO0657 


88 


53 


10087 


AF118086 


Homo sapiens 


PRO 1992 


172 


71 


10088 


Y01400 


Homo sapiens ' 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


119 


68 


10089 


K02576 


Homo sapiens 


salivary proline-rich protein I 


117 


34 


10090 


W4835 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


77 


10091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


56 


10092 


G03I19 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7200. 


90 


56 


10093 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


134 


66 


10094 


Ml 5530 


Homo sapiens 


B-cell growth factor 


116 


64 


10095 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


65 


10096 


M15530 


Homo sapiens 


B-cell growth factor 


87 


68 


10097 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


84 


79 


10098 


G00369 


Homo sapiens 


Hiiman secreted protein, SEQ ID NO: 
4450. 


150 


66 


10099 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


157 


70 


10100 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


103 


60 


10101 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


114 


70 


10102 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


130 


53 


10103 


G03786 ' 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


156 


52 


10104 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


87 


60 


10105 


L27428 


Homo sapiens 


reverse transcriptase 


220 


38 


10106 


W90834 


Homo sapiens 


Human lymphocyte targeted peptide 
#2. 


91 


43 


10107 


AF161356 


Homo sapiens 


HSPC093 


•227 


61 


10108 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


64 


10109 


AF130089 


Homo sapiens 


PRO2550 


126 


77 


10110 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO.250. 


118 


60 


10111 


M63274 


Plasmodium 
falciparum 


malaria antigen 


81 


54 


10112 


AF090944 


Homo sapiens 


PRO0663 


133 


70 


10113 


L27428 


Homo sapiens 


reverse transcriptase 


91 


32 


10114 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3 17- 
3). 


112 


64 


10115 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


85 


44 
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NO: 




Species 


Description 


V^Mterman 


% 

Identit 


10116 


L27428 


Homo sapiens 


reverse transcriptase 


112 


44 


101 17 


G02501 


JHomo sapiens 


Human secreted protein, SEQ ID NO: 
65S2. 


1 15 


61 


101 18 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


79 


34 


101 19 


AF 194537 


Homo sapiens 


NAG 1 3 


295 


51 


10120 


D00570 


Mus musculus 


open reading frame (196 AA) 


97 


40 


10121 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


122 


47 


10122 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


55 , 


10123 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


92 


66 


10124 


AF090942 


Homo sapiens 


PRO0657 


127 


78 


10125 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


145 


68 


10126 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


113 


49 


10127 


U93563 


Homo sapiens 


putative pi 50 


94 


35 


10128 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HC AC J81. 


146 


79 


10129 


AF 130089 


Homo sapiens 


PRO2550 


208 


75 


10130 


M15530 


Homo sapiens 


B-cell growth factor 


102 


73 


10131 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


122 


30 


10132 


L27428 


Homo sapiens 


reverse transcriptase 


197 


40 


10133 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


99 


73 


10134 


AF 194537 


Homo sapiens 


NAG 13 ' 


220 


48 


10135 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


75 


10136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


117 


61 


10137 


YS6573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


223 


5 P 


10138 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


178 


75 


10139 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


170 


52 


10140 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


133 


64 


10141 


AF072164 


Homo sapiens 


HFSE-1 


80 


60 


10142 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


103 


74 


10143 


Ml 5530 


Homo sapiens 


B-cell growth factor 


91 


40 


10144 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


103 


61 


10145 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


162 


65 


10146 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


128 


53 


10147 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


112 


61 


10148 


X71975 


Drosophila 
melanogaster 


put. homologue to S.cerevisiae GAR1 
gene 


118 


39 


10149 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH. 8c} 


123 


49 


10150 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


163 


81 



404 
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NO: 








Waterman 


Idcntit 

y 


10151 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


106 


83 


10152 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


52 


10153 


AF1 13685 


Homo sapiens 


PRO0974 


133 


62 


10154 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


129 


67 


10155 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


89 


50 


10156 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


10157 


ABO 11099. 


Homo sapiens 


KIAA0527 protein 


94 


35 


10158 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


78 


72 


10159 . 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


139 


71 


10160 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


68 


10161 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


88 


10162 


Ml 5530 


Homo sapiens 


B-cell growth factor 


137 


63 


10163 


AF090895 


Homo sapiens 


PRO0117 


93 


67 


10164 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


60 


10165 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


126 


52 


10166 


AF1 18082 


Homo sapiens 


PRO 1902 


111 


55 


10167 


AF090895 


Homo sapiens 


PRO0117 


112 


75 


10168 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


135 


69 


10169 


AF090942 


Homo sapiens 


PRO0657 


149 


67 


10170 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


145 


68 


10171 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


85 


51 


10172 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


136 


62 


10173 


G 02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


158 


63 


10174 


AF090942 


Homo sapiens 


PRO0657 


75 


44 


10175 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


97 


47 


10176 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


108 


.75 


10177 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


168 


80 


10178 


M22332 


Homo sapiens 


unknown protein 


131 


62 


10179 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


160 


60 


10180 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


65 


68 


10181 


AF 11 9900 


Homo sapiens 


PR02822 


132 


72 


10182 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


114 


56 


10183 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


124 


59 


10184 


AF2 10651 


Homo sapiens 


NAG 1 8 


150 


65 


10185 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


109 


46 


10186 


X71442 


Rattus 


ORF 1; putative 


91 


47 
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SEQ ID 

NO. 


Accession No. 


Species 


Description 


j Smith- 
Waterman 


Identit 






norvegjcus 








10187 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


54 


10188 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


82 


72 


10189 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


78 


61 


10190 


AF220264 


Homo sapiens 


MOST-1 


165 




10191 


Y91617 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 9 SEQ ID NO:290. 


54 


44 


10192 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


1 19 


80 


10193 


X92485 


Plasmodium 


pval 


128 


57 


10194 


X61296 


Rattus 
norvegicus 


open reading frame 2 


79 


72 


10195 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


1 16 


62 


10196 


AF1 30089 


Homo sapiens 


PRO2550 


144 


62 


10197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


56 


10198 


AF090942 


Homo sapiens 


PRO0657 


133 


76 


10199 


S79304 


Rattus sp. 


cytochrome oxidase subunit Ij COX I 


280 


66 


10200 


S61070 


Homo sapiens 


reverse transcriptase homolog=poJ 
{retroviral element} 


188 


49 


10201 


G03790 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7871. 


125 


51 


10202 


AF068294 


Homo sapiens 


HDCMB45P 


174 


65 


10203 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


174 


50 


10204 


U93572 


Homo sapiens 


putative pi 50 


77 


40 


10205 


L27428 


Homo sapiens 


reverse transcriptase 


133 


50 


10206 


J02621 


Homo sapiens 


high mobility group protein 14 


94 


57 


10207 


AF130079 


Homo sapiens 


PR02852 


144 


73 


10208 


R95913 


Homo sapiens 


Neural thread protein. 


134 


51 


10209 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


154 


76 


10210 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


162 


60 


10211 


AF090931 


Homo sapiens 


PRO04S3 


128 


79 


10212 


AF130051 


Homo sapiens 


PRO0898 


144 


76 


10213 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


95 


54 


10214 


AFI30089 


Homo sapiens 


PRO2550 


158 


70 


10215 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


96 


55 


10216 


M15894 


Homo sapiens 


chorionic somatomammotropin 
precursor 


116 


63 


10217 


AK024372 


Homo sapiens 


unnamed protein product 


85 


56 


10218 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


1 1 8 




10219 


M37679 






"it 


~ 


10220 


G03806 


Homo sa ' ns 

mo sapiens 


Human secreted protein SEQ ID NO' 
7887. 




~ 

■ 


10221 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


127 


47 


10222 


Y36505 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 29. 


102 


48 


10223 


AF090940 


Homo sapiens 


PRO0644 


105 


76 


10224 


U49974 


Homo sapiens 


mariner transposase 


155 


76 


10225 


K02576 


Homo sapiens 


salivary praline-rich protein 1 


154 


42 



406 



0164835A2_I_> 



WO 01/64835 



PCT/USO 1/0492 7 



NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


°/o 

y 


10226 


AF293639 


Homo sapiens 


dioxin receptor repressor 


819 


98 


10227 


L27428 


Homo sapiens 


reverse transcriptase 


137 


44 


10228 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


106 


70 


10229 


U93566 


Homo sapiens 


p40 


206 


80 


10230 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


114 


66 


10231 


API 18082 


Homo sapiens 


PRO 1902 


76 


52 


10232 


AF 11 8086 


Homo sapiens 


PRO 1992 


99 


75 


10233 


X53375 


Helianthus 
annuus 


anther-specific protein SF1 8 


95 


64 


10234 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


156 


72 


10235 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


100 


35 


10236 


L27428 


Homo sapiens 


reverse transcriptase 


244 


50 


10237 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


93 


70 


10238 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


142 


72 


10239 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


149 


80 


10240 


S52010 


Mus sp. 


1st Met is at position 21 


98 


42 


10241 


AL1 10147 


Homo sapiens 


hypothetical protein 


125 


35 


10242 


X98485 


Plasmodium 
vivax 


putative 


85 


51 


10243 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


70 


65 


10244 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


125 


49 


10245 


U49973 


Homo sapiens 


ORF2: function unknown 


150 


81 


10246 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


113 


56 


10247 


AF 13 0089 


Homo sapiens 


PRO2550 


139 


73 


10248 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


160 


59 


10249 


U93565 


Homo sapiens 


putative pi 50 


109 


43 


10250 


AF 130089 


Homo sapiens 


PRO2550 


104 


54 


10251 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


78 


36 


10252 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


110 


58 


10253 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


123 


54 


10254 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


95 


57 


10255 


X03 145 


Homo sapiens 


pot. ORF V 


76 


63 


10256 


Y01405 


Homo sapiens 


Secreted protein encoded by gene 23 
clone HDPBA48. 


116 


65 


10257 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


113 


68 


10258 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


138 


68 


10259 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


135 


60 


10260 


G02528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6609. 


95 


59 


10261 


M86246 


Homo sapiens 


EHS-2 


96 


62 


10262 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


180 


66 


10263 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 



407 



0 1 6483SA2 J_J 



WO 01/64835 



PCT/USOI/04927 



NO: 


Accession No. 


Specs 


Description 


Smith- 

Warcrmnn 

Score 


% 

Identit 

y 








7871. 






10264 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


313 


84 


10265 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


56 


10266 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


150 


61 


10267 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


408 


89 


10268 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 


26 


10269 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


133 


52 


10270 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


61 


10271 


X61 048 


Hydra sp. 


mini-collagen 


121 


40 


10272 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


150 


66 


10273 


L24521 


Homo sapiens 


transformation-related protein 


134 


51 


10274 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


96 


47 


10275 


U59446 


Brassica napus 


myrosinase-binding protein related 
protein 


115 


39 


10276 


GO 1480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5561. 


482 


98 


3 0277 


GO380O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


143 


81 


10278 


U93564 


Homo sapiens 


putative pi 50 


130 


46 


10279 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


123 


44 


10280 


AF090944 


Homo sapiens 


PRO0663 


156 


73 


10281 


AF 130079 


Homo sapiens 


PR02852 


146 


75 


10282 


X89401 


Homo sapiens 


ribosomal protein L21 


108 


48 


10283 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


65 


66 


102S4 


Y25821 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 41. 


1602 


98 


10285 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


308 


43 


10286 


AF090931 


Homo sapiens 


PRO0483 


131 


74 


10287 


Y48480 


Homo sapiens 


Human breast tumour-associated 
protein 25. 


203 


90 


10288 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


116 


44 


10289 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


160 


64 


10290 


X67863 


Mus musculus 


T2 


147 


37 


10291 


AF229067 


Homo sapiens 


PADI-H protein 


123 


43 


10292 


AF078844 


Homo sapiens 


hqp0376 protein 


125 


59 


10293 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


37 


10294 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


103 


64 


10295 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


162 


70 


10296 


V00662 


Homo sapiens 


ATPase 6 


268 


88 


10297 


AB007861 


Homo sapiens 


KIAA0401 


424 


92 


10298 


M24732 


Homo sapiens 


lamin-like protein 


183 


40 


10299 


D90053 


Sus scrofa 


destrin 


123 


48 


10300 


X61047 


Hydra sp. 


mini-collagen 


86 


43 



408 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Waterman 


JJentir 

y 


10301 


LT49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


180 


50 


10302 


AF1 18078 


Homo sapiens 


PRO 1848 


157 


63 


10303 


X55681 


Lycopersicon 
esculentum 


extensin (class 1) 


87 


36 


10304 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


105 


62 


10305 


X70944 


Homo sapiens 


PTB-associated splicing factor 


84 


32 


10306 


AF068294 


Homo sapiens 


HDCMB45P 


220 


42 


10307 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


96 


41 


10308 


AJ005559 


Mus musculus 


SPR2A protein 


64 


42 


10309 


AL353995 


Arabidopsis 
thatiana 


AtAGP4 


98 


38 


10310 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


213 


92 


10311 


M15386 


Homo sapiens 


gamma-globin 


614 


88 


10312 


V00488 


Homo sapiens 


alpha globin 


574 


90 


10313 


AL049730 


Arabidopsis 
thaliana 


pEARLI 1 -like protein 


117 


38 


10314 


AL356299 


Homo sapiens 


bK3216D2.1.1 (S- 
adenosylhomocysteine hydrolase 
(SAHH), isoform 1) 


256 


70 


10315 


AF1 94537 


Homo sapiens 


NAG 13 


113 


63 


10316 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


99 


62 


10317 


AF090931 


Homo sapiens 


PRO0483 


86 


68 


1031S 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


102 


77 


10319 


L29219 


Homo sapiens 


clkl; putative 


341 


100 


10320 


V00488 


Homo sapiens 


alpha globin 


314 


89 


10321 


AJ223953 


Homo sapiens 


hPTTG 


354 


89 


10322 


X79389 


Homo sapiens 


glutathione transferase Tl 


78 


75 


10323 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


101 


66 


10324 


AF1 16715 


Homo sapiens 


PR02829 


128 


73 


10325 


AF1 16619 


Homo sapiens 


PRO 1051 


125 


77 


10326 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


249 


100 


10327 


U73844 


Homo sapiens 


ESE-la 


185 


90 


10328 


AB051901 


Homo sapiens 


VDUP1 


132 


83 


10329 


ALI63300 


Homo sapiens 


human ubiquitin conjugating enzyme 
G2 EC 6.3.2.19. 


720 


100 


10330 


AF1 17237 


Homo sapiens 


prefoldin subunit 2 


729 


100 


10331 


D42054 


Homo sapiens 


KIAA0092 gene product is distantly 
related to smooth muscle myosin. 


352 


84 


10332 


AF038965 


Homo sapiens 


26S proteasome ATPase subunit 


431 


100 


10333 


Y94960 


Homo sapiens 


Human secreted protein clone mil 1 7 1 

protein sequence SEQ ID NO: 1 26. 


399 


96 


10334 


X60376 


Brassica napus 


proline-rich protein 


114 


31 


10335 


M34427 


Homo sapiens 


T-plastin 


343 


91 


10336 


V00488 


Homo sapiens 


alpha globin 


445 


88 


10337 


M15386 


Homo sapiens 


gamma-globin 


402 


85 


10338 


X61123 


Homo sapiens 


BTG1 


367 


90 


10339 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


278 


87 


10340 


A03992 


synthetic 
construct 


growth hormone 


145 


61 


10341 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 



409 



WO 01/64835 PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


% 
y 








4518. 






10342 


U02680 


Homo sapiens 


protein tyrosine kinase 


140 


92 


10343 


AF 180681 


Homo sapiens 


guanine nucleotide exchange factor 


1425 


88 


10344 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


123 


69 


10345 


V00488 


Homo sapiens 


alpha globin 


318 


93 


10346 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


329 


98 


] 0347 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


100 


66 


10348 


U29953 


Homo sapiens 


pigment epithelium-derived factor 


359 


74 


10349 


G01790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5871. 


85 


66 


10350 


V00488 


Homo sapiens 


alpha globin 


361 


90 


10351 


U93567 


Homo sapiens 


p40 


203 


84 


10352 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQIDNO:220. 


253 


85 


10353 


AF1 94537 


Homo sapiens 


NAG 13 


196 


41 


10354 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


297 


81 


10355 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


144 


65 


10356 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


94 


54 


10357 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


153 


68 


10358 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


136 


53 


10359 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


95 


77 


10360 


X61296 


Rattus 
norvegicus 


open reading frame 2 


87 


40 


10361 


L24521 


Homo sapiens 


transformation-related protein 


103 


32 


10362 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


101 


55 


10363 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


89 


80 


10364 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


101 


5S 


10365 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQIDNO.220. 


322 


85 


10366 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


59 


10367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


53 


10368 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


120 


59 


10369 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10370 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


169 


81 


10371 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


80 


75 


10372 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10373 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


77 


10375 


AF130089 


Homo sapiens 


PRO2550 


139 


78 


10376 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


62 



410 



0164835A2J_> 



WO 01/64835 
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NO: 






Description 


Waterman 


Identit 

y 








7874. 






]0377 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


96 


51 


10378 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


58 


10379 


W8S627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


117 


71 


10380 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


54 


10381 


U93569 


Homo sapiens 


putative p 1 50 


160 


60 


10382 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


340 


77 


10383 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. . 


83 


71 


10384 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


137 


63 


10385 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


67 


50 


10386 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


117 


58 


10387 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


45 


10388 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


70 


10389 


U93570 


Homo sapiens 


putative pi 50 


112 


58 


10390 


AF090942 


Homo sapiens 


PRO0657 


126 


68 


10391 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


123 


64 


10392 


AF130089 


Homo sapiens 


PRO2550 


144 


71 


10393 


U40739 


Homo sapiens 


cyclin C 


464 


86 


10394 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


75 


10395 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101 . 


137 


55 


10396 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


165 


66 


10397 


AF090930 


Homo sapiens 


PRO0478 


133 


80 


10398 


AK024455 


Homo sapiens 


FLJ00047 protein 


153 


62 


10399 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


69 


10400 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


73 


51 


10401 


AB030033 


Dictyostelium 
discoideum 


AmiB 


82 


33 


10402 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


113 


63 


10403 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


148 


77 


10404 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


71 


10405 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


128 


59 


10406 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


143 


76 


10407 


AF090942 


Homo sapiens 


PRO0657 


105 


64 


10408 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


93 


54 


10409 


J02621 


Homo sapiens 


high mobility group protein 14 


140 


64 


10410 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


46 


10411 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


.79 


56 
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Species 
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Smith- 


% 


NO: 








Waterman 


Identit 










- Scorc 


-1 








7879. 






10412 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


79 


66 








terminal, clone XEH.Sc} 






10413 


AF1 30079 


Homo sapiens 


PR02852 


184 


90 


10414 


Y86305 


Homo sapiens 


Human secreted protein HLYES3 8, 


237 


77 








SEQ ID NO:220. 






10415 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








gene 44 clone HTDAD22. 






10416 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


89 


78 


10417 


S80864 


Homo sapiens 




1 84 




10418 


Y30681 


Homo sapiens 


Splice variant ZAP-1 B protein or the 




— 








human tumor suppressor gene ZAP-1 . 






10419 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


90 








7068. 






10420 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


59 








6588. 






10421 


AF090942 


Homo sa iens 


PRO0657 


~J54 


— 


10423 


L78669 


Homo salens 


PTH^e*" 10 ^ fUSi ° n prot ^'" 


"106 


— 


10424 


AF095770 


Homo sapiens 

omo sapiens 


-responsive os eosarcoma 


—pr— 


— 






— 




— — : — 




. . — 


AT T^QTHO 


Trypanosoma 


— — — — ; : — — 

probable similar to nng-h2 finger 


130 




64 














10426 


Y02693 


Homosa iens 

omo sapiens 


Human secreted protein encoded by 


— 


— 








gene 44 clone HTDAD"^2 






10427 


AF068294 


Homo sa iens 


HDCMB45P 


"256 


-— 


10428 


AF194537 


Homo sapiens 




~Tjp 


— 


10429 


AF09093 1 


Homo sapiens 


PRO0483 


108 


~J9 


10430 


G03714 


Homo sapiens 


Human secreted protein SEQ ID "NO 1 


93 


~~68 








7795. 






1043 1 


AF095770 


Homosa iens 

omo sapiens 




-— 


— 








protein 




■ 


10432 


X67920 


Homosa iens 

m s pien 


tr^tophany I-tRN A synthetase 


~ T8 


— 


10433 


Y86305 


omo sapiens 


riuman secreted protein rlLYEojo, 


~904 


— 

78 








oca rr\ Xlrf'VOTfi 








AF095770 


Hq — - ■■ 

omo sapiens 


PTH -responsive osteosarcoma Dl 


1 

27 


87 








protein 






1Q43S — 


G02490 


Homo sa ien 

omo sapiens 




— — 




75 








6571. 






Q436 — 


s _ 9 0 


JVlus musculus 


nuclear localization signal binding 


— — 

117 


— 

72 














10437 


G00577 


Homo sapiens 


Human secreted protein, SEQ IDNO: 


120 


— 






__ _ 


4658. 






10438 — 




Oryza sativa 


similar to RING-H2 finger protein 


— — 

140 


-— 

69 








RHAla (AF078683) 






10439 


AFI30089 


Homosa iens 


PRO2550 





— 


10440 


Y86305 


-rj — - — —■ 

om sapiens 


Human secreted protein HLYES38 


Tfi 


— 














10441 


W94294 


Homosa iens 

P s 


Human phosphatidylinositol transfer 


— — 


— 














10442 


AP000616 


ryza sa lva 


smiiter k^RING H2 finger protein 


— — 


-— 

87 








PT-t A 1 n /'A f?ft7SfiS3\ 






10443 


AF095770 


Homo sapiens 


PXH-responsive osteosarcoma Dl 


127 


87 








protein 






10444 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


67 








7519. 






10445 


U93567 


Homo sapiens 


P 40 


262 


69 


10446 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


112 


92 








protein 






10447 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


120. . 


83 



4V2 



0164835A2 I > 



WO 01/64835 PCT/US01/04927 



NO: 






Description 


Waterman 


Idcntit 

y 








protein 






10448 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


104 


68 


10449 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


115 


80 


10450 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


117 


80 


10451 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


80 


10452 


AF130089 


Homo sapiens 


PRO2550 


315 


68 


10453 


G02530 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6611. 


110 


71 


10454 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


86 


34 


10455 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 13 


85 


10456 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


180 


83 


10457 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


119 


96 


10458 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


133 


58 


10459 


Y41710 


Homo sapiens 


Human PR0618 protein sequence. 


894 


71 


10460 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


84 


71 


10461 


K02401 


Homo sapiens 


chorionic somatomammotropin 


471 


88 


10462 


G02113 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6194. 


241 


73 


10463 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


93 


68 


10464 


U 15647 


Mus musculus 


reverse transcriptase 


110 


47 


10465 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


57 


10466 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


145 


93 


10467 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


57 


10468 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


62 


55 


10469 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


93 


86 


10470 


S58722 


Homo sapiens 


X-Jinked retinopathy protein {C- 
terminal, clone XEH.8c} 


119 


63 


10471 


AF068294 


Homo sapiens 


HDCMB45P 


117 


43 


10472 


Y86573 


Homo sapiens 


Human gene 9 1-encoded protein 
fragment, SEQ IDNO:490. 


335 


72 


10473 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 


10474 


X79238 


Homo sapiens 


ribosomal protein L30 


126 


58 


10475 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10476 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


106 


61 


10477 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


119 


41 


10478 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


87 


66 


10479 


AF1 30079 


Homo sapiens 


PR02852 


97 


54 


10480 


AK024455 


Homo sapiens 


FJLJ00047 protein 


135 


69 



413 



WO 01/64835 



PCT/USOl/64927 1 



SEQID 

NO: 


Accession No. 


Species 


Description 


Waterman 


Identic 


10481 


K02401 


Homo sapiens 


chorionic somatomammotropin 


.463 


80 


10482 


G00398 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4479. 


69 


83 


10483 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


132 


61 


10484 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


119 


80 


10485 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


90 


54 


10486 


U79260 


Homo sapiens 


unknown 


105 


90 


10487 


Y87156 




IDNO:195. 


243 




10488 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


90 


78 


10489 


AF095770 


Homo sapiens 


protein 


120 


64 


10490 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


123 


60 


10491 


Y48576 


■ Homo sapiens 


Human breast tumour-associated 
protein 37. 


471 


84 


10492 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


176 


77 


10493 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


135 


64 


10494 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


123 


44 


10495 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


86 


46 


10496 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


145 


40 


10497 


AL359782 


bmcei" 050 " 13 


protein rha la. 


111 


50 


10498 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH,8c} 


96 


73 


10499 


AL 132841 


Caenorhabditis 


Y15E3A.3 


142 


96 


10500 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


120 


49 


10501 


AF220264 


Homo sapiens 


MOST-1 


93 


83 


10502 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


U6 


57 


10503 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


70 


10504 


AF090931 


Homo sapiens 


PRO0483 


88 


80 


10505 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


89 


.72 


10506 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124.- 


121 


81 


10507 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. • 


188 


~82 


10508 


AF130050 




PRO0872 


101 




10509 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


176 


~6~I 


10510 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


96 


10511 


AK026249 


Homo sapiens 


unnamed protein product 


97 


68 . 


10512 


X95276 


Plasmodium 
falciparum 


ORF91 


75 


29 


10513 


X17093 


Homo sapiens 


leukocyte antigen F 


397 


91 


10514 


W48351 


Homo sapiens 


Human breast cancer related protein 


11.5. 


66 



414 



_0164835A2_I_> 



WO 01/64835- 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Identit 

y 








BCRB2. 






10515 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


68 


10516 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


86 


47 


10517 


AF090944 


Homo sapiens 


PRO0663 


1 18 


48 


10518 


S5S722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


131 


77 


10519 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NQ:103. 


230 


58 


10520 


AF095770 


Homo sapiens 


PTH -responsive osteosarcoma Dl 
protein 


127 


87 


10521 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


130 


40 


10522 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10523 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


139 


69 


10524 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


125 


100 


10525 


J04495 


Macaca mulatta 


alpha-globin 


144 


90 


10526 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


105 


61 


10527 


AF 130089 


Homo sapiens 


PRO2550 


132 


67 


1052S 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


90 


53 


10529 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


112 


75 


10530 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


157 


79 


10531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


59 


10532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


121 


63 


10533 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


43 


10534 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


131 


65 


10535 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


95 


60 


10536 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


157 


79 


10537 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


199 


90 


10538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


113 


75 


10539 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


120 


65 


10540 


AF130089 


Homo sapiens 


PRO2550 


195 


67 


10541 


Y60522 


Homo sapiens 


Human normal bladder tissue EST 
encoded protein ] 94. 


73 


72 


10542 


G01175 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5256. 


83 


100 


10543 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


81 


54 


10544 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


147 


75 


10545 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


109 


66 



415 



MSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
MO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identic 


10546 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


46 


10547 


AF130089 


Homo sapiens 


PRO2550 


133 


63 


10548 


Y86305 


Homo sapiens 


Hum 3D secreted protein HLVES38, 
SEQIDNO:220. 


246 


80 


10549 


G 003 82 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


122 




10550 


J02459 


bacteriophage 
lambda 


J (taihhost specificity;1.132) 


772 


97 


10551 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


80 


10552 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


12 j 


50 


10553 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


184 


71 


10554 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


107 


60 


10555 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


-j^O 


69 


10556 


U49973 


Homo sapiens 


ORjF2 .* function unknown 


91 


70 


10557 


AF151850 


Homo sapiens 


CGI-92 protein 


459 


63 


10558 


X66285 




HC1 ORF 






10559 


S58722 


Homo sapiens 


terminal, clone XEH.8c} 


"757 


~6~7" 


10560 


AF1 30050 


Homo sapiens 


PRO0872 






10561 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


335 


80 


10562 


W48351 


Homo sapiens 


BCRB2. 


126 


~58 


10563 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NXP 


127 


76 


10564 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


158 


62 


10565 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


77 


10566 


V00488 


Homo sapiens 


alpha globin 


464 


89 


10567 


G02490 


Homo sapiens 


Human secreted protein SEQ ID NO' 
6571. 






10568 


S5S722 


Homo sapiens 


terminal, clone XEH.8c} 






10569 


G03040 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7121. 






10570 


G03807 


Homo sapiens 


Human secreted protein SEQ ID NO' 
7888. 


125 


65 


10571 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101 . 


79 


88 


10572 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


158 


80 


10573 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


138 


56 


10574 


AF 119900 


Homo sapiens 


PR02822 


154 


55 


10575 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


87 


72 


10576 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


131 


66 


10577 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


80 


47 


10578 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


121 


53 


10579 


Y91577 


Homo sapiens 


Human secreted protein sequence 


335 





416 



0164835A2 I > 



WO 01/64835 



PCT/USO 1/0492 7 



NO: 








Watermnn 


Idcntit 

y 








encoded by gene 2 SEQ ID NO:250. 






10580 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


47 


10581 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


324 


68 


10582 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


129 


66 


10583 


AF090895 


Homo sapiens 


PRO01 17 


137 


63 


10584 


L27428 


Homo sapiens 


reverse transcriptase 


98 


50 


10585 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4499. 


101 


68 


10586 


GO0382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


108 


82 


10587 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


10588 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


100 


70 


10589 


X03145 


Homo sapiens 


pot. ORF I 


120 


43 


10590 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


139 


70 


10591 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


92 


50 


10592 


AK024455 


Homo sapiens 


FLJ00047 protein 


111 


75 


10593 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


143 


58 


10594 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7717. 


116 


54 


10595 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


196 


57 


10596 


AF130089 


Homo sapiens 


PRO2550 


140 


66 


10597 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


115 


52 


10598 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


79 


71 


10599 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


79 


10600 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


119 


67 


10601 


AF090895 


Homo sapiens 


PRO0117 


106 


68 


10602 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


83 


72 


10603 


AF090930 


Homo sapiens 


PRO0478 


119 


70 


10604 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


83 


10605 


AF 132949 


Homo sapiens 


CGI- 15 protein 


114 


96 


10606 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


176 


77 


10607 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


155 


74 


10608 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


127 


63 


10609 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


88 


60 


10610 


AF 130089 


Homo sapiens 


PRO2550 


86 


90 


10611 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


149 


72 


10612 


L27428 


Homo sapiens 


reverse transcriptase 


311 


52 


10613 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


83 


84 


10614 


D00570 


Mus musculus 


open reading frame (196 AA) 


116 


69 



417 



JSDOCID: <WO 0164835A2J_> 



WO -01/64835 



PCT/US01/04927 



NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

IcJcntir 


10615 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


290 


62 


10616 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


125 


51 


10617 


AF1 16695 


Homo sapiens 


PR02221 


131 


56 


10618 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


101 


48 


10619 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


104 


65 


10620 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


112 


58 


10621 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


336 


85 


10622 


AF 118082 


Homo sapiens 


PRO 1902 


80 


73 


10623 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


136 


50 


10624 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


109 


67 


10625 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


92 


75 


10626 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 


10627 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


104 


52 


10628 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


137 


66 


10629 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


129 


67 


10630 


U93569 


Homo sapiens 


putative pi 50 


104 


52 


10631 


AF090895 


Homo sapiens 


PRO0117 


136 


70 


10632 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


89 


57 


10633 


AF1 13685 


Homo sapiens 


PRO0974 


130 


64 


10634 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


63 


10635 


M24732 


Homo sapiens 


lam in-like protein 


80 


65 


10636 


U49974 


Homo sapiens 


manner transposase 


116 


72 


10637 


AK024455 


Homo sapiens 


FLJ00047 protein 


158 


67 


10638 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


121 


64 


10639 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


137 


57 


10640 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


72 


1064] 


L27428 


Homo sapiens 


reverse transcriptase 


150 


54 


10642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


68 


81 


10643 


U93565 


Homo sapiens 


putative pi 50 


282 


57 


10644 


AF090942 


Homo sapiens 


PRO0657 


153 


74 


10645 


Y48333 


Homo sapiens 


Human prostate cancer-associated 
protein 30. 


196 


90 


10646 


Y86573 


Homo sapiens 


Human gene 9 1 -encoded protein 
fragment, SEQ ID NO:490. 


242 


73 


10647 


U79260 


Homo sapiens 


unknown 


95 


77 


10648 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


149 


64 


10649 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


87 


10650 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO.250. 


339 


89 



418 



01 64835A2_L> 



WO 01/64835 



PCT/US01/04927 



NO: 








Watcrninn 


Identif 

y 


10651 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


102 


37 


10652 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


138 


70 


10653 


AF1 19855 


Homo sapiens 


PRO 1847 


74 


100 


10654 


M15530 


Homo sapiens 


B-cell growth factor 


94 


52 


10655 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


144 


54 


10656 


M15317 


Plasmodium 
lophurae 


histidine-rich protein 


124 


51 


10657 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


143 


66 


10658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


114 


77 


10659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


172 


55 


10660 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


78 


10661 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10662 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


151 


71 


10663 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10664 


AK000496 


Homo sapiens 


unnamed protein product 


140 


77 


10665 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


147 


75 


10666 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10667 


G0032S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


149 


78 


10668 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


303 


78 


10669 


D00526 


Rattus 
norvegicus 


L-gulono-gamma-lactone oxidase 


108 


73 


10670 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


143 


65 


10671 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


409 


84 


10672 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
4770. 


110 


65 


10673 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4504. 


123 


43 


10674 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


85 


36 


10675 


U93565 


Homo sapiens 


putative pi 50 


115 


45 


10676 


AF 109907 


Homo sapiens 


S164 


205 


65 


10677 


G02620 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6701. 


118 


73 


10678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


62 


10679 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10680 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


248 


54 


10681 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88 


52 


10682 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


126 


77 


10683 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


123 


72 



419 



J)164835A2_I_> 



WO 01/64835 



PCT/US01/O4927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








terminal, clone XEH. 8c} 






10684 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c} 


123 


72 


10685 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


99 


40 


10686 


AF068294 


Homo sapiens 


HDCMB45P 


182 


54 


10687 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


53 


10688 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


277 


76 


10689 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4314. 


1 18 


65 


10690 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


143 


71 


10691 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


97 


81 


10692 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
<*ene No. 101 


160 


68 


10693 


AF1 30089 


Homo sapiens 


PRO2550 


133 


56 


10694 


S58722 




terminal, clone XEH. 8c} 






10695 


G00437 




Human secreted protein, SEQ ID NO: 
4518. 






10696 


G03133 




Human secreted protein SEQ ID NO* 
7214. 






10697 


G00648 




Human secreted protein SEQ ID NO" 
4729. 






10698 


GO 1246 




Human secreted protein, SEQ ID NO" 
5327. 






10699 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


92 


38 


10700 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


132 




10701 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 






10702 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


66 


73 


10703 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


124 


44 


10704 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


78 


10705 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


119 


50 


10706 


AK025116 


Homo sapiens 


unnamed protein product 


160 


84 


10707 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 




10708 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10709 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


230 


76 


10710 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


96 


45 


10711 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


174 


70 


10712 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


50 


10713 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


148 


45 


10714 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


151 


80 



420 



,0164835A2J_> 



WO 01/64835 



PCT/US01/O4927 



SEQ ID 
MO: 


Accession No. 


Species 


Description 


Smith- 


% 
y 


10715 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


157 


67 


10716 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


109 


68 


10717 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


151 


52 


10718 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


84 


80 


10719 


Ml 9973 


Rattus 
norvegicus 


cytochrome PB24 


104 


45 


10720 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


72 


10721 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


161 


52 


10722 


L27428 


Homo sapiens 


reverse transcriptase 


159 


42 


10723 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


127 


75 


10724 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


84 


50 


10725 


AF090895 


Homo sapiens 


PRO0117 


170 


71 


10726 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


105 


42 


10727 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


123 


38 


10728 


U93563 


Homo sapiens 


putative pi 50 


162 


64 


10729 


X14576 


Murine leukemia 
virus 


gag fusion protein 


172 


41 


10730 


AF090895 


Homo sapiens 


PRO0117 


151 


60 


10731 


AF1 19851 


Homo sapiens 


PRO 1722 


127 


54 


10732 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


152 


61 


10733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


156 


76 


10734 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


73 


10735 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


65 


10736 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


141 


75 


10737 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


74 


10738 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


124 


69 


10739 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


91 


60 


10740 


AF1 30051 


Homo sapiens 


PRO0898 


90 


63 


10741 


AF090895 


Homo sapiens 


PRO0117 


129 


55 


10742 


AF1 19900 


Homo sapiens 


PR02822 


164 


85 


10743 


AJ005560 


Mus musculus 


SPR2B protein 


88 


35 


10744 


Z79996 


Homo sapiens 


cB33F2.1 (PUTATIVE novel protein 
similar to C-terminaJ parts of APOL 
(apolipoprotein L) and TNF-inducible 
protein CGI 2-1) 


185 


100 


10745 


G03 140 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7221. 


94 


88 ■ 


10746 


X82208 


Homo sapiens . 


bera-centractin 


134 


96 


10747 


D49744 


Mus musculus 


farnesyltransferase alpha subunit 


228 


60 


10748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


127 


50 


10749 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


62 



421 



0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO. 








Waterman 


ldentit 




















4523. 




— 


10750 


AF1 30089 


Homo sapiens 


PRO2550 


147 


62 


1075] 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


71 








4484. 






10752 


D 17652 


Homo sapiens 


HBp 1 5/L22 


416 


72 


10753 


AB002389 


Homo sapiens 


KIAA0391 


486 


86 


10754 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


238 


73 








7174. 






10755 


S79410 


Mus musculus 




Tbl 


~S7 








protein ' OCa '' zation signal bindin § 






10756 


AF 1 30089 


Homo sapiens 


PRO2550 


T04 


— 


10757 


X75421 


Hypocrea 




197 


66 






jecorina 








10758 


AF298594 




arabino°alactan protein 


~T07 


— 


10759 


U94S32 


Homo sapiens 


KSRP 


— 


-— 


10760 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


179 


82 


— 







_ 4473 \ . . ...... 








573 


Homo sapiens 


Human gene 91 -encoded protein 


271 


80 














10762 


Z g 1S2 8 


~C hab df 

aen r a i is 


predicted using Genefinder — contains 


— — 


— 

27 






e egans 


similarity to Pfam domain - PF00939 
























transmembrane region), Score = 2.8, E- 












value=1.2, N=l; PF0I757 (Domain of 












unknown function), Score = 556.0, E- 












value=8.1e-164, N=l 






10763 


V00662 


Homo sapiens 


ATPase 6 


593 


83 


10764 


A06977 


Homo sapiens 


albumin 


610 


81 


10765 


X67863 




T2 






10766 


AF137030 


Homo sapiens 


transmembrane protein 2 


620 


96 


10767 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


66 








7878. 






10768 


XI 5324 


Homo sapiens 


a n g i otens inoge n 


529 


~ 


10769 


M64983 


Homo sapiens 


fibrinogen beta chain 


604 


87 


10770 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


381 


83 


10771 


G02532 


Homo sapiens 


Human secreted protein SEQ ID WO* 


133 


73 








6613. 






10772 


M641 10 


Homo sapiens 




~346 


~94" 


10773 


AB014566 


Homo sapiens 


KIAA0666 protein 


145 


78 


10774 


AL138810 


Homo sapiens 


dJ179L10.2 (Similar to CGI-29 


216 


90 








protein) 






10775 


AJ223953 


Homo sapiens 


hPTTG 


353 


82 


10776 


U92288 


Human 


HN1 


] ]3 


39 






herpesvirus 6 








10777 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


143 


— 








RHA 1 a (AF078683) 






10778 


XI 2796 


Bos tauru ' 

Homo'sa^iens 


riivivj i protein ^/\/\ i - z i jj 


—— 




-— 

-|| 


10779 


U86782 


omo sapiens 














homolog 






10780 


M60047 


Homo sapiens 




514 


— 


10781 


Y 14480 


Homo sapiens 


^rao^ent^'hvm^ secreted protein 




~48 








encoded by gene 14. 






10782 


AF1 30079 


Homo sapiens 


PR02852 


175 


48 


10783 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


64 








6574. 






10784 


Ml 1147 


Homo sapiens 


ferritin light chain 


439 


83 


10785 


AF09093I 


Homo sapiens 


PRO0483 


145 


55 


10786 


AF072441 


Homo sapiens j 


calcineurin binding protein cabin 1 


498 


90 


10787 


AF1 16715 


Homo sapiens . | 


PR02829 . ... . . . 


127 


88 



422 



JSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 




Description 


Waterman 
Score 


% 

IUcnrit 

J" 


10788 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


631 


91 


10789 


AC005253 


Homo sapiens 


R26445 1 


403 


84 


10790 


M15386 


Homo sapiens 


gamma-globin 


617 


82 


10791 


Y95005 


Homo sapiens 


Human secreted protein vc57 1, SEQ 
ID NO:50. 


716 


95 


10792 


GO 1285 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5366. 


165 


52 


10793 


X67863 


Mus musculus 


T2 


128 


50 


10794 


Y82326 


Homo sapiens 


Human arginase I SEQ ID NO: 17. 


507 


80 


10795 


Y00755 


Homo sapiens 


extracellular matrix protein BM-40 
(AA 1 - 303) 


559 


87 


10796 


AF119851 


Homo sapiens 


PRO 1722 


117 


71 


10797 


AF090930 


Homo sapiens 


PRO0478 


146 


63 


10798 


V00488 


Homo sapiens 


alpha globin 


509 


84 


10799 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


131 


42 


10800 


X83618 


Homo sapiens 


hydroxymethylglutaryl-CoA synthase 


422 


78 


10801 


AF1 32952 


Homo sapiens 


CGI- 18 protein 


341 


88 


10802 


M17375 


Gallus gallus 


type XII collagen 


88 


42 


10803 


K02401 


Homo sapiens 


chorionic somatomammotropin 


436 


85 


10804 


K02401 


Homo sapiens 


chorionic somatomammotropin 


410 


86 


10805 


U61232 


Homo sapiens 


cofactor E 


359 


77 


10806 


D86438 


Homo sapiens 


Ibal (ionized calcium binding adapter 
molecule 1) 


406 


77 


10807 


AF 118082 


Homo sapiens 


PRO1902 


105 


38 


10808 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


60 


10809 


AF090942 


Homo sapiens 


PRO0657 


163 


52 


10810 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


116 


86 


10811 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


45 


10812 


X61296 


Rattus 
nbrvegicus 


open reading frame 2 


156 


41 


10813 


U82303 


Homo sapiens 


unknown 


103 


64 


10814 


G01478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5559. 


75 


77 


10815 


U52077 


Homo sapiens 


mariner transposase 


131 


85 


10816 


D381 12 


Homo sapiens 


ATPase subunit 6 


251 


74 


10817 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


80 


10818 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


102 


52 


10819 


G0036I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


94 


66 


10820 


Y21811 


Homo sapiens 


CPF polypeptide 36PRO. 


200 


62 


10821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


60 


10822 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


70 


10823 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


337 


63 


10824 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


61 


10825 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


122 


68 


10826 


J03634 


Homo sapiens 


erythroid differentiation protein 
precursor 


222 


68 



423 



WO 01/64835 PCT/USO 1/0492 7 



NO: 


Accession No. 


Species 


Description 


Smith- 


id 

y 


10827 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


452 


78 


10828 


V00488 


Homo sapiens 


alpha globin 


132 


86 


10829 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


131 


70 


10830 


AF090942 


Homo sapiens 


PRO0657 


122 


43 


10831 


AF090931 


Homo sapiens 


PRO0483 


14] 


72 


10832 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


138 


63 


10833 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


88 


57 


10834 


AF 11 8082 


Homo sapiens 


PRO! 902 


109 


62 


10835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


135 


44 


10836 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


86 


10837 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


143 


47 


10838 


U52077 


Homo sapiens 


mariner transposase 


259 


67 


10839 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


341 


83 


10840 


Y44361 


Homo sapiens 


Human cell cycle regulation protein-2. 


144 


60 


10841 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


96 


56 


10842 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


565 


80 


10843 


AP0006 16 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


1 18 


100 


10844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


81 


10845 


AF014889 


Homo sapiens 


NADH dehydrogenase subunit 2 


416 


85 


10846 


AF1 94537 


Homo sapiens 


NAG 13 


177 


70 


• 10847 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


40 


10848 


Ml 5530 


Homo sapiens 


B-cell growth factor 


92 


67 


10849 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


98 


52 


10850 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


121 


76 


10851 


AF1 16712 


Homo sapiens 


PR02738 


84 


57 


10852 


V00662 


Homo sapiens 


ATPase 6 


278 


84 


10853 


U01849 


Trypanosoma 
brucei 


ORF1 


93 


37 


10854 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNF1U96. 


93 


61 


10855 


AFI30089 


Homo sapiens 


PRO2550 


126 


72 


10856 


AF130079 


Homo sapiens 


PR02852 


154 


75 


10857 


AK025116 


Homo sapiens 


unnamed protein product 


113 


70 


10858 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


114 


71 


10859 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO- 
7784. 


97 


56 


10860 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


207 


78 


10861 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


83 


50 


10862 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


126 


46 


10863 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


63. 


10864 


B0 1372 


Homo sapiens 


Neuron-associated protein. 


106 


38 


10865 


U38964 


Homo sapiens 


hPMSR2 


152 


68 


10866 


AF1 19855 


Homo sapiens 


PRO 1847 


142 


67 
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NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Id en tit 

y 


10867 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


75 


10868 


U79260 


Homo sapiens 


unknown 


94 


47 


10869 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 


151 


52 


10870 


AJ0 10025 


Homo sapiens 


unr-interacting protein 


100 


64 


10871 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


no 


44 


10872 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


136 


71 


10873 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


126 


61 


10874 


AF 130089 


Homo sapiens 


PRO2550 


115 


88 


10875 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


114 


59 


10876 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


83 


48 


10877 


AF1 18086 


Homo sapiens 


PRO 1992 


145 


82 


10878 


AF1 16712 


Homo sapiens 


PR02738 


1 10 


55 


10879 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID.NO: 
7253. 


123 


77 


10880 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


111 


61 


10881 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


106 


51 


10882 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


107 


67 


10883 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


101 


65 


10884 


U79260 


Homo sapiens 


unknown 


111 


56 


10885 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


60 


10886 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


107 


57 


10887 


G03O43 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


148 


80 


10888 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


142 


58 


10889 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


156 


67 


10890 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


105 


52 


10891 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


187 


65 


10892 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


120 


37 


10893 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 


174 


77 


10894 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


83 


10895 


V00662 


Homo sapiens 


ATPase 6 


316 


95 


10896 


V00662 


Homo sapiens 


ATPase 6 


427 


82 


10897 


U09500 


Homo sapiens 


cytochrome b 


403 


92 


10898 


U93564 


Homo sapiens 


p40 


215 


91 


10899 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


69 


77 


10900 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


159 


60 


10901 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


93 


43 
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SEOID 


Accession No. 


Species 


Description 


Smith- 
Wnteiman 


kicnlit 


10902 


MI 0546 


Homo sapiens 


cytochrome oxidase I 


302 


66 


10903 


G00447 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4528. 


102 


58 


10904 


AK024455 


Homo sapiens 


FLJ00047 protein 


118 


74 


10905 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


108 


64 


10906 


AF090942 


Homo sapiens 


PRO0657 


149 


74 


10907 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


167 


91 


10908 


D38112 


Homo sapiens 


ATPase subunit 6 


355 


68 


10909 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


145 


72 


10910 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


] 14 


72 


10911 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


144 


53 


10912 


V00662 


Homo sapiens 


ATPase 6 


289 


69 


10913 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


93 


51 


10914 


V00662 


Homo sapiens 


cytochrome oxidase III 


509 


86 


10915 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


459 


80 


10916 


Z28029 


Saccharomyces 
cerevisiae 


ORF YKL030w 


96 


32 


10917 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


94 


56 


10918 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


1 14 


36 


10919 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 17 


88 


10920 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


39 


10921 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


99 


95 


10922 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


75 


63 


10923 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 
RHAla(AF078683) 


125 


85 


10924 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


140 


73 


10925 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


38 


10926 


W03642 


Homo sapiens 


Human cannabinoid GPR N-termina! 
sequence. 


1 1 1 


58 


10927 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 


75 


10928 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


78 


10929 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


77 


57 


10930 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


93 


53 


10931 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


83 


54 


10932 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


262 


84 


10933 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


72 


10934 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


62 


10935 


AP000616 


:Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


69 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


Identit 

y 


10936 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


120 


56 


10937 


AB028893 


Homo sapiens 


ribosomal protein SI 1 


279 


53 


10939 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


127 


55 


10940 


AF1 19900 


Homo sapiens 


PR02822 


154 


53 


10941 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


127 


62 


10942 


K02576 


Homo sapiens 


salivary proline-rich protein I 


109 


41 


10943 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


61 


10944 


AF116715 


Homo sapiens 


PR02829 


119 


69 


10945 


M10546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


10946 


L27428 


Homo sapiens 


reverse transcriptase 


112 


51 


10947 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


90 


10948 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


121 


83 


10949 


AF1 16715 


Homo sapiens 


PR02829 


134 


67 


10950 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c) 


139 


83 


10952 


AF130079 


Homo sapiens 


PR02852 


119 


79 


10953 


Z38128 


Mus musculus 


histone HI 


126 


35 


10954 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


135 


53 


10955 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


126 


37 


10956 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


81 


69 


10957 


AF 118082 


Homo sapiens 


PRO 1902 


90 


51 


10958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


116 


53 


10959 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


51 


10960 


U79260 


Homo sapiens 


unknown 


97 


52 


10961 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


194 


66 


10962 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


48 


10963 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


66 


73 


10964 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


87 


10965 


Z28029 


Saccharomyces 
cerevisiae 


ORF YKL030w 


99 


70 


10966 


V00672 


Pan troglodytes 


reading frame protein 5 


107 


70 


10967 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


58 


10968 


K02401 


Homo sapiens 


chorionic somatomammotropin 


478 


77 


10969 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


86 


10970 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


156 


57 


10971 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


126 


56 


10972 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


103 


51 


10973 


K02401 


Homo sapiens 


chorionic somatomammotropin 


491 


82 


10974 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


93 


10975 


D381I2 


Homo sapiens 


NADH dehydrogenase subunit 4 


236 


85 


10976 


Y02785 


Homo sapiens | 


Human secreted protein encoded by 


108 


50 
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SEQID 


Accession No. 




Description 


Smith- 
Waterman 


Identit 

y 








gene 51 clone HUKEX85. 






30977 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


435 


73 


10978 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


119 


74 


10979 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


137 


73 


10980 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


133 


63 


10981 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


90 


65 


10982 


X92485 


Plasmodium 
vivax 


pva] 


139 


47 


10983 


Y15913 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


64 


47 


10984 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


463 


77 


10985 


U49973 


Homo sapiens 


ORP1 ; MER37; putative transposase 
similar to pogo element 


309 


65 


10986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


145 


87 


10987 


AF130089 


Homo sapiens 


PRO2550 


88 


31 


10988 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 




10989 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


137 




10990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






10991 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






10992 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


69 


10993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO. 1 63. 


120 




10994 


AJ277092 


Orycto3agus 
cuniculus 


rranslationally controlled tumor protein 
3 


172 




10995 


AF05283 1 


Trypanosoma 
cruzi 


unknown 


92 




10996 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


1 19 


"33 


10997 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4738. 


123 




30998 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


165 


52 


10999 


S52010 


Mus sp. 


orfl 5" of EpoR 


93 




11000 


AF130079 


Homo sapiens 


PR02852 


131 


81 


11001 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


111 




11002 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


132 


83 


11003 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7873. 


151 


60 


11004 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6633. 


159 


58 


11005 


K02401 


Homo sapiens 


chorionic somatomammotropin 


579 


89 


11006 


K02401 


Homo sapiens 


chorionic somatomammotropin 


464 


87 


11007 


Ml 5894 


Homo sapiens 


chorionic somatomammotropin 
precursor 


370 


74 


11008 


K02401 


Homo sapiens 


chorionic somatomammotropin 


467 


88 


11009 


K02401 


Homo sapiens 


chorionic somatomammotropin 


537 


91 


11010 


K02401 


Homo sapiens 


chorionic somatomammotropin 


475 


87 


11011 


K02401 


Homo sapiens 


chorionic somatomammotropin 


460 


88 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 

Score 


y 


11012 


G03639 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7720. 


101 


61 


11013 


AF266166 


Gillichthys 
mirabilis 


ribosomal protein S13 


223 


54 


11014 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


176 


61 


11015 


AF116715 


Homo sapiens 


PR02829 


121 


69 


11016 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


92 


75 


11017 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


113 


78 


1101S 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


40 


11019 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


48 


11020 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


151 


51 


11021 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


107 


47 


11022 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


72 


11023 


AF1 16715 


Homo sapiens 


PR02829 


140 


75 


11024 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


70 


11025 


AP000001 


Pyrococcus 
horikoshii 


58aa long hypothetical protein 


99 


52 


11026 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


601 


81 


11027 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


434 


74 


11028 


Z97333 


Homo sapiens 


RHCE protein 


149 


51 


11029 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


113 


48 


11030 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


120 


77 


11031 


U61234 


Homo sapiens 


cofactor C 


226 


63 


11032 


M10546 


Homo sapiens 


cytochrome oxidase I 


116 


88 


11033 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


584 


93 


11034 


LI 09 10 


Homo sapiens 


splicing factor 


108 


58 


11035 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


140 


48 


11036 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


101 


79 


11037 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


48 


11038 


AF126163 


Homo sapiens 


HHLA3 protein 


96 


68 


11039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


144 


79 


11040 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


62 


11041 


L06498 


Homo sapiens 


ribosomal protein S20 


321 


73 


11042 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


123 


84 


11043 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


125 


68 


11044 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


208 


91 


11045 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


79 


62 


11046 


D38112 


Homo sapiens 


ATPase subunit 6 


297 


90 


11047 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


416 


73. 


11048 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


73 


80 


11049 


AF090944 


Homo sapiens 


PRO0663 


96 


70 


11050 


AF1 30089 


Homo sapiens 


PRO2550 


152 


71 


11051 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


108 


81 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 

--Score 


% 

Identit 

y 








gene 22 clone HMSJW18. 






11052 


WS9031 


Homo sapiens 


Polypeptide fragment encoded by °ene 
167. 





--— 


11053 


AF 109907 


Homo sapiens 


S164 


136 


- — 


1 1054 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


1 17 


58 


1 1055 


X72004 


Halichoems 
grypus 


ATP synthase subunit 6 


160 


78 


11056 


V00662 


Homo sapiens 


ATPase 6 


300 


71 


1 1057 


AF228021 


Bos taurus 




-?7r! 


~ir 


1 1058 


L27428 




revers^transcriptase 






1 1059 


U09116 


Homo sapiens 


ORF1, encodes a 40 RDa product 


~4^4" 


... 


11060 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


68 


11061 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


301 


85 


1 1062 


Y27S54 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 






1 1063 


Y08061 


Homo sapiens 








11064 


U50188 


Cavia porcellus 


CGRP-receptor component protein 


152 


88 


1 1065 


G03107 




Human secreted protein, SEQ ID NO: 
7188. 






1 1066 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 12 


64 


1 1067 


AP000616 


Oryza sativa ' 


similar to RING-H2 finger protein 
RHAla(AF078683) 




71 


1 1068 




Drosophila 
melanogaster 


CG 1 2706 gene product 


145 


43 


1 1069 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 60 


65 


1 1070 


D38112 


Hom° SaP j e " S 


cytochrome c oxidase subunit 1 


416 


67 


1 1 071 






cytochrome c oxidase subunit 3 


220 


86 


1 1072 


G02532 


Homo sapieni 


Human secreted protein, SEQ ID NO" 
6613. 






1 1073 


G03411 


Homo sapiens 


7 4 9 2 P ' SEQ 1D N °' 






11074 


G03787 


Homo sapiens 


7868. 


117 


65 


1 1075 


D38112 


Homo sapiens 




— — 




1 1076 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


~588 


ih — 


11077 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c) 


106 


86 


11078 


Y86248 


Homo sa iens 

omo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


. 

90 


50 


11079 


AF1 32200 


Homo sapiens 


PRO 1751 


~97 


— 


11080 


AF1 16661 


Homo sapiens 


PRO 143 8 


109 


51 


11081 


AK024455 


Homo sapiens 


FLJ00047 protein 




48 


1 1082 


G03786 




Human secreted protein s SEQ ID NO! 
7867. 


Jj| 




1 1083 


J02963 




platelet glycoprotein lib precursor 




80 


11084 


AF068294 


Homo sapiens 


HDCMB45P 


229 


59 


1 1085 


Ml 0546 


omo sapiens 


cytochrome oxidase 1 


137 


73 




Y86248 


rlomo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:l63. 


105 


49 


11087 


L26953 


Homo sapiens 


chromosomal protein 


122 


75 


11088 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


103 


46 


11089 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


112 


56 


11090 


AF130089 


Homo sapiens 


PRO2550 


172 


60 


11091 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


107 | 35 
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NO: 


Accession No. 


Species 


Description 


Wntcrman 
Score 


Idcntit 

y 


11092 


AF090930 


Homo sapiens 


PRO0478 


146 


75 


11093 


R96800 


Homo sapiens 


Human histiocyte-secrcted factor HSF. 


163 


82 


1 1094 


V00662 


Homo sapiens 


ATPase 6 


130 


87 


11095 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


82 


11096 


D38112 


Homo sapiens 


ATPase subunit 6 


152 


62 


11097 


U18339 


Variola virus 


D4L 


96 


60 


11098 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


113 


60 


11099 


AF 130089 


Homo sapiens 


PRO2550 


128 


75 


11100 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


218 


62 


11101 


U49973 


Homo sapiens 


ORF1; MER37;. putative transposase 
similar to pogo element 


232 


79 


1 1 102 


AF090930 


Homo sapiens 


PRO0478 


154 


73 


11103 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


133 


65 


11104 


K03036 


Mus musculus 


alpha- 1 type I procollagen 


105 


43 


11105 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


142 


59 


11106 


L13129 


Mus musculus 


synexin 


119 


35 


11107 


AF280111 


Homo sapiens 


cytochrome P450 subfamily III A 
polypeptide 43 


122 


52 


11108 


J00314 


Homo sapiens 


beta-tubulin 


606 


94 


11109 


X05196 


Homo sapiens 


aldolase C 


412 


89 


111 10 


D00570 


Mus musculus 


open reading frame (196 AA) 


127 


85 


11111 


D50532 


Homo sapiens 


macrophage lectin 2 


165 


65 


11112 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


74 


11113 


M15530 


Homo sapiens 


B-cell growth factor 


163 


82 


11114 


V00662 


Homo sapiens 


ATPase 6 


217 


91 


11115 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


100 


11116 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


36 


1 1 1 17 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


132 


59 


11118 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


103 


67 


11119 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


94 


42 


11120 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


75 


11121 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


104 


64 


11122 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


90 


59 


11123 


AF161356 


Homo sapiens 


HSPC093 


119 


64 


11124 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


102 


70 


11125 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


60 


11126 


AF068294 


Homo sapiens 


HDCMB45P 


272 


59 


11127 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


120 


79 


11128 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


101 


78 


11129 


AF130089 


Homo sapiens 


PRO2550 


124 


80 


11130 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


109 


51 . 


11131 


U40265 


Trypanosoma 
cruzi 


ATPase subunit 6 


102 


31 
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NO: 


Accession No. 


Species 


Description 


Smith- 


% 
y 


11132 


X01677 


Homo sapiens 


glyceraldehyde-3-phosphate 
dehydrogenase 


555 


88 


11133 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


108 


37 


31134 


M81757 


Homo sapiens 


SI 9 r-ibosomal protein 


232 


76 


11135 


U93565 


Homo sapiens 


putative pi 50 


173 


51 


11136 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


369 


100 


11137 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


145 


61 


11138 


AB030255 


Homo sapiens 


serine/threonine phosphatase ] gamma 


404 


83 


11139 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 
(transcription factor-like protein 
MRGX)) 


248 


77 


11140 


Y36421 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 8. 


81 


53 


11141 


Ml 3692 


Homo sapiens 


alpha- 1 acid glycoprotein precursor 


1 12 


41 


11142 


M16961 


Homo sapiens 


alpha-2-HS-glycoprotein 


543 


89 


11143 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


605 


88 


11144 


AF090944 


Homo sapiens 


PRO0663 


137 


67 


11 145 


X92485 


Plasmodium 
vivax 


pval 


102 


57 


11146 


AF130089 


Homo sapiens 


PRO2550 


113 


58 


11147 


U63542 


Homo sapiens 


FAP protein 


154 


57 


11148 


W03988 


Homo sapiens 


SH2 domain from human SH-PTP2 
(amino acids 1-106). 


162 


73 


11149 


AF116695 


Homo sapiens 


PR02221 


173 


44 


11150 


M15530 


Homo sapiens 


B-cell growth factor 


105 


45 


11151 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


160 


64 


1 1152 


M15530 


Homo sapiens 


B-cell growth factor 


124 


75 


11153 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


117 


67 


11154 


AF1 18082 


Homo sapiens 


PRO 1902 


79 


52 


11155 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


123 


72 


11156 


Y27893 


Homo sapiens 


Human secreted protein encoded by 
gene No. 116. 


266 


96 


11157 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


148 


84 


11158 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


72 


11159 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


105 


58 


11160 


X60376 


Brassica napus 


proline-rich protein 


111 


39 


11161 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


37 


11162 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


53 


11163 


AF090930 


Homo sapiens 


PRO0478 


146 


73 


11164 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


90 


60 


11165 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


141 


5] 


11166 


X82629 


Homo sapiens 


Mox-2 


307 


83 


11167 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


168 


93 


11168 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


71 


66 


11169 


AC079041 


Arabidopsis 
thaliana 


hypothetical protein 


98 


45 


11170 


AF090930 


Homo sapiens 


PRO0478 


156 


50 
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SEQ ID 

NO: 


Accession No. 




Description 


Watcrrann 


Identit 

y 


11171 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


99 


70 


11172 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


124 


54 


11373 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


82 


11 174 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


127 


66 


11175 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


157 


68 


1 1 176 


AF09093 1 


Homo sapiens 


PRO04S3 


146 


87 


11377 


M15530 


Homo sapiens 


B-cell growth factor 


153 


75 


11178 


AF090930 


Homo sapiens 


PRO0478 


129 


52 


11179 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


98 


77 


11180 


AF1 94537 


Homo sapiens 


NAG 13 


165 


48 


11181 


AF068294 


Homo sapiens 


HDCMB45P 


146 


53 


11182 


AF09093 I 


Homo sapiens 


PRO0483 


153 


78 


11183 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


65 


11184 


Y 13396 


Homo sapiens 


Arr-r»o acid sequence of protein 
PR 532. 


452 


92 


11185 


AF090930 


Homo sapiens 


PRO0478 


132 


51 . 


11186 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


178 


79 


11187 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


237 


70 


11188 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


83 


11189 


AL080206 


Homo sapiens 


hypothetical protein 


103 


71 


11190 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


77 


11191 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


112 


65 


11192 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


127 


54 


11193 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


150 


65 


11194 


M92046 


Trypanosoma 
cruzi 


surface antigen 


160 


26 


11195 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


126 


72 


11196 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


98 


75 


11197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


117 


50 


11198 


D88548 


Homo sapiens 


24-kDa subunit of complex I 


160 


47 


11199 


AF090852 


Antilocapra 
americana 


prion protein 


115 


38 


11200 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


136 


62 


11201 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 


75 


11202 


U93563 


Homo sapiens 


putative pi 50 


239 


43 


11203 


U93570 


Homo sapiens 


putative pi 50 


173 


45 


11204 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


166 


68 


11205 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO. 163. 


155 


68 


11206 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


87 


56 



433 
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Accession No. 


Species 


Description 


Smith- 




NO: 










ldcntit 










Score 


y 


11207 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


123 


89 


11208 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


619 


87 


11209 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


125 


89 


11210 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


521 


83 


11211 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


76 








6566. 






11212 


D38U2 


Homo sapiens 


cytochrome c oxidase subunit 1 


547 


81 


11213 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


74 








7878. 






11214 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


63 








7253. 






1121.5 


AF162149 


Mycoplasma 


variable surface lipoprotein 


118 


34 






bovis 






11216 


AF 130089 


Homo sapiens 


PRO2550 


147 


96 


11217 


AF090931 


Homo sapiens 


PRO0483 


1 12 


77 


11218 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


70 








6613. 






11219 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


55 








4497. 






11220 


AF130051 


Homo sapiens 


PRO0898 


122 


63 


11221 


AF 130079 


Homo sapiens 


PR02852 


171 


64 


11222 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


55 








4414. 






11223 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


130 


71 








gene 22 clone HMSJW18. 






11224 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


69 








7870. 






11225 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


85 








6613. . 






11226 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


90 








6441. 






11227 


Ml 5530 


Homo sapiens 


B-cell growth factor 


122 


55 


11228 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


. 121 


67 








encoded by gene 17. 






11229 


M15317 


Plasmodium 


histidine-nch protein 


116 


43 






lophurae 








11230 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


131 


78 








7878. 






11231 


W19932 


Homo sapiens 


Alzheimer's disease protein encoded by 


74 


76 








DNA from plasmid pGCS55. 






11232 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


134 


67 








SEQIDNO:163. 






11233 


AK023045 


Homo sapiens 


unnamed protein product 


100 


69 


11234 


A P0006 16 


Oryza sativa 


similar to RING-H2 finger protein 


127 


82 








RHAla(AF078683) 






11235 


AP000616 


Oryza sariva 


similar to RING-H2 finger protein 


125 


65 








RHAla(AF078683) 






11236 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 


109 


53 








RHAla(AF078683) 






11237 


AF029403 


Homo sapiens 


oxysterol 7alpha-hydroxylase 


144 


72 


11238 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


44 








6953. 






11239 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


161 


96 








SEQ ID NO: 163. 






11240 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


43. 








6566. 






11241 


L36529 


Homo sapiens 


protein p84 


343 


90 


11242 


M15530 


Homo sapiens 


B-cell growth factor 


122 


55 


11243 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


130 


65 . 
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NO: 


Accession No. 




Desc ription 


Waterman 
Score 


Identit 

y 








clone HCACJ81. 






11244 


U92698 


Rattus 
norvegicus 


ribosomal protein S2 


229 


74 


11245 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


116 


41 


11246 


W75859 


Homo sapiens 


Human secretory protein of clone 
DM340- 1. 


123 


37 


11247 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


132 


73 


11248 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7S70. 


93 


56 


11249 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


132 


56 


11250 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


72 


11251 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


46 


11252 


AK021455 


Homo sapiens 


unnamed protein product 


146 


81 


11253 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88. 


56 


11254 


S80916 


Homo sapiens 


parotid "o" protein, Po=salivary 
proline-rich protein {exon 3} 


113 


43 


11255 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


136 


45 


11256 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


107 


67 


11257 


AF1 94537 


Homo sapiens 


NAG 13 


153 


76 


11258 


Ml 5530 


Homo sapiens 


B-cell growth factor 


130 


70 


11259 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


131 


72 


11260 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


104 


59 


11261 


G 02922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7003. 


316 


92 


11262 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


111 


81 


11263 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


142 


71 


11264 


AF1 30089 


Homo sapiens 


PRO2550 


111 


66 


11265 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


118 


71 


11266 


M15530 


Homo sapiens 


B-cell growth factor 


158 


76 


11267 


AF 130079 


Homo sapiens 


PR02852 


124 


68 


11268 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


104 


50 


11269 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


57 


11270 


G037S7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


107 


63 


31271 


AF 118082 


Homo sapiens 


PRO 1902 


87 


47 


11272 


Ml 0546 


Homo sapiens 


cytochrome oxidase 1 


305 


74 


11273 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


108 


53 


11274 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


235 


81 


11275 


U93570 


Homo sapiens 


putative pi 50 


182 


54 


1 1276 


X03145 


Homo sapiens 


pot. ORF V 


144 


60 


11277 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


103 


57 


11278 


AL035681 


Homo sapiens 


dJ756G23. 1 (novel Leucine Rich 


269 


100 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 








Protein) 






] 1279 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


103 


61 


11280 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


115 


70 


11281 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


71 


11282 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


81 


11283 


AF1 16715 


Homo sapiens 


PR02829 


141 


71 


11284 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


III 


76 


11285 


U52077 


Homo sapiens 


mariner transposase 


198 


78 


11286 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO' 
6613. 


120 




1 12S7 


G03798 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7879. 






11288 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


63 


60 


11289 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


123 


72 


11290 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


120 


70 


11291 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


106 


46 


11292 


AF1 16715 


Homo sapiens 


PR02829 


130 


63 


11293 


AF090930 


Homo sapiens 


PRO0478 


134 


60 


11294 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


1 19 


65 


1 1295 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 




50 


11296 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


49 


11297 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


72 


11298 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


126 


54 


11299 


AK025270 


Homo sapiens 


unnamed protein product 


150 


54 


11300 


AF 11 8078 


Homo sapiens 


PR01848 


121 


63 


11301 


AF090942 


Homo sapiens 


PRO0657 


81 


43 


11302 


AF1 16715 


Homo sapiens 


PR02829 


131 


60 


11303 


AF090931 


Homo sapiens 


PRO0483 


121 


71 


11304 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


108 


82 


11305 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


122 


59 


11306 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


93 


89 


11307 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


108 


65 


11308 


G 02 872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


114 


51 


11309 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


67 


55 


11310 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


105 


71 


11311 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


104 


73 


11312 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


88 


73 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 

Watermnn 

Score 


% 

y 


11313 


U40265 


Trypanosoma 
cruzi 


ATPase subunit 6 


94 


31 


11314 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 1 07. 


157 


53 


11315 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


113 


54 


11316 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


165 


90 


11317 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


149 


74 


11318 


AP000616 


Oryza sativa 


similar to RJTNG-H2 Finger protein 
RHAla(AF078683) 


144 


55 


11319 


AF161356 


Homo sapiens 


HSPC093 


112 


56 


11320 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


102 


53 


11321 


AF1 16661 


Homo sapiens 


PRO 143 8 


131 


74 


11322 


AK000496 


Homo sapiens 


unnamed protein product 


137 


59 


11323 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


74 


57 


11324 


U63542 


Homo sapiens 


FAP protein 


151 


79 


11325 


Ml 5530 


Homo sapiens 


B-cell growth factor 


192 


45 


11326 


AF1 16715 


Homo sapiens 


PR02829 


164 


78 


11327 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


127 


62 


11328 


AF090895 


Homo sapiens 


PRO0117 


1 16 


69 


11329 


AF090942 


Homo sapiens 


PRO0657 


132 


62 


11330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


190 


72 


11331 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


112 


49 


11332 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


119 


38 


11333 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


131 


52 


11334 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


145 


53 


11335 


X92485 


Plasmodium 
vivax 


pval 


105 


38 


11336 


M15530 


Homo sapiens 


B-cell growth factor 


148 


73 


11337 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


100 


11338 


AF 118086 


Homo sapiens 


PRO 1992 


123 


80 


11339 


G02532 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


79 


66 


11340 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


162 


73 


11341 


AF 13 0079 


Homo sapiens 


PR02852 


145 


80 


11342 


M19419 


Mus musculus 


proline-rich salivary protein 


104 


44 


11343 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


123 


62 


11344 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


91 


11345 


U94832 


Homo sapiens 


KSRP 


122 


51 


11346 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


56 


11347 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


188 


87 


11348 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


173 


59 


11349 


W48351 


Homo sapiens 


Human breast cancer related protein 


111 


68 
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Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 








BCRB2. 






11350 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


62 


11351 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


267 


6S 


11352 


AF161356 


Homo sapiens 


HSPC093 


134 


51 


11353 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


47 


11354 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


1 15 


64 


11355 


G0253S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


126 


67 


11356 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


123 


59 


11357 


AF130089 


Homo sapiens 


PRO2550 


114 


62 


11358 


AK000496 


Homo sapiens 


unnamed protein product 


126 


41 


11359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


173 


81 


11360 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


64 


11361 


AF130056 


Homo sapiens 


PRO 1367 


74 


57 


11362 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


116 


63 


11363 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


104 


91 


11364 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


137 




11365 


Z30643 


Homo sapiens 


chloride channel (putative) 






11366 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


107 


68 


11367 


AF068294 


Homo sapiens 


HDCMB45P 


281 


57 


1 3368 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


1 10 


71 


11369 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


136 


89 


11370 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


129 


79 


11371 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


137 


57 


11372 


U18339 


Variola virus 


D4L 


95 


61 


11373 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


100 


52 


11374 


M15530 


Homo sapiens 


B-cell growth factor 


135 


61 


11375 


X92485 


Plasmodium 
vivax 


pval 


111 


50 


11376 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 






11377 


W48351 


Homo sapiens 


BCRB2. P 






11378 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


96 




11379 


G02567 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6648. 


83 


54 


11380 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


85 


11381 


GO3790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


156 


64 


11382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


121 


79 


11383 


R13556 


Homo sapiens 


Protein encoded downstream of hhcjvl 
oncoprotein. 


104 


41 
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NO: 


Accession No. 






Waterman 


y 


11384 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


86 


53 


11385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF0786S3) 


114 


40 


11386 


AF068294 


Homo sapiens 


HDCMB45P 


158 


48 


11387 


B08976 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO: 133. 


136 


46 


11388 


AP090931 


Homo sapiens 


PRO0483 


123 


74 


11389 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


75 


11390 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


102 


48 


11391 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


117 


54 


11392 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


134 


43 


11393 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


46 


11394 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


49 


11395 


AF090930 


Homo sapiens 


PRO0478 


148 


70 


11396 


AF2 10651 


Homo sapiens 


NAG 18 


100 


62 


11397 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


56 


11398 


AF063243 


Bos taurus 


ribosomal protein L30 


165 


77 


11399 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


104 


37 


11400 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


175 


71 


11401 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


341 


92 


11402 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


137 


90 


11403 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 


70 


11404 


M10126 


Leishmania 
tarentolae 


NH2 terminus uncertain 


101 


41 


11405 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


126 


69 


11406 


X82385 


Homo sapiens 


RNA polymerase 11 subunit 


296 


100 


11407 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


68 


11408 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


141 


44 


11409 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


118 


77 


11410 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


77 


11411 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


169 


82 


11412 


AK025116 


Homo sapiens 


unnamed protein product 


133 


44 


11413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


113 


63 


11414 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


65 


11415 


AK024455 


Homo sapiens 


FLJ00047 protein 


87 


63 


11416 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


94 


57 


11417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


123 


58 


11418 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


69 
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SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Iilcntit 

y 


11419 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


113 


53 


11420 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


166 


68 


11421 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


124 


51 


11422 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


113 


70 


11423 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


88 


62 


11424 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


136 


72 


11425 


AK002154 


Homo sapiens 


unnamed protein product 


292 


75 


11426 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


95 




11427 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


143 


64 


11428 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


81 


11429 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


130 


81 


11430 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


172 


61 


11431 


G0032S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 12 


74 


11432 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


52 


11433 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


128 


58 


11434 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


147 


63 


11435 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


139 


83 


11436 


X63220 


Bos taurus 


NADH dehydrogenase 


154 


57 


11437 


AF090895 


Homo sapiens 


PRO0U7 


122 


59 


1 1438 


J04655 


Ascaris suum 


collagen 


90 


46 


11439 


AF090942 


Homo sapiens 


PRO0657 


166 


52 


11440 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


55 


11441 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


116 


49 


11442 


M15530 


Homo sapiens 


B-cell growth factor 


105 


75 


11443 


G03043 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7124. 


133 


64 


11444 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


115 


67 


11445 


AF090895 


Homo sapiens 


PRO01I7 


79 


61 


11446 


AF1 18086 


Homo sapiens 


PRO 1992 


109 


63 


11447 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


160 


80 


11448 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


97 


75 


11450 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


136 


72 


11451 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


89 


55 


11452 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


144 


71 


11453 


AF090931 


Homo sapiens 


PRO0483 


140 


78 


11454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


129 


65 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 


11455 


G00383 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4464. 


104 


63 


1 1456 


AF1 18086 


Homo sapiens 


PRO 1992 


151 


71 


11457 


U73168 


Homo sapiens 


partial CDS, human putative tumor 
suppressor (TJ23946) 


121 


95 


11458 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


96 


73 


11459 


AK000496 


Homo sapiens 


unnamed protein product 


233 


63 


11460 


AK024455 


Homo sapiens 


FLJ00047 protein 


90 


73 


11461 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


132 


64 


11462 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


103 


86 


11463 


K03207 


Homo sapiens 


salivary proline-rich protein precursor 


135 


36 


11464 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


88 


11465 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


104 


77 


11466 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


141 


48 


11467 


AF090942 


Homo sapiens 


PRO0657 


89 


58 


1 1468 


AC005698 


Arabidopsis 
thaliana 


T3P18.6 


128 


47 


11469 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


101 


48 


1 1470 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


136 


81 


11471 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


91 


45 


11472 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


77 


11473 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


145 


55 


11474 


AF2 10651 


Homo sapiens 


NAG 18 


1 17 


57 


11475 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


117 


62 


11476 


AF1 16715 


Homo sapiens 


PR02829 


124 


77 


11477 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


45 


11478 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


85 


78 


11479 


X91940 


Homo sapiens 


WNT-8B protein 


204 


97 


1 1480 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


131 


71 


11481 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117 


63 


11482 


X92485 


Plasmodium 
vivax 


pval 


98 


52 


11483 


AF217518 


Homo sapiens 


uncharacterized bone marrow protein 
BM042 


895 


100 


11484 


U79260 


Homo sapiens 


unknown 


104 


80 


11485 


L27428 


Homo sapiens 


reverse transcriptase 


3 69 


52 


11486 


M15530 | 


Homo sapiens 


B-cell growth factor 


122 


67 


11487 


U49974 


Homo sapiens 


manner transposase 


156 


62 


11488 


AF090942 


Homo sapiens 


PRO0657 


110 


44 


11489 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


156 


65 


11490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


115 


52 


11491 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


51 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identic 








7214. 






1 1492 


AF090930 


Homo sapiens 


PRO0478 


145 




11493 


X63726 


Phoca vitulina 


cytochrome c oxidase subunit I 


556 


87 


11494 


AL021897 


Mycobacterium 
tuberculosis 




1 19 




11495 


X98485 


Plasmodium 
vivax 


putative 


92 




11496 


Y01 158 


Homo sapiens 


clone HCACJ81. 






11497 


G02538 


Homo sapiens 


6619. 






1 1498 


G02532 




Human secreted protein SEQ ID WO* 
6613. 






11499 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. ' ' ■ 






1 1500 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO' 
7879. 






11501 


AE003568 


Drosophila 
melanogaster 


CGI 2566 gene product 






1 1502 


G03043 




7124. 






1 1503 


U12390 


p Sport 1 


beta-ga.lactosidase alpha peptide 






1 1504 


Q03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 






1 1505 


G03714 




Human secreted protein, SEQ ID WO: 
7795. 






1 1506 


W48351 


Homo sapiens 


BCRB2 breaSt CanCCr re,ated P rotem 


101 




11507 


M81321 


Macaca 
fascicularis 


proline-rich protein 


128 


48 


11508 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


110 


78 


11509 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


98 


~60 


1 1510 


X07882 


Homo sapiens 


Po protein 


1 IS 




11511 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


133 


79 


11512 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


11513 


AF000381 


Homo sapiens 


non-functional folate binding protein 


~2lr 


— 


1 1514 


X92485 


Plasmodium 
vivax 




~TT3 


~44 


11515 


S58722 


Homo sapiens 


terminal, clone XEH.8c} 


109 




11516 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


141 


78 


1 1517 


AF090894 


Homo, sapiens 


PRO0113 


"To8 


— 


11518 


M15530 


Homo sapiens 


B-cell growth factor 


"93 


— 


11519 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


66 


73 


11520 


AF 117065 


Homo sapiens 


male-specific lethal-3 homolog 1 


200 


61 


11521 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


335 


87 


11522 


M15530 


Homo sapiens 


B-cell growth factor 


104 


74 


11523 


AF1 16715 


Homo sapiens 


PR02829 


122 


64 


11524 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


68 


11525 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


103 


61 



442 



WO 01/64835 



PCT/U SO 1/04927 



NO: 








Waterman 


Identit 

y 








gene 22 clone HMSJW18. 






11526 


X92485 


Plasmodium 


pval 


111 


42 


11527 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


101 


80 


11528 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


134 


58 


11529 


AF090931 


Homo sapiens 


PRO0483 


163 


76 


11530 


AF1 19900 


Homo sapiens 


PR02822 


139 


44 


11531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


73 


11532 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


126 


50 


11533 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


46 


11534 


AF090930 


Homo sapiens 


PRO0478 


120 


81 


11535 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


105 


64 


11536 


X92485 


Plasmodium 
vivax 


pval 


123 


56 


11537 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


72 


11538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


64 


11539 


AE003834 


Drosophila 
melanogaster 


CG8054 gene product 


456 


84 


11540 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


135 


53 


11541 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


99 


70 


11542 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


77 


63 


11543 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


73 


11544 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


78 


58 


11545 


U79260 


Homo sapiens 


unknown 


128 


43 


11546 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 


122 


56 


11547 


X92485 


Plasmodium 
vivax 


pval 


116 


46 


11548 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


55 


11549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


95 


75 


11550 


U67056 


Acanth amoeba 
castellan ii 


myosin I heavy chain kinase 


111 


44 


11551 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


159 


60 


11552 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


127 


50 


11553 


Z34975 


Homo sapiens 


ldlCp 


470 


86. 


11554 


AF2 10651 


Homo sapiens 


NAG 18 


101 


65 


11555 


X78677 


Homo sapiens 


ketohexokinase 


241 


92 


11556 


G04039 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8120. 


254 


72 


11557 


AB032976 


Homo sapiens 


KIAA1 150 protein 


565 


75 


11558 


U93572 


Homo sapiens 


p40 


127 


37 



443 



_0164835A2_I_> 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 

Waterman 

Score 


% 
y 


11559 


U27488 


Pseudorabies 
virus 


glycoprotein gX 


88 


38 


11560 


R80095 


Homo sapiens 


Human superoxide-dismutase-4 
polypeptide. 


113 


63 


11561 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


65 


11562 


AF 130089 


Homo sapiens 


PRO2550 


111 


33 


11563 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 


60 


11564 


AF090895 


Homo sapiens 


PRO0117 


172 


77 


11566 


V01512 


Homo sapiens 


c-fos 


416 


82 


11567 


S5S722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


72 


33 


11568 


XI 6074 


Mus sp. 


L-34 protein (AA 1-264) 


117 


35 


11569 


Y45381 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


131 


77 


11570 


AK025395 


Homo sapiens 


unnamed protein product 


683 


98 


11571 


D78255 


Mus musculus 


PAP-1 


134 


80 


11572 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


110 


70 


11573 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


78 


11574 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


141 


59 


11575 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


58 


11576 


AC007842 


Homo sapiens 


BC331191 1 


409 


86 


11577 


Z22636 


Sinorhizobium 
meliloti 


Thi 


103 


36 


11578 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


83 


36 


11579 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


115 


60 


11580 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


167 


44 


11581 


AF 13 0079 


Homo sapiens 


PR02852 


117 


77 


11582 


AF229067 


Homo sapiens 


PAD1-H protein 


136 


75 


11583 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


132 


72 


11584 


Z70292 


Homo sapiens 


chemokine CC- 1 


386 


93 


11585 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


107 


67 


11586 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


134 


75 


11587 


X05006 


Homo sapiens 


S-protein 


237 


100 


11588 


Y21850 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 1 880830). 


365 


77 


11589 


U43959 


Homo sapiens 


beta 4 adducin 


114 


54 


11590 


AF030162 


Homo sapiens 


inner mitochondrial membrane 
translocase Tim23 


104 


95 


11591 


X03557 


Homo sapiens 


56-KDa protein (aa 1-478) 


491 


74 


11592 


AF151074 


Homo sapiens 


HSPC240 


151 


73 


11593 


AF1 16719 


Homo sapiens 


PR02987 


558 


89 


11594 


AF090942 


Homo sapiens 


PRO0657 


120 


41 


11595 


X55039 


Homo sapiens 


centromere autoantigen B (CENP-B) 


160 


32 


11596 


X98475 


Mus musculus 


vasodilator-stimulated phosphoprotein 


101 


43 


11597 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


140 


73 



444 



_0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smitli- 
Watermnn 


1 % 
Idcntit 


11598 


U07151 


Homo sapiens 


ARJL3 


1 13 


35 


11599 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


123 


79 


11600 


AL050084 


Homo sapiens 


hypothetical protein 


1 19 


100 


1 1601 


AF090931 


Homo sapiens 


PRO0483 


124 


73 


11602 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


60 


] 1603 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


82 


11604 


AF224494 


Mus musculus 


arsenite inducible RNA associated 
protein 


473 


79 


11605 


M60047 


Homo sapiens 


heparin binding protein 


597 


95 


11606 


G00933 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5014. 


721 


97 


11 607 


W12091 


Homo sapiens 


C57S derivative of antigen non-specific 
human GIF. 


573 


98 


1 1608 


AF229439 


Mus musculus 


zinc finger protein 289 


535 


80 


1 1609 


M86637 










11610 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


93 


70 


11611 


G 022 24 




6305. 






1 1612 


G02532 




Human secreted protein SEQ ID NO' 
6613. 






1 1613 


AK024372 


Homo sapiens 




1 05 




11614 


Y94918 


Homo sapiens 


Human secreted protein clone 
dd504 18 protein sequence SEQ ID 
NO.-42T 


310 


62 


11615 


Y48547 


Homo sapiens 


Human breast tumour-associated 
protein 8. 


641 


96 


1 1616 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


108 


81 


11617 


L340S1 


Homo sapiens 


bile acid CoA: Amino acid N- 
acyltransferase 


356 


82 


11618 


G01399 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5480. 


570 


98 


11619 


AF020038 


Homo sapiens 


NADP-dependent isocitrate 
dehydrogenase 


272 


83 


1 1620 


AB051901 


Homo sapiens 


VDUP1 


247 


-— 


1 1 62 1 


Y48547 


Homo sapiens 


Human breast tumour-associated 
protein 8. 


691 


— 


11622 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P0276S; Method: 
conceptual translation supplied by 
author 


382 


71 


11623 


A06977 


Homo sapiens 


albumin 


562 


92 


11624 


X73460 


Homo sapiens 


ribosomal protein L3 


502 


84 


11625 


AL035398 


Homo sapiens 


dJ796I17.1 (PUTATIVE novel protein) 


373 


98 


11626 


D50310 


Homo sapiens 


cyclin I 


175 


100 


11627 


AE0O0950 


Archaeoglobus 
fulgidus 


thermosome, subunit alpha (thsA) 


131 


25 


11628 


D88315 


Mus musculus 


tetracycline transporter-like protein 


137 


81 


11629 


Y36204 


Homo sapiens 


Human secreted protein #76. 


406 


100 


11630 


AF 130089 


Homo sapiens 


PRO2550 


110 


63 


11631 


J02982 


Homo sapiens 


gjycophorin B precursor 


95 


100 


11632 


AF1 16719 


Homo sapiens 


PR02987 


558 


93 


11633 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


92 


11634 


U09823 


Oryctolagus 


elongation factor 1 alpha 


483 


85 



445 



0 1 6483 5A2_ l_> 



WO 01/64835 PCT/US01/04927 ' 



SEQ ID 

NO: 


Accession No. 


Species 


Description 

•'• '■>; ■ ■; r 


Smith- 
Score 


Identtt 






cuniculus 








11635 


G01922 


Homo sapiens 


Human secretedprotein, SEQ ID NO: 
6003. 


157 


73 


11636 


AF1 16719 


Homo sapiens 


PR02987 


491 


90 


11637 


D14421 


Rattus 
norvegicus 


b isotype of B regulatory subunit of 
protein phosphatase 2A 


208 


86 


11638 


AF116719 


Homo sapiens 


PR02987 


544 


93 


11639 


AF 130079 


Homo sapiens 


PR02852 


127 


81 


11640 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


150 


59 


11641 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


100 


66 


11642 


G0341I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


119 


71 


11643 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:491. 


64 


100 


11644 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


82 


70 


11645 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


109 


74 


11646 


AF 13 0089 


Homo sapiens 


PRO2550 


128 


52 


11647 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


61 


11648 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


73 


84 


11649 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


128 


60 


11650 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4518. 


83 


78 


1 1651 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


64 


11652 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


103 


62 


11653 


AF 130089 


Homo sapiens 


PRO2550 


116 


69 


11654 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


176 


54 


11655 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


129 


63 


11656 


AF130089 


Homo sapiens 


PRO2550 


135 


77 


11657 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


146 


93 


11658 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


123 


61 


11659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


117 


47 


11660 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


92 


84 


11661 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


70 


75 


11662 


U00029 


Saccharomyces 
cerevisiae 


Yhr217cp 


101 


50 


11663 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


107 


88 


11664 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


110 


90 


11665 


U63542 


Homo sapiens 


FAP protein 


119 


69 


11666 


AF130114 


Homo sapiens 


PR02459 


78 


38 


11667 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 


64 


11668 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


66 



4.46 



-ISOOCID: <WO 0164835A2_L» 



WO 01/64835 



PCT/US01/04927 



NO: 




Species 


Description 


Waterman 


Identit 

y 








4478. 






11669 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


63 


11670 


G034U 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


116 


68 


11671 


AK021618 


Homo sapiens 


unnamed protein product 


97 


43 


11672 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


108 


55 


11673 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


95 


40 


11674 


AF229163 


Homo sapiens 


natural resistance-associated 
macrophage protein 1 


104 


54 


11675 


M15530 


Homo sapiens 


B-cell growth factor 


89 


81 


11676 


M92357 


Homo sapiens 


B94 protein 


143 


100 


11677 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


74 


11678 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 


64 


11679 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


66 


11680 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


108 


85 


11681 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


92 


11682 


M15530 


Homo sapiens 


B-cell growth factor 


127 


64 


11683 


AF090931 


Homo sapiens 


PRO0483 


152 


60 


11684 


AF130087 


Homo sapiens 


PR02411 


150 


68 


11685 


U18339 


Variola virus 


D4L 


94 


70 


11686 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


104 


80 


11687 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


134 


68 


11688 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


131 


70 


11689 


AF090944 


Homo sapiens 


PRO0663 ' 


129 


73 


11690 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871. 


111 


67 


11691 


U79260 


Homo sapiens 


unknown 


98 


44 


11692 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


361 


64 


11693 


G00962 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5043. 


269 


100 


11694 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


132 


83 


11695 


Y94526 


Homo sapiens 


Human lysine-rich statherin protein. 


136 


82 


11696 


AF1 16637 


Homo sapiens 


PRO 1489 


232 


95 


11697 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


108 


50 


11698 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


120 


81 


11699 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


150 


80 


11700 


AK024455 


Homo sapiens 


FLJ00047 protein 


148 


68 


11701 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


131 


72 


11702 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


105 


45 


11703 


AC005545 


Homo sapiens 


delta-adaptin, partial CDS 


127 


38 


11704 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


143 


81 



447 



0164835A2_I_> 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smirb- 

Watcrman 

Score 


Identit 

y 


] 1705 


G02532 




Human secreted protein, SEQ ID NO' 
6613. 


109 




1 1706 


G00637 


Homo sapiens 


4718. 


153 


67 


] 1707 


Y 02 671 


Homo sapiens 


Human secreted protein encoded by 
g ene22cloneHMSJWl8. 


129 


— 


11708 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


89 


69 


11709 


AB047600 


Macaca 
fasciculai'is 


hypothetical protein 


95 


63 


11710 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


100 


76 






Homo sapiens 


platelet glycoprotein lib precursor 






11712 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 


11713 


AF008196 


Homo sapiens 


bax epsilon 


140 


71 






Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 










Homo sapiens 


Human secreted protein; SEQ ID NO: 
4478. 


125 


67 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 






1 1717 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


125 




U7]8 — 


G02480 


Homosa iens 

omo sapiens 


Human ecreted rote n SEO ID NO- 

uman secrete protein, Q 


— 


— 


1 1719 


Y20717 


— 

omo sapiens 


— '■ — — rr-j 

rotein fra" merit 59 W ' 


— — 


— 


1 1720 


AJ243883 


Periplaneta 

americana 


putative transcription factor 


1 18 


~57 


1 1721 


AF220264 




MOST-1 


121 




1 1722 


AF 130051 


Homo sapiens 


PRO0898 


169 


~7~2~ 


1 1723 


K0240 1 






507 


95 


11724 


K02401 


Homo sapiens 


chorionic somatomammotropin 


461 


85 


1 1725 


U93564 








"7^ 




AF 130089 


Homo sapiens 




~To~7~ 




11727 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


79 


11728 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


109 


90 


1 1729 


Y51824 




Human OSBH protein 






11730 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


80 


11731 


K02401 


Homo sapiens 


chorionic somatomammotropin 


459 


94 


117j2 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


9j 


42 


11733 — 


QQO447 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


"Til 


~67 








Fragment of human secreted protein 
encoded by gene 27. 








S79410 


Mus musculus 


nuclear localization signal binding 
protein 










Homo sapiens 


PRO0483 


3 2 <3 


9 § 




K0240 1 




chorionic somatomammotropin 






11738 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


73 


11739 


G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


131 


57 


11740 


X55695 


Lycopersicon 
esculentum 


glycine-rich protein 


164 


48 


11741 


AK024455 


Homo sapiens 


FLJ00047 protein 


89 


77 



448 



0164835A2J_5 



WO 01/64835 



PCT/US01/O4927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 

WfltcrmBn 

Score 


% 

Idcntit 

y 


11742 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


415 


90 


11743 


S58722 


Homo sapiens 


X-Jinked retinopathy protein {C- 
terminal, clone XEH.8c} 


105 


94 


11744 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


116 


87 


11745 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


131 


69 


1 1746 


W13S31 


Homo sapiens 


Human cyclin E delta 9 mutant. 


140 


52 


11747 


AB001431 


Mus musculus 


motor domain of KIF14 


179 


84 


11748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


141 


72 


1 1749 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


99 


63 


11750 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


88 


64 


11751 


AF090931 


Homo sapiens 


PRO0483 


118 


68 


11752 


AF150105 


Homo sapiens 


small zinc finger-like protein 


194 


68 


11753 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


127 


69 


1 1754 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


100 


80 


11755 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


109 


65 


1 1756 


K02401 


Homo sapiens 


chorionic somatomammotropin 


427 


87 


1 1757 


U52077 


Homo sapiens 


mariner rransposase 


113 


82 


11758 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


133 


54 


11759 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


111 


80 


11760 


AF090944 


Homo sapiens 


PRO0663 


143 


76 


11761 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 6 


529 


99 


11762 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


171 


58 


11763 


AF 13 2200 


Homo sapiens 


PR01751 


127 


66 


11764 


AK025116 


Homo sapiens 


unnamed protein product 


141 


80 


11765 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


75 


62 


11766 


AF090930 


Homo sapiens 


PRO0478 


106 


59 


11767 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


127 


80 


11768 


AF090931 


Homo sapiens 


PRO0483 


164 


60 


11769 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


102 


81 


11770 


AF161356 


Homo sapiens 


HSPC093 


131 


68 


11771 


AF130089 


Homo sapiens 


PRO2550 


160 


82 


11772 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


140 


79 


11773 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


136 


71 


11774 


AF090942 


Homo sapiens 


PRO0657 


153 


78 


11775 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 


108 


52 


11776 


K02401 


Homo sapiens 


chorionic somatomammotropin 


395 


91 


11777 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


107 


70 


11778 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


99 


62 


11779 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


64 



449 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 




Description 


Smith- 
VVHlerman 


% 

Identit 


1 1780 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4433. 


1 16 


68 


11781 


AF1 94537 


Homo sapiens 


NAG 13 


201 


65 


11782 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


139 


68 


1 1783 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


59 


11784 


G0341I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


78 


117S5 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


68 


11786 


AF130089 


Homo sapiens 


PRO2550 


108 


83 


11787 


S58722 


Homo sapiens 


X-Jinked retinopathy protein {C- 
terminal, clone XEH.Sc} 


124 


88 


11788 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


103 


100 


11789 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


146 


60 


11790 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 
SEQ ID NO: 163. 


165 


73 


11791 


AL035494 


Homo sapiens 


dJ635G 19.2.3 (novel protein 
(PUTATIVE PARTIAL isoform 3)) 


106 


56 


11792 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


66 


11793 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


125 


61 


11794 


A20770 


synthetic 
construct 


(Aspl-Thrl61)-EPI 


95 


76 


11795 


S 74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


108 


61 


1 1796 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


96 


77 


11797 


S58722 


Homo sapiens 


X-luiked retinopathy protein {C- 
terminal, clone XEH.8c} 


111 


65 


11798 


Y75995 


Homo sapiens 


Human skin cell protein, SEQ ID 
NO: 173. 


324 


87 


1 2 799 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-5 


120 


75 


11800 


L00693 


Homo sapiens 


carcinoembryonic antigen 


96 


85 


11801 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


44 


11802 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


125 


64 


11803 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


82 


11804 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


107 


85 


11805 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


152 


80 


11806 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


122 


61 


11807 


U91985 


Homo sapiens 


DNA fragmentation factor-45 


585 


92 


11808 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


11809 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


170 


89 


11810 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:195. 


191 


76 


11811 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 

- 


100 



450 



..0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 




Description 


Smith- 
Waterman 


y 


11812 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


156 


55 


11813 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


121 


84 


.11814 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


107 


75 


11815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


74 


78 


11816 


M15530 


Homo sapiens 


B-cell growth factor 


14S 


68 


11817 


AF090942 


Homo sapiens 


PRO0657 


139 


60 


11818 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


117 


100 


11819 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


73 


11820 


Z72853 


Saccharomyces 
cerevisiae 


ORF YGR069w 


94 


42 


11821 


G03710 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7791. 


176 


65 


11822 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


60 


11823 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1 037. 


104 


68 


11824 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


99 


51 


11825 


W90172 


Homo sapiens 


Human heart muscle specific protein. 


256 


34 


11826 


ABO 17007 


Homo sapiens 


PMS2L16 


248 


89 


11827 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


127 


86 


11828 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


135 


71 


11829 


AF1 19851 


Homo sapiens 


PRO 1722 


129 


60 


11830 


AF 132200 


Homo sapiens 


PRO 1751 


120 


77 


11831 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


133 


83 


11832 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


128 


64 


11833 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


152 


100 


11834 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


73 


11835 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


88 


11836 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


95 


11837 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


411 


91 


11838 


AK000496 


Homo sapiens 


unnamed protein product 


113 


78 


11839 


AF1 19855 


Homo sapiens 


PRO 1847 


84 


68 


11840 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


128 


88 


11841 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


113 


69 


11842 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


55 


11843 


AF090942 


Homo sapiens 


PRO0657 


139 


65 


11844 


AF220264 


Homo sapiens 


MOST-1 


88 


69 


11845 


AF184612 


Drosophila 
melanogaster 


split ends 


95 


47 


11846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


136 


82 



451 



WO 01/64835 



PCT/USO 1/0492 7 



NO: 


Accession No. 




Description 


Waterman 


Identit 

y 


11847 


ABO 17007 


Homo sapiens 


PMS2L16 


310 


100 


1 1848 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


100 


79 


11849 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


111 


61 


11850 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


1 10 


72 


11851 


AF090931 


Homo sapiens 


PRO0483 


164 


80 


11852 


AF1 16661 


Homo sapiens 


PRO 143 8 


124 


67 


11853 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


119 


72 


11854 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


135 


59 


11855 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


100 


84 


11856 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


120 


54. 


11857 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


68 


11858 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


69 


31859 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


66 


11860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029, 


94 


66 


11861 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


75 


11862 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


108 


86 


11863 


AF 130050 


Homo sapiens 


PRO0872 


81 


58 


11864 


AF1 19900 


Homo sapiens 


PR02822 


152 


90 


11865 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


78 


45 


11866 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


526 


93 


11867 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAIa(AF078683) 


123 


80 


11868 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. 


119 


58 


11869 


Y 10830 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


246 


100 


11870 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


101 


59 


11871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


59 


11872 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


84 


11873 


D28U3 


Homo sapiens 


MOBP 


458 


80 


11874 


D28114 


Homo sapiens 


MOBP 


306 


68 


11875 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


101 


85 


11876 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


110 


64 


11877 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


57 


11878 


M15530 


Homo sapiens 


B-cell growth factor 


98 


62 


11879 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


100 


11880 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


99 


70 


11881 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


101 


53 



_016483SA2_L> 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


ldentit 

y 


1 1882 


G 003 92 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


219 


75 


11883 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


131 


61 


11884 


U93563 


Homo sapiens 


putative pi 50 


139 


75 


1 1 885 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct. 


154 


52 


11886 


AF090944 


Homo sapiens 


PRO0663 


149 


80 


1 1887 


AP000616 


Oryza sativa 


similar to R1NG-H2 finger protein 
RHAla(AF078683) 


107 


94 


1 1888 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


138 


62 


31889 


L26953 


Homo sapiens 


chromosomal protein 


123 


75 


1 1890 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


69 


1 1891 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


129 


83 


11892 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


126 


78 


11893 


G00481 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
4562. 


82 


70 


1 1894 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


90 


1 1895 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.Sc} 


118 


75 


1 1 896 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


167 


86 


1 1897 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


136 


77 


11898 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


126 


58 


1 1899 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


158 


78 


1 1900 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


137 


71 


11901 


G00613 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4694. 


85 


62 


11902 


G02558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6639. 


242 


73 


11903 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


127 


59 


11904 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


182 


85 


11905 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


135 


75 


1 1906 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


128 


79 


1 1907 


U93574 


Homo sapiens 


putative pi 50 


169 


64 


11908 


AF090944 


Homo sapiens 


PRO0663 


128 


75 


11909 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


100 


84 


1 1910 


AF090930 


Homo sapiens 


PRO0478 


159 


76 


11911 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


496 


88 


11912 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


551 


92 


11913 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


168 


76 


11914 


K02401 


Homo sapiens 


chorionic somatomammotropin 


450 


85 


11915 


K02401 


Homo sapiens 


chorionic somatomammotropin 


478 


87 


11916 


K02401 


Homo sapiens 


chorionic somatomammotropin 


454 


82 



453 



NSDOCID: <WO 0164835A2_I_» 



WO 01/64835 



PCT/USO 1/04927 



NO: 




Species 


Description 


Waterman 
Score 


Identit 

y 


11917 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


121 


■73 


11918 


X65121 


Mus musculus 


alpha 1 (X) collagen 


112 


35 


11919 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


115 


62 


11920 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


76 


11921 


AF1 50087 


Homo sapiens 


small zinc finger-like protein 


181 


52 


11922 


U55376 


Caenorhabditis 
elegans 


F16H1 1.2 gene product 


122 


78 


11923 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
4473. 


256 


80 


11924 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


127 


74 


11925 


AF090944 


Homo sapiens 


PRO0663 


144 


75 


11926 


AJ223475 


Escherichia coli 


InsA protein 


477 


100 


11927 


M15386 


Homo sapiens 


gamma-globin 


616 


92 


11928 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7871. 


105 


72 


11929 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


92 


11930 


AF090895 


Homo sapiens 


PRO0117 


80 


71 


11931 


AF 130089 


Homo sapiens 


PRO2550 


129 


68 


11932 


AK025I16 


Homo sapiens 


unnamed protein product 


113 


53 


11933 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


114 


70 


11934 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


69 


76 


11935 


AF1 16695 


Homo sapiens 


PR02221 


336 


70 


11936 


AF090931 


Homo sapiens 


PRO0483 


133 


77 


11937 


AF130089 


Homo sapiens 


PRO2550 


135 


72 


11938 


L06237 


Homo sapiens 


microtubule-associated protein IB 


143 


36 


11939 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


72 


11940 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


158 


76 


11941 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


130 


53 


11942 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


111 


65 


11943 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


106 


59 


11944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


65 


11945 


D28113 


Homo sapiens 


MOBP 


186 


62 


11946 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


353 


76 


11947 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


113 


72 


11948 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


139 


87 


11949 


AJ006591. 


Homo sapiens 


cysteine-rich protein 


148 


76 


11950 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminai, clone XEH.8c} 


113 


88 


11951 


AF 116695 


Homo sapiens 


PR02221 


205 


57 


11952 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 


158 


75 


11953 


AF1 13685 


Homo sapiens 


PRO0974 


107 


72 


11954 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


107 


80 


11955 


AK026107 


Homo sapiens 


unnamed protein product 


129 


84 


11956 


AF 109907 


Homo sapiens 


S164 


140 


96 



454 



SDOCID: <WO 01 64835A2J_> 



WO 01/64835 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


% 

luentit 

y 


11957 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


99 


100 


11958 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


124 


41 


11959 


K02401 


Homo sapiens 


chorionic somatomammotropin 


497 


90 


11960 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


155 


69 


11961 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


126 


85 


11962 


M15530 


Homo sapiens 


B-cell growth factor 


114 


86 


11963 


K0240I 


Homo sapiens 


chorionic somatomammotropin 


472 


87 


11964 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


61 


11965 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


145 


78 


11966 


L27428 


Homo sapiens 


reverse transcriptase 


144 


50 


11967 


G 025 32 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


187 


76 


11968 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1 037. 


124 


81 


11969 


G 0063 7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


64 


11970 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


70 


1 1971 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 


61 


11972 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


106 


70 


11973 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


70 


75 


11974 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


86 


60 


11975 


G 03 790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


72 


11976 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


105 


42 


11977 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


121 


58 


11978 


AK024372 


Homo sapiens 


unnamed protein product 


104 


80 


11979 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


137 


77 


11980 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


96 


90 


11981 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


56 


11982 


R59842 


Homo sapiens 


ApoE4LI protease. 


129 


62 


11983 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


132 


53 


11984 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


105 


90 


11985 


M15530 


Homo sapiens 


B-cell growth factor 


117 


43 


11986 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


99 


65 


11987 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


117 


84 


11988 


L26953 


Homo sapiens 


chromosomal protein 


124 


79 


11989 


U93564 


Homo sapiens 


p40 


523 


88 


11990 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


82 


11991 


AF090894 


Homo sapiens 


PRO0113 


126 


56 


11992 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


129 


6 3 


11993 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


67 


11994 


U71363 


Homo sapiens 


zinc finger protein zfp6 


213 


68 



455 



0 1 64835A2_I_> 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

luentit 

y 


11995 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


82 


65 


1 1996 


AF130089 


Homo sapiens 


PRO2550 


145 


68 


1 1997 


R95913 


Homo sapiens 


Neural thread protein. 


114 


88 


11998 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


122 


56 


11999 


X62677 


Oryctolagus 
cunicuJus 


retrovirus related reverse transcriptase 


147 


81 


12000 


AF090931 


Homo sapiens 


PRO0483 


152 


81 


12001 


AF090931 


Homo sapiens 


PRO0483 


145 


81 


12002 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


64 


12003 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


78 


12004 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


51 


12005 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


63 


12006 


LI 9527 


Homo sapiens 


ribosomal protein L27 


547 


93 


12007 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


82 


12008 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


82 


12009 


M55409 


Homo sapiens 


pancreatic tumor-related protein 


257 


98 


12010 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


127 


64 


12011 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


291 


89 


12012 


X03717 


Homo sapiens 


pot. unidentified reading frame 


126 


43 


12013 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


131 


80 


12014 


AB011148 


Homo sapiens 


KJAA0576 protein 


135 


84 


12015 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


114 


87 


12016 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


70 


12017 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


12018 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


135 


87 


12019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


137 


93 


12020 


AK001363 


Homo sapiens 


unnamed protein product 


490 


100 


12021 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


111 


79 


12022 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


106 


87 


12023 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


157 


58 


12024 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


87 


66 


12025 


R95913 


Homo sapiens 


Neural thread protein. 


157 


45 


12026 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


138 


52 


12027 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


116 


60 


12028 


V00488 


Homo sapiens 


alpha globin 


191 


100 


12029 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


102 


87 


12030 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


79 


12031 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


86 


12032 


X01703 


Homo sapiens 


alpha-tubulin 


534 


93 


12033 


U09823 


Oryctolagus 
cuniculus 


elongation factor I alpha 


557 


90 j 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Wotermnn 

Score 


% 
y 


12034 


X01703 


Homo sapiens 


alpha-tubulin 


619 


94 


12035 


AF255556 


Notothenia 
coriiceps 


alpha tubulin 


430 


85 


12036 


X53618 


Paracentrotus 
lividus 


alpha-tubulin (AA 1-452) 


437 


87 




IV10Z51U 


Homo sapiens 


mitochondrial transcription factor 1 


- 


95 






Homo sapiens 


Human secreted protein #28. 




75 


12039 


X01703 


Homo sapiens 


alpha-tubulin 


604 


97 






Homo sapiens 


alpha-tubulin 


517 


90 


12041 


S70154 


Homo sapiens 


cytosolic acetoaceryl-coenzyme A 
thiolase, CT {EC 2.3. 1.9} 


533 


94 


12042 


X01703 


Homo sapiens 


alpha-tubulin 


472 


85 


12043 


M23613 


Homo sapiens 


nucleophosmin 


428 


80 


12044 


AL03 1 1 74 


Schizosaccharom 
yces pombe 


hypothetical protein 


102 


52 


12045 


Y94653 


Homo sapiens 


Human netrin-like protein (NEL) 
amino acid sequence. 


152 


82 


12046 


AFO 16507 


Homo sapiens 


C-terminal binding protein 2 


• 172 


100 


12047 


X05196 


Homo sapiens 


aldolase C 


454 


93 


12048 


X03796 


Mus musculus 


aldolase C (aa 1-227) 


351 


74 


12049 


X05196 


Homo sapiens 


aldolase C 


408 


88 


12050 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


147 


70 


12051 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminaJ, clone XEH.8c} 


169 


55 


12052 


AF090944 


Homo sapiens 


PRO0663 


119 


59 


12053 


V01577 


Homo sapiens 


variable region 


223 


93 


12054 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


74 


12055 


AF090931 


Homo sapiens 


PRO0483 


121 


74 


12056 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


76 


12057 


U18339 


Variola virus 


D4L 


100 


52 


12058 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


135 


62 


12059 


ABO 01 684 


Chlorella 
vulgaris 


ORF41c 


81 


75 


12060 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


128 


74 


12061 


AK024455 


Homo sapiens 


FLJ00047 protein 


70 


46 


12062 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


130 


69 


12063 


AF090944 


Homo sapiens 


PRO0663 


110 


66 


12064 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


109 


53 


12065 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


94 


79 


12066 


AF09093 1 


Homo sapiens 


PRO0483 


123 


70 


12067 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


111 


75 


12068 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


144 


77 


12069 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


76 


12070 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


114 


83 


12071 


AF1 16715 


Homo sapiens 


PR02829 


138 


63 


12072 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


127 


82 


12073 


U79260 


Homo sapiens 


unknown 


98 


73 


12074 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


132 


80 
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SEQ JD 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 
y 








clone HPMBQ32. 






12075 


J02459 


bacteriophage 
lambda 


T (tail component; 144) 


555 


98 


12076 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


65 


12077 


X00911 


Rattus 
norvegicus 


pot. MSA-precursor 


190 


73 


12078 


AF1 64797 


Homo sapiens 


ribosomal protein L17 isolog 


538 


95 


12079 


A06977 


Homo sapiens 


albumin 


602 


87 


12080 


AF000198 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


118 


38 


12081 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12082 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


50 


12083 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12084 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


64 


54 


12085 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


53 


12086 


A 14829 


Homo sapiens 


preproapolipoprotein 


619 


82 


12087 


AF1 19900 


Homo sapiens 


PR02822 


107 


76 


12088 


U52077 


Homo sapiens 


mariner transposase 


277 


74 


12089 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 
RHAla(AF078683) 


126 


100 


12090 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


92 


12091 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


115 


42 


12092 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


112 


81 


12093 


G03343 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7424. 


114 


64 


12094 


AJ237660 


Bacteriophage 
21 


Ren protein 


187 


94 


12095 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


80 


12096 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


49 


12097 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


92 


12098 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


148 


59 


12099 


AF090942 


Homo sapiens 


PRO0657 


115 


70 


12100 


AF090931 


Homo sapiens 


PRO0483 1 


112 


74 


12101 


AF090944 


Homo sapiens 


PRO0663 


103 


37 


12102 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


76 


58 


12103 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


130 


80 


12104 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


84 


12105 


R95913 


Homo sapiens 


Neural thread protein. 


105 


75 


12106 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


149 


75 


12107 


Y09561 


Homo sapiens 


A'l P receptor 


188 


85 


12108 


AF090895 


Homo sapiens 


PRO0117 


107 


83 


12109 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


117 


65 


12110 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


65 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Itlcntit 

y 








6292. 






12111 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


117 


69 


12112 


Y 12478 


Homo sapiens 


congenital heart disease 5 protein 


291 


62 


12113 


AF220264 


Homo sapiens 


MOST-1 


142 


76 


12114 


Y12077 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
NO: 390. 


109 


95 


12115 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


171 


63 


12116 


AF064869 


Rattus 
norvegicus 


brain-enriched guanylate kinase- 
associated protein 2; BEGA2 


294 


98 


121 17 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


128 


36 


12119 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


102 


73 


12120 


U79260 


Homo sapiens 


unknown 


98 


65 


12121 


AF161356 


Homo sapiens 


HSPC093 


118 


67 


12122 


AF130087 


Homo sapiens 


PR02411 


115 


80 


12123 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


280 


75 


12124 


S48406 


Mus sp. 


alpha 1 (XII) collagen {triple-helical 
domain COL2} 


59 


41 


12125 


AB030816 


Homo sapiens 


H-REV107 protein-related protein 


420 


79 


12126 


R96418 


Homo sapiens 


Partial human transforming growth 
factor beta receptor type II. 


169 


100 


12127 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


91 


46 


12128 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


109 


83 


12129 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


127 


57 


12130 


AF1 19900 


Homo sapiens 


PR02822 


108 


79 


12331 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


70 


12132 


AF303828 


Mus musculus 


ubc-like protein MMS2 


131 


43 


12133 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


127 


58 


12134 


U49974 


Homo sapiens 


mariner transposase 


124 


80 


12135 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


94 


12136 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


69 


12137 


AF192913 


Homo sapiens 


zinc finger protein ZNF1 80 


581 


94 


12138 


X58907 


Homo sapiens 


steroid 2 1-monooxygenase 


128 


58 


12139 


AF09093 1 


Homo sapiens 


PRO0483 


121 


70 


12140 


AK024455 


Homo sapiens 


FLJ00047 protein 


84 


60 


12141 


AF090931 


Homo sapiens 


PRO0483 


122 


71 


12142 


G009S5 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


153 


60 


12143 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


121 


100 


12144 


G02922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7003. 


191 


62 


12145 


U57092 


Homo sapiens 


Rab30 


257 


87 


12146 


AF1 30089 


Homo sapiens 


PRO2550 


129 


74 


12147 


AF090931 


Homo sapiens 


PRO0483 


138 


80 


12148 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


135 


60 


12149 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


138 


60 
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SEQ ID 

NO: 


Accession No. 


Specks 


Description , , 


Smith- 
Waterman 


% 


12150 


U51723 


Plasmodium ■ 
vivax 


V-SERA 1 


107 


37 


12151 


G 03 789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


82 


68 


12152 


Y23884 


Homo sapiens 


Amino acid sequence of FK506 
binding protein 65 (FKBP65). 


124 


69 


12153 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID "NO: 
6571. 


93 


70 


12154 


AF229067 


Homo sapiens 


PADI-H protein 


162 


68 


12155 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


125 


86 


12157 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


39 


12158 


U89439 


Bos taurus 


ubiquitin-like protein 


146 


76 


12159 


G03725 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7806. 


677 


98 


12160 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


124 


90 


32161 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


109 


80 


12162 


AF090931 


Homo sapiens 


PRO0483 


125 


76 


12163 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


191 


64 


12164 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


96 


61 


12165 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


115 


68 


12166 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


219 


93 


12167 


AF1 18086 


Homo sapiens 


PRO 1992 


155 


78 


12168 


X12789 


Mus musculus 


cytokeratin 8 (AA 1 - 489) 


174 


63 


12169 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


48 


12170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


150 


57 


12171 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


78 


12172 


AF068294 


Homo sapiens 


HDCMB45P 


184 


56 


12173 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


50 


12174 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


59 


12175 


X92485 


Plasmodium 
vivax 


pval 


108 


44 


12176 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


102 


84 


12177 


X83703 


Homo sapiens 


nuclear protein 


330 


68 


12178 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


147 


50 


12179 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


103 


73 


12180 


AF132200 


Homo sapiens 


PRO 1751 


131 


64 


12181 


AC005534 


Homo sapiens 


CTF5 


261 


90 


12182 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


156 


75 


12183 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


75 


12184 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


101 


74 


12185 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
66S8. 


156 


56 



460 . 



_0164835A2_L> 
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NO: 




Species 


Description 


Waterman 
Score 


Idcntit 

y 


12186 


G02213 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6294. 


626 


97 


12187 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


96 


51 


12188 


. AF 118067 


Homo sapiens 


PRO 1578 


89 


73 


12189 


M63154 


Homo sapiens 


intrinsic factor 


130 


96 


12190 


AF090942 


Homo sapiens 


PRO0657 


144 


72 


12191 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


76 


12192 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 


63 


12193 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


159 


76 


12194 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


76 


12195 


AF090931 


Homo sapiens 


PRO0483 


134 


62 


12196 


AF 130089 


Homo sapiens 


PRO2550 


117 


74 


12197 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


113 


57 


12198 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


70 


12199. 


J02459 


bacteriophage 
lambda 


Fi (DNA packaging;] 17) 


203 


95 


12200 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


135 


58 


12201 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


86 


12202 


AF 119900 


Homo sapiens 


PR02822 


148 


68 


12203 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


59 


12204 


R59842 


Homo sapiens 


ApoE4Ll protease. 


118 


91 


12205 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


121 


72 


12206 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


131 


65 


12207 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


145 


65 


1220S 


S58722 


Homo sapiens 


X-linked retinopathy protein (C- 
terminal, clone XEH.8c} 


145 


86 


12209 


AJ242956 


Homo sapiens 


El fusion protein 


96 


88 


12210 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


132 


78 


12211 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


81 


12212 


K03179 


Homo sapiens 


pro-alpha- 1 type-I collagen 


96 


37 


12213 


G01895 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5976. 


310 


57 


12214 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


493 


82 


12215 


R13144 


Homo sapiens 


Deleted in Colorectal Carcinomas. 


700 


98 


12216 


AF 132200 


Homo sapiens 


PRO 1751 


109 


61 


12217 


AB013897 


Homo sapiens 


HKR1 


324 


64 


12218 


AF071172 


Homo sapiens 


HERC2 


123 


86 


12219 


L38593 


Homo sapiens 


integral membrane protein 


78 


62 


12220 


AF036233 


Homo sapiens 


cdc25B phosphatase 


204 


51 


12221 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


107 


60 


12222 


AL049795 


Homo sapiens 


dJ622L5.9 (eukaryotic translation 
initiation factor 3, subunit 2 (beta, 
36kD) (TR1P-1, TGF-beta receptor 
interacting protein 1)) 


163 


86 


12223 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


76 
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I Accession No. 


Species 


Description 


Wurman 




NO: 










Idenlit 
en i 










Score"" 3 " 




12224 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


64 








7871. 






J2225 


AF130079 


Homo sapiens 


PR02852 


103 


80 


12226 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


S6 








4457. 






12227 


AF119855 


Homo sapiens 


PRO 1847 


96 


78 


12228 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


195 


87 








4473. 






12229 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


103 


90 








gene 44 clone HTDAD22. 






12230 


U49957 


Homo sapiens 


LIM protein 


212 


100 


12231 


AK022759 


Homo sapiens 


unnamed protein product 


646 


100 


12232 


Y0115S 


Homo sapiens 


Secreted protein encoded by gene 1 8 


110 


70 








clone HCACJ81. 






12233 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


109 


74 








gene No. 107. 






12234 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


123 


76 








sequence. 






12235 


AF220264 


Homo sapiens 


MOST-1 


104 


85 


12236 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


113 


87 








terminal, clone XEH.8c} 






12237 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


84 








7881. 






12238 


AF090942 


Homo sapiens 


PRO0657 


103 


66 


12239 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


72 








4435. 






12240 


Y36495 


Homo sapiens 


Fragment of human secreted protein 


135 


75 








encoded by gene 27. 






12241 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


136 


95 








RHAla(AF078683) 






12242 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


148 


72 








terminal, clone XEH.8c} 






12243 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


118 


39 


12244 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


78 


100 






major 


DNA, complete virion genome 






12245 


AC005200 


Homo sapiens 


plasmalemmal porin 


131 


79 


12246 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


65 








6571. 






12247 


W48351 


Homo sapiens 


Human breast cancer related protein 


90 


55 








BCRB2. 






12248 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


77 


64 








encoded by gene 121. 






12249 


U63542 


Homo sapiens 


FAP protein 


108 


56 


12250 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


101 


75 








terminal, clone XEH.8c} 






12251 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


63 


12252 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


71 








5066. 






12253 


AF1 18086 


Homo sapiens 


PRO 1992 


73 


70 


12254 


AF083929 


Mus musculus 


ES18 


108 


44 


12255 


AF130089 


Homo sapiens 


PRO2550 


124 


88 


12256 


AF090931 


Homo sapiens 


PRO0483 


132 


80 


12257 


X95190 


Homo sapiens 


branched chain acyl-CoA oxidase 


166 


86 


12258 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


53 








7868. 






12259 . 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


88 








7124. 






12260 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 



462 
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NO: 


Accession No. 


Species 
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Waterman 
-Score 


Idcntit 

y 








4409. 






12261 


G0221 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


133 


72 


12262 


GO 1 828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


110 


70 


12263 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


125 


85 


12264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


126 


65 


12265 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


114 


95 


12266 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


63 


12267 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


166 


73 


12268 


AF090931 


Homo sapiens 


PRO0483 


108 


82 


12269 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


120 


68 


12270 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


120 


66 


12271 


Y8624S 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


74 


12272 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


129 


60 


12273 


AF 130089 


Homo sapiens 


PRO2550 


134 


79 


12274 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


118 


80 


12275 


AF 194537 


Homo sapiens 


NAG 13 


151 


52 


12276 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


75 


68 


12277 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


113 


60 


12278 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


70 


12279 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


119 


92 


12280 


G037J4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


90 


12281 


U66464 


Homo sapiens 


hematopoietic progenitor kinase 


102 


83 


12282 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


118 


65 


12283 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


162 


82 


12284 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


130 


77 


12285 


AF1 16661 


Homo sapiens 


PRO 143 8 


104 


74 


12286 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


98 


66 


12287 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


105 


74 


12288 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


129 


78 


12289 


X53581 


Rattus 
norvegicus 


ORF7 


111 


47 


12290 


AF090931 


Homo sapiens 


PRO0483 


147 


87 


12291 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 


69 


12292 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


137 


53 


12293 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


342 


56 



463 



WO 01/64835 



PCT/USOl/04927* 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 


ltlentit 

y 


12294 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


60 


12295 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


1 12 


66 


12296 


G00397 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
447S. 


140 


62 


12297 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


144 


70 


12298 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


95 


5j 


12299 . 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


108 


70 


12300 • 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


122 


85 


12301 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


117 


73 


12302 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


1 15 




12303 


AF 130051 


Homo sapiens 


PRO0898 


1 16 


65 


12304 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


119 


60 


12305 


V00662 


Homo sapiens 


ATPase 6 


1 1 8 




12306 


AF 107406 . 


Homo sapiens 


GW128 


98 


66 


12307 . 


A05308 


synthetic 
construct 


gjioblastoms-derived T-cell suppressor 
factor 






32308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 






12309 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 






12310 


U 12390 


Cloning vector 
pSportl 


beta-gal actosidase alpha peptide 


98 


47 


12311 


Y2757I 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


130 


— 


12312 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. ... 


134 


^76 


12313 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


98 


82 


12314 


U79260 


Homo sapiens 


unknown 


97 


64 


12315 


L27428 


Homo sapiens 


reverse transcriptase 


214 


44 


12316 


D00570 


Mus musculus 


open reading frame (196 AA) 


13 5 




12317 


AF090931 


Homo sapiens 


PRO0483 


104 


74 


12318 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. ' 


123 




12319 . 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 






12320 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


97 


62 


12321 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


95 


58 


12322 


G00362 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4443. 


104 


59 


12323 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


115 


75 


12324 


AF286472 


Homo sapiens 


retinitis pigmentosa GTPase regulator 


121 


53- 


12325 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


162 


75 


12326 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


75 


12327 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. . 


125 


88 



464 
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Waterman 


% 

Identit 

y 


12328 


AF090944 


Homo sapiens 


PRO0663 


122 


76 


12329 


AF09093 1 


Homo sapiens 


PRO0483 


156 


81 


12330 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


107 


80 


12331 


AF116715 


Homo sapiens 


PR02829 


127 


72 


12332 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


49 


12333 


S80862 


Western equine 
encephalomyeliti 
s virus Western 
equine 

encephalomyeliti 
s virus, Peptide 
Partial, 259 aa 


nucleocapsid gene C 


108 


35 


12334 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


141 


83 


12335 


AK024455 


Homo sapiens 


FLJ00047 protein 


133 


71 


12336 


AF090944 


Homo sapiens 


PRO0663 


139 


84 


12337 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


124 


71 


12338 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


102 


69 


12339 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


121 


81 


12340 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


149 


54 


12341 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


134 


66 


12342 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


60 


12343 


AF 118086 


Homo sapiens 


PRO 1992 


80 


63 


12344 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


80 


12345 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


145 


72 


12346 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


143 


75 


12347 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


116 


37 


12348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6613. 


176 


76 


12349 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


103 


48 


12350 


S58722 


Homo sapiens 


X-Jinked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


81 


12351 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


108 


70 


12352 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


101 


84 


12353 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


105 


42 


12354 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c) 


147 


66 


12355 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


125 


65 


12356 


R96S00 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


96 


68 


12357 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4435. 


122 


43 


12358 


AL1 60493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


132 


38 


12359 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


90 
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Idenlit 








4478. 






12360 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


56 


12361 


AB001684 


Chlorella 
vulgaris 


ORF49b 


92 


44 


12362 


U93564 


Homo sapiens 


P 40 


257 


84 


12363 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class II HERVs 


427 


84 


12364 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


102 


73 


12365 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


102 


68 


12366 


AF090944 


Homo sapiens 


PRO0663 


94 


45 


12367 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


150 


55 


12368 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


105 


85 


12369 


AF130089 


Homo sapiens 


PRO2550 


112 


78 


12370 


AF090895 


Homo sapiens 


PRO0117 


125 


64 


12371 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


121 


95 


12372 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


72 


12373 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


72 


12374 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


133 


79 


12375 


AP000001 


Pyrococcus 
horikoshii 


58aa long hypothetical protein 


96 


51 


12376 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


76 


12377 


S58722 


Homo sapiens 


X-hnked retinopathy protein {C- 
terminal, clone XEH.Sc} 


123 


78 


12378 . 


Y02671 . 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


126 


83 


12379 


Y53643 


Homo sapiens 


A bone marrow secreted protein 
designated BMS6. 


329 


98 


12380 


U64094 


Homo sapiens 


soluble type II interleukin-1 receptor 


1 14 


100 


12381 


J02459 


bacteriophage 
lambda 


D (head-DNA stabilization;! 10) 


467 


100 


12382 


LI 0908 


Mus musculus 


Gcapl gene product 


100 


89 


12383 


AF220264 


Homo sapiens 


MOST-1 


113 


68 


12384 


U79260 


Homo sapiens 


unknown 


94 


60 


12385 


B0 1372 


Homo sapiens 


Neuron-associated protein. 


148 


52 


12386 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


116 


63 


12387 


G00952 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5033. 


125 


92 


12388 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


65 


12389 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


150 


62 


12390 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


102 


38 


12391 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


113 


56 


12392 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413 


73 


.92 



466 



_016A835A2J_» 



WO 01/64835 



PCT/USO 1/0492 7 



SEQID 

NO: 


Accession No. 




Description 


Smith- 
Walerman 

~f^r 


% 

Identit 

y 


— 

- ^ — 


M69797 


— : 

omo sapiens 


Q 

— J . — — — — 




-Z| 






Homo sapiens 


Human nistiocyte-secreted iactor HSF. 


— — 




P3 95 — 








"rjr 


-~ 


12396 
■■ — 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


62 


l^gg — 


/\r 1 -suuoy 


Homo sapiens 




— 




_ 




AK023 117 


Homo sapiens 


unnamed protein product 




-~ 


12399 


AF1 23652 


Homo sapiens 


FEZ1 


157 


96 


12400 


D3 8112 


Homo sapiens 


NADH dehydrogenase subunit6 


139 


87 


12401 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


306 


60 


12402 


AC00503 1 


Homo sapiens 


neuronal apoptosis inhibitory protein 


143 


96 


12403 


J0307 1 


Homo sapiens 


chorionic somatomammotropin CS-2 


585 


100 


12405 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


125 


100 


12406 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


150 


63 


12407 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


157 


85 


12408 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


120 


75 


12409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


50 


12410 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7546. 


103 


51 


12411 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


71 


12412 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


159 


70 


12413 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


79 


12414 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


72 


12415 


G 02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


72 


12416 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


102 


50 


12417 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


171 


75 


12418 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


137 


55 


12419 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


1 1 1 


52 




R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


129 


53 


12421 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


130 


55 


12422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


87 


12423 


U93564 


Homo sapiens 


p40 


350 


93 


12424 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


484 


88 


12425 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


130 


69 


12426 


X60376 


Brassica napus 


prolinc-rich protein 


89 


47 


12427 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


88 


12428 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


117 


75 


12429 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


107 


67 



467 



0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


ldcntit 


12430 


AK027208 


Homo sapiens 


unnamed protein product 


108 


64 


12431 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


154 




12432 


AF 130089 


Homo sapiens 


PRO2550 


1 18 


88 


12433 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


90 


55 


12434 


AK024455 


Homo sapiens 


FLJ00047 protein 


1 13 ' 


75 


12435 


AP000616 


Oryza sativa 


similar to RING-H2 Finger protein 
RHAla(AF078683) 


107 


80 


12436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


1 14 


75 


12437 


AC005498 


Homo sapiens 


R31665 2 


163 


75 


12438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


12439 


G03I16 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7197. 


145 


84 


12440 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 


73 


32441 


G00397 


Homo sapiens 


Human secreted protein SEQ ID NO* 
4478. 


122 


69 


12442 


AF090942 


Homo sapiens 


PRO0657 


120 


72 


12443 


G02902 




6983. P 






12444 


AF1 30089 




PRO2550 






12445 


AL021395 


Homo sapiens 


isomerase (cyclophilin)) 


225 





12446 


AF09093 1 


Homo sapiens 


PRO0483 







12447 


G02515 


Homo sapiens 


6596. 


130 




12448 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


122 


74 


12449 


AF1 19855 


Homo sapiens 


PRO 1847 


88 


72 


12450 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


84 


12451 


AK024455 


Homo sapiens 


FLJ00047 protein 


78 


68 


12452 


AF116712 


Homo sapiens 


PR02738 






12453 


G03787 


Homo sapiens 


Human secreted protein SEQ ID NO* 
7868. 


109 


iP 


12454 


G037I4 


Homo sapiens 


Human secreted protein SEQ ID NO' 
7795. 






12455 


AK024455 


Homo sapiens 


FLJ00047 protein 






12456 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


72 


12457 


AF090944 


Homo sapiens 


PRO0663 


129 


79 


12458 


G02501 




6582. 






12459 


R22278 


Homo sapiens 


Human gp. -specific component 
phenotype Gel. 






12460 


G02480 


Homo sapiens 


6561. 






12461 


Y87156 


Homo sapiens 


ED NO: 195. ^ ^ ^ 


111 


61 


12462 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


84 


12463 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7888. 


94 


38 


12464 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


134 


74 


12465 


AF200715 


Homo sapiens 


PTB domain adaptor protein CED-6 


332 


95 


12466 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


64 I 



468 



0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Identit 

y 








7867. 






12467 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


I 17 


55 


12468 


AF 199023 


Homo sapiens 


phospholipid scramblase 4 


476 


98 


12469 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


89 


12470 


AF108841 


Homo sapiens 


pol protein 


306 


87 


1247 1 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


1 1 1 


79 


12472 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


124 


78 


12473 


U18339 


Variola virus 


D4L 


117 


60 


12474 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


124 


63 


12475 


AF1 30089 


Homo sapiens 


PRO2550 


105 


52 


12476 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


80 


62 


12477 


M91242 


Rattus 
norvegicus 


calcium channel alpha- 1 subunit 


156 


68 


12478 


AFl 30089 


Homo sapiens 


PRO2550 


154 


79 


12479 


Ml 5530 


Homo sapiens 


B-cell growth factor 


154 


70 


12480 


U39904 


Mus musculus 


citron 


718 


97 


12481 


R99364 


Homo sapiens 


Human REST protein DNA binding 
domain. 


229 


37 


12482 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


97 


54 


12483 


AF161356 


Homo sapiens 


HSPC093 


94 


55 


12484 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


97 


75 


.12485 


AF090942 


Homo sapiens 


PRO0657 


103 


55 


12486 


U79260 


Homo sapiens 


unknown 


98 


74 


12487 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


96 


53 


12488 


U37100 


Homo sapiens 


aldose reductase-like peptide 


400 


97 


12489 


U93569 


Homo sapiens 


putative pi 50 


141 


56 


12490 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


66 


12491 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


89 


12492 


Y00876 


Homo sapiens 


Human LAPH-1 protein sequence. 


219 


45 


12493 


AK024455 


Homo sapiens 


FLJ00047 protein 


85 


53 


12494 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


63 


12495 


Y36722 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 98. 


216 


60 


12496 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


57 


12497 


Y77551 


Homo sapiens 


C-terminal domain of betalc integrin. 


132 


67 


12498 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


122 


69 


12499 


AF161356 


Homo sapiens 


HSPC093 


98 


51 


12500 


G 02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


150 


81 


12501 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


97 


52 


12502 


AFl 32200 


Homo sapiens 


PRO 1751 


94 


69 


12503 


GO0637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


65 


12504 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


99 


75 



469 



0 1 6483SA2_I_> 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identir 


— 









Score 


>' 




ivi i yo j 1 


Rattus 


- ..- — 

fos-related antigen 


132 


92 


-■ — 




norvegicus 








12506 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 13 


48 









4435. 






.^flft — 




Homo sapiens 


KSRP 


1 17 


48 






Homo sapiens 


Human N-proteinase (70 kDa short 


121 


62 


— 






form). 






12509 


ArUUOo I O 


: 

Oryza sativa 


similar to RTNG-H2 finger protein 


124 


66 








RHAla (AF078683) 










Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


80 








7 ^5?" : 






^25Y1 — 




Homo sapiens 


BC-2 protein 


516 


98 


12512 


Y 12950 


Homo sapiens 


Amino acid sequence of a human 


96 


50 






__ . 


secreted peptide. 






12513 — 


rfr 4R7 


Homo sapiens 


Human secreted protein, SEQ ID NO: 




121 


45 






__ . 








12514 


AL 162044 


liomo sapiens 


hypothetical protein 





48 


[|^| — 




Homo sapiens 


PR02852 




96 


73 


^ j-H — 




Homo sapiens 


Human phospholipase 2 HPPL2. 


113 


77 


2517 


D J 2202 


Homo sapiens 


alpha 1 C adrenergic receptor isoform 2 


164 


86 


12518 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


85 


— 


—————— 





7001 ' 






12519 


U00694 


Gallus gallus 


vitamin D3 hydroxylase associated 


187 


45 


... 






protein 






12520 


AB030829 


Rattus 


carbonic anhydrase III 


503 


63 


— 




norvegicus 








— 




Bos taurus 


32 kd accessory protein 


618 


95 


— 


M3634 1 


Homo sapiens 


ADP-ribosylation factor 4 


374 


93 


— 


A T7AQrtO'2 1 

Aruyuyj i 


Homo sapiens 


PRO0483 


163 


82 


'— . ... 


AB007925 


Homo sapiens 


KIAA0456 protein 


130 


62 


125-5 


G02314 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 






. 


6395. 






mi~ — 


Ar 1 /ay^o 


Homo sapiens 


TALE homeobox protein Meis2a 


321 


93 


125 


ALJ Ji» loz. 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


.156 


85 








complete virion genome. 






OCTO 

12528 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 




- 


.. 


7492. 






— too — 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


68 


- 






6953. 






12530 


AK.U24J /z 


Homo sapiens 


unnamed protein product 


1 12 


56 


12531 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


76 


65 






._ 


encoded by gene 17. 






2532 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


66 


■ — 






4409 - . 








VJUjoU/ 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


53 














— 

~ — 




Homo sapiens 


PRO0657 


83 


60 


535 


Vv'JojjJ 


Homo sapiens 


Human breast cancer related protein 


136 


65 










BCRB2. 






12536 


R95913 


Homo sapiens 


Neural thread protein. 


106 


52 


12537 


AF1 30089 


Homo sapiens 


PRO2550 


148 


84 


12538 


AF102826 


Homo sapiens 


RD1 14/siniian type D retrovirus 


605 


91 








receptor 






12539 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


143 


52 








SEQ ID NO.T 63. 






12540 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


.65 








4435. 






12541 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


209 


92 








7681. 







470 



0164835A2_I_> 



WO 01/64835 PCT/USfl 1/04927 



SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
■\Vnterman 


Idtntit 


12542 


M31964 


Saimiriine 
herpesvirus 2 


collagen-like protein 


92 


52 


12543 


G03714 




Human secreted protein, SEQ ID NO: 
7795. 


146 


76 


12544 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


72 


12545 


AC004912 


Homo sapiens 


similar to CR16, SH3 domain binding 
proteinj similar to 2205340A 
(PID:gl 587070) 


442 


98 


12546 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
subunit) 


175 


91 


12547 


AF1 30089 


Homo sapiens 


PRO2550 


112 


40 


12548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


48 


12549 


X551 10 


Homo sapiens 


neurite outgrowth-promoting protein 


479 


72 


12550 


X98296 


Homo sapiens 


ubiquitin hydrolase 


500 


88 


12551 


R943 I 7 


Homo sapiens 


Hepatocyte proliferation substance HP- 
041V. 


390 


92 


12552 


GO 1623 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5704. 


142 


90 


12553 


D86966 


Homo sapiens 


similarto human ZFY protein. 


161 


56 


12554 


D00570 


Mus musculus 


open reading frame (196 A A) 


122 


67 


12555 


AF1 30089 


Homo sapiens 


PRO2550 


94 


88 


12556 


AF043 1 84 


Homo sapiens 


T cell receptor beta chain 


607 


84 


12557 


U93564 




p40 


154 


100 


12558 


U42026 


Homo sapiens 


plasma membrane Ca2+-ATPase 


200 


97 


12559 


g7941Q 


Mus musculus 


nuclear tocafeaUon signal binding 

protein 


103 


50 




U90552 






356 


98 


12561 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


156 


37 


12562 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


138 


41 


12563 


XI 6454 


Homo sapiens 


carcinoembryonic antigen subdomain B 


353 


98 


12564 


AL03 1 1 86 


Homo sapiens 


region 1 protein) 


1 19 


100 


12565 


M68941 


Homo sapiens 


protein-tyrosine phophatase 


223 


93 


12566 


Y14487 




cytosolic serine 
hydroxymethyltransferase 


145 


84 


12567 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


149 


61 


12568 


AB005047 


Homo sapiens 


SH3 binding protein 


124 


52 


12569 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


102 


53 


12570 


X68142 


Oryctolagus 
cuniculus 


elongation factor 1 gamma 


678 


93 


12571 


L20315 


Mus musculus 


MPS1 protein 


337 


77 


12572 


AL359782 


brucei 


possible ul 102, vsrismt 3 dna, 
complete virion genome. 


1 14 


50 


I2574 — 


AF1 30089 




PRO2550 










Homo sapiens 


Human secreted protein SEQ ID NO* 
7492. 


~~R6~ 


~5~T 


12575 


AF005902 


^Monocle Iphis 
domestica 


Rinesin homolog 


534 


82 


12576 


Y56021 


Homo sapiens 


Human CD40 receptor interacting 
protein 4C4. 


711 


96 


12577 


Y99662 


Homo sapiens 


Human GTPase associated protein- 13. 


634 


100 


12578 


Z48008 


Saccharomyces 
cerevisiae 


Soklp 


225 


100 



471 



01 64835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description' 


Smith- 


% 


NO: 








Waterman 


Idenlit 










Score 


y 


12579 




- —- : 

Homo sapiens 


translation initiation factor elF-2b delta 


195 


97 




- 




subunit 








M22538 


Homo sapiens 


NADH-ubiquinone reductase 


224 


82 






Mus musculus 


This gene is isolated by means of 


1 12 


44 








differential display method using ttw, 












an excellent mouse model for ectopic 












ossification.; similar to megakaryocyte 












stimulating factor precursor and 












cartilage superficial zone protein 






12582 


U93 . 67 


Homo sapiens 


p40 


364 




93 


12583 


/V i UU iZjj 


Homo sapiens 


phosphoinositol 3 -phosphate binding 


475 


100 








protein-1 






] ? 5g 4 — 


■ 07< - Q9 


— ■ 

Homo sapiens 


peroxisome proliferator activated 


1 13 


■ 

81 


■ ■ — 






receptor 










— — : 

Drosophila 


CG8135 gene product 


. 191 


38 






melanogaster 






12586 


AF1 19851 


Homo sapiens 


PRO 17^2 ~ 


88 


64 


12587 


AF090895 


Homo sapiens 


r isXJy) ill 


144 


59 


12588 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


57 




. 


— 


7645. 






]25g — 




Mus musculus 


Rho-guanine nucleotide exchange 


111 


32 




-— — — 




factor 






}9 — 
" — 




— : 

Homo sapiens 


small nuclear ribonucleoprotein 


383 


• 97 




77Q 


Ovis aries 


small proline-rich protein with paired 


60 


33 


— 


■ 




repeat 










— — ; 

Homo sapiens 


Human secreted protein encoded by 


118 


78 


— 





— — . 


gene 22 clone HMSJW18. 










Homo sapiens 


Fragment of human secreted protein 


123 


83 








encoded by gene 27. 






12594 


AF090944 


Homo sapiens 


rKUOOoi 


152 


82 




yQO^S 


Homo sapiens 


ULBP-3 amino acid sequence. 


852 


100 


12596 




Drosophila 


type I 


159 


42 






melanogaster 








12597 


AF191309 


Mus musculus 


zinc finger protein 


407 


66 


?2cri| — 


W8284 1 


Homo sapiens 


Human cerebral protein-1. 


107 


91 




AF070664 


Homo sapiens 


HSPC008 


128 


89 


yyZQQ 


Uujood 


Homo sapiens 


cytochrome P450 


257 


96 


] 2 60] — 


AB01 5798 


Homo sapiens 


DnaJ homolog 


226 


73 


12607 — 




Arabidopsis 


unknown protein 


88 


40 






thaliana — _ 








12603 


G03267 


Homo sapiens 


— ; 

Human secreted protein, SEQ ID NO: 


80 


35 














12604 


AF200187 


■ u- • 

cercopithicine 


EBNA2-like protein 


109 


28 






herpesvirus 15 








12605 


Ar 1 o 1 53z 


Homo sapiens 


— 

HSPC047 


720 


100 


12606 


M21302 


Homo sapiens 


small proline rich protein 


60 


59 


12607 


Y0267 1 


Homo sapiens 


Human secreted protein encoded by 


107 


47 






-— 


gene 22 clone HMSJW1 8. 






. — 




this . 


RP59 protein 


599 


54 


- — 





norvegicus 








09 




Homo sapiens 


sarcomeric mitochondrial creatine 


763 


96 








kinase precursor (EC 2.7.3.2) 






12610 


G02532 


Homo sapiens 


Human secreted protein, SEQ TD NO: 




87 








6613. 






12611 


W7S175 


Homo sapiens 


Human secreted protein encoded by 


337 


100 








gene 50 clone HSTAG52. 






12612 


S79410 


Mus musculus 


nuclear localization signal binding 


145 


66 








protein 






12613 


AF1 19855 


Homo sapiens 


PRO 1847 


,57 - 


61 
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SEQ ID 

NO: 




Spec.es 


Description 


Smith- 
Wflterman 


% 

Identic 


12614 


U82303 


Homo sapiens 


unknown 


107 


62 


12615 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


146 


49 


12616 


AF I 30051 


Homo sapiens 


PRO0898 


254 


56 


12617 


AF037350 


Rattus 
norvegicus 


NF-E2-related factor 2 


807 


59 


12618 


Y02168 


Homo sapiens 


A facilitative glucose transporter 
protein GLUT8. 


452 


100 


12619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


158 


84 


12620 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


246 


60 


12621 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


79 


43 


12622 


AF109907 


Homo sapiens 


S164 


93 


40 


12623 


Y36156 


Homo sapiens 


Human secreted protein #28. 


114 


61 


12624 


AF1 16712 


Homo sapiens 


PR02738 


104 


48 


12625 


G 02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


81 


65 


12626 


AF1 13685 


Homo sapiens 


PRO0974 


104 


43 


12627 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


1 16 


45 


12628 


AF 107406 


Homo sapiens 


GW128 


133 


58 


12629 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


100 


57 


12630 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


172 


66 


12631 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


136 


63 


12632 


AF1 18082 


Homo sapiens 


PRO 1902 


138 


45 


12633 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


151 


62 


12634 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


95 


58 


12635 


AF064597 


Homo sapiens 


LINE-1 like protein 


108 


46 


12636 


Y02693 


Homo sapiens 


Human secreted protein encoded bv 
gene 44 clone HTDAD22. 


116 


49 


12637 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


52 


1263S 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


79 


70 


12639 


M29622 


Mus musculus 


open reading frame 2 


81 


60 


12640 


M26460 


Homo sapiens 


retinoblastoma 1 


100 


45 


12641 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


110 


63 


12642 


AB047936 


Macaca 
fascicularis 


hypothetical protein 


100 


52 


12643 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


41 


12644 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


143 


48 


12645 


AF090942 


Homo sapiens 


PRO0657 


157 


54 


12646 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


130 


56 


12647 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


267 


65 


12648 


AF130079 


Homo sapiens 


PR02852 


109 


57 


12649 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


99 


61 


12650 


X55686 


Lycopersicon 


extensin (class II) 


65 


— 
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SEQ ID 

NO: 


Accession No. 


Species 


; Description t'v<&? --, 


Smilh- 
VVaferman 


% 

Identit 

y 






esculentum 








12651 


G03714 


Homo sapiens 


7795. 




84 


12652 


AF1 19851 


Homo sapiens 


PRO 1 722 




• 70 


12653 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


133 


50 


12654 


Y45382 


Homo sapiens ■ 


Human' secreted protein fragment 
encoded from gene 28. 




70 


12655 


G00354 


Homo sapiens 


4435. 






12656 


G03789 


Homo sapiens 


7870. 


9j 


52 


12657 


U93563 


Homo sapiens 


putative pi 50 


1046 


50 


12658 


G03789 




7870. 




70 


12659 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


104 . 


70 


12660 


AF1 19851 


Homo sapiens 


PRO 1722 




56 


12661 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


90 


65 


12662 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


59 


64 


12663 


AF1 18082 




PRO 1 902 


91 


66 


12664 


L10908 


Mus musculus 


Gcapl gene product 


113 


50 


12665 


Ml 8093 


vulgaris 


hydroxyproline-rich glycoprotein 


124 


37 


12666 


AF090895 


Homo sapiens 


PRO0117 


76 


57 


12667 


G03789 




Human secreted protein. SEQ ID NOi 
7870. 


103 


43 


12668 


W48351 




Human breast cancer related protein 
BCRB2. 


90 


61 


12669 


M15530 




B-cell growth factor 


90 


80 


12670 


AF 198447 


Aspergillus 
Homo"! iens 


60S ribosomal protein L3 


217 


69 


12671 


G03790 


pi s 


— ■ : — — 

Human secreted protein, SEQ ID NO: 


1j7 


51 


12672 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


77 


68 


12673 


AF055985 


Onchocerca 
volvulus 


pyrrol idone-nch antigen 


67 


40 


12674 


X79417 


Sus scrofa 


40S ribosomal protein S12 


377 


75 


12675 


AF1 07406 






74 


51 


12676 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #15 j. — — 


99 


62 


12677 


G02872 


Homosa ien 

omo sapiens 


riurnan secreted protein, SEQ ID NO* 
6953. 


127 

— ; 


65 


12678 


G00590 


Homo sapiens 


4671. 


99 


62 


12679 


G01984 


Homo sapiens 


Human secreted protein SEQ ID NO* 
6065. 


"128 




12680 


AF194537 


Homo sapiens 


NAG 13 


-LI? 


59 


12681 


R96800 




Human histiocyte-secreted factor HSF. 


155 


77- 


12682 


AF161356 


Homo sapiens 


HSPC093 


104 


50 


12683 


AF 107406 


Homo sapiens 


GW128 


84 


54 


12684 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


90 


39 


12685 


V40883_cdl 


Homo sapiens 


03 : DEC-1?97 Coding sequence of 
clone BG366 2. 


728 


100 . . 


12686 


M24732 


Homo sapiens 


lamin-like protein 


94 


54 


12687 


X00824 


Gailus gallus 


collagen 


66 


42 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 

y 


12688 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


184 


64 


12689 


AF1 18086 


Homo sapiens 


PRO 1992 


140 


61 


12690 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 102, variant a 
DNA, complete virion genome 


131 


54 


12691 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


180 


72 


12692 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


104 


72 


12693 


AF130089 


Homo sapiens 


PRO2550 


204 


51 


12694 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


92 


52 


12695 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


110 


52 


12696 


L39103 


Homo sapiens 


glycoprotein lb alpha 


61 


45 


12697 


M15530 


Homo sapiens 


B-ce!l growth factor 


121 


64 


12698 


M36913 


Zea mays 


cell wall protein (put.); putative 


75 


40 


12699 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


68 


12700 


M24732 


Homo sapiens 


lam in -like protein 


95 


35 


12701 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


67 


50 


12702 


AF298594 


Nicotiana alata 


arabinogalactan protein 


105 


30 


12703 


X92485 


Plasmodium 
vivax 


pval 


97 


38 


12704 


AF2 10651 


Homo sapiens 


NAG 18 


89 


64 


12705 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


190 


53 


12706 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


78 


12707 


U52077 


Homo sapiens 


mariner transposase 


451 


51 


12708 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


244 


77 


12709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


138 


65 


12710 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


99 


82 


12711 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


104 


34 


12712 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


106 


45 - 


12713 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


36 


12714 


AF1 16715 


Homo sapiens 


PR02829 


139 


68 


12715 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


89 


62 


12716 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


86 


54- 


12717 


Z93891 


Hegeter politus 


cytochrome oxidase 


72 


48 


12718 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


105 


46 


12719 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


74 


46 


12720 


U30221 


Crithidia 
fasciculata 


NADH dehydrogenase subunit 5 


92 


39 


12721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 


82 


12722 


AF 107406 


Homo sapiens 


GW128 


72 


54 


12723 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


55 



475 



0164635A2 I > 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Waterman 


Identit 

y 








6720. 






12724 


AFl 16661 


Homo sapiens 


PRO 143 8 


145 


59 


12725 


Y53871 


Homo sapiens 


A human brain-derived signalling 
factor polypeptide. 


584 


98 


12726 


LI 0908 


Mus musculus 


Gcapl gene product 


95 


37 


12727 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


124 


37 


12728 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


66 


36 


12729 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


110 


55 


12730 


L23852 


Homo sapiens 


Putative 3' end of coding region; 
putative 


83 


37 


12731 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


59 


48 


12732 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


88 


69 


12733 


AF090895 


Homo sapiens 


PRO0117 


130 


55 


12734 


R959I3 


Homo sapiens 


Neural thread protein. 


162 


45 


12735 


U72355 


Homo sapiens 


Hsp27 ERE-TATA-binding protein 


972 


57 


12736 


AF247039 


porcine 
adenovirus 3 


163R* 


108 


35 


12737 


M22332 


Homo sapiens 


unknown protein 


148 


45 


12738 


AF090895 


Homo sapiens 


PRO0117 


89 


72 


12739 


B08525 


Homo sapiens 


Protein encoded by a novel gene 
associated with colon disease. 


296 


86 


12740 


X90872 


Homo sapiens 


associated to Golgi apparatus 


116 


54 


12741 


M26460 


Homo sapiens 


retinoblastoma 1 


75 


37 


12742 


S52010 


Mus sp. 


orfl 5' of EpoR 


104 


31 


12743 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


88 


50 


12744 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


111 


47 


12745 


AF090944 


Homo sapiens 


PRO0663 


152 


65 


12746 


Y21106 


Homo sapiens 


Human bc!2 proto-oncogene wild type 
protein fragment 3. 


62 


64 


12747 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


76 


12748 


X66861 


Mus musculus 


Hox-1.4 


92 


34 


12749 


AF166125 


Homo sapiens 


selenoprotein N 


978 


98 


12750 


W87504 


Homo sapiens 


Human N-methyl-D-aspartate receptor 
subunit encoded by clone NMDA24. 


98 


29 


12751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


91 


78 


12752 


X72963 


Nicotiana 
tabacum 


pAP8 product 


80 


45 


12753 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


123 


61 


12754 


AF146191 


Homo sapiens 


FRG1 


208 


85 


12755 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


75 


12756 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


72 


48 


12757 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


111 


51 


12758 


U83280 


Leishmania 
donovani 


39 kDa antigen 


106 


60 


12759 


AF116661 


Homo sapiens 


PRO 1438 


73 


48 


12760 


AF164615 


Homo sapiens 


envelope-protein 


257 


83 ! 
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SEQ1D 
NO: 








Waterman 


Identit 

y 


12761 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 18. 


135 


70 


12762 


AF073519 


Homo sapiens 


small EDRK-rich factor 1 , long isoform 


111 


50 


12763 


M29622 


Mus musculus 


open reading frame 2 


73 


68 


12764 


L27428 


Homo sapiens 


reverse transcriptase 


111 


60 


12765 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


45 


12766 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


66 


12767 


M36914 


Zea mays 


cell wall protein (put.); putative 


78 


36 


12768 


L27428 


Homo sapiens 


reverse transcriptase 


159 


59 


12769 


AJ005567 


Mus musculus 


SPR2I protein 


55 


39 


12770 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


115 ' 


82 


12771 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


76 


64 


12772 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


126 


85 


12773 


AF130114 


Homo sapiens 


PR02459 


129 


50 


12774 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


220 


71 


12775 


AF287482 


Chlorobium 
tepidum 


Orfl22 


166 


68 


12776 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 


50 


12777 


AF 13 0079 


Homo sapiens 


PR02852 


258 


58 


12778 


AF090931 


Homo sapiens 


PRO0483 


65 


76 


12779 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


66 


12780 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


59 


37 


12781 


AF 119900 


Homo sapiens 


PR02822 


132 


59 


12782 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


97 


51 


12783 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


77 


77 


12784 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


84 


12785 


L27428 


Homo sapiens 


reverse transcriptase 


183 


39 


12786 


AF1 19851 


Homo sapiens 


PRO 1722 


128 


61 


12787 


AF1 16715 


Homo sapiens 


PR02829 


111 


72 


12788 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


77 


12789 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


84 


66 


12790 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


71 


12791 


AF 124729 


Mus musculus 


acinusS' 


122 


50 


12792 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


86 


52 


12793 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


108 


74. . 


12794 


M62415 


Pseudopleuronec 
tes americanus 


HPLC6 


82 


35 


12795 


Y36156 


Homo sapiens 


Human secreted protein #28. 


133 


48 


12796 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


74 


12797 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


170 


32 


12798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


43 


12799 


Y76184 


Homo sapiens 


Human secreted protein encoded by 


90 


60 
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NO. 


Accession No. 


Spec." 


Description 


Smith- 
Waterman 


% 

Idcntit 








gene 61. 






12800 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


102 


38 


12801 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


98 


69 


12802 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


177 


69 


12803 


AF130089 


Homo sapiens 


PRO2550 


94 


74 


12804 


S77772 


Homo sapiens 


aspartylglucosaminidase, AG A {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


83 


68 


12805 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 


82 


28 


12806 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


80 


60 


12807 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


122 


47 


12808 


X03145 


Homo sapiens 


pot. ORF V 


150 


34 


12809 


LI 0908 


Mus musculus 


Gcapl gene product 


111 


34 


12810 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:12S. 


170 


55 


12811 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


101 


62 


12812 


AJ233591 


Mus musculus 


reverse transcriptase 


287 


67 


12813 


U82303 


Homo sapiens 


unknown 


149 


52 


12814 


AF 107406 


Homo sapiens 


GW128 


103 


58 


12815 


U82303 


Homo sapiens 


unknown 


93 


83 


12816 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


161 


54 


12817 


BO 1390 


Homo sapiens 


Neuron-associated protein. 


81 


38 


12818 


AF287482 


Chlorobium 
tepidum 


Orfl22 


174 


69 


12819 


AE003499 


Drosophila 
melanogaster 


CG 12706 gene product 


166 


28 


12820 


M15530 


Homo sapiens 


B-cell growth factor 


124 


71 


12821 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


161 


53 


12822 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


150 


77 


12823 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


103 


66 


12824 


AF064597 


Homo sapiens 


LINE-1 like protein 


64 


45 


12825 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


113 


45 


12826 


U62039 


Elephantulus 
edwardii 


reverse transcriptase 


74 


46 


12827 


M15530 


Homo sapiens 


B-cell growth factor 


127 


54 


12828 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


94 


84 


12829 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


59 


57 


12830 


AF229067 


Homo sapiens 


PADI-H protein 


184 


61 


12831 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


69 


34 


12832 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


83 


46 


12833 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


61 


12834 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


142 


40 


12835 


U 19098 


Lycopersicon 
chilense 


proline-rich protein 


81 


38 


12836 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


229 


68 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


y 


12837 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


81 


60 


12838 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


88 - 


71 


12839 


X72963 


Nicotiana 
tabacum 


pAP8 product 


94 


40 


12840 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


315 


56 


12841 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


55 


12842 


AK025947 


Homo sapiens 


unnamed protein product 


124 


57 


12843 


AF1 59055 


Homo sapiens 


leucine zipper-like protein 


69 


55 


12844 


AF040257 


Homo sapiens 


TNF receptor homolog 


98 


50 


12845 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


94 


76 


12846 


X88799 


Oryza sativa 


DNA binding protein 


94 


37 


12847 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


98 


48 


12848 


G01984 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
6065. 


164 


56 


12849 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


75 


59 


12850 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


122 


54 


12851 


M31964 


Saimiriine 
herpesvirus 2 


collagen-like protein 


79 


50 


12852 


AF 11 8082 


Homo sapiens 


PRO 1902 


94 


77 


12853 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


139 


54 


12854 


R59842 


Homo sapiens 


ApoE4Ll protease. 


108 


82 


12855 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


99 


66 


12856 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


89 


56 


12857 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


52 


12858 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


80 


12859 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


88 


61 


12860 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


66 


33 


12861 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


75 


47 


12862 


M29622 


Mus musculus 


open reading frame 2 


74 


62 


12863 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


70 


12864 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117 


54 


12865 


AF090894 


Homo sapiens 


PRO0113 


108 


63 


12866 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


121 


54 


12867 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


44 


12868 


S68106 


Ascaris suum, 
Peptide Partial, 
100 aa 


type IV collagen alpha 2 chain, alpha 2 
(IV) {alternatively spliced} 


75 


35 


12869 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


97 


56 


12870 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


146 


62 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 
y 


12871 


G00412 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
4493. 


78 


76 


12872 


G03793 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7874. 


105 


56 


12873 


AB030234 


Canis familiaris 


D4 dopamine receptor 


61 


58 


12874 


M22332 


Homo sapiens 


unknown protein 


128 


41 


12875 


U05313 


Trypanosoma 
brucei 


CR3 


67 


48 


12876 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


65 


12877 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


66 


54 


12878 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


78 


34 


12879 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


51 


12880 


AF130079 


Homo sapiens 


PR02852 


125 


75 


12881 


AFO90895 


Homo sapiens 


PRO01 17 


142 


59 


12882 


AFI81251 


Rattus 
norvegicus 


lung Kruppel-like factor 


93 


39 


12883 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


74 


12884 


M26460 


Homo sapiens 


retinoblastoma 1 


136 


41 


12885 


AJ277557 


Homo sapiens 


mitochondrial 5'(3')-. 
deoxyribonucleotidase (dNT-2) 


280 


100 


12886 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


119 


62 


12887 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


43 


12888 


AF1 19851 


Homo sapiens 


PRO 1722 


94 


50 


12889 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


121 


39 


12890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


68 


12891 


AK024455 


Homo sapiens 


FLJ00047 protein 


77 


51 


12892 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


115 


83 


12893 


L27428 


Homo sapiens 


reverse transcriptase 


89 


29 


12894 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


112 


72 


12895 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


72 


62 


12896 


AK023563 


Homo sapiens 


unnamed protein product 


260 


64 


12897 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


76 


66 


12898 


AC002304 


Arabidopsis 
thaliana 


F14J 16.29 


111 


36 


12899 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


83 


53 


12900 


AF1 16638 


Homo sapiens 


PRO 1546 


123 


57 


12901 


AB010361 


Mus musculus 


mszf47 


64 


43 


12902 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


86 


12903 


L06498 


Homo sapiens 


ribosomal protein S20 


249 


56 


12904 


L13635 


Rattus 
norvegicus 


growth response protein 


176 


69 


12905 


AF 156961 


Homo sapiens 


gag 


183 


44 


12906 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


207 


67 


12907 


AF 107406 


Homo sapiens 


GW128 


99 


50 


.12908 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. . 


93 


75 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Waterman 


J den tit 

y 


12909 


AF263540 


Homo sapiens 


MIB006 


1809 


100 


12910 


Y48292 


Homo sapiens 


Human prostate cancer-associated 
protein 78. 


60 


50 


1291 1 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


179 


87 


12912 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


108 


72 


12913 


Z72499 


Homo sapiens 


herpesvirus associated ubiquitin- 
specific protease (HAUSP) 


5242 


99 


12914 


AF092136 


Homo sapiens 


PTD015 


297 


100 


12915 


AF161356 


Homo sapiens 


HSPC093 


123 


42 


12916 


AF044670 


Homo sapiens 


33 kDa Vamp-associated protein; VAP- 
33 


506 


90 


12917 


W27087 


Homo sapiens 


Human transforming growth factor 
alpha HIII. 


1243 


100 


12918 


Y59807 


Homo sapiens 


Human normal ovarian tissue derived 
protein 84. 


111 


43 


12919 


AJ388518 


Canis familiaris 


non-histone chromosomal protein 
HMG-17 


108 


84 


12920 


AK023392 


Homo sapiens 


unnamed protein product 


119 


60 


12921 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


87 


12922 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
termina!, clone XEH.8c} 


124 


81 


12923 


X78444 cdl 


Homo sapiens 


21 -MAY- 1997 Human UCSP-2 cDNA. 


1090 


100 


12924 


AF1 16661 


Homo sapiens 


PRO 143 8 


89 


52 


12925 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


82 


80 


12926 


AF1 19855 


Homo sapiens 


PRO 1847 


120 


63 


12927 


AF090931 


Homo sapiens 


PRO0483 


92 


90 


12928 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


119 


88 


12929 


Y91429 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 1 SEQ ID NO: 1 50. 


108 


38 


12930 


Y12661 


Homo sapiens 


neuro-endocrine specific protein VGF 


2506 


99 


12931 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


61 


50 


12932 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


.121 


53 


12933 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


104 


52 


12934 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


60 


12935 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


331 


77 


12936 


Ml 5530 


Homo sapiens 


B-cell growth factor 


83 


71 


12937 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


146 


64 


12938 


R59842 


Homo sapiens 


ApoE4Ll protease. 


112 


61 


12939 


AF090895 


Homo sapiens 


PRO0I17 


94 


80 


12940 


X73459 


Homo sapiens 


signal recognition particle subunit 14 


549 


98 


12941 


L23545 


Homo sapiens 


putative 


141 


45 


12942 


AF054178 


Homo sapiens 


CI-B 14.5a homolog 


238 


84 


12943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


110 


55 


12944 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


143 


55 


12945 


D00570 


Mus museums 


open reading frame (251 AA) 


166 


34 


12946 


U62039 


Elephantulus 


reverse transcriptase 


63 


50 
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NO: 








Waterman 


Idcntit 

y 






edwardii 








12947 


AE003499 


Drosophila 
melanogaster 


CG 12706 gene product 


104 


31 


12948 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


65 


100 


12949 


X70343 


Nicotian a 
sylvestris 


extensin 


102 


38 


12950 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


45 


12951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


85 


56 


12952 


AF090895 


Homo sapiens 


PRO0117 


139 


68 


12953 


AF090896 


Homo sapiens 


PRO0131 


93 


85 


12954 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


101 


75 


12955 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 
subunit 


292 


68 


12956 


X71087 


Homo sapiens 


monocyte chemoattractant protein 
(MCP-3) 


408 


96 


12957 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


591 


68 


12958 


AF1 09907 


Homo sapiens 


S164 


188 


63 


12959 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


173 


50 


12960 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


168 


62 


12961 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


98 


56 


12962 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


106 


84 


12963 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


125 


44 


12964 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


107 


57 


12965 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


71 


65 


12966 


AC009853 


Arabidopsis 
thaliana 


hypothetical protein 


102 


44 


12967 


AF090942 


Homo sapiens 


PRO0657 


99 


62 


12968 


M15530 


Homo sapiens 


B-cell growth factor 


90 


65 


12969 


AF090895 


Homo sapiens 


PRO0117 


83 


54 


12970 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


119 


54 


12971 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


123 


57 


12972 


L3255S 


Homo sapiens 


sequence is expressed in human Tera-2 
clone 13 (embryonal carcinoma) cells. 
The sequence may contain mismatches 
(one strand sequenced only once). 97% 
identical in 320 bp overlap with human 
54 kDA prot; ORF 


426 


85 


12973 


AF008196 


Homo sapiens 


bax epstlon 


78 


87 


12974 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


96 


80 


12975 


AL031848 


Homo sapiens 


dJ202O8.2.2 (novel rodent HES2 (hairy 
and Enhancer of Split 2) LIKE protein 
(isoform 2)) 


176 


100 


12976 


AJ245905 


Chlorocebus 
aethiops 


HSBPl-like protein 


94 


100 


12977 


AF067519 


Homo sapiens 


PITSLRE protein kinase beta SV1 
isoform 


1923 


93 
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NO: 








Score 


Identit 

y 


12978 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


168 


73 


12979 


X52164 


Mus musculus 


Q300 protein (A A 1-77) 


105 


39 


12980 


Y13141 


Bromheadia 
finlaysoniana 


cxtensin 


53 


36 


12981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


87 


77 


12982 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6292. 


83 


89 


12983 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


92 


46 


12984 


AF1 13685 


Homo sapiens 


PRO0974 


111 


67 


12985 


U31086 


Gallus gallus 


neuron-glia adhesion molecule 


52 


50 


12986 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


132 


68 


12987 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


91 


71 


12988 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


135 


81 


12989 


AB029042 


Homo sapiens 


ATPase inhibitor precursor 


364 


100 


12990 


G 00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4523. 


174 


72 


12991 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


174 


57 


12992 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


158 


57 


12993 


AFl 18082 


Homo sapiens 


PRO 1902 


77 


78 


12994 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


60 


12995 


M15530 


Homo sapiens 


B-cell growth factor 


105 


76 


12996 


AJ005562 


Mus musculus 


SPR2D protein 


104 


36 


12997 


Y48346 


Homo sapiens 


Human prostate cancer-associated 
protein 43. 


184 


77 


12998 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1S. 


82 


53 


12999 


LI 1645 


Homo sapiens 


alpha-tubulin 


196 


68 


13000 


AFl 13685 


Homo sapiens 


PRO0974 


123 


52 


13001 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


78 


53 


13002 


AJ005564 


Mus musculus 


SPR2F protein 


76 


44 


13003 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


101 


76 


13004 


AL159178 


Streptomyces 
coelicolor A3(2) 


putative secreted protein 


89 


37 


13005 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


64 


36 


13006 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


85 


70 


13007 


AJF068294 


Homo sapiens 


HDCMB45P 


237 


52 


13008 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


128 


50 


13009 


AFl 44054 


Homo sapiens 


apoptosis related protein APR-4 


100 


75 


13010 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


148 


67 


13011 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


152 


66 


13012 


B01390 


Homo sapiens 


Neuron-associated protein. 


99 


32 


13013 


A27282 


Homo sapiens 


TGR-CL3C 


69 


42 


13014 | G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


46 
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SEQ ID 


Accession No. 


Species 


Description ! • Mi - ** •• 


Score rrna " 


% 
y 








4770. 






13015 


AF1 30089 


Homo sapiens 


PRO2550 


SS 


60 


13016 


AF266223 


Gillichthys 
mirabilis 


ribosomal protein L27 


122 


50 


13017 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


116 


40 


13018 


U82303 


Homo sapiens 


unknown 


64 


60 


13019 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


70 


57 


13020 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


169 


91 


13021 


V00488 


Homo sapiens 


alpha globin 


213 


100 


13022 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


59 


13023 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


109 


: 86 


13024 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


151 


84 


13025 


U82303 


Homo sapiens 


unknown 


86 


52 


13026 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


124 


61 


13027 


G 00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


83 


50 


13028 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


74 


63 


13029 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein; 


59 


46 


13030 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cIoneHMSJW18. 


96 


68 


13031 


S71494 


Mus sp. 


SmD homolog {Gly-Arg repeat} 


93 


55 


13032 


AF 113685 


Homo sapiens 


PRO0974 


100 


48 


13033 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


242 


85 


13034 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


178 


41 


13035 


GO 1129 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


228 


81 


13036 


X66285 


Mus musculus 


HC1 ORF 


103 


44 


13037 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


90 


55 


13038 


U47924 


Homo sapiens 


RPL13-2 


424 


63 


13039 


AF090942 


Homo sapiens 


PRO0657 


111 


44 


13040 


AK025116 


Homo sapiens 


unnamed protein product 


131 


52 


13041 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. • 


92 


72 


13042 


M11759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


100 


69 


13043 


AF130089 


Homo sapiens 


PRO2550 


109 


45 


13044 


G02879 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6960. 


116 


52 


13045 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


162 


70 


13046 


L10908 


Mus musculus 


Gcap 1 gene product 


162 


40 


13047 


L02321 


Homo sapiens 


glutathione S-transferase GSTM5-5 


1072 


93 


13048 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


89 


60 


13049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


153 


60 


13050 


U33547 


Homo sapiens 


MHC class II antigen 


69 


85 


13051 


AC006014 


Homo sapiens 


similar to mismatch repair proteins; 
similar to PID.g 13 04 125 


652 


91 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Wnterman 

Score 


% 

Identit 

y 


13052 


A27282 


Homo sapiens 


TGR-CL3C 


65 


51 


13053 


Y691 66 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


106 


60 


13054 


M20030 


Homo sapiens 


small proline rich protein 


57 


33 


13055 


AF022117 


Balaena 
mysticetus 


metallothionein 


1 19 


62 


13056 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


78 


43 


13057 


R20305 


Homo sapiens 


Val(l) to Met, His(2) deleted, Ala(76) 
to Lys beta-globin mutant. 


274 


98 


13058 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


56 


39 


13059 


U 10696 


Zea mays 


Ec metallothionein class II protein 


66 


26 


13060 


Y40417 


Homo sapiens 


A human N-acetylneuraminate lyase 
(hNANL) protein. 


1183 


80 


13061 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


125 


41 


13062 


AF002210 


Homo sapiens 


copper chaperone for superoxide 
dismutase 


702 


61 


13063 


AF 11 8086 


Homo sapiens 


PRO 1992 


69 


84 


13064 


M24097 


Homo sapiens 


MHC HLA-C-alpha-2 chain 


1550 


96 


13065 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


51 


40 


13066 


AF161531 


Homo sapiens 


HSPC046 


488 


97 


13067 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


141 


50 


13068 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


116 


46 


13069 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


69 


13070 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
1DNO:103. 


224 


71 


13071 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


153 


52 


13072 


AF2 16965 


Homo sapiens 


ancient conserved domain protein 3 


876 


100 


13073 


Y15917 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


108 


35 


13074 


X77664 


Homo sapiens 


retinoic acid receptor beta isoform 1 


53 


71 


13075 


AF 156961 


Homo sapiens 


gag 


136 


37 


13076 


R59842 


Homo sapiens 


ApoE4LI protease. 


117 


75 


13077 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


121 


65 


13078 


U23183 


Caenorhabditis 
elegans 


gene lies in inverted repeat and exon 1 
overlaps tRNA; may be pseudogene 


100 


79 


13079 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


56 


13080 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


65 


33081 


Y 12473 


Homo sapiens 


cenrrin 


359 


79 


13082 


G01518 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5599. 


375 


98 


13083 


Y57891 


Homo sapiens 


Human transmembrane protein 
HTMPN-15. 


1030 


99 


13084 


AB046765 


Homo sapiens 


KIAA1545 protein 


161 


93 


13085 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


66 


40 


13086 


AF 13 0079 


Homo sapiens 


PR02852 


137 


45 


13087 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


86 


33 


13088 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 | 44 
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WO 01/64835 



PCT/US0 1/0492 7 



NO: 


Accession No. 


Spec.es 


Description 


Smith- 
Watermon 


% 

Identit 

-2 








7893. 






13089 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


63 


13090 


AL 12 1905 


Homo sapiens 


dJ534B8.3 (novel protein similar to an 
aspartic protease) 


963 


100 


.13091 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


95 


47 


13092 


AB030236 


Canis familiaris 


D4 dopamine receptor 


65 


38 


13093 


AF161356 


Homo sapiens 


HSPC093 


119 


48 


13094 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


67 


50 


13095 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


124 


56 


13096 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


73 


13098 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


13099 


AF130051 


Homo sapiens 


PRO0898 


194 


61 


13100 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ IDNO:360. 


80 


70 


13101 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


111 


43 


13102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


71 


13103 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


116 


63 


13104 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


90 


64 


13105 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 






13106 


U23455 


Caenorhabditis 
elegans 


similar to D. melano°aster homeotic 
protein BarH2 (PIR: A4 1 726) 


101 


75 


13107 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


100 


65 


13108 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


137 


77 


13109 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


94 


88 


13110 


AF1 19855 


Homo sapiens 


PRO 1847 


103 


49 


13111 


AF1 19851 


Homo sapiens 


PRO 1722 


142 


80 


13112 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


107 


51 


13113 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 




54 


13114 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


— 


13115 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


138 


50 


13116 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


110 


56 


13117 


AK025653 


Homo sapiens 


unnamed protein product 


188 


100 


13118 


AF 13 0089 


Homo sapiens 


PRO2550 


129 


68 


13119 


AF161356 


Homo sapiens 


HSPC093 


131 


59 


13120 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


45 


13121 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


82 


55 


13122 


GO0637 


Homo sapiens 


Human secreted protein, SEQ ID NO: | 87 . 


45. 
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SEQ ID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4718. 








... 

Y J94 


Homo. sapiens 


Human secreted protein encoded by 
gene No. 9. 


21 J 


100 


13124 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 


13125 


AF1 19855 


Homo sapiens 


PRO 1 847 


109 


45 






Homo sapiens 


Neura! thread protein. 


1 j7 


47 ..... 


1^127 — 


BO 1 390 


Homo sapiens 


Neuron-associated protein. 




47 


13128 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


122 


67 


13129 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


105 


52 


13130 


G 02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


72 


13131 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


500 


90 


13132 


AF138957 


Bos taurus 


type II collogen cyanogen bromide 
fragment CB8 


99 


38 


131 j3 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


157 


69 


1 j1 j4 


U92817 


Homo sapiens 


unnamed HERV-H protein 


1 13 


48 


13135 


Y 19743 


Homo sapiens 


SEQ ID NO 461 from W09922243. 


967 


99 


13136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


52 


13137 


AF090928 


Homo sapiens 


PRO0470 


133 


69 


13138 


AF09093 1 


Homo sapiens 


PRO0483 


150 


60 


13139 


AF1 16661 


Homo sapiens 


PR01438 


146 


57 


13140 


AL0223 1 8 


Homo sapiens 


bK150C2.2 (Phorbolin3) 


443 


56 


13141 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


114 


80 


13142 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


86 


43 


13143 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


62 


13144 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


66 


13145 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


123 


71 


13146 


AF1 07406 


Homo sapiens 


GW128 


120 


50 


13147 


LI 0908 


Mus musculus 


Gcapl gene product 


91 


42 


13148 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


113 


66 


13149 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


119 


67 


13150 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


168 


86 


13151 


G0041S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4499. 


74 


58 


13152 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


108 


61 


13153 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


131 


67 


13154 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


113 


54 


13155 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


127 


84 


13156 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


167 


80 


13157 


YI4482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


68 


13158 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


514 


71 


13159 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


130 


66 
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NO? 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Idenrit 








SEQ ID "NO: 163. 






13160 


AF116661 


Homo sapiens 


PRO 1438 


152 


63 


13161 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


105 


53 


13162 


AF1 07406 


Homo sapiens 


GW128 


159 


62 


13163 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


132 


59 


13164 


M15530 


Homo sapiens 


B-cell growth factor 


97 


80 


13165 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


67 


36 


13166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 1 8 . 


126 


57 


13167 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


95 


53 


13168 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


96 


57 


13169 


AL049795 


Homo sapiens 


dJ622L5.7.2 (novel protein (isoform 2)) 


250 


95 


13170 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO - 
4447. 


1 16 


56 


13171 


LI 0908 


Mus musculus 


Gcapl gene product 


75 


53 


13172 


S73853 


Homo sapiens 


NF2=neurofibromatosis type 2 
{alternatively spliced, form A4} 


160 


73 


13173 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


76 


13174 


A52568 


Homo sapiens 


HMGI-C 


84 


44 


13175 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


91 


49 


13176 


AK025116 


Homo sapiens 


unnamed protein product 


130 


44 


13177 


AF090901 


Homo sapiens 


PRO0195 


103 


84 


13178 


AF090894 


Homo sapiens 


PRO0M3 


92 


62 


13179 


AF 130079 


Homo sapiens 


PR02852 


118 


38 


13180 


AK025047 


Homo sapiens 


unnamed protein product 


141 


67 


13181 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


87 


59 


13182 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


220 


74 


13183 


S79980 


Bos taurus 


ribosomal protein L37 


186 


94 


13184 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


113 


90 


13185 


AF1 55232 


Pisum sativum 


extensin 


95 


36 


13187 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


90 


78 


13188 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO" 
7868. 


1 13 


79 


13189 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


158 


53 


13190 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


130 


59 


13191 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


131 


66 


13192 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


439 


86 


13193 


Z66499 


Caenorhabditis 
elegans 


T01B7.8 




35 


13194 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


126 


61 


13195 


Y13141 


Bromheadia 
fmlaysoniana 


extensin 


58 


43 


13196 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


95 


76 


13197 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


116 


60 
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% 
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13198 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-!ectin antibody 


98 


41 


13199 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


102 


58 


13200 


AF220264 


Homo sapiens 


MOST-1 


141 


45 


13201 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


112 


58 


13202 


Y17221 


Homo sapiens 


Human secreted protein (clone £k317- 
3). 


115 


57 


13203 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


91 


39 


13204 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


65 


13205 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


98 


68 


13206 


M15530 


Homo sapiens 


B-cell growth factor 


105 


76 


13207 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


84 


45 


13208 


L27428 


Homo sapiens 


reverse transcriptase 


205 


49 


13209 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


77 


57 


13210 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


48 


13211 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


122 


33 


13212 


M15530 


Homo sapiens 


B-cell growth factor 


159 


61 


13213 


AF 130089 


Homo sapiens 


PRO2550 


126 


56 


13214 


AF2 10651 


Homo sapiens 


NAG18 


125 


68 


13215 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


114 


76 


13216 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


157 


70 


13217 


R59842 


Homo sapiens 


ApoE4Ll protease. 


102 


72 


13218 


K01 664 


TDrosophila 
melanogaster 


Bkm-like protein 


123 


37 


13219 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


101 


55 


13220 


D14167 


Gallus gallus 


ribosomal protein L37a 


126 


40 


13221 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


178 


82 


13222 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


141 


55 


13223 


LI 0908 


Mus musculus 


Gcapl gene product 


114 


53 


13224 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


216 


73 


13225 


G03259 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7340. 


104 


76 


13226 


AF130079 


Homo sapiens 


PR02852 


158 


66 


13227 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


13228 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


100 


46 


13229 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


72 


13230 


M15530 


Homo sapiens 


B-cell growth factor 


162 


72 


13231 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


161 


66 


13232 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 I" 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Idcntit 


13233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


--— 


13234 


AF126163 


Homo sapiens 


HHLA3 protein 


1 3 1 




13235 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


125 


85 


13236 


AB001684 


Chlorella 
vulgaris 


ORF49b 


55 


45 


13237 


M15530 


Homo sapiens 


B-cell growth factor 


1 14 


~66 


13238 


AL3 55929 


Neurospora 


conserved hypothetical protein 


99 


46 


13239 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF0786S3) 


114 


63 


13240 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


1 10 




13241 


AF1 19851 


Homo sapiens 


PRO 1722 


167 


63 


13242 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


147 


68 


13243 


U28971 


Caenorhabditis 
elegans 


similar to RJD tandem repeat region of 
RD protein (nuclear ma-bindin CT 
protein) 


95 


~65 


13244 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


84 


13245 


Y 14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


104 


62 


13246 


AF1 61356 


Homo sapiens 


HSPC093 


116 


67 


13247 


X61046 


Hydra sp. 


mini -col J a gen 






13248 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


458 


-~ 


13249 


MI 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


92 


37 


13250 


ABO 173 62 


Bombyx mori 








13251 


G02532 


Homo sapiens 


Human secreted protein SEQ ID NO' 
6613. 


125 





13252 


G00360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4441. 


122 


58 


13253 


AF2 10651 


Homo sapiens 


NAG 18 


151 


73 


13254 


AF224494 


Mus musculus 


arsenite inducible RNA associated 
protein 


299 




13255 


AK026107 


Homo sapiens 


unnamed protein product 


102 


60 


13256 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


1 13 


61 


13257 


AK024455 


Homo sapiens 


FLJ00047 protein 


1 15 




13258 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


155 


"31 


13259 


AJ251579 


Arabidopsis 
thaliana 


cef protein 


1 15 


39 


13260 


L78671 


Homo sapiens 


CoxlJ/D-loop DNA fusion protein 


360 




13261 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


151 


61 


13262 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


413 


— 


13263 


D23661 


Homo sapiens 


ribosomal protein L37 


487 


94 


13264 


AF 1 30079 


Homo sapiens 


PR02852 






13265 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


109 


61 


13266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


133 


79 


13267 


LI 0908 


Mus musculus 


Gcapl gene product 


103 


43 


13268 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


70 


13269 


GO4064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8145. 


97 


51 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Watermnn 

Score 


y 


13270 


AF 130089 


Homo sapiens 


PRO2550 


95 


47 


13271 


AF1 30075 


Homo sapiens 


PR02532 


90 


60 


13272 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


103 


54 


13273 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


77 


91 


13274 


AF 130089 


Homo sapiens 


PRO2550 


104 


57 


13275 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


94 


45 


13276 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


122 


52 


13277 


AF 113685 


Homo sapiens 


PRO0974 


98 


36 


13278 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


85 


68 


13279 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


139 


89 


13280 


G01I29 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


243 


79 


13281 


AF1 16715 


Homo sapiens 


PR02829 


108 


56 


13282 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


105 


60 


13283 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


194 


78 


13284 


AL359782 


Trypanosoma 
brucei 


probable granule cell antiserum 
positive 8 


104 


47 


13285 


S79410 


Mus muscuius 


nuclear localization signal binding 
protein 


104 


72 


13286 


AF 13 0089 


Homo sapiens 


PRO2550 


147 


58 


13287 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


58 


13288 


AF 119855 


Homo sapiens 


PRO 1847 


115 


45 


13289 


AF161356 


Homo sapiens 


HSPC093 


131 


50 


13290 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


43 


13291 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


112 


49 


13292 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


150 


69 


13293 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


284 


76 


13294 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


444 


85 


13295 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


633 


93 


13296 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 18. 


164 


76 


13297 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


147 


80 


13298 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


84 


38 


13299 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


139 


49 


13300 


AF 13 0079 


Homo sapiens 


PR02852 


98 


60 


13301 


AF090894 


Homo sapiens 


PRO0113 


114 


52 


13302 


AF1 30079 


Homo sapiens 


PR02852 


161 


83 


13303 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


154 


76 


13304 


AF1 18086 


Homo sapiens 


PRO 1992 


120 


75 


13305 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


100 


73 


13306 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


108 


56 
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SEQ ID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Identit 


13307 


Y74110 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #297. 


448 


100 


13308 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


85 


65 


13309 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


104 


47 


13310 


AF1 16715 


Homo sapiens 


PR02829 


124 


76 


1331 1 


GO 1 246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


63 


13312 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


61 


13313 


AF1 16661 


Homo sapiens 


PRO 143 8 


142 


67 


13314 


AF161356 


Homo sapiens 


HSPC093 


121 


46 


13315 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


169 


52 


13316 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


90 


13317 


AF 130079 


Homo sapiens 


PR02852 


109 


58 


13318 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


113 


80 


13319 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121." 


81 


61 


13320 


AF1 30089 


Homo sapiens 


PRO2550 


106 


77 


13321 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


13322 


D29833 


Homo sapiens 


proline rich peptide P-B 


59 


33 


13323 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


417 


78 


13324 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


119 


82 ~* 


13325 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWl 8. 


168 


, 


13326 


A3 1038 


Nicotiana alata 


PRP3 


88 


41 


13327 


AF090895 


Homo sapiens 


PRO0117 


87 


55 


13328 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


88 


51 


13329 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


103 


66 


13330 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


101 


67 


13331 


AF090930 


Homo sapiens 


PRO0478 


93 


36 


13332 


AF159055 


Homo sapiens 


leucine zipper-like protein | 


118 


71 


13333 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


89 


57 


13334 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


121 


80 


13335 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


160 


73 


13336 


AJ005562 


Mus musculus 


SPR2D protein 


86 


44 


13337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


66 


13338 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


78 


57 


13339 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


102 


91 


13340 


AF090895 


Homo sapiens 


PRO0117 


137 


47 


13341 


K01 664 


Drosophila 
melanogaster 


Bkm-like protein 


115 


46 


13342 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


100 


76 


13343 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


65 
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NO: 


Accession No. 


Species 


Description 


Waterman 
Score 


Identit 








7795. 






13344 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


76 


76 


13345 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQ ID NO: 163. 


111 


72 


13346 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


143 


69 


13347 


AF1 19900 


Homo sapiens 


PR02822 


93 


58 


13348 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


169 


75 


13349 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


83 


68 


13350 


AF1 16715 


Homo sapiens 


PR02829 


134 


64 


13351 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


115 


65 


13352 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


59 


13353 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


139 


77 


13354 


AF1 16661 


Homo sapiens 


PRO 143 8 


112 


75 


13355 


G 00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


170 


73 


13356 


AF126163 


Homo sapiens 


HHLA3 protein 


116 


55 


13357 


U18339 


Variola virus 


D4L 


101 


58 


13358 


AF1 30089 


Homo sapiens 


PRO2550 


127 


52 


13359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


184 


82 


13360 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


78 


50 


13361 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


126 


71 


13362 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


174 


76 


13363 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


176 


51 


13364 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


99 


41 


13365 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


118 


56 


13366 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


158 


50 


13367 


Y 19467 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


65 


64 


13368 


M15530 


Homo sapiens 


B-cell growth factor 


132 


77 


13369 . 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


63 


13370 


AF1 30079 


Homo sapiens 


PR02852 


108 


61 


13371 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


150 


76 


13372 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


117 


65 


33373 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ lDNO:163. 


118 


57 


13374 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


121 


63 


13375 


AF1 18086 


Homo sapiens 


PRO 1992 


147 


74 


13376 


AF 130089 


Homo sapiens 


PRO2550 


149 


48 


13377 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


55 



■493 
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SEQ ID 

NO: 


Accession No. 


Species 


— 

Description 


Smith- 
Score 


% 
y 


13378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


13379 


AF090894 


Homo sapiens 


PRO0113 


101 


52 


133S0 


AF 107406 


Homo sapiens 


GW128 


126 


51 


13381 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


156 


84 


13382 


G03I72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


119 


80 


13383 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


84 


70 


13384 


S79410 


Mus musculus 


nuclear localization sigual binding 
protein 


107 


53 


13385 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


53 


13386 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


65 


13387 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


63 


13388 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


155 


77 


13389 


AF 130089 


Homo sapiens 


PRO2550 


159 


62 


13390 


AF161356 


Homo sapiens 


HSPC093 


154 


74 


13391 


AC006135 


Arabidopsis 
thaliana 


putative vicilin storage protein 
(globulin-like) 


171 


28 


13392 


AF113685 


Homo sapiens 


PRO0974 


226 


57 


13393 


AF1 18082 


Homo sapiens 


PRO 1902 


69 


68 


13394 


D23661 


Homo sapiens 


ribosomal protein L37 


498 


96 


13395 


AF 130079 


Homo sapiens 


PR02852 


109 


54 


13396 


AF] 19851 


Homo sapiens 


PRO 1722 


107 


57 ■ 


13397 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


68 


13398 


AF 13 0089 


Homo sapiens 


PRO2550 


145 


68 


13399 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


134 


54 


13400 


J03798 


Homo sapiens 


small nuclear riboprotein Sm-D 


451 


93 


13401 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


39 


13402 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


99 


76 


13403 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


159 


76 


13404 


AF090894 


Homo sapiens 


PRO0113 


152 


70 


13405 • 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


45 


13406 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


172 


72 


13407 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


135 


84 


13408 


G03714 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


63. 


13409 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


58. 


13410 


AF130089 


Homo sapiens 


PRO2550 


10] 


70 


13411 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


73 


13412 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


48 


13413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


50 


13414 


AF090942 


Homo sapiens 


PRO0657 


171 


72 


13415 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} | 


120 


64 
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SEQ ID 


Accession No. 


Species 


Description 


Score 


% 


13416 


M15530 


Homo sapiens 


B-cell growth factor 


121 


45 


13417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


115 


58 


13418 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


146 


70 


13419 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


85 


64 


13420 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77 


73 


13421 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


149 


62 


13422 


AF130089 


Homo sapiens 


PRO2550 


111 


39 


13423 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


106 


42 


13424 


AF 11 9882 


Homo sapiens 


PR02492 


110 


70 


13425 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


104 


59 


13426 


AF 1 18082 


Homo sapiens 


PRO 1902 


129 


48 


13427 


X70771 


Chironomus 
tentans 


Spl7 


95 


56 


13428 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


66 


13429 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


149 


80 


13430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


103 


63 


13431 


AF1 16661 


Homo sapiens 


PRO 1438 


128 


54 


13432 


AK024455 


Homo sapiens 


FLJ00047 protein 


117 


69 


13433 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


91 


40 


13434 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


64 


13435 


AF090944 


Homo sapiens 


PRO0663 


115 


50 


13436 


X55687 


esculentum 


extensin (class II) 


73 


32 


13437 


AF090942 


Homo sapiens 


PRO0657 


77 


50 


13438 


AP000061 


Aeropyrum 
pernix 


1 04aa long hypothetical protein 


82 


41 


13439 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


94 


51 


13440 


AF161356 


Homo sapiens 


HSPC093 


115 


53 


13441 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


125 


44 


13442 


Y59784 


Homo sapiens 


Human normal ovarian tissue derived 
protein 61. 


1096 


100 


13443 


U33547 


Homo sapiens 


MHC class II antigen 


120 


73 


13444 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


106 


71 


13445 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


115 


34 


13446 


G02256 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6337. 


486 


100 


13447 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


155 


60 


13448 


AF130079 


Homo sapiens 


PR02852 


167 


56 


13449 


AB047890 


Macaca 
fascicularis 


hypothetical protein 


171 


62 


13450 


AK025326 


Homo sapiens 


unnamed protein product 


111 


67 


13451 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


76 



495 
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SEQ ID 


Accession No. 


Species 


Description 


Smitb- 


% 


INO. 








Watcrmnn 


ldentit 














13452 


G 02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


91 








6574. 






13453 


U65650 


Arabidopsis 


blue-copper binging protein III 


103 


35 






thaliana 






13454 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


69 








6613. 






13455 


AF090894 


Homo sapiens 


PRO0113 


122 


56 


13456 


D38585 


Homo sapiens 


TSC-22 


710 


99 


13457 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


1 13 


45 








encoded by gene 121. 






13458 


X55686 


Lycopersicon 


extensin (class II) 


64 


51 






esculentum 








13459 


AF 107406 


Homo sapiens 


GW128 


162 


54 


13460 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


145 


65 








gene 22 clone HMSJW18. 






13461 


X6I045 


Hydra sp. 


mini-collagen 


107 


34 


13462 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


46 








6913. 






13463 


Y35970 


Homo sapiens 


Extended human secreted protein 


189 


80 








sequence, SEQ ID NO. 219. 






13464 


L47668 


Homo sapiens 


alpha- 2 collagen type I 


64 


39 


13465 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


65 








6571. 






13466 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


159 


76 








gene No. 107. 






13467 


Y14437 


Homo sapiens 




-Tqq 


~~70 








gene 27 clone HSAWA27. 






13468 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


69 








6619. 






13469 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


39 








8153. 






2 3470 


M15530 


Homo sapiens 


B-cell growth factor 


~!44 


~57 


13471 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 16 


85 








7881. 






13472 


V00488 


Homo sapiens 


alpha globin 


704 


96 


13473 


AJ012166 


Canis familiaris 


brain-specific synapse associated 


56 


45 








protein, Bassoon 






13474 


AF1 16661 


Homo sapiens 


PRO 143 8 


128 


45 


13475 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


78 








5909. 






13476 


V00488 


Homo sapiens 


alpha globin 


278 


91 


13477 


W65762 


Homo sapiens 


Human HDCAP protein. 


490 


98 


13478 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 


2184 


97 








the counter allele of CMAP in cystatin 


















13479 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 


2184 


97 








the counter allele of CMAP in cystatin 












cluster 






13480 


Y42382 


Homo sapiens 


Amino acid sequence of 1x3 17 11. 


613 


95 


13481 


D29833 


Homo sapiens 


proline rich peptide P-B 


61 


58 


13482 


AK023277 


Homo sapiens 


unnamed protein product 


488 


98 


13483 


X67703 


Drosophila 


Mst84Da 


53 


60 






melanogaster 








13484 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


135 


54 








gene No. 119. 






13485 


AF210350 


Sceloporus 


NADH dehydrogenase subunit 4 


149 


79 






siniferus 








13486 


Y 14544 


Danio rerio 


Hoxc8 protein 


98 


36 


13487 


AP002032 


Arabidopsis 




111 


27 



496 



_0164835A2_I_= 



WO 01/64835 



PCT/USO 1/0492.7 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






thaliana 








13488 


AB047846 


Homo sapiens 


gammal-COP 


4463 


100 


13489 


M37194 


Rattus 
norvegicus 


clathrin-associated protein 17 


739 


100 


13490 


Y59743 


Homo sapiens 


Human normal ovarian tissue derived 
protein 20. 


138 


96 


13491 


U013I7 


Homo sapiens 


G-gamma globin 


198 


90 


13492 


W04180 


Homo sapiens 


Cellubrevin-1. 


459 


98 


13493 


ALI21585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


2058 


100 


13494 


X53777 


Homo sapiens 


putative ribosomal protein (AA 1-184) 


970 


100 


13495 


AJ271881 


Homo sapiens 


bromodomain containing protein 


3378 


99 


13496 


AF1 32970 


Homo sapiens 


CGI-36 protein 


878 


100 


13497 


V73498 cdl 


Homo sapiens 


21-AUG-1997 Human S100P1 DNA. 


273 


100 


13498 


AL12I585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


2058 


100 


13499 


X82456 


Homo sapiens 


L1M and SH3 domain protein 


1420 


100 


13500 


AJ277932 


Homo sapiens 


RPB1 la protein 


588 


98 


13501 


Y53569 


Homo sapiens 


Consensus sequence for human RAD1 
binding proteins. 


633 


84 


13502 


W73434 


Homo sapiens 


Human secreted protein encoded by 
Gene No. 2. 


1903 


100 


13503 


AJ005981 


Sus scrofa 


cAMP-reguJated phosphoprotein 


577 


100 


13504 


AF1 16715 


Homo sapiens 


PR02829 


143 


65 


13505 


U97553 


murid 

herpesvirus 4 


unknown 


84 


35 


13506 


X06323 


Homo sapiens 


put. ribosomal protein L3 (AA 1 - 348) 


1903 


100 


13507 


W82397 


Homo sapiens 


Human UBP protein #3. 


4264 


99 


13508 


M 86246 


Homo sapiens 


EHS-2 


110 . 


71 


13509 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


121 


50 


13510 


AF073519 


Homo sapiens 


small EDRK-rich factor 1 , long isoform 


91 


44 


13511 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


591 


100 


13512 


Y04369 


Homo sapiens 


Human HUCEP-1 4 protein. 


1389 


100 


13513 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


136 


60 


13514 


AE000232 


Escherichia coli 
K12 


orf, hypothetical protein 


14S0 


100 


13515 


X76717 


Homo sapiens 


MT-11 protein 


382 


100 


13516 


Y00919 


Homo sapiens 


Human Rab protein, RABP-2, protein 
sequence. 


1039 


100 . 


13517 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


136 


67 


13518 


AF090895 


Homo sapiens 


PRO0117 


109 


56 


13519 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


109 


65 


13520 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


145 


71 


13521 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


93 


60 


13522 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


140 


72 


13523 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


130 


73 


13524 


M15530 


Homo sapiens 


B-cell growth factor 


112 


67 


13525 


AF1 19851 


Homo sapiens 


PRO 1722 


135 


57 


13526 


G00541 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4622. 


98 


82 


13527 


W48351 


Homo sapiens 1 Human breast cancer related protein 
| BCRB2. 


117 


41 



497 
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SEQ ID 

NO: 


Accession No. 




1 Description 


Smith- 
Waterman 
Score 


Identit 

y 




Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 




100 


1j529 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


79 


78 


13530 


AJ000536 


Arabidopsis 


COPl protein 


98 


32 


13531 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP*^ 


101 


62 


13532 ' 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


56 


13533 


Z80779 


Homo sapiens 


histone H2B 


628 


100 


135j>4 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 " : . 


151 


62 


— 


Mil 759 


— : 

Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


82 


63 






Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


93 


56 




AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


139 


81 




AF1 30051 


Homo sapiens 


PRO0898 


90 


53 


13539 


X16832 


Homo sapiens 


preprocathepsin H (AA -22 to 3 14) 


1835 


99 


1j540 


Y02671 




Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


100 


80 


i 




— — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


82 


13 ^ 42 — 


G00452 


-— : ; — 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
4533. 




65 


13543 


W88627 




Secreted protein encoded by gene 94 
clone HPMBQ32. 






13544 


AF130089 


Homo sapiens 


PRO2550 


107 


72 




M 3 6647 


Homo sapiens 


mitochondria] hinge protein precursor 


435 


89 


13546 — 


U638 1 0 


Homo sapiens 


WD40 protein Ciao 1 


505 


100 


13547 


AF090931 


Homo sapiens 


PRO0483 


126 


73 


13548 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


143 


65 


13549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


82 


62 


13550 


AF1 13685 


Homo sapiens 


PRO0974 


126 


48 




G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


89 


53 




AF1 1 6661 


Homo sapiens 






69 


13553 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


85 


13554 


X16832 


Homo sapiens 


preprocathepsin H (A A -22 to 314) 


1835 


99 






Homo sapiens 


Human secreted protein clone yd73 1 
protein sequence SEQ IDNO:92. 


528 


84 


1j556 


GO0j52 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


13557 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


67 


13558 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


189 


75 


13559 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 


53 


13560 


AF 118086 


Homo sapiens | PRO 1992 


157 


71 


13561 


Y00320 


Homo sapiens 


Human secreted protein encoded by 
gene 64. 


101 


64 


13562 


AF159055 


Homo sapiens 


leucine zipper-like protein 


119 | 74 
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016483 5A2_L> 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO. 








Waterman 


Identit 










Score 




13563 


AF 118082 


Homo sapiens 


PRO 1902 


83 


86 


13564 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


234 . 


97 








5327. 






13565 


Y59778 


Homo sapiens 


Human normal ovarian tissue derived 


66 


100 








protein 55. 






13566 


GO] 828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


60 








5909. 






13567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


96 








SEQ ID NO: 163. 






13568 


X83703 


Homo sapiens 


nuclear protein 


1633 


99 


13569 


G02473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


242 


45 








6554. 






13570 


G00521 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


90 








4602. 






13571 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


110 


80 








RHAla(AF078683) 






13572 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


114 


46 








RHAla(AF078683) 






13573 


X92744 


Homo sapiens 


hBD-1 


290 


83 


13574 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


107 


68 








factor (BCGF). 






13575 


AF 107406 


Homo sapiens 


GW128 


356 


100 


13576 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


101 


48 








sequence. 






13577 


AF220264 


Homo sapiens 


MOST-1 


95 


73 


13578 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


129 


68 








SEQ ID NO: 163. 






13579 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


76 


13580 


AF130051 


Homo sapiens 


PRO0898 


128 


64 


13581 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


142 


58 


13582 


AF1 18082 


Homo sapiens 


PRO 1902 


85 . 


51 


13583 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


114 


52 








encoded by gene 90. 






13584 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


54 








6536. 






13585 


X52164 


Mus musculus 


Q300 protein (A A 1-77) 


109 


64 


13586 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


109 


77 






brucei 


complete virion genome. 






13587 


AF090901 


Homo sapiens 


PRO0195 


114 


53 


13588 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


66 








4478. 






13589 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 








7437. 






13590 


AK000521 


Homo sapiens 


unnamed protein product 


1313 


100 


13591 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13592 


AL357374 


Homo sapiens 


bA353C18.2 (novel protein) 


421 


100 


13593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


62 








7867. 






13594 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


103 


56 








gene 100 clone HNFIU96. 






13595 


U16359 


Rattus 


nitric oxide synthase 


109 


76 






norvegicus 








13596 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


127 


88 








RHAla(AF078683) 






13597 


G00325 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


275 


100 








4406. 






13598 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


108 


63 








sequence. 






13599 


AF116661 


Homo sapiens 


PRO 143 8 


128 


64 
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01 64835A2_I_> 
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SEQID 


Accession No. 


Species 


Description 


SmiJh- 

Waterman 

Score 


y 


13600 


X55682 


Lycopersicon 
esculentum 


extensin (class I) 


58 


43 


13601 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


45 


13602 


U2S971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


122 


92 


13603 


AF149419 


Oryctolagus 
cuniculus 


eye sodium bicarbonate cotransport 
protein NBC2 


126 


75 


13604 


AJ005567 


Mus musculus 


SPR2I protein 


63 


40 


13605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


150 


96 


13606 


G 03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


121 


60 


13607 


G02867 


Homo sapiens 


Human secreted protein, SEQ ]D NO: 
6948. 


85 


60 


13608 


G0341J 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


57 


35 


13609 


AF 107406 


Homo sapiens 


GW128 


108 


42 


13610 


D82345 


Homo sapiens 


NB thymosin beta 


167 


100 


13611 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


59 


13612 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


50 


13613 


Y32193 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 044150. 


490 


80 


13614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


57 


13615 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13616 


AF 11 9900 


Homo sapiens 


PR02822 


95 


53 


13617 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


5S 


13618 


S79978 


Homo sapiens 


prion protein, PrP {octapeptide 
repeats} 


88 


42 


13619 


AF220264 


Homo sapiens 


MOST-1 


130 


81 


13620 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


65 


13621 


AL1 32841 


Caenorhabditis 
elegans 


Y15E3A.3 


168 


85 


13622 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


118 


65 


13623 


M86246 


Homo sapiens 


EHS-2 


100 


63 


13624 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO. 163. 


104 


46 


13625 


AF067205 


Homo sapiens 


vesicle transport related protein 


198 


69 


13626 


U39529 


Echinometra 
mathaei 


bin din 


72 




13627 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


52 


13628 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


123 


54 


13629 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


113 


46 


13630 


AF161536 


Homo sapiens 


HSPC051 


582 


100 


13631 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


158 


87 


13632 


AF130089 


Homo sapiens 


PRO2550 


105 


66 


13633 


AF1 16661 


Homo sapiens 


PRO 143 8 


116 


53 


13634 


AF1 16715 


Homo sapiens | PR02829 


101 


60 



500 



_0164835A2J_=. 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 

NO: 




Species 


Description 


Waterman 
Score 


Identit 

y 


13635 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 


63 


13636 


U33547 


Homo sapiens 


MHC class II antigen 


154 


79 


13637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22. clone HMSJW1S. 


107 


75 


13638 


AF090895 


Homo sapiens 


PRO0117 


87 


62 


13639 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


131 


69 


13640 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


128 


60 


13641 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


128 


81 


13642 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


210 


67 


13643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


2S7 


76 


13644 


Y73483 


Homo sapiens 


Human secreted protein clone yll 8_I 
protein sequence SEQ ID NO: 188. 


376 


88 


13645 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


130 


85 


13646 


Y87212 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO:251. 


423 


97 


13647 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP2 


103 


86 


13648 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


128 


55 


13649 


X58521 


Homo sapiens 


nucleoporin p62 


2610 


98 


13650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


69 


13651 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


111 


81 


13652 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


140 


60 


13653 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7344. 


121 


46 


13654 


U 13066 


Nicotiana alata 


arabinogalactan-protein precursor 


92 


33 


13655 


S794I0 


Mus musculus 


nuclear localization signal binding 
protein 


134 


50 


13656 


Y07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


156 


100 


13657 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1S. 


124 


48 


13658 


GO I 246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


171 


80 


13659 


AK000017 


Homo sapiens 


unnamed protein product 


611 


100 


13660 


U90446 


Mus musculus 


RNAse L inhibitor 


3100 


99 


13661 


AE000882 


Methanothermob 
acter 

thermoautotrophi 
cus 


phosphoenolpyruvate synthase 


88 


38 


13662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


65 


13663 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


97 


34 


13664 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


77 


41 


13665 


Y94890 


Homo sapiens 


Human protein clone HP02798. 


325 


98 


13666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


65 


13667 


AF090930 


Homo sapiens 


PRO0478 


119 


88 


13668 


AF130089 


Homo sapiens 


PRO2550 


120 


77 



501 



0164835A2_J_> 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identir 

y 


13669 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


165 


85 


13670 


W80293 


Homo sapiens 


Human translocation associated protein 
designated Gp25L-H. 


1003 


95 


13671 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


113 


91 


13672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


160 


60 


13673 


AL 132841 


Caenorhabditis 
elegans 


Y15E3A.3 


178 


75 


13674 


AF118086 


Homo sapiens 


PRO 1992 


85 


75 


13675 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


132 


55 


13676 


AF220264 


Homo sapiens 


MOST- 1 


133 


85 


13677 


Y 19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


122 


51 


13678 


Y65416 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1577. 


467 


98 


13679 


Y16589 


Homo sapiens 


A protein that interacts with 
presenilins. 


2286 


99 


13680 


AL1 60371 


Leishmania 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


104 


52 


13681 


AF090931 


Homo sapiens 


PRO0483 


] 17 


67 


13682 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


60 


56 


13683 


AF090944 


Homo sapiens 


PRO0663 


93 


90 


13684 


AL096770 


Homo sapiens 


bA150A6.2 (novel 7 transmembrane 
receptor (rhodopsin family) (olfactory 
receptor like) protein (hs6Ml-21)) 


178 




13685 


AF1 16661 


Homo sapiens 


PRO 143 8 


115 


60 


13686 


U62039 


Elephantulus 
edwardii 




86 




13687 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


77 




13688 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 






13689 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 




13690 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


107 




13691 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


178 


85 


13692 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


94 


35 


13693 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7888. 


143 


78 


13694 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


98 


50 


13695 


D63163 


Rattus sp. 


cyclin C 


111 


86 


13696 


Y02671 


Homo sapiens 


gene 22 clone HMSJ Wl 8. 






13697 


AP000616 


Oryza sativa 


similar to RJNC3-H2 finger protein 
RHAla(AF078683) 


124 


80 


13698 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


129 


70 


13699 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


93 


58 


13700 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


100 


73 


13701 


M76744 


Homo sapiens 


BGP 


112 


52 



502 



_0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Waiter i i 


Idcntit 

y 


13702 


Y15155 


Homo sapiens 


phosphorylase kinase beta-subunit 


206 


100 


13703 


AF090901 


Homo sapiens 


PRO0195 


100 


70 


13704 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


131 


79 


13705 


R95913 


Homo sapiens 


Neural thread protein. 


106 


36 


13706 


Ml 5073 


Homo sapiens 


MHC HLA-DR-beta- 1 chain 


74 


77 


13707 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQ ID NO: 3 63. 


1 10 


35 


13708 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


150 


56 


13709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


89 


77 


13710 


X70775 


Chironomus 
cingulatus 


Sp 12 gene homologue 


85 


38 


13711 


X80265 


Hordeum 
vulgare 


structural protein 


96 


41 


13712 


AB007922 


Homo sapiens 


KJAA0453 protein 


147 


64 


13713 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


60 


13714 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


108 


65 


13715 


AF1 16661 


Homo sapiens 


PRO 143 8 


120 


77 


13716 


AF1 18086 


Homo sapiens 


PRO 1 992 


127 


64 


13717 


AK024455 


Homo sapiens 


FLJ00047 protein 


93 


56 


13718 


AF1 16661 


Homo sapiens 


PRO 143 8 


137 


55 


13719 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


93 


59 


13720 


AF1 19851 


Homo sapiens 


PRO 1722 


94 


58 


13721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


124 


45 


13722 


Y27S68 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


1 17 


58 


13723 


AF161363 


Homo sapiens 


HSPC098 


115 


50 


13724 


Y95829 


Homo sapiens 


Native human Tie receptor signal 
peptide. 


108 


100 


13725 


AF1 18086 


Homo sapiens 


PRO 1992 


166 


75 


13726 


AF 116636 


Homo sapiens 


PR01488 


95 


70 


13727 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


70 


13728 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


60 


13730 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. 


118 


63 


13731 


U33547 


Homo sapiens 


MHC class 11 antigen 


123 


69 


13732 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


135 


88 


13733 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


75 


13734 


AF026204 


Caenorhabditis 
elegans 


C30E1.1 gene product 


102 


51 


13735 


Y 12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


117 


63 


13736 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


112 


87 


13737 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


115 


58 


13738 


AF130089 


Homo sapiens 


PRO2550 


122 


70 


13739 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


124 


74 


13740 


AF130051 


Homo sapiens | PRO0898 


85 


70 



503 



0164835A2_i_> 



WO 01/64835 PCT/USO 1/04927 



SEQID 

NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Iclentit 

y 


13741 


AF093748 




KJH type splicing regulatory protein; 
KSRP 










Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 












6613.. 






13744 


G03790 


Homo sapiens 


7871. 


91 


48 


13745 


G02538 


Homo sa iens 

omo sapiens 


Human secreted protein SEQ ID WO' 
6619. 


1 1 6 


— 


13746 


AB001684 


Chlorella 
vulgaris 


ORF54d 


70 


66 


13747 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


140 


60 


13748 


AF090901 


Homo sapiens 


PRO0 1 95 


92 


39 


13749 


AF090895 


Homo sapiens 


PRO0117 


148 


67 


13750 


AF1 19882 




PR02492 






13751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ff^^' j : — — 


109 


60 




G00397 


: 

riomo sapiens ' 


Human secreted protein, SEQ ID NO: 


TQ 

129 




65 


13753 


AF161356 


Homo sapiens 


HSPC093 


99 


48 


13754 . 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18 


124 


70 


13755 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


51 


13756 


AF1 30089 


Homo sapiens 


PRO2550 


142 


96 


13757 


AF130089 




PRO2550 






13758 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


50 


13759 


Y 19609 


Homo sa iens 

Homo salens 


SEQ ID NO 327 from W09922243. 


— 


— 


13760 


G02832 


omo sapiens 


Human secreted protein SEQ ID NO' 
6913. 


-— 


— 


13761 


U62040 


Elephantulus 
edvvardii 




134 


— 


13762 


AB001684 


Chlorella 


ORF49b 


1 00 


45 


13763 


AF1 18082 


Homo sapiens 


PRO 1902 


129 


50 


13764 


AF220264 


Homo sapiens 


MOST-1 


129 


66 


13765 


R59842 


Homo sapiens 








13766 


AF220264 


Homo sapiens 


MOST-1 


115 


71 


13767 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


73 


13768 


AF 130089 


Homo sapiens 


PRO2550 


97 


34 


— 


U80739 


Homo sapiens 






100 




G03714 


riomo sapiens 


Human secreted protein, SEQ ID NO: 

77^ 5 . -r^-Ti : -r= : : 


128 


55 


13771 


R1Q7g5 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 


~T02 





— 


AE004507 


Pseudomonas 

aeruginosa 


hypothetical protein or bacteriophage 
Pfl 


— 

3 




44 


13773 


AB044885 


Canis familiaris 


dopamine receptor D4 


78 


52 


13774 


AF026246 


Homo sapiens 


HERV-E envelope glycoprotein 


108 


53 


13775 


S71805 


Homo sapiens 


RNA-binding protein=TLS/FUS-ERG 


126 


100 


13776 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


82 


13777 . 


ABO 15727 


Mus musculus 


truncated granzyme M . . 


86 


37 


13778 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74. ... 


52 . . 



504 



0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 





Accession No. 


Species 


Description 


Smith- 


% 


NO: 




















Score 


y 








6913 






13779 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


137 


61 








encoded by gene 17. 






13780 


M14123 


Homo sapiens 


neutral protease large subunit 


246 


55 


13781 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


99 


67 








gene 22 clone HMSJW1 8. 






13782 


AF 132972 


Homo sapiens 


CGI-38 protein 


902 


99 


13783 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


63 








6550. 






13784 


G 02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


68 








6613. 






13785 


AF1 30051 


Homo sapiens 


PRO0898 


138 


72 


13786 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


123 


95 








RHAla(AF078683) 






13787 


Y64890 


Homo sapiens 


Human 5" EST related polypeptide SEQ 


92 


53 








IDNO:1051. 






13788 


U 12206 


Homo sapiens 


unknown 


84 


42 


13789 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


58 








7874. 






13790 


S79410 


Mus musculus 


nuclear localization signal binding 


102 


50 








protein 






13791 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


76 








6292. 






13792 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


72 








4435. 






13793 


AB006006 


Bos taurus 


neurocalcin alpha 


1000 


100 


13794 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


183 


95 


13795 


W34499 


Homo sapiens 


Obesity receptor C protein. 


120 


85 


13796 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


127 


67 






elegans 


RD protein (nuclear rna-binding 












protein) 






13797 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


182 


83 


13798 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


112 


71 








gene 22 clone HMSJW18. 






13799 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


91 


84 


13800 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


67 








6983. 






13801 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


72 








7795. 






13802 


AF 119900 


Homo sapiens 


PR02822 


131 


50 


13803 


AK024435 


Homo sapiens 


FLJ00025 protein 


117 


95 


13 804 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


53 








6613. 






13805 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


48 








6613. 






13806 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: - 


98 


66 








4478. 






13807 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


66 








4478. 






13808 


AL359782 


Trypanosoma 


possible (hhv-6) ul 1 02, variant a dna, 


150 


54 






brucei 


complete virion genome. 






13809 


M37679 


Mus musculus 


Ig heavy chain precursor 


70 


100 


13810 


AL451015 


Neurospora 


putative protein 


96 


55 






crassa 








13811 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


88 


77 








gene No. 107. 






13812 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


71 








7795. 







505 



0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


Identit 


13813 


AF1 30089 


Homo sapiens 


PRO2550 


93 


75 


13814 


KOI 664 


DrosophiJa 
melanogaster 


Bkni-like protein 


119 


60 


13815 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7795. 


130 


64 


13816 


AJ013435 


Blackstonia 
imperfoliata 


maturase 


93 


48 


13817 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.Sc} 


130 


76 


13818 


AF119851 


Homo sapiens 


PRO 1722 


141 


50 


13819 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


60 


13820 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


76 


72 


13821 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


135 


50 


13822 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


95 


52 


13823 


L10908 


Mus musculus 


Gcap 1 gene product 


103 


50 


13824 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


62 


13825 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


125 


83 


13826 


LI 0908 


Mus musculus 


Gcap I gene product 


96 


43 


13827 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO - 
7052. 


102 


64 


13828 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


53 


13829 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


70 


13831 


AP000060 


Aeropyrum 
pemix 


lOlaa long hypothetical protein 


79 


50 


13832 


AF 11 9900 


Homo sapiens 


PR02822 


94 


46 


13833 


Y19767 


Homo sapiens 1 


SEQ ID NO 485 from W09922243. 


145 


40 


13834 


AF2 18028 


Homo sapiens 


unknown 


1 17 


57 


13835 


L10908 


Mus musculus 


Gcapl gene product 


79 


45 


13836 


Z26876 


Homo sapiens 


ribosomal protein 


129 


93 


13837 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


76 


13838 


AF044311 


Homo sapiens 


gamma-synuclein 


603 


99 


13839 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


100 


82 


13840 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


46 


13841 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


64 


13842 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


92 


62 


13843 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
protein) 


93 


60 


13844 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


63 


13845 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


145 


79 


13846 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7873. 


102 


50 


13847 


G03469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7550. 


101 


68 



506 



_0164835A2_L> 



WO 01/64835 



PCT/US01/O4927 



SEQID 
NO: 


Accession No. 




Description 


Smith- 
Waterman 


% 

Identit 


13848 


G03240 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7321. 


107 


43 


13849 


U05313 


Trypanosoma 
brucei 


CR3 


98 


40 


13850 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


100 


65 


13851 


API 59055 


Homo sapiens 


leucine zipper-like protein 


103 


55 


13852 


R59842 


Homo sapiens 


ApoE4Ll protease. 


99 


71 


13853 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


142 


65 


13854 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


79 


13855 


R59842 


Homo sapiens 


ApoE4Ll protease. 


93 


69 


13856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


1 14 


76 


13857 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


94 


57 


13858 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


105 


58 


13859 


AF289022 


Homo sapiens 


formiminotransferase cyclodeaminase 
form C 


467 


100 


13860 


AF078844 


Homo sapiens 


hqp0376 protein 


488 


100 


13861 


AB032436 


Homo sapiens 


brain-specific Na-dependent inorganic 
phosphate cotransporter 


2968 


100 


13862 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


91 


72 


13863 


AF1 30079 


Homo sapiens 


PR02852 


131 


63 


13864 


Z29701_cdl 


Homo sapiens 


29-MAY-1998 Wild-type human c-Src 
tyrosine kinase cDNA. 


2380 


100 


13865 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


80 


33 


13866 


AL050318 


Homo sapiens 


dJ977B1.5 (myosin regulatory light 
chain 2, smooth muscle isoform) 


904 


100 


13867 


U37690 


Homo sapiens 


RNA polymerase II subunit 


358 


100 


13868 


X13923 


Homo sapiens 


cytochrome c oxidase subunit VIb (AA 
1-86) 


491 


100 


13869 


LI 3848 


Homo sapiens 


RNA helicase A 


6669 


99 


13870 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


133 


46 


13871 • 


K02064 


Bos taurus 


cytochrome c oxidase subunit IV 
precursor EC 1.9.3.1 


96 


72 


13872 


D31763 


Homo sapiens 


ha0946 protein is Kruppel-related. 


4606 


96 


13873 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


83 


43 


13874 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


117 


70 


13875 


Z52203_cdl 


Homo sapiens 


17-SEP-1998 Human PR0217 protein 
encoding cDNA, UNQ 191. 


2135 


99 


13876 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


106 


64 


13877 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


50 


13878 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


104 


60 


13879 


L25404 


Brassica napus 


cyclin 


124 


42 


13880 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


139 


59 


13881 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


70 


64 



507 
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SEQID 1 Accession No. 
NO: 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 


13882 


AB000098 


Rattus 
norvegicus 


MIPP65 


905 


50 


13883 


Z95114 


Homo sapiens 


DK212A2.1 (TNF-inducible protein 
CG12-1 (similar to apoiipoprotein L)) 


1639 


100 


13884 


AF132984 


Homo sapiens 


Nuclear pore complex interacting 
protein NPIP 


551 


82 


13885 


AF121862 


Homo sapiens 


Sorting nexin 13 


1453 


99 


1^886 


G02493 


Homo sapiens. 


Human secreted protein, SEQ ID NO: 
6574. 


83 


69 


13887 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


109 


77 




AF217197 


Homo sapiens 


FBP interacting repressor 


2725 


99 


13889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


187 


94 


13890 


AF1 18086 


Homo sapiens 


PR01992 


85 


62 


13891 


M58664 


Homo sapiens 


Signal transducer CD24 


371 


97 


13892 


AK023443 


Homo sapiens 


Unnamed protein product 


125 


45 


13893 


AB040972 


Homo sapiens 


KIAA 1539 protein 


2271 


99 


13894 


U03750 


Escherichia coli 


DeaD 






13895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 





"41 — 




AF078851 


Homo sapiens 


Secretogranin III 






13897 


U62039 


Elephantulus 
edwardii 


Reverse transcriptase 


109 


48 


13898 


X07816 


Human 
herpesvirus 4 


Epitope C13 (57 AA) 


55 


53 


13899 


AF1 16661 


Homo sapiens 


PR01438 


146 


48 


13900 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


77 


60 


13901 


Y28643 


Homo sapiens 


Human serine protease inhibitor from 
cDNA clone HETDK50. 


2191 


100 
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TABLE 3 



SEQ ID 

nucleotide 
sequence 


SEQ ID 

peptide 
sequence 


M 
od 


SEQ ID 

USSN 

09/515,1 

26 


Predicted 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 

ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 

E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 


1 


13902 


A 


I 


114 


434 


AI FKCVEGMFRIAMVNVCFVSSGSLLI * 
PLTY/GVYDEWTHFAYMTIDLLEIPITG 
SHPVVLNALFCLEAP\WISPNTGSFAYP 
VYPKSLIAHDFAVEATMPYIRLSST 














KSNIPNLGDCGWESLFNR\QSVJRSSIjAV 
\NDTYSSKKSNAETFTFHADLCTLiSDKD 
RPITIQTAliAELVKHKPKATYEQLIAVL 
DEF/ANFLKKWWKAYDKENLFCEEG* KL 
CAASN 


3 


13904 


A 


4 


1 


427 


EGFLELLRTRNHSNS QLQLTTG IGLFLN 
EGLiKLVDKFLED V* K * YHS ETFTVNFSD 
TE*AMKHINDYVEKGTQGKIVDLVKELD 
RDTVFDLANYIFFKGKWDRPFEVNDTEE 
EDFHVDQVSTVNEPIMKLLS \MLWIHPC 


4 


13905 


A 


5 


1 


464 


KIKSFYASKDTIKRMRVTDWKKIFAY*! 
SDKELIFTLRTLKT**K*GKQPNLKNGQ 
EI * VP1SPQDIQIAHK\HLEGWSTSLVS 
E LAP CEAP V / RH P LTG LTI AGLQG FG EA 
GRLVRGRWGC*W/VHPF*KYI/WQFLSK 
LHTSLPYDPTTPLLGTCSR 


5 


13906 










TKTCT* IFTVALSVIARAWNQPGRPL.CS 
EWL\KYMVHTME *HSAIKRLNYRYKNNC 
VNLFLG ITLS EKSQTQNVI 


6 


13907 


A 


7 


587 


2 


FLTRETGDPTGRSSSHANTQSRFFPDDP 
PG\ PLNNLGNTHGCGRRAGRCPGTGPDG 
P \ AGCGGPRCWPSGHLAATGD * GPS CGR 
LGANRGEAGPAGFTACSPLSGCRTPYTH 
HFPASRMSCHIjNCASPRTYRSQGNRGCE 
RVAQGSQGAGGERGAKSQVPVPAPARNK 
DPAKCRKPRNRRPGNSGPWRAYRRQR 


7 


13908 


A 


8 


1 


474 


RI LNEEHGKYEGLHE * EVKWHLY I KS PA 
FTDLHLCYQKDMNG I STSASS P AVGTVG 
MDMDEDDDFSKWNFYYSPHSYPDK*LTI 
FKTESRVRESDEVTQIKVNWDEEVISGL 
LTS LKDNVLKATG VLYDVAYKX YLCEHT 
RSTLKEESLKLERNLQNH 


8 


13909 


A 


9 


3 


539 


SQ CS PF I S PACS LTALEE ETEALR VH PR 
LCLSPNIAPSSGPPRPPELAPCPPSSQA 
GLRTCH S WVKGLiHQ P L P VASGM KS T F CN 
KTYTCPYPPP/PPLCPNHSPNALTL.PDS 
VTHAVP FE / L * S PSAPPSS TA* ILGSPS 
\CGASPCNHPHSHPGICPTPPGLWPVCP 
CAPRAWQRDGTRQT 


g 






10 " " 






RL*LGLiEYALLVWGTPKV*H*GGFPIYY 
YIVLLLSYALHQVTEYSMYVS IMAFNAK 
VSDPLIVGTYMTLLNTVSNLIGNWPSTV 
SL.WLVNPLTVKECV* TSYQNCCTPDAAE 
LCKKLGGSCVTALDGYYVESI I CVSIAF 
V/W*VFLVHKFK 


10 


13911 


A 


11 


20 


475 


KMGVPPLLMSDPNRFLFPKNFLREKTIS 
PPKTF* PLKI WVKGQ WVLNFLGFPGFKI 
FFPVFKFFFFFFFF/RDRVSLYHPGWSA 
VSQSELTAALTSPGSGDQVILPSQPPK* 
ENHLNLGGRGCSEPRLPRAEFLDLRS FS 



509 



01B4835A2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
locution 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I-Isoleucine, 
K=Eysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryp to ph an, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IIVIFP 


]] 


13912 


A 


12 


392 


63 


HIRADPGLEPRPSARTGLGPELGCCTMN 
KLGDAGSATQSLGGSQLSWQLSRREQEL 
EQARWEAQ*QVETLGRVAREKEALAKEH 
AGLAVQLAA\AEHEGRTVSEEATHLQ 


12 


13913 


A 


13 


18 


338 


APHPQYLQMPPMIiLPPRTGPGQFSLPSS 
RHGGHLE/GKEHETSVTLCGGEPPPQTP 
PRD PDPG P * ARRAP C PRRPT * AHPRALS 
RAAPQEPRALAGPRARHPICPGSL 


13 


13914 


A 


14 




371 


TTKQ* KDNPIQTGAKDLKRHFCKEDMQM 
ASNHT\KRSLTSLVSREMHIKTTRMAGI 
KKSDNNKHW*GRGEIRT/LLRCWWDCKV 
VQLLWKTVWQFL*GliYQYIPHDPVISRIj 
GISLILYIALRTF 


14 


13915 


A 


15 


443 


1 


SRTFLDTDMKSMRGFKAS*NRQTLLLGA 
NAAGD \ KLKAMLTWHSENRKILQNYVKC 
TLPMLYKWNNKAWMTAYLFTTQCTEYFK 
PTVETHCSEKKISF\KILLTIYNAPGHL 
RALMQ I YKEIHWF VPVNI PS ILQPI VQ 
NVISTFKS 


15 


13916 


A 


16 


375 


38 


HQQNGFLKKTDPTLLCUQETHFR/CKDT 
KRLKVRRY/ QPNSNQKRAR/VPI LIEDK 
IDFKTKKIFMMIKDLTIINI*ASNTRVP 
KSMKQKLAGLKEEMDNSVIMVGGFSYPV 
SIRK 


16 


13917 


A 


17 


448 


189 


NRDRVSIiCCPGWS * TPGLKRSFHLGLPK "" 
CWDYRRD I Y*LL \ FARHTQM ISTHS *QT 
TNTCS YPAF* KSRP * E PGYNTTHTPHS S 
DML 


17 


13918 


A 


18 


1 


426 


GMSKHARSLI INQL.FKKCSTLFVL/REM 
QIKSLLASSSSSSSRNSVSGQGFETVGT 
HCGQKCKL VQPF\ WKAVW * YLLKL.NVFI 
LFLIYNRNAHLEDTCENVDRALFVIVKN 
WKLS KYP PGVKWR I KL WYSHS VES STAV 
L.PKV 


18 


13919 


A . 


19 




423 


YFETFQPLLRRLGYQGTFFFP\KPWSSC 
LGIEYNSGPDSCA* FFLQNQ I R LVNS AN 
I RLMAMTLKTNQVAIAQFLECKESDQQF 
CIGVTHVKARTGWA*F*SAQGCDLLQNL 
QNVTQ/GAKI PLW* GDFNAEPTQEA* K 
HF 


19 


13920 


A 


20 


10 


443 


LKVDSGDSEVRYVFILQHITLLMCSAYM 
NQLLNIFVRPSLLAVALHMTPGFTKEDV 
Y S CFR F LRDVFADE F I FLPGNTL * D FEE 
SCYLLCKSEAIQVTTKDILFTEKGNTVL 
*FLVGLFKPF/VESYHIICKSLLDEK*A 
PFIEEP 


20 


13921 


A 


21 


11 


426 


VLVETNNLRMGQVTM * PELPDM / S PDA W 
TLSDSPSQKIGHAQQ\KYSI IKWKWYTE 
DWAQACLEDTSKYEQVTQI PMAPNDATL 
/PSS/AHLAFTAS *GAPSDS * LRNTLGL 
ICDGST\KPSSTN* KQTVIALQSHAGIjN 
MKEC 


21 


13922 


A 


22 


1147 


1768 


QLGMSHGYSSKSMPQKLMCFLFNHHLQK - 
GHECLP KVLNSNPPP 1 1 KYLALQDLMLL 
S QYS P S RRQE VFSLS Q PGGH P HNWTAI S 
RECLNLIiNGMTQKLILYQEAAATNGRVS 
SSYPVEPKKLNSPEETAFQTPKSSQMPR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
IJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Argin>ne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PSVPPLVKTSLFSSKL.STPDWSPFGTP 
FGSSVMNRMAG I FDVNTCYGS * AT * HSE 
W\I>LNLSLLYHVWLCGVFLI/rTWYVSWI 
L.FKI YATKAHVF PVQ P PFAEGS * VPSKS 
VK* QS S P \ L I KYLALQDLMLLS QYS PS P. 
RQEVFSLSQPGGHPHNWTAISRECLNLL 
NGMTQKLILYQEAAATNGRVSSSYPVEP 
KKLiNSPEETAFQTPKSSQMPRPSVPPLV 
KTSLFSSKLSTPDWSPFGTPFGSSVKN 
RMAGI FDVNTCYGSPQSPQLIRRGPRLjW 
TSAS 


22 


13923 


A 


23 


421 


3 


KIIFRAFKG*KSPSMPGFKA*KDRLILL 
LGTNADNDFKRKPM/ LLYHS KNPRAL.KN 
YIKSIQPLYE*KKA*MTAHLjLKVWVTEY 
FKPIVGTYC*KI/SFKILLLNDDAPG\H 
PKALIEM YRE IN I FMS ANTLF I LQPMEK 
GVI 


23 


13924 


A 


24 


226 


2 


THERTHSKI IHVIIIKITHPLNPSILiRP 
QTTA* IKWRDLGSLQPLP/PG/LKRFSY 
LILPSSWDYRCPPLRPANFCIF 


24 


13925 


A 


25 


2 


611 


FFFFLLLGLLHQI PD VSPTGKYTTLLPL 
MIILMISGIKEVIY/DHK*HGRQNS/VR 
NTKbL*QDSWDTFKWKEVNVGDTVKASN 
GELLPADTVPMC\YIATSNPDRETN/VK 
TRQAIiPETASV 


25 


13926 


A 


26 


1 


443 


ATQWRPS LVPASAENVN KARS FAAG I HA 
LGGTN IND AM LMA VQLLDS I NQDEIiVT * 
GSVLLIIL.LTDGEPTAWETNLMSIQNNV 
REAVRGRYSIiFCLGLGFDVCYAFIiSKLiA 
LDNGGIARRIHEDSDSALHMQDFY\QEV 
AHPLVTAV 


26 


13927 


A 


27 


2 


359 


KAQSKQWLPRGGSGPVK/ARVDEESAKV 
I VRV YWEAQG I LL I DFLKGQRRT I YAYS 
ENILRKPEL* *RNAWGSFSRVLHHDRAP 
AHFSHQ/TRAIG/REF* *KTIRHPHY/S 
PNLNPLDAFCF 


27 


13928 


A 


28 


2 


427 


WRKTVKGHFTDQCRKHKAM/LGN* IHFL 
EYHIHKHS IQYSG I QATQEKFGGLKTSY 
CSKD*YITVLDLTLKISNKDVSN*HENP 
ISMYKN/ I PTVIYGSKVI KDRKLKA* I * 
CP/HRNKKF*YFDYFYVLIjNTMTFIiKIjW 
RIITEI 


28 


13929 


A 


29 


47 


412 


HCDVLLASSRYTCILPYSDRDDGPQDQL 
KMSVDFRSRRTGS/WFPQNSWGHMGVCG 
WGGAGRTLDL.IHLRIPMRGLRSGGFLCR 
RKLVSESYG*EPSPL*K*KGGWGSEPS/ 
LTTVPSQL 


29 


13930 


A 


30 


336 


419 


RLECSGMISAYCNLSLPGLSDPSNSASR 


30 


13931 


A 


32 


1 


408 


KIALKLRSNYSKISGY/MANIQKLI I FL 
YTSNEQIEFEIKNTIPFTLAPPKIK*LG 
TNLiTKC \ VQEL YE K I YKSVMKN I KEIjNK 
WRSSYGKGKSKIKIIKINK*MDIACSWL 
RRLiKIVKISVXjHNIiIYRFNIVPIEIP 


31 


13932 


A 


33 


412 


2 


QGHL.SLQKFIALPFVQL.CPAPRGGVYRG 
RQDS LiS CGGLHP F * AS GLL CL P TQAS AM 
VGTPPPASLL.I.CSSILDCCASNERGSVG 
VE P PEPGTGHNLLVCRLLRPLE KRSIRW 
KSAVGVARFSRCRPSWLPLARKKNSP | 



511 



NSDOCID: <WO 0164635A2 I 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid. 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine 
K=Lysine, L=Lcucine, M=Mochionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


32 


13933 


A 


35 


304 


33 


KWGEKVW YWQKMTQI VQWDRTES PQ I D 
N*SLTKEIQWRKDSIiFNKW*GNNWTAPF 
SS/RSLNLNKDLTAVTKIKSKWVTDLNV 
KHKTIKLL 


33 


13934 


A 


36 


2 


424 


SKTYS IGHFTYEGKGTTTSLWGKVNVED 
AVGETLGRLLVEYPWTQRFFDSFGKLSS 
ASA I MGN P FVKAHGL KGL TS LGDALQHL 
DDLKG\TFAQLTELHCDKLHVDPENFFL 
LiG*VIiVTVIjAIiHF/ SRRFTP/ELQAS /W 
PKM 


34 


13935 


A 


37 


2 


433 


NKRLPGPGF*KRPNPPGEGGPARYFNPL 
GGPNREIPLGPEV*THFGPQMRPPVFLK 
ILKISGAFWGPPVGPPTWGG *EGLNPRG 
PGYN*P/R/PPAPPPWG*SQAPFPKPAP 
PP/TGIKPNP 


35 


13936 


A 


38 


3 


425 


GAAQLLPVSLPSAQRAIDLQILTTPWEN 
WTSIASLQHKTTIGRSWLITGPISATSS 
TPTSCTVASP\TRGHV*RSASWKYDVLQ 
HSSHMWWHTAVYLGEAFHQVHATGSCH 
HRVLSCCPITGRSGRDAL*QLLPLi/HAL 
TEA 


36 


13937 


A 


39 




463 


LNEVRD I KL.SSDHWPS KTNS FLHS PG FL 
SRFEPQPASVA/PRP*SQQQSI,PGKAST 
SLWPPNPVPFVTSSLSLSALPGLFLWLP 
SIPPLPGSPFFSPE*PEVPLFPGPTYHN 
PSPPD/PTVLEAHPDQAPLP\PGVPTAE 
QRPTPAP*AHRPS/LPLPP 


37 


13938 


A 


40 






K WNLI EPMS FC/TYRETI IGVTDR YTRD 
KIFESGR 


38 


13939 


A 


41 




427 


NLKMKSRTSKENIGEHI YDFDIGRFVNT " 
* SKPERKRLSFKTSVQEKTP * L*GKHK* 
MKLEP\XiPYPKIKSKWIKDNVRSKAIKLi 
LEENRSESSIWQRML/SIMPKAQAIKEK 
VDNLDFFKIKNCVSKDTIKKALDRPSER 
EKI ... 


39 


13940 


A 


42 


30 


448 


FEMRE I I VAFHS I L I R YHGL INLRKFQ * 
MDRRYSKEVQDLLETMKLFARIVGPL+H 
DKFIESHAIjEFELLREIKRLQEYRTAGI 
TNFCSARTYDHLKKTR * EERLKRTMLSE 
VLQYIQDSNACQQWLRRETY\ IESGPNP 


40 


13941 


A 


43 


1 


928 


LiRQRMPPQGLQPCPWPPRYP/PDSPLGC 
PGGLA\ PPNSPSWPKLASLAGSRGSSAP 
PGVWGLVATGA* PLQRP S AP P P CPR / LC 
PGREPCMCP 


41 


13942 


A 


44 


2 


409 


SKSPDPTQQPLRGGSIjTHSAPGPSLiSQP 
LAQLT P P AFGPS TS S L FNLQKS S LS ARH 
PQRKRRGGPSEPTPGSRPQDATVHPACQ 
IFPHYTP\VWHILGPQRHTP*SVD/HPG 
LDKRLLPETPGPCYSNSQPVWLCLTP 


42 


13943 


A 


45 


3 


136 


I,PMTLFTELE*ATLRFIWNHITIQIAKA 
ILTNQK\NKARGITNIC 


43 


13944 


A 


46 


365 


2 


AWGSLRAPRSAVPEPGTPPVWGNALAHR 
QFSPCLDGLSCPSFLRG*NSPPHPGSPG 
LPKHA*APAAVSPWVDPDRTVQPPPPAP 
PST\D*PHCSPPCTPWCPRRLGSAPVMP 
GCPTASYPRR ' 


44 


13945 


A 


47 1 1 


456 • | AELSELYEESSDLQMDVMPGEGDLPKWX 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














EEATGTRAAI PVfVPPATGAQQLEEEGPM 
EE\EEAQPMAAPEGKRSLANGPNAGEQP 
GQPSPGRRTSRAEDEA\EEFDDWEDDYD 
F\PREEPVKGARLRFLPPS*KTPPSFWE 
NRNTPLWGGLKIFY 


45 


13946 


A 


48 


116 


442 


PQNMGMERKPGP PKGL9WASGDLEDNLG 
VMSVGFLLSSPDDAVIWRGPRKTGLIKQ 
FIjRNGTGEKVDYFMGDTQLGTSDEHLSS 
VATWAKAHLGEPGAFTFPQRVSLRE 


46 


13947 


A 


49 


434 


3 


FTVPGPLLTVIPIFISKVPFKFSLPQIF 
NFTPFGFPFQIRV*TIIPISLRFFRKKP 
LFFFF+NKVPLCSPYWNSIARSWVPAAS 
VFQV\KESFYLN1iLSVPPQVPLNVFTjTF 
FFFFFLGRDR/SLPLLPRMVWNSWAOAI 
LPLWL 


47 


13948 


A 


50 


1 


482 


EKPYQCSDFGKAFNMKTQ\LVVHQGVHT 
GNNPYQCRECGKAFGRKEQLTAHLIAHA 
R*KPYGCSECGKAFSSKSYLVIHRRTHT 
GERPYECSSCDRAFCGKSQLI IHQITHS 
TENHYECNECENTYPRKASLKI HQKIDL 
GKKPF/ECNEWGKALIjK 


48 


13949 


A 


51 


> 


470 


REFL.WQEGHRAFATVDEAGEEVTjQILDL 

yadvyeellai pvangtktdkdklaggd 
ytatieaficasgraiqggtshllgqnf 

SAMSEIVLEDPEIPGENQFAYANSWGI/T 
TRTTWVMTIGHGDNMGL/I.LTPRVA*VQ 
/MVI IPGGXPKAFLKRQKT 


49 


13950 


A 


52 


26 


448 


S PGTEREYRIGQQS VTGVTSVDDSNSYW 
RIRGKSATVCERGTPIK*GQPIRLTHVN 
TGRNLHSHHFTS PLYGN / QVAL * GD IVI 
IL*RRKQRLKGFTEEGIKLRFKEVSAFG 
DEGEGDYLDDWTVLCNGPYWVRDGEVRF 
NT 


50 


13951 


A 


53 


3 


495 


AMEVKAFAETHIRGFTLNDAANSRLIIT 
QVRRHYLKEAATTLKTVLDHQHTPSRLA 
VTRVI QALAMKGDV\ DNI * VFHKMLNGL 
EDSIGLSKMDFINNIALAQI *NSNLDAS 
V*HIENMLTS *NNVI EPQYFGSA YLFRQ 
VLEEQLEPTVEKIS IMAERLAKPLQ 


51 


13952 


A 


55 


1 


428 


QERGTKKEMEDRMTLEETK\EQIL,KX.EE 
KL*ALQEEKHQLFLQIiKKVljHEEERRRR 
KEQSDLTTLTSAAYQQSLTVHTGTHLX.S 
VQGS PGGHNRPGTLMAADRAKQMYG PQV 
LTTRRYVG I AAAFAGTPEHGQFQGRPGG 
VYG 


52 


13953 


A 


56 


2 


453 


EDGDLDAFSSDEGLTMAMSYLKDDI FRI 
YITETQECRRYHRPPCAQEAPCNMVHPI 
VICYGCYGPEVGTRYTCSVCPDYDLSSS 
CKGKGLHRGHTKIiAFPSPFGHLSEGFSH 
SRWLRNVKHGLLRWS*WEMGPPGNWSPR 
TSCA/GEARLGP 


53 


13954 


A 


57 


3 


435 


ELNTS I FRSRPI EGLGLNTVLTTDNSNL 
VI\NRIGIV\PSVTEKEYTDPSSDGTYA 
WKI FSHETITKAQI LKLFLS YDYAVNNP 
WUAYPHYKSPEKCPSI ILHDRLYYLNGL 
* CAASAMVMIAI VTYNVALLAYHRWNGL 
TYMID 


54 


13955 


A 


59 


409 


1 


LCCEHRGKTVCVPRGKFTTLNAS INKER 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
• peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
' location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
. nucleotide insertion 














SKISVLSFRLMKLKKEYVK*K* \EKEKI 
NNRIRIEKINKAKS/WFL*HNKIE*TIR 
MINHQKRENIQITDFRNERTELPPDSTD 
I KKA I R K Y * K * F YPNI F YN P EEMNKF 


55 


13956 


A 


60 


1 


393 


GNVSSCGDHPCEG*L,CP\PDKVMS*GI* 
VPEEACTQCIGEDGVHHQFLESWVPDHQ 
PCHICTSLNGRKDNCTTQTCPTGKASTC 
GLCEIiARLRQNADQCCPENECVCDPES C 
DLPPVPN*ERGLQSTLTNP 


56 


13957 


A 


61 


250 


1 


REDCKRVLYKDKVSLCCAGPCSVA*SCL 
KAAS TFQ \ VKHF S LQ S S WVYRHAP PGLA 
NFFHFLQRQVLTTPPGLVLNSWVHAI 


57 


13958 


A 


.62 


365 


3 


IKKKLIWRLFTFPKPSWGKKTKKGIPFP 
*GPKKIPGIKFPQGIKKAFKGNR*TLGK 
GN*KKKKKKKECGPGGDIPCS* IG/RNI 
VKILIIPKAI YSFNT r H I KI PKTFFTE I 
ETTITjKFIj*NH . 


58 


13959 


A 


63 


245 


558 


FLPTQVISYVKRALAEGAQI*CGEGVDK 
LSLPARNQAGYFMLPTEITDIKDESCCM 
TEEIFGPMTCDVPFDSEEDVIERANNVK 
YGLAATVW/SSNVARV 


59 


13960 


A 


64 


106 


532 


ERACQSGTSGQGGVPRRAIGAPR/E/DA 
FTGAVYI YHGDAGGIVPQYSMHPFA*SL 
YPSGQ/SVAARGNLSGDSFNCHN*GLQG 
LLliASSV * R P VMLLNDP 1 1 Y TTKN/ 1 S G 
PKCHE 


60 


13961 


A 


65 




548 


VQKYPQQNRSCVPPVAEWAVPQSSRIiKY 
RQLFNSHDKTMSGHLTGPQARTILMQSS 
1/P QAQLAS I WNLS D I DQDGKIjTAE EF I L 
AMHLIDVAMSGQPLPPVLPPEYIPPSF/ 
RRVRSCSGISVISSTSVDQRLPEEPVFR 
DEQQQLEKKLPGTFEDKKRENS * RGNLE 
LEKRRQALLEQQRQG 


61 


13962 


A 


66 


72 


952 


S S RTYTTSIjPLLKKE VTRKIRKY F I LNN 
N*NTTYHNLWDLTKWFSGKLLF*RSR* 
EYRKDKISDLCFYIQMLE/QRQVIKPKV 
STRKEINKSRTQ*SGKETMERPKLEDKF 
LVNINNINNP**LNL*RKKEGHHCIFYR 
H * KDNKG Y I YANNFN I LDQMDKFFER YV 
TKMD\QEQIVSLNIPIADKSNA*LNNFP 
SSSSSSSSSSSS 


62 


13963 


A 


67 


139 


422 


VNGNEPEPL*KGISRHEHRRQPHNGFRP 
KNKGGAKNQ*ASLGMKTPEAPAHSDKPR 
RRQHAAYSS+AMPFLGICISFSQCNL/C 
PPKLNV 


63 


13964 


A 


68 


274 


1 


NLKNKAVI TKTA*H* QKNTDQWNR I KNP 
EIN\HPFYGQLIFDKDAKNTM/WKR/DS 
NFNK*CWEN* ILTCKKMQLHP/ SLTTNT 
KLNSQWIKDL 


64 


13965 


A 


69 


22 


419 


KIIGEDGDT/PLSEMAKSQRQIFSKDLV 
KLDSTINQVCI IDI YKLHHP\KWTVYTF 
FSSH*GTFSKIDHILLDKTYPNNL*RTG 
IMQHMLSDIKVFKLEISTRKVTEKS*HT 
MSLCNKTLEHPSGNEDASAYLK 


65 


13966 


A 


70 


57 


423 


KDKNSQVTSEEEEQEGKIKENLNSWRDT 
R/CH/WIGR\WIVRLSFLPKLIFIFNTV 
PIKIRTQFFMKLNKPVLKFIY*TKRTRI 
AKSLLKKK/RQEGKISPTNRRTYYEAYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ n> 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


I Amino acid sequence (A=AI;inine 
| C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutatnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














TLES I GASMPS LDLG 


66 


13967 


A 


71 


394 


1 


KHFRCMLQEDCRILLLWFFSETGSPCAV 
SPRIECSSKIIVHSLKLLGSSDPRASLS 
* VAGTI GVHHHTQP * GI / CN I * GSGRRR 
SEKQWIKEKNVSSEAKRRRTF*NE*SV* 
KGDYNKDQGLRLGTVAHAYN 


67 


13968 


A 


72 


485 


1707 


SALGLSQPRPPGASDAAPDSTPAGV*TA 
SSSAPAGEKDANQMEWARRDPGPLHQAA 
PAPH* SHLVRDKRAQERLRKTSRGPPHS 
HRSGPVDLSPRSSHSRDSTPVHGPYFRS 
APDPRG S APAFRGP I ATRGR / RPRAPRA 
TASHVPLS SSSWKEP EEATAPRF * AS AC 
MKWRLE AGCPGHRGVWPG P PS RRLFMWA 
TATGE KS TP S PFTG VWRFG VE ACT* AGS 
PTAHIQHSQRTWQPVPPCTNGSRSPWTS 
S Y P KNQ S KVGT V P KRG WACQ PIP ET AVR 
QATI SQ* ARKGAARPHHQARQVEHSTQ/ 
QRPSGDSGKFDQRSLCEASTLLQPERPE 
VQGKI CKFRV/GERRRTAS PNS AVPE KR 
TRTQTMVGTAFIGCSGKWRRVYSVRR/R 
SSPRIPPPGVGSS\PLKTPPPP 


68 


13969 


A 


73 


1 


462 


QDHRSKSSHSNKRPSLASSLSENFFQRA 
AKPLQVNKWKKLYSTPLIAIPTCMGFGV 
HQDKYR FL.VX.PS LGRS LQS ALD VS PKHV 
LTKRS VLKVACRLLNALKS S LKMS WVYG 
RVT*KYLL*F/YRTTNRGLL+SMGFAFR 
YLPHGDNPAHLERGN 


69 


13970 


A 


74 


307 


436 


LP * VGC I TEG I PEQDMVRGKHKLLRQE K 
TRLIGPNCPGVINPGECKIGIMPGHIHK 
KGRIG IASRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTEFIDCLEIFVNDSA 
TEGI ILIGENGA/NAEDNAAECL, 


70 


13971 


A 


75 


I 


435 


EISDSKAQLAAMAL I IDTWERMNCFSE * 
NHEPLRTHCALAAS KLLKKPD \ QAE \RE 
HIiCTS L \ WSGTNTDKNGEELHGGKRVME 
RLKKALI IAHQGMDPSLQVRVFIEILNR 
YI YFYEKENDAVTI QVLNQL I QKI LEDL 
PNLESWK 


71 


13972 


A 


76 


58 


366 


E FPDLVKDMNLHI QEAQCI PNKINLNKL 
MNRHRI I /RLLNTTTKKRILKAAKQK*N 
I/T/IRGSSICMMMDFSSEITEDRRKWH 
S I * KI LK / EKTQNCLPRVFCP VKI 


72 


13973 


A 


77 


1 


445 


YHETG C FLiMG A I VD I TLTFNT YVHF QGK 

MKGFSLLiADPQEFWVDNSTSMSAPMLSG ' 

MGTFQHWSDIH\DNLSVTHVPFTDSACL 

LLIQPHYAFDLDKVEGLTFHQNSLNWMK 

KLSSRTIHLTMPQIiALQGSYDL*DLLDQ 

A*LSDILTP 


73 


13974 


A 


78 


53 


444 


ERGGYGAGPVAWQFLVPSTAPMliQSPPIi 
GFAIDHTPPVPAPAN*APCPLPYAA*RT 
TGPHHIAHIjDTTFGTGDTPHPSSPASPP 
STPWLPDAFPCPLPTWD/RPPCLQPLLS 
SLPTPRLPFLCFLLIAPYAP 


74 


13975 


A 


79 


1 


353 


HI RTEIiDYYGIiTWI IYSDT* EAYNYIY 
IMVT*NVYKPQLVnsTI FDRATMHSQDVRH 
HLLCIRIiMLKNPKHHAV* /VLNGHYAFV 
SR/ S FKHALVQYVQAFRTHPDEPL YS FC 
XGLTFIHM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


1 Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L-Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


75 


13976 


A 


80 


1 


350 


LYFYALL.FLSSG/ CVAYVATRDNCCI LD 
ERFGSYCPTTCGIAYFLSTYHIKVDKDL 
QTLEDILHQVEDKTS * FKQLIKAIQLTY 
NPDDSLKPNMIDAATLKSMKML* * IMTY 
EASILT 


76 


13977 


A 


81 


76 


386 


PAYPQVRGPASTPASCIRPTNARVLSTT 
PRGKSVAEAHSVSPSAHRGVTSVIKIiWS 
AKRLH* YGA\KVRPNS\GCTP 


77 


13978 


A 


82 


1 


360 


ESI LQED ITVLNVGAPNNRAS KYMRQ / M 
LVELQRQIDESII ICEDFWTLISEMDST 
RPKISKDIVGLNGTIN*LDIIDIYRLIj* 
PTTAEKTFFSSSRGAFT*MDHIL>CYETH 
IYKFKLHE 


78 


13979 


A 


83 


39 


344 


WGIL.EGGEYIWHVGGGKSLRASGGLWSQ 
PPRQPDDSRPLVL.VPVCHLLPHWIGPTD 
LGHKRQW/MGPGAVAHVCNPSTLGG*GE 
WIT*GREFETSLANMVKPC 


79 


13980 


A 


84 


1 


427 


QQAHIAGHHRIHTGEKP*KCEACDKVYR 
GKSSL/MK/HRRIKTGEKAYKCEECHKV 
YSR/N/ SQTVKDRRIHTGEKP*KCKS\C 
DKAFGHDSHLAQHARIHTGEKPYQCNGC 
GKAFSRQSTLVYRQAIHGVGKLY * GNDC 
HKVFSNAT 


80 


13981 


A 


85 


88 


307 


TWTQRRAKLVRRIGWALLPFPRPSGSKP 
PP*TPPALPPCVP/PQSSPWTPPQPPAP 
DSRE VSKDWTQMRS FKEN 


81 


139S2 


A 


S6 


5 


375 


NSLIiNTHSRGPGASHCTFWLHERASSRD 
LTGAES YG ICRLRRLI SLS I VFSKS IHG 
LACIS ILFLFLFLFLFFVKTGSWLCCPG 
WPQAPELRQSSLLSLLSSWNYRR* PPHQ 
LIF\DFFCRN 


82 


13983 


A 


87 


2 


342 


VIKNEDHYIMIKCLI*QEDQTILNLHSF' 
NKTNINI YKPHMTNLQKAVDKI TITV * T 
CGTSLS I I AVCRLKLVKL/ VEDLSNI IN 
KLDLM/DNT YKTIiHLNI RJDYT FLKHTWN 
IYKN 


83 


13984 


A 


88 


1 


338 


CNEPRSHHCTPAWRQSKTPSKKKKQKKK 
LOTS /CI KASMKS RVQG * LR/ CWAQVMG 
* *GGVLWI>FVK/REFFTLSINLSKEGHS 
RRVPCliGCLKKKKKKKKKKKKKKKKKKK 
KKFF 


84 


13985 


A 


89 


931 


2 


VASPNALPE*LLPVSGHTHDLERVGVAR 
LGQPHHTELTACRARMESRDTCPGVTLH 
P*PPSFPSSSSPSPGGPRTRLTHQGAGL 
EGSQGPLQSQNPSAA/PLGACRGGWEWP 
QGPGS GS *GG* LMLRELRE / WQEAAVQL 
PTPG*PS*ESGRPLPSSASGPTPPGSP/ 
SP*PGTQGIiCGCHPSG1jCATAAGPQDGS 
GPTPTPH\PVQPSCRDSGPGQRLSPTPS 
LTS WPNTRPS PPTG PG PGPGGRATW/ PG 
S/PSPQRESPSPQLPCCTPGSFAGHPCP 
AP\AAPSSVACPLP/PDGFPRAPASSGI 
TTAPSPGPDP 


85 


13986 


A 


90 


66 


464 


LWVYSSFPRLPLHSGAESPEGDILWPGL 
TLFLSPS PS PLS \ CHSHLSPTCRRPRES 
PRFSLCWR PHPPS KPPPG * VKRDCCPS P 
DPVRLSVSENPSAGPR/VFLRPP 


86 


13987 


A 


91 


197 


2 . 


NAEIIPAPQKVGPFIKIWVINFFFPFFF 
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SEQ U> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=A'aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknowi, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFF * DRVLLCHPGWSAWQS * LTVASNF 
*F\K*SSCLT 


87 


13988 


A 


92 


317 


2 


SRTPDFR*STRLGLPKCWDYRCEPPCPA 
TSSF**HLSLC*QGNL/WGPPDSIRMG 
AG\ TRKTKCWEGCHFLFFLRRSL/NS VA 
QAGVQWRDLCPLQSLTPRFKRLS 


88 


13989 


A 


93 


1163 


0 


FAFGFEM * YCS VAQAS \ VQWYDL.TLLQP 
PLPGVQSDSPASSLPNKLGLQGTSHHAW 
LNFIi 


89 


13990 


A 


94 


3 


363 


VLWPCRPAGPSLGLNFPLYSWLRLQTFA 
AIRPGSTGRRLCIiPNWVFTRNS U?FH*L 
VC/CSSRHNTYLQECTGHREPTYQLNIH 
DIKLIiFLiRFAMEQSFSADTGGGGRESNI 
HLIPYIIHT 


90 


13991 


A 


95 


17 


353 


PEPRYVGFISGGKVDIKKTWKTQGRLER 
TVYYTGMYFINCH\VXiKNTDSSWGTSP* 
* IQQHAGKRDNNYQLL \ KLQG \QFSE AY 
TKCYSNPDSTHDVRKVYQDCPLLAPLND 
TS 


91 


13992 


A 


96 


3 


355 


ANYPRDL.CKVTDEGGYTKQQI FNADQTA 
LYWKKMP \ SRTLMEREE KS VSGVQAS KD 
RLTLLLGGS VAGDVK/LKPML I YHSENP 
RIC*SYSLPVLCKl«I/NKTWMTAHLFTT 
WFTECLNPL 


92 


13993 


A 


97 


367 


2 


SLHRARRGKGVGVRMGERLPS FQSWRLR 
L /RRRfCLCRR WGQGPRGN* Q * QKPDGGA 
RASQTDGGGERRRGVQTGSQAERDTS RE 
T YSPRG I QRE I VRVS KGGKTGRQ WQS QR 
QIQREKRVGR 


93 


13994 


A 


98 


115 


359 


LNFFFYLLNFMLLFHRYYALKVSYFKSS 
LDRKLLELLWNKYWVNTLS S SSLLTVS / 
DYTCKECI * ILSKLHSRLISETLFHRK 


94 


13995 


A 


99 


- 1 


490 


CVDPRVRT I FKKDKVGGLPLLNLCIYYK 
ATVT* TVWYWHKD / RTDQWDRI KSLE IN 
PCIYGQLIFFNYYFFSQ*CQDSSMEGNS 
LFNKWC/CRPLTLYTRIGSKWIKYLM*E 
LKLYRLSLHDPEFGNGFLEITSEVEITK 
EEIGKLDXIAKLKTFC 


95 


13996 


A 


100 


195 


3 


SRCCKQRHIjVQWNRTEKPE/ YGRLIFNK 
GVKTN*WGKNNLFNKWYWDN* ISTCKRM 
WLVPYLSPYT 


96 


13997 


A 


101 


1 


371 


FSLIKISMMLLMKMEK*NLQFIWN/HRR 
LQIAKARLNKKNKTEGITLPDFKIYYKA 
WWYWHKKRHIDHWNRLENSN INRHI CS 
QLILTKVPGANTKDHPFNEWSWENWCT 
KMKPDP/YLSSYTK 


97 


13998 


A 


102 


6 


370 


KKGTIPNYF*RE*TDRSKPN*NYATKEN 
Y I / P I S * NKKIL * KLANKI QQH I KKKPD 
NSLFYKIQFGSILETFITINQISKPKEK 
NHI I I STDAENA \ FDK I QH 


98 


13999 


A 


103 


298 


24 


KCWTS I YLLLFFRDEVSLCCPGCAQTPR 
LK * SFCLS FPKCWPYRREPL VPGLDQ/L 
FLESNRS VSVLKKNVP * WFSNSSGS KSG 
NHDAFCGIS 


99 


14000 


A 


104 


2 


352 


GT I ADFTQCCQAADKAACLLPKLDELP \ 
QNGRLRSAKHRLKCASLQKSGKRTFRAW 
T*AGLIHRFPIAEFAEVSNIjPTEIjTTVH 
TECCHGDIiLECADHRADLAKSICENHDS 
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1 SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

Iv— l^ysine, L^Leucine, IV1— Methionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISSKL 


100 


14001 


A 


105 


139 


361 


KHTEAF*MPCAKYY\LFGGIiNQICALPE 
KTPVSDRKTKCCTESLVNRRPCFSALEV 
DETYVPKELNAETFTFHAK 


101 


14002 


A 


106 


3 


350 


ELEMIMLSEEVMLKAKIRQKLGLL/QPV 
S*WNAKEKFLKEIKSATPVWTQMI \KN 
S L I AD / ME WW I DD QTSHNVPLS QS L I Q 
S / K/ ALTLFS SMKAERGEEAREEKLEGS 
RGWFTRFKER 


102 


14003 


A 


107 


225 


1 


QG I MMDTVE YVGKGEPVRCW WKCKLVAL 
L*KTTWKLLRKLKI * \YDPAVPLLDIHP 
KEIKSIYQRDSCISMFTA 


103 


14004 


A 


108 






LGYSGVRAPLEEAVCRFS DLKLiRAGRTT 
TLFKADRQGHLS LQRFLLTFDSLCP / AP 
RGGVYRGRQASLSCSGLHPVGASWPRCIi 
PTQASAMAGAPAAASLPPCSLI SDCCAS 
NE*GSVSVGPSEPGTGHN 


104 


14005 


A 


109 






KVWGEKVWYWQKMTQIVQWDRTESPQID 
N*SLTKEIQWRKDSLFNKW*GNN#TAPF 
S S /RS LNLNKDLTAVTKI KS KWVTDLNV 
KHKTIKLL 


105 


14006 


A 


110 


135 


_ 


KQAILWPGAVAHA/WSCNPSTLGGPGGQ 
IT*GQEFEKSLTNMVKP 


106 


14007 


A 


111 


140 


338 


IMSIILYYGSICYCYYWLID*LILRWTL 
ALVAQAGVQWRELGSLQPLP/ PCLSLSS 
SWDYRPP 


107 


14008 


A 


1 12 


26 


361 


RSFFWVFFFEKKSLLGIPGGKKGPPLN* 
LEPLAPGPKGFSGLNPLENGNWGPGPPC 
RGDFGFF*KKTPFPF\GPGGVKTPNPGE 
TF/EPKPPKGVGFPGETPGPGPGENLYQ 
WK 


108 


14009 


A 


113 


1 ] 


330 


ASIWLHIFLFLSYFLEKGPRYVAQAGLN 
L.LGSTGPLSASRAAGTTGALHCTQL*TY 
LLLPKHGKAVFF\QETLLLRSTHPFPLK 
HVYTPTYELLLVWDRDSLSRPR 


109 


14010 


A 


114 


350 


2 


GCCPCLL.FPGS PTVFGSL.YS I RLVAFVR 
AVP P VWKAVPG * P P VKM * FFHL F I YLFL 
KQ/VSLWHPGWS*LAQSQVSRFFPRSLP 
SNWDYRGAPPRLTLFFFFFKQSQGIEAM 
VIAL 


110 


14011 


A 


115 


3 


341 


RMVS IS * PRDPPAFASQRAGI TGVSHRA 
RP VYS F LLLSNV P VMD VPQD I Y P FLVDG 
YLSLP LVCCV* V* VLLC YPTWP PGLKRS 
SCLSLPSSWDYRHEPLKPA/SCCVLLSS 
G 


111 


14012 


A 


116 


1 


384 


PSYPGDL.TEIMDEGGYPNQQI FSIDDTT 
FYWKKMPSRTFLTTEKSIS/GF*ASKNK 
LTFLLEANVAGDFKLKPVFTYHALRNPR 
TIjKKYATSTIjPVLCKWNNKA* VT /AHVF 
TTWFTEYFKLSVETCCSE 


112 


14013 


A 


117 


43 


347 


CAAG FG LL * T P PRT * TRKPRRNGT Y KP I 
SLMNINAKILKK/ ILANRIQQHI KTK\I. 
LQSDQVGFI 


113 


14014 


A 


118 


351 


23 


FNK*CWHNWIFTCKGMNLEPYFTPNTKI 
NAIKAIQLLEENKEVNLC*LGLGNGF/S 
RIiPKAQMAKETIDKLDF I KLKALCSK\N 
TIKKVKKSEELEKIFANHIHGTSIQNI 


114 


14015 | A . 


119 


123 


340 


AADSSTHSLTKPAYIiEKQNFFFFQMeYC ) 
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SEQ ID 
NO: of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparttc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LIPLAEGPWPNLA*MQPPPTGFKQSSCL 
SLPSTWD*R/HWSPRLAN 


115 


14016 


A 


120 


1 


371 


PSYP*DLTEIMDEGGYPNQQIFSIHDTT 
FYWKKMPSTTFLiTTEKSIS/GF*ASKNK 
LTFLL*ANVAGDFKLKPVFTYHALRNPR 
TLTKYATS TL P VLCKWNNKA* VT /AHVF 
TTWLTEYFKLSVET 


116 


14017 


A 


121 


82 


367 


YS KRSNT I VAGD / FTPLS ALDS SSRHR I 
NKKTSNCTIDPMDVIGIYTVFHTVSTEY 
TFFSSANGPFSKTDHLLGYKTSLKTF*T 
KLiK*HQ/CIFSDHN 


117 


14018 


A 


122 


27 


253 


MKTENI LGEN IGE VFG VGKDS LDMS PKA 
* TI KEKNDKLFF IKVKNFS YS KHTI * KI 
KNQATVWQ / KL LT VHKSDR 


118 


14019 


A 


123 


3 


305 


GTRQGCPLSPL* FNTVLEILVR* ISQNK 
*INKT/SLTADP/MVLHIENPKGSIK*V 
LELINEFSQVAGYKINM/QKTVAFLYTN 
N*LSKKEIKKTIQFIIASKRT 


1 19 


14020 


A 


124 


1 


327 


QVIFCLGLPKCWDYRHEP/RMPGLRGAD 
FFSLI * ILWEHMF* VMCILPHPCGDFWA 
MLNF*EREGMFF/ CLKRWKSHNVSQA/G 
PQTPRFKRFSRLTLLSSWDHRHAPPHLA 
K 


120 


14021 


A 


125 


74 


334 


IFEFGVLKVTTCLSVSSHEVGKLCPFFF 
FFKNGV* FCCPGLQG YNGS * L \ TLELLK 
QSSCLSL*CTQDHRCLPPCLANFNLINS 
SIH 


121 


14022 


A 


126 


381 


60 


TVSQVLAHTCSPSYLKAKAGGSLEPGSC 
YPGCSEL*SCHCTPAWIT\SQTPSL*KI 
H* KKI IVKRAIVE CVCVYVYTY IWLLIli 
KFSDS I P FAQI WG FHEEFP Y WNQ C 


122 


14023 


A 


127 


323 


2 


RGFLGLSDEVSGQQLFSILSEIENQFKN 
NLEGCGGSCL/SIPKCWDYRCQPSRLAY 
A* LI FVFLVE \TKNTG FRHVGQAGLELL 
TSSDLP\ALASQNAWLTGISHHAEPY 


123 


14024 


A 


128 


2 


354 


ENCQINTJLRFHLRKPKKEEYTKPTASRR 
K*IVNIRIERNGIEQK\TIEKINELKSW 
LFENINKIDSHSGKWMVGEE*CVFVCVC 
VRERQSDRERETRLI KI *NERGDITADL 
TEIK* LLILQKLNNY 


124 


14025 


A 


129 


74 


327 


GELAMLPRLILNS* \PCNPPA\SASLVT 
GTTDTPLYPAHYY Y\ I I I I I I I IEMEFH 
S \VAQAAMQWCDLGSLEPL\P PGFRRFS 
CLSL 


125 


14026 


A 


130 


83 


382 


YEFHASDGGSRLSSQHFGRPRWKDYLSL 
EVEGQ PGQHS ETPS LS LS LSF / CFLEME 
SCSIAQAGVVQWPYLGSLQPLPPRFRRF 
* CLSL PKCWNYGHE PQL 


126 


14027 


A 


131 


749 


2 


RQSLALSPRLECSSTISAHHNFCLQGSS 
DS PASASRVAGITGVHCHAKLI FVFLVE 
TRF\THVGQ\AGLELLTSSDLPALASQS 
SG I TGMSHHVQARCVI F/ CGPTFKNQDM 
KY*NKKEKNQSTFLEHLQNQQENRHAAH 
TLRHHTAS/LKSSEFLFAIRTSFLIS*K 
RS GTTGMF * YN / WT I GNG VEDRFVLGPP 
FGLGVQW YHHS LLQ PQS PGLKQS SL FSL 
PSS*DYRHVPTPSFLKFLRRWGLAILLR 
L 
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ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


127 


14028 


A 


132 


371 


] 


KKNLNLNLTL*VKINTNWIMDLTIKYKI 
IT*KKQEKVGNLEQGKEFSDLTPKV*SI 
KGNTDK* DV* KPFSL* KPM* GITYRQATD 
* KKMSAKHVSNKYP \LSRIYKPNDKAKN 
KPIRKWAKQMN 


128 


14029 


A 


133 


2 


318 


NHKIILKDAKKAFDKM*HPFTMKILNK* 
GIEQYAT/DI I KAMCDRPTADI ILNGEK 
LKAFPL,RSGRRPLL.\FH* IW+ VLARTVR 
PRKRNKGSQEWWLMALI PSTLGG 


129 


14030 


A 


134 


3 


362 


WSELLGSSDPPALASQIAGI IDMKHYAQ 
PGTDF/STSVYFTLPTQIRL*ALKRKFL 
L/EFK*IFASI * Q F AKC S CMS RNFL PAQ 
PYHHWHFPNE ETE AQGNFSLLWAC / LRE 
DWIPTD 


130 


14031 


A 


135 


12 


398 


KCSTS I VI RE I QVKTRE I YHYTPTKMAK 
IKKPDNAKC/WSGCGATKTLIHCW*NN\ 
SLEIWLFLMKFNICLPCDPEIVPLDLYL 
REMKTYIHENTYTKVSIAPLFK\QPKSP 
STGES INCSI I I PRMECI I 


131 


14032 


A 


136 


309 


1 


QSEAPSQKNTKKQKTKVGGLTS * FQTYH 
NATVIKTVWH/W/YIGIDIDQWNSIQSP 
EINFHIHGK/LISNKAAKTNEWGKNSLF 
NK* KKDNR I FTCKRMKVDSQ PHI 


132 


14033 


A 


137 


2 


251 


QCGKAFRAASVLRMHGRTHPEDKPYECK 
Q*GKAFRSASHL*MHGRTHNQEK/P/HE 
C KE CG KP F RS AQNLR I Q * RTQ AH I RMHS 
G 


133 


14034 


A 


138 


396 


3 


LELLTS * SACLGL.PKCWDYRREPPHPAK 
MIL*SKSS\LPFLCSKCLNTIWALLLLL 
CSLVPLL.TLPLKEFYAAH*SPDSLKYPV 
LQKLSKLEMLVLHETVQRKEQLTYSLYK 
PIFGFIiVGYSFPLFCSYKTS 


134 


14035 


A 


139 


3 


384 


LDF I KMKNCCS S KE LCGKYE KP*T/MEK 
MFAKHISNK*FIFRI*K*LSKLTKKK*A 
KDLNRYS L.KED I QMANKCM / KKCS I * LV 
IREKQTKKTMRYY*NGLS*RS/GQTK* * 
QGWGGTGPL.IHCWWEYPMI 


135 


14036 


A 


140 


63 


388 


FMYLFIY*LER/CLTLAQTKVQWCDHSS 
LQPQTPGLKH\ PPASASGVATTTIMPG* 
FLFFF * KRGPCW VAHARTT /WLNLGSF 


136 


14037 


A 


141 


370 


3 


GPGHFPALFFKGPGLGPWVPPI IPALLE 
G*GGGFPGP/RGSGPPGQPCFFLKFQNF 
PGFV\GAPWPGFPGWI>GRGTALNPEAE 
GSINLKGPPGLQPGGKTKLFFQKKKKKY 
QHFQKTQLILCF 


137 


14038 


A 


142 


363 


2 


REKATEENFATS * DWLMMFKEGSHFHS I 
KVLGET P S AD VET WSC PEDLAKIMDED 
GYTKQIFNVGET\PSRTLITRKEKSVPG 
FKAS KLQR IG * LLLGVNAAGDFKLQS VL 
MDYSENHAE 


138 


14039 


A 


143 


99 


375 


KNKNQNISPPLVRYTLLSPLLLPFIINS 
FFFFFFSF/ CFF*KKSRFEPKVAFQGGN 
LS*LNPRPKGSKEFSCL.SLQSTWITGGP 
PPHWADFDF 


139 


14040 


A 


144 


39 


399 


LQI I YNY I WLYIWliY I FWLVS KI YKELL 
QINNKETNKNGQKT* IDVSSNNMYMNG* 
*LHEKIHDIVIR*MRIKTTVTYHVMSTR 
MVIIKETENKYW*GYGE\KILIYCWWEN 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FEK/SVWPFL 


140 


14041 


A 


145 


358 


3 


REHTAEDKYEASRSWFPRFKEK\SPLRN 
I I VQGEAATTNGEVAVS Y/PEDIAKNID 
EGGYTQQIF\NADETF*YWKMLFRTFIA 
REKS \RLPSRDRLTLDDGANAAGDFKIiK 
PMFIYHSDSC 


141 


14042 


A 


146 


3 


376 


AVMGMYKINWFVPTNTSILiQPID*GVI 
LTFRSY*LINT\FCNTIGAIGSDSSDGS 
GQSKLKTFWKGFT I PKS \ I KNTCDLWEE 
VKISTLMGVWKNVIPALIDLFERLKTLL 
EEVTAYMVNIGRRL 


142 


14043 


A 


147 


374 


161 


EFHRVSQDGLDLLTS * STCLSLPKCWDY 
SKWQRRI*TQ/DLCDSED*SSIKPIQLC 
QELMS * MAEMAHGQP 


143 


14044 


A 


148 


373 


2 


FFFFFFFCDLLLFPKQPRSHASFSLPSV 
VPFPEMPSSSLTLFYLFVYNIiL.R*YL/N 
SVTQAGVQWHDEGSLQPRP\SLZ>SSWDH 
WCTTPCPAN* IFFL*RVGLAMLPRLLLN 
SWAQAILPLWSPK 


144 


14045 


A 


149 


3 


282 


ILKRI IHPDQVFFIPAMQGWFNI * K* IN 
TV\NHIIRIKGK\NRMIISLDTVKAFGK 
I QQPFMI \ * ECLG I KGNFLNLI KGTYENP 
QLAGHSGLCL 


145 


14046 


A 


150 


3 


360 


DII*ESKFQGHHTTGVQKGLQYGIILFI 
I * EVFFFAGFFWAFYHSSLAPMPELGGH 
\YPPTGIFPLKRLEVP/LLNTSVLFASG 
VSVTSAHHSLIEGNRKQI ILALS ITITL 
GIYFTLLQI 


146 


14047 


A 


151 


349 


3 


KKSSGSHLETKSNRLPGPSQGPWCKEIR 
KGSLAWQEKSWPFYKPFLLEVGGGRVIR 
GS RMTRHGHLIR S S KTLI KHH VPK * TVQ 
FNFCLFI FLEMESRS \VAQAGVQWCNIG 
SLOP 


147 


14048 


A 


152 


500 


0 


MPSRLECSGA/ISDAQCNLQNSGAPSEP 
SSALSNPCG*DYRDTLPLPGYFLKFFVK 
TRSRYVTPAGL*LLASS\IPPTSA/FSK 
HWDYRHE 


148 


14049 


A 


153 


373 




QAGFTLL>TSSDPPASAS\QSAGITGMNH 
RTWPGNF * PQKS CDS FVTKLMCTCHKNH 
I YAQS LVT * LGHS Y I FF * DSLQPS PPRF 
KQFSCHSLPSSWDHRHTPPCPARTQLHF 
GYEYFSRHFLRRR 


149 


14050 


A 


154 


332 


3 


THKFQRLTSKSKKLKRKLIFHQGAQKSN 
SLNPDI KKNLKRSQRNME*EKI FTDHLS 
SRIYRELIiKLKAK/INLIKMGKGYFYEG 
DT*MADKHIKRCSISPIIREMQIKTTIR 


150 


14051 


A 


155 


337 


21 


LGLLPFFQRCPAKRGGI *RGSLAAVALL 
NCGELHPV * ASW WLCL\PVRGKLPTEAS 
VMGDAP P P TKLE CPRS TSDCCDGS KNFE 
SVDL.SLLGSVGVGSVDLDPVAP 


151 


14052 


A 


156 


21 


491 


HPGSRG CS E PRSGH CTPAWGT JCVKTPAS 
TKTKQNKKKK* RTVCLTPWHTLL/ RVWC 
DGGDYSS SLLi 


152 


14053 


A 


157 


3 


394 


REDLLS PGFQGCCEL* LCHC I PAWVT/ S 
ETLSQKKKK 


153 


14054 


A 


158 


36 


354 


LS AAFTS QLLGRLRQENG VN \ C S E PRLH 
HCTPAWATE * DS VS KKKKGKPFW I PNPL 
S SLSPAF I SGQGPWESHLKNPGRALQGG 
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sequence 


Predict- 
ed end 
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ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K-Lysine^L^Leud^ 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argmine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibfe 
nucleotide insertion 














GFQPLICPRPGRTHMSLKSPWRP 














TITLFFLGNKVLLCCPGWSAVA*-S*STV 
ASN\FGPK* LLCLSPPSSSDSRYARPHM 
ANFK 














HLSPLSIPQNRHCHHGPFSVSCWAKLPD 
GWAGQRGSSLP/M*GRPGRGAPSPPRR 
GGWPGGGLTPHLPSGRGGWPGRGTP\PS 
Q*GRPGRGAPS PPGRGGWPGGGLTPPPP 
SQTGRLAGRR 


156 


14057 


A 


161 


293 


2 


PHIAIRPPTTVDKSALYWKKMPSRTYKV 
RQKSMPGFKSSKDRLiT1jFLGAK\AAANF 
KLKPMLID\HSKNTKALKNYAKSILPVL 
CKWR\HKA*MSPHLF 








162 




136 


livpiliamaflrlterkilgyiqlrtg 
PNVWTPPTGATYRIjLS/P*QP**LFTKE 
P\LEPVTCTI TLYI TDPTLALTI ALVL * 
TPLPI PNP VGNLNLGLLF I LATCSLAVY 
SIL*SNRNGIPKTYRTKNSRLYTTTHRP 


158 


14059 


A 


163 


1 


464 


RQGL/DSVTQAGVQGRNLSS \LQPL/PP 
GLK*SSRFSVSNSWDYRCAPPHLANFFN 
LFFVQRG\FTMLPKL*TRS\GPGD 


159 


14060 


A 


164 


1 


353 


FNYS S S L>GNK S ET P PQKK KKKKKGRG PG 
ENPGTPTFGGAKGGGNPG I KILKTTGPK 
GGNPCFKKKPK+PHRGGRP/HGNPVYRG 
GPGKKNFWPPGGGGSKG*KSPIKPPPGG 
KKGDFF 














FFVCTFCRERSLIiCCRSWSGTPGLK* PS 
CLS LPKFS DYRHKW PCS ARNFLPACIiPA 
CLPPSLPSFFPPS/SPSSSPRQGVTVIQ 
VRVQWRDYGSLQP*PPKTKRSFTLAFQD 


161 


14062 


A 


166 


330 


2 


WDYRCAPPHPVLYLKM/STFNIFFL*RG 
VTMLSRLVSSDPPISASQVARIIDVSHW 
AQL * RS V / CYVFE TGSGSLTQAGVQRYN 
HGSVQPQHSRLS *SSHPSLWNYR YPPPC 


162 


14063 


A 


167 


2 


396 


YSNVLLGIYPKELKTLVHWMFICV*NTC 
IWMFITALFLA\ES*CSSVGE* IGKTWY 
IPTMKYYSVMKRNAISSHEKTWTRLKCI 
LPSERSRCDRLQTV 














IiIFGKIYKINNSSKTDNKRGKTQIQILG 
MKTGDIMTY\QQTSKEYYAHEFNNLEEM 
DQLFKKHKRP/RIHQYEKHHLTGPMTIN 
EFEFINVKTPKNQSPGWDDFIGKFYQMF 

S I T Y+PKPDRQ+KKQSCGPISFM 














GRVDLP PTQESRPPG P P PSD PGVQAPRP 
S IVRNCGILTRGSPGPD\PS PLSS *AQL 
CGSPPPPSFS 


165 


14066 


A 


170 


2 


327 


PGGI G CGELR *HHCTP TCATERDS V/ WN 
KIIHYIIMIiASPNELILPLLNICLEIL* 
AGPLTAICISMFMATIA FTIAQMGKQPK 
CPSTNE W I RKMWY I H I ME YYLHFKMK 


166 


14067 


A 


171 


2 


343 


PGWSQTPDFR*STCLGLPKCWDYRR/AA 
TVPGLFLFLTALC 


167 


14068 


A 


172 


367 


3 


FIRDVQFISALRYLLTPERMA/MIKKSK 
NNRYRCGCGK/RG/TLLHCW* KCKLVQP 
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ing to first 
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Predict- 
ed end 
nucle- 
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location 
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ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro/ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L**SVWRFLKELPLNSVIPLLGIYPK,/D 
KEVIYEKDSCILVFIAVQFAIA/RNIKP 
TYCPINE* IKKMWYIY 


168 


14069 


A 


173 


102 


345 


YQLQN I PRWVYS KLQNLS FAF VLL* RDR 
VFLCHLGWSSLL/QLKRSFHLYLASSWD 
YRQVTPRPAN* ILFRD/RGLALLPR 


169 


14070 


A 


174 


22 


401 


strlglpecwdyrcesscpatylpf*hs 
ipflsqlkvqvhhrryfhlfpelsnl.pl 
lsknvs*ty*ksfls/wpslrvlfcfvl 

FC f vtg fhs VQ AR VQ WGNEG SLQPQPPS 
LK*SIHLSLPSS*DCR 


170 


14071 


A 


175 


2 


388 


SDLQLRAGRTTALFKAVRQGHLTLQRFL 
LSF\V*LCPAPRGGAYRGRQASLSCGGL 
HPGRAYRLSCLPKQAWAMVGAPTPASLS 
SCSLISDCCASNHRDSVGVGPFEPCAGC 
NLTEHRFLSPSGAWSC 


171 


14072 


A 


176 


334 


1 


KLLELIN* FNKVAVYKINTQNQLHCYTL 
IRTTQKGSLKTILLTIAPKRIKYLGI/Q 
/ LTKE VKDLHS E S YTLLKE I K * DINK* K 
DIPCS*MGMVFFGVCLFVYFCFEMAFRS 


172 


14073 


A 


177 


339 


2 


KNSPYNFPKGKNPGNL*SLKSRIF/CWG 
STFAHIGELFFFKMGKKFPAI/RPFFFF 
EDRVLLCRPGCSGWRSWLIAAST/CLG 
WDYRCTPPCLANFL *RQGLTMLPRLLSN 
SGLQ 


173 


14074 


A 


178 




303 


DLRRAACLILPKCWDYRHVPPRPADNSG 
FLHLI I I * VCLPLLCHAQLVFI FFIETG 
FCHVAWAGLKFLGSSSPLTLA/FSKCWD 
YR 


174 


14075 


A 


179 


342 




INRLNIFTMAIFSTLIYRFNTIPIKIPA 
GFVEMDKLILKLVWKFKAHGIGKTTLKK 
NRVPNSKAYYNATVSKT\YWNEDRNINQ 
WN+IDNLKINPCVCGQLIFNKDAKTNEE 
R 


175 


14076 


A 


180 


2 


323 


S TS Y P I TGSHA FL * P QNWDAE TNS * H I 
NNVNLRLKI IKLLEENTEKNCHDLGLAT 
DYY/S VTPKA *ATTrXIDKLEL I KI KNF 
CTSKDIT*KVKRQLIGENSCKSFM 


176 


14077 


A 


181 


326 


3 


RKRKKKRREREPKKDEERKRLYGKDIIK 
KRRKDTM/DWEKILQKGVRQRMCI *KI + 
RGVTQELKANPIRKGGNNLNKVHQRI*M 
ANKHMKRCPKS*VIREI*IPTIMRYH 


177 


14078. 


A 


182 


5 


326 


TKTIEKAAN*IRRSRRKKIKFKPEIKNR 
KTTTKINES KSCFYKKTNKTDKLLVKLI 
RKKKTKKI ITKDEKNHTI \ *DCTEVKGM 
TECYEQVYANKFDNS \EKMDNFLER ' 


178 


14079 


A 


183 


3 


378 


TVLCSFIVWGLDFFG* IHTIYS*VFCYK 
WKTCVHNFLFRFFSA YI * YLLIFVY * S * I 
MHLCLICY*LLL/CSCIFFWIFYMLYFS 
FWL/CFLYFYFYILCPFIFF/CIFICLL 
FFVYLFLLFIFFFIFLF 


179 


14080 


A 


184 


365 


3 


MQLKCIMKS ISVQAGVQWHNFGSMQGPP 
PW\LHHFPASASLVTGTTWARHQGRVIF 
/ I FFFFLMRRSL/DS VDQAGAQWHDPGS 
LQPLLPGFK*FA/ CPCWDDRRAPQCPAN 
FVFLA\RHGFTILAR 


180 


14081 


A 


185 


51 


293 


PYVNQEASVLTNS FYQAS /I * KKSNYRP 
I S LMN I DAKI LNKTQQ I KLNKVND S RQT 
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nucleotide deletion, \=possible 
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GEDQ I PKYCTVSDGRMMRDYFPLFS S 


181 


14082 


A 


186 


326 


3 


TKICQWILNLGKKREFGFKKKKKIKCPN' 
KRHRLANG I KK* D PTTGCLQD I CFRSKD 
AQR LQ I KG WKKI FQ KTG I *R/GVAVLTS 
DNIEFKIKTVTRDRVANYI IKGSIHQ 


182 


14083 


A 


187 . 


17 


330 


KDNEM * I KRTMRCYYTSFTMAKINLKIN 
NIKC*QG*GTTGT/L./LYC* WKHKIVQP 
FSKTV*QFLMRSNKYLRYNPAILjLLGIY 
SN*VKI/C*KTYMRMFIVFLITKDWKQP 
NYSS 


183 


14084 


A 


188 


124 


357 


slwggdlegggekkergenpggpkkpgk 
kkkkggnlgarg/vfkkrgrpqkekpgk 

KGEKGGP*GAGI*GGGGNGVKKK . 


184 


14085 • 


A • 


189 


333 


124 


RLRQENRLNP/GSRGCREPRSRHCTPGW 
ATE*DSVSKTNKKMYKSIIARDCHAENS 
IVSFLSKRGCVFLF 


185 


14086 


A 


190 


1 


327 


KSWL.IGDNIPSSQSLjIQKKGVNPSFK.SM 
KADRGKEAAEEKSEASRSWFMREKERSH 
LHN/ I K+ VQGKAAS YPEDLAS I I DEGGY 
TKQ* IFNEDY/MWKKM*FRSFLTREKL. 


186 


14087 


A 


191 


53 


400 


VNILGPFHLFFTFVQNALSPNIYMTHML 
TSLGPLHMSSLQTGFPKPSYLKFQPSPH 
SVPVGF/IPGMQR*FSIHKSVNVIYHVI 
RMNDISHMIISKDT*KAFDKIQHIGSLF 
PAXjEL 


187 


14088 


• A 


192 


65 


321 


RARTEIYLIHTLLGVYAGETKTGPYTDC 
M*MFTTAPTTLVKTWI*PKSLSIGERIN 
KL/WILICTYME YYSAI KS VLTHLTT WV 
NPH 


188 


14089 


A 


193 


127 


241 


IIKKDSRRICNLTEKLPLRKINS/WPGA 
VAHA CNPS TLGG * GGE SR I TYNESRNKG 
EQTQNNK 


189 


14090 


A 


194 


1 


337 


DLPASASQSAGIIG*ESIRN+NKLVKKT 
LKCSQKDPKEDLNK/WKDIVCSWLGRLS 
LTKVS I LP KL.I YKFAKIPTKLYYD* KKN 
R PGAVAQAYNSS TLGGRGGQ I 


190 


14091 


A 


195 


2 


333 


NFNSLFFFVEIEKLILKFILNCK\VARR 
ILQRKNKVGRLTLPNFNTDYKATVIKTA 
WH*HKDTHMDQRNRTESTKINTYKGAKT 
I * WRKNSLFNK* CWDK/WITTCKRKKVD 
P 


191 


14092 


A 


196 


299 


2 


PHPQRD WQPS P PS I CRI FS P S S I PG I Q 
SWFNIQKSINVFYHIVILNTQHRSSSSS 
SSSSSSSSSSY*LMIKF/LNKLGIEGNF 
LILIKAIHGKPTVNH 


192 


14093 


A 


197 


2 


199 


LAIiSPKLECTGAITAYCSLELLGSNDPP 
TLASKVF/GITGMSH*AQPQWFIjLLYY 
FIFFFSSSSVL 


193 


14094 


A 


198 


226 


2 


KKFFFPNPNLKNFPLGRVFF/CSPG*KF 
FAPLGFFLVSFLRQKVPHLGSPGKTFFF 
FFFFGDRVWTCHPCWSAVA 


194 


14095 


A 


199 


338 


93 


PQHNGSCL* SQLFERLRAEDPLNLRVQG 
CNEL*LYLCTPSWVNPVSEKKSIQEKNA 
NLDFRE Y/ RD KRREN WTC V * NS I ERSSE 


195 


14096 


A 


200 


1 


361 


PRSHLSQLRVRLSQ/ IKKNQKTIGKGFR 
AIG/MGKKF*GKTLKVPPTRGKVNQWKF 
L KL KS FLiPRKKKTE I TL * TLG KHFR KWE 
KFFATYPS.TRE* ISKICRDLNHFTGKKI 
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G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possible 
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I I P I FQRAKGL ' " 


196 


14097 


A 


201 


2 


358 


ENKTTMR YH* I P I RMAEN * VLAVGTL / L 
Q PWWE YKR V+ PL WKTNSLAVTLLN INLP 
CNPTILLVGIYPRERKTYVYTVACTQMF 
LTVLFTVAPNWKQSQGPSTGE * KNK* WH 
IHIMGHYS 


197 


14098 


A 


202 


110 


357 


IWVFIWCHFLLPEKTSFFPGFPQCLDIS 
* KTLFVHSVLFFFKDRVLLCHPGWSAVA 
Q+*FTAASD/FQESSCFSLPGSWDYKRM 


198 


14099 


A 


203 


2 


374 


ILKLHAFFMAETPGACKTPCDLRQVFIF 
IFIYLF/MRLS LTLi vaqag vqwrnlg S 3j 
QPPPPG FKRF * GR CL F YPKDGLVCPGLA 
GS*TGQIGKGIRTLSPDLCLSDGFCGSK 
PfCS/ASASQGHVG 


199 


14100 


A 


204 


375 


3 


LSSILIRGSFNLSTLITTQEHL*LLLPS 
W\PLAI I * FISPLAETNRTPFDHTEGES 
ELVSGFNIEYAAGPFAILFIAEYTNI I I 
USTTLTTTIFLGTTYDAIjSHELYTTYFVT 
KTIPLTSPALMNS 


200 


14101 


A 


205 


393 


3 


SARLGLPKCWDYRR/ASTVPGL\SLMLL 
LKS YVS KMKKCE * L VKPLKAKL * DS CKE 
L*VGNNLIMPSTYDQENDRVDSSLTWLC 
VLLLLLLLLLLLFWRQSFALVARAGMRW 
RDLSSL* PLPPGFRQFSCI 


201 


14102 


A 


206 


2 


377 


FRAVLQGRWSLQKFLLPFVQLCPAPKGG 
VYM * RQRS / SLS CGGLRPVRAS WLLCLP 
TQASAMVNAPPPARLLPPRS I SDCYTSS 
EQGSVSMGPAEPGVGYDLLVCCLLRPLE 
KHSIWVRVSCFSRY 


202 


14103 


A 


207 


327 


1 


I LTGNFKQ IRML I YHS VNPRALKNYAKS 
TLPVNYKWNNKA* MTAHLFI AWPTGCFK 
PTVEPYCSKKKK\ I PFKVLLLIDNAFGH 
PRALMKMPEE I S I VFM PANPAS I L 


203 


14104 


A 


208 


310 




FCWWAPGMGFPLGEWGLILGPGSPFFLK 
EGFPGVGQPGPPKRWGLKG* PPGAHWKG 
F/ I FFFADRVSLCHPCWSAVAQSQLTAN 
PASQVQAIL.LPQPPDYRIR 


204 


14105 


A 


209 


374 


3 


GVRDP/LEEWCPF*DLKLVARRT/SLF 
FKAVRQGHLSLQRFLLPFVWLCPAPRGG 
VCRGTQASLSCSGLHPVRASWPLCLPTQ 
ASAMVGAPSPASLPPCSSISDCCASNER 
GSVGMGHSETDVGY 


205 


14106 


A 


210 


3 


196 


LENLKF/ LDKFLETYNLTRLNQEETE I L 
NRPMTSSKI * SVIKNLARAMAHAYNPST 
LGGKGRRIA 


206 


14107 


A 


211 


34 


339 


IIKSKRINYMSCELYFNINIDNFFFFET 
VFRSVI\KLKCYGAFMVHCTLNLPGSTN 
LSTSASQVSGAINRGYPGQF/lGIiSK*F 
LVKTGFCHVA*AGLEPLGSG 


207 


14108 


A 


212 


200 


312 


HYGQFHV1>FCSLF*MESHSVT/ QAG/ VQ 
WP/SLALQTPPPGFKQFSCFCLVSSWDY 


208 


14109 


A 


213 


300 


124 


NH I FFYFQ I HRVCVCV CVCVCVCLCVCV 
YRY I YCVS P W / RN * * S L * KECLS LVLLT 
PTC 


209 


14110 


A 


214 


1 


357 


QVDHTSDRKANLNM F * KI KVIQTMFS DY 
NGVKSENNRRKTGKLTNIQKLNST/L* I 
SNETNEKPQE I RKYFEVN INENTTYQDL 
WDEVS I VAQAHNPS TLEAEAGRLLEPRS 
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LRSAWTTE 


210 


14111 


A 


215 


360 


1 


LTIETYCSE\KKTPFKILLFIDNLAGHP 

SFNSYYLRTRIHRTFWVAQSWLQCPTT 
AISDIQIGPVAMGLLLIPSAFVLCWLSI 
QQVSKLKH 


21 1 




A " 








SSRSRADGLFF YQCK* QTLYW * HL»WYKT 
QKE KKKTQMKEKKQR I NKAR \ GDKLPE C 
EAVCGKPKNPANPVQRILVGHLDAKGSF 
PWQAKMGSHHNIiTTGATLINEQWLLTTG 
KNLL 


212 












LGNRGLFPNPGFSGPFPKKQVWEGALCA 
PG*KVCTLKF* FI \SLAIPGNGSKNFFT 
LFEGPSPIL 


213 


14114 


A 


218 


3 


397 


GGQG Y SGTGHHGG I WH KM EH*R*PLPPK 
K.KKKKKKKKKKKKKKKKKKGGPPKKKPR 
GGQNLKGGGKEKPPPQKGVKKKTtiSGRI 
IKKDGREKHTRGELWKKTFIW /EGEKIG 
EKPPKKI +DHEGKKKVLRGKG 


214 


14115 


A 


219 


3 


401 


DSWATLHGNIMK*SAAVL*ALGLV\FGF ' 
TVSGPTGIVLSNS*LDIELHDTYYWAH 
FHYVIiSIGAVFAIIGGIjIH* IPLFSGYT 

Ij3GMPRRYSDYPDAYTT*NIL 


215 


14116 


A 


220 


162 


3 


KPAQRNL*SNPEKEDINILKRNQS/WPG 
TVAHACNPS TLGGQGKQI I RS GV * EQP 


216 












WCRDRERERERATERERERERERQTDRQ 
TDRRREEA/EGWAAWAIN*GRAPGTSLE 
AALECSCPRRPPQPAPPEQGPFPRTTAR 
GQPRPPKLIjOPEAPSQTRPHGYPWPLRV 
LPQSGPEVRPRE 














D HMR P KVR G CS E L * S CH CT PA W A T \ S E S 
LS QKKKKKKKKRKKI GEKI * GAFFKVAP 
PFFFFKKKMNLKSPVGIAGVAKNTRWIL 
KHRG 


21S 


I41I9 


A 


223 


2 


410 


ATSPI IEEL I T *HDHAL,IN1 LLM * FLGL 

LPAIILVLIRLPSLR\ILYISDEVNDPS 
ITIES MGHQWY * TYE Y \ TDYSGL I LNCY 
ILPPLFLEPGDLRLLDGDDQVALPI 


219 


14120 


A 








GVGKPGFPWAKKWAXjPPYSKGYPK*" VQN 
LILKVKLLLLKKNPGENSLEATGTWGLP 
PKARAPKLKPNW / DS PKRKNPGAFKKKK 
KRQPTEWEKIFANHASDKRLTSRIYKEL 
QQ*KSKQPIFLIPIYHLNHFKG 


220 




A 








NRTTWVPFKTPPLLYHINTKKK * HTGNK 
KHKTKRAKKRKKTKNAKK/EKEKNPHPQ 
RPQKRKK/EEQKPKRGKKKRGKKKKPFK 
EYSYL*K\YIHTNLQ 


221 


14122 


A 


226 


3 


386 


PS THVS INLAMGI PL * ADA VI I GFHSK I 
KDALAHFLPQGTPTPLIPILAIIETISL 
L I QPIAL S VRLTAN I TAGHLLMHL I GST 
TLTVSTINLSSTLIIFTILILLTIL*IA 
VAL I QA Y / VCPLLVGA Y 


222 


14123 


A 


227 


3 


372 


YSLDSPSLTRFFTFHFMLPFISAAIAAL'"' 
HLLFLHETGSNNPLGITSRSDKSTFHPY 
YTIRDALGLLLFLLSLMTLTLFSPDLLG | 
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| Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DPDNYTGSRNAPVKQPRPH I\KPE*YPL 
FA/YTILDPPYXL 


223 


14124 


A 


228 


2 


263 


PRVRSR WEDCLRPGA* DQ / LCAP * RDC F 
FKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGPFKKKPR 
GGQNXiKGGKKKKFFPKRGE 


224 


14125 • 


A 


229 


314 


1 


LIKTI FLYQVSLCCPG * SQTPGLKQS S C 
LSLPKCWDHRHKPPCPAQSFYFTFTERA 
LRVIY* FI *RQS /L.RSVAQAGVRWCGPG 
SLKPPPPRFKQLSCLSLLSSW 


225 


14126 


A 


230 


86 


405 


VSYMS IQDHENGHVSGYFELVGQFCLRV 
GIVGGSTFESSNSHSYIYFFSFYLFIFF 
EMGLSVTQAGVQWHKHGLLQPQPPG\liT 
*SSHLSFPR£WDHRPVSPWPA 


226 


14127 


A 


231 


386 


98 


FFPQRR*FPFGNFPPFFPPPKLG\PSQK 
SPIGDFPPPPFFNPGPGPPFFFPPPFGK 
GFSFPPPL*FGPPQGFF*RAPPFFFFFF 
FFFFFFFFIRLLV 


227 


14128 


A 


232 


2 


362 


TYVL FVI P YVETDLVQ * AG * S YS MDY SN 
LTRLQTLNLNVP FN I AALATLRLLYLHE 
TGSNNPIRITCHSDEVTFHPYYTIKDAL 
G*L\LFLLSLMTLTLFSPDLLRDPDNYT 
LAKPLNTPG 


228 


14129 


A 


233 


5 


365 


KNVTPPVNSSQHDLTDIYRNLYLMATEY 
TFFSN/AHGTYTKIDHNLSHKRNLNKF* 
RLKF*RLKLNSKGVLWPKWNQTTNQ*QK 
DNRKLSKHLKTKNTLUSINP+ 1 KEEVSWE 
ILKMYLREG 


229 


14130 


A 


234 


3 


367 


NRIVPNHDIPYFSIiCVCVX/FFFDRVS\T 
LSIi*GWSAGAGSLLTAAPTSCVQAIIjCIj 
SLihSTWNYRRNPLRPAYL/ SYFKSSKSF 
TL/LPKGIL 


230 


14131 


A 


235 


107 


2 


QEAYE*DKERSSRIYMELK/SIQPKPPN 
NRVKKWTKEINR/HFSKEDTQTDKRHMK 
KC*SLICISLVISD*QRNAN*TPNEIRL 
IP 


231 


14132 


A 


236 


349 


0 


NNL* IFLIFLFTSVFLSLISYLFYYIFI 
RSLPICGLFTYFFS FSYIFLKYIiMTLAI 
YYSF*\FFSLIQVILFALIFI 


232 


14133 


A 


237 


3 


317 


TRMHISDKKTYNTNIKRIFRIHIjMRQTT 
QLKIGKRYEQTLYKSIFEWTINTYKEVK 
QLNS KRKPTNTVI KK* SKDLYKHFSEED 
IQITTRY\liKKC*ASLIIKEM 


233 


14134 


A 


238 


346 




AAAQMPAYQELVEEAIAYGRKLGGSQED 
QTMNANDQRFVMS AAD ILS * I PD WSTE 
VHARLSFDKYAMSARA\RRLIELSKEYG 
LSKGRIVI \KLSSTWEGLQAAKELDEQR 
GIGC 


234 


14135 


A 


239 


1 


279 


LLLPPLPSLLLPPSPSSCLPPSPSS/SP 
PSP/S/PPPSLLHPPPPSLLLPHLPSLL, 
DPPPTSHLLPSIP*LDIPNAPT\NSSNP 
SCPCRPRSTEYFK 


235 


14136 


A 


240 


98 


2 


RPRRPRLVLNS * \LKLSAHI S1.PKCWDY 
RREPP 


236 


14137 


A 


241 


342 


1 


DTAS LLQ PMDQ GA I S TFNVYYL / RNI FH 
EAIAAIDSDSPDGTGHSKLKTF*KGFTI 
LDAIKNICDSW/EEGVKISTLTGVWKKL 
I LTLMDDFEGFKT I VGGVTADWETARE 
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E=Glutamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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LEL 


237 


J4138 


A 


242 


3 


339 


RKDSRKRERKNKEKRKGRKEGRKKPIRM 
TVHF S S KTMVAK/ ARRQ + NP VFRMLE E * 
KN\FPPRILYSEIIFFGSESQIKKFSGD 
GNPKGS FPELFFLRI PMEVFNLKEAYLR 
KS 


238 


14139 


A 


243 


216 


3 


NSTGTQTQSQKFMLSLVKKLISFIYL^D 
GVLLFAQDGVKWRDLGSLQPPPP/GPTL 
FKQFSCLSLPSS*GSRI 


239 


14140 


A 


244 


40S 


2 


VLTS * SNIAST \ WYGLYQ/ TQLR/KILA 
YSSI/THIG/WNNPVLQYNHNITILNLT 
IYI ILTLNA/FLVLNFKSS IGTILLSRT 
*NEIT*LPSLIPSTLLSLGGLPPLTGFL 
PKWAI I EEFTKINSLII PTI I ATITLLN 
LY 


240 


14141 


A 


245 


3 


243 


I IMLKAGQMTVFPNEDVVIMAYKHTDRYS 
TSLVMNEM\Q I KTTMGYKYAPITVAKLL 
N/SNTRYWP*RG/CSRVIHCWRKCR 


241 


14142 


A 


246 


2 


337 


FLIMDLQKYINPKIKEPQGETNHKTVSF 
MNINAKILNKILVIQI**FPKKNNVSKF 
G/FIP*/SQGCFNINQCYSLDERRKYMI 
ISKNIEKAFNKIQHSSFMIKTKNRKKLP 
EL 


242 


14143 




247 


347 


1 


HYTP I R M TK I QNTDN I KC W * G CGNP / GT 
LLHCW*ECKMVQAL/WL*TVWQLFTKXiN 
LLLPYNPITFFGIYPNEGRTHVHTKTCT 
LIFI IAALFI IARTQKQ/PRCSSVDEWI 
NKLWYIQ 


243 


14144 


A 


248 


319 


2 


KKIILFIICTERVFLCCPGWSQTPRLKQ 
SSSLGLPKC*DYRRESS\PGLNVIL*CL 
KFHLRVAMLFYVFEV* \IVIFFFLRQSL 
/DSVTRAGVQWYNLSSLQLPPPGFK 


244 


14145 


A 


249 


329 


3 


KFPFFHTGEAKNYAVFVINKRI KKKQHI 
HI IYNYGGHQKAECKEIEAHVHCWWEYK 
MMQ/SL*KTAW*FLKK*TMELPYDPVIL 
LLRIYS KELKAGTRTDVCTS MCS FLFL 


245 


14146 


A 


250 


316 


3 


KTHKTIMGTNHIPNEKIDST*LKNLNIR 
AI T I KLL \ KNNR VS LHDLR I GHGL LDMT 
PRAQAAKKKI KR * I DEFI ETKNFCASKD 
NIKRV/NRQHKE * DTMFANH I S 


246 


14147 


A 


251 


152 


364 


QLTLSNHINNQI KYK* TRHS \NHHHHHH 
HHHHHHHHQ* KRLPE * S YKSKSQEHYTY 
GMDITGPKNLRTI PL 


247 


14148 


A 


252 


230 


1 


PCWICEFI VSSNFC WF VLETG S CYVAHA 
RVQWCDNRSL*PQTPGLKRYSCLSFRSG 
WD* RH/ VP IANF* TLPRGGVL 


248 


14149 


A 


253 


3 


345 


D VGLAGL ELLTS GDP PALAS Q I AG I TG V 
SHHSWPLLFFFFFFF/CFFEGGFFFFLP 
V*SQGGDLGSREPWPYGFKGISRLRFPN 
EGKIGPQPLAPDMFCFFDKTWLSTWPG 
WFH 


249 


14150 


A 


254 


194 


1 


G RVDTKW ANTHER FS KCS / TT VD * WI F K 
SWYLCTMEYYSTINKKEILSFVTTWMHL 
EDIMLSEVS 


250 


14151 


A 


255 


327 


3 


VKTAE F VNKWQKNS TKL WNS QAQ I DKKK ' 
I VNQ I NDLRQTE I W MGDR IMNLE SRI QM 
QCDWNTSDFCVTPQ \ YNETEH* WKKVKR 
HLEGREENLTL* I VKLKEQD FEAS Q 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


251 


14152 


A 


256 


314 


3 


KNFPSLINFLRQSFAFNGQGGVQRVDLG 
SPQPLDSRVSDRIRLCFKTQKKKRFSP* 
QT I LKLS KV/E/ MTER VLKTAREKHLVT 
YK/GKPIRLTMDFSAETLQARRE 


252 


14153 


A 


257 


2 


30S 


RWWWIV\EEEEEEVDCGGEV*FFFFCFFF 
FFFCFYVFFFYLFFFCFFFFIFFYFLFF 
LFLFIFFFFFFVFFIFFFFFDCFFFFFF 
FFFFFFFFFFFFFFFFVFF 


253 


14154 


A 


258 


52 


186 


CFGKLGWDDCIjRSGVQGCSET*ACHCTP 
AWAT\NETLSLKKRRRG 


254 


14155 


A 


259 


1 


269 


IjKRSSHIjSLPRSWNYRCMPPCPANF*TF 
FVETG S HH I AQAHLKL \ P PAS A/ FPKCW 
DYRHESLCPALDS PFTNINSATI I T I LQ 
GKCWHYC 


255 


14156 


A 


260 


387 


3 


PLLQVAKINPKR ILDLNVKPKTI KCLQE 
NTGENCWDFGSGKDFLDMTPKMQSTK*Q 
I S Kit I K I YNFS S KTQ / SFCTTDH ENFLE 
DIVKR I KIQSHKLEKRFVNHI PDKRLIS 
RIYQELFRTQHEKPHTIK 


256 


14157 


A 


261 


62 


72 


LRIIKFSEES MLKAEMG * KL.GLFHQAVN 
KCNSLTVNEWNAKKRFLKKMKSATPVK 
THMIRK\DMEKVLEVWIDQISHNISLSQ 
SLIQRKVLILFSSMKPARGEEAV* * KNY 


257 


14158 


A 


262 


2 


193 


GGRGCSELiRSCHCIPAWVTRAKLSLKKK 
KKKNSKF/RELGNKGQFMGP*L.RKGFTC 
YKKRSPLIF 


258 


14159 


A 


263 


3 


330 


QLKKKYYEELYANKLGNPDEMDKFLE I C 
ILQKLTQHKN* KFFKDSRRKSKMNRPIT 
NR L I Q * P KEL P KRKTLR PDG F TGEL YHK 
HFFFKL.TKPLHKFF / AKI KEEEPFPNS 


259 


14160 


A 


264 


326 


43 


TQETEAGESL\DPGGRGCN+LGSCPCTP 
AWATGMKLPFKKKKKKITSLPELRFLSY 
TCSSSQNFLYLNDDTIIHPVAQAKNL.GD 
LDSSPTDPIQ 


260 


34161 


A 


265 


311 


1 


FPP FP / S FPLERKEKFKGRKKRGGTKRG 
PFPPFFPFSSLPLGFP/SPQRA* FLLFR 
FREFLKFFFPFYLPPKPGLGKFFFFFFF 
FWDGVSLCRPGWSAVARSRLTV 


261 


J4J62 


A 


266 


5 


323 


DS T KAQ KNTWS I E PG EVG TL IH C/ WVQ 
PL* KTVWW FLK/D VK/ LELR YD PAI AHL 
G I / YPKKNKT * TCTQVNI TALFIMTKM * 
K*PKD/PINNEWVT/KLiWYMHTMEY*ST 
IN 


262 


14163 


A 


267 


3 


293 


GSLQPLAPRFK* FSCLSLL/ SAGEWHEP 
WRRSLQRSCHCTPSSLGYRAGLHLKKNN 
NNNNKSEVSRICTIGIFLLFIY/CIYET 
ESHSVT+AGVHWLDF 


263 


14164 


A 


268 


3 


332 


HLS I INLNVNQLNS PLKAYTLPVWI * KK 
KWPNY\CCLQEIHFASKNTYKLKVKELK 
KKFQANE KEKHADKTGFKS KTVK\ KKNG 
HYIMIKRSVQKENIYIYMFLIADPDIC 


264 


14165 


A 


269 


3 


326 


KVLERHDVLKLiTQ/DKICNliNM/PYTKE 
MES I VNNLPKKKTPGLHSFTGEFYQTFK 
NEMI LTSYNFSQKFEAEEILNS FYEAS I 
LilA I PKL.DKDI I RKEN * R/ PNAKI LN KI 
L 


265 


14166 


A 


270 


1 


318 


PYPAKLSSLSKGEIGTF\LDKQMLKEFV 
TTRPALQEILKGALNVL I*/ LERKDHYQ 



529 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 

location 
correspon 
ding to 
last amino 
ncid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G/uramic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QMQKH 


266 


14167 


A 


271 


1 


318 


LKLLTSGNLPASASQTAGITGINHRAQP 
HL SFSL* LWKE T/CVPG 


267 


14168 


A 


272 


108 


322 


DSHVI FLLCGNLTVFVCLFLRWGlVDS I 
THPGVQWHHHNS1.*P*TPGI J K*SYHL.SI J 
SSIWNYRRTPLRLAFFL 


268 




A 








GFFFPFPFFRQGFPLLAQVK/VPWGNFR 
SLQPLPPGVKQFS CLNLPKN / WG FSLKG 
FFKKGCHFPF * Y * I FFNG VKKPDI KFS I 
FTNPFFFFFETESCSWQAGVQ*HDLGS 
TRP 


269 


14170 


A 


274 


41 


328 


FWLLTPGQETERERESETERESEREREA 
GEAERGRETDGIDSLiIYSKLYSKTIHLK 
I LE Y I \ AT YFKR h F YNWEIi * T * AK/ TR I 
IEKPSVRHQCQQRS 


270 


14171 


A 


275 


3 


322 


EAQELL\DPGSGGCSELRSHHCTSAWAT 
KAKLRFKKKKKKKCPGRYLGNIVTiQ* FN 
I IAKCDKYH I VKL I F/CFSS AYTAS I QK 
LIQVSRTVTHRKWKKTRQYVYILD 


271 


14172 


A 


276 


175 


313 


STLISYFRDRVLLCYPGWSAVGQSQFTA 
A*NFW\VK*SSSLSS\PSSW 


272 


14173 


A 


277 


254 


3 


RPRRQFGIEGSFLNQIKNIYKKSTANVI 
LYVTJRLNAFSLISGTRQGCPLSRLPFNI 
VL*LPVNIIRQ/EKVI*GMQIVKEELNL 
SL 


273 


14174 


A 


278 


186 


1 


PETMQARRQ*SE I FKVLKEKN/ LQHRIL 
YPEKLSFQSEREIKTF\CRLKKFITSRP 
ALQRMIiKE 


274 


14175 


A 


279 


288 


3 


GRLRGKKGFNPEGENS KEFFKPLPSGLG 
AKTQPVFKKKKKKKKTRNPIKK/WAKDP 
NRYLTKEAI *MASRHMKRCSTSCVIREL 
* I KTTMR YDYVL I 


275 


14176 


A 


280 


3 


329 


LKESSRLiSLXiCSWDHRFALPPMLSGLVW 
NSCPQ/ CDPSASAS LS IG I TGMSHHTWL 
*WLF /C*ETGSHPVTQAGMQWYDPSPIjH 
PTKKGVYLT/RRFWRLGSPSAW 


276 


14177 


A 


281 


233 


3 


IITKDKEEHFIMIKRS IHQQGITITNIC 
VPYNRGSK*MK*KLTKLMGENR*/HSII 
IAGDFNTTFS IMDRAI RKSAKNG 


277 


14178 


A 


282 


32 


309 


LPDITPRDHLSPGTMDFIQETGCSKCWR 
GCGTPVCC*WECKL/VIVQSLSRTVWRF 
LKKLKIGQVRWLMPVISALWDVEVGGSP 
EARSSRPAWP 


278 


14179 


A 


283 


1 


302 


CKRNNKAWMTVHLFTAWLSEYFTPTVET 
YCS*KTIPFTIL/LLDDNAPGQPGVLVE 
MHKDMNVAFRPANTAS ILQPMNQGGIST 
FNSYYLRNTFPKAIVAIE 


279 












TTKSASQV\RQFSCLEPPSSWDY 


280 


14181 


A 


285 


131 


15 


DRSNPGRFLWTSNS SLYERPRE I RPSSQ 
APPPVNDPI*T 


281 


14182 


A 


286 


1 


359 


FFFFFFSWFWCFSFFFFKKKGGAPGGG 
GPPPPKKTPLFSPQKKNFFLQKKPPGKG 
FKGPGLPPPNFGPG/RNGGPLKPGDPHP 
PDFTPPRWLFKPPF*KRGEPFFPPPGFP 
KNLFLKKVP 


282 


14183 A 


287 


52 


419 


L.EERAGTSWPKKENKKQHFCMKRKKKKK 
KKKKKKKKKKKKKF F KKGAQ IFSF* GGV 



530 



0164835A2 I > 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


I SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=L,ysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NVH/SGPGPVLKTGGGGRHSPPPPPSP 
PSSLFFPSSSLPSLPFLSLPLPSFLPSS 
PPLLSS/CPPPLFSS 


283 


14184 


A 


288 


236 


325 


LN I RT PGS WLXAVAHACN P S TliG GRGRR 
IT 


284 


14185 


A 


289 


218 


409 


KVKGNKADLIRSANGS *N/ VKGVLCHQY 
L>E KS F CLETE FH S CL PQLVH WPDLGL LQ 
PMPPGIARF 


285 


14186 


A 


290 


3 


426 


HEETGS PRLECSGMI TAHCS LDLPKLK* 
SSHLSLGS /RHVP PCIAHFL Y 


286 


14187 


A 


291 


382 


1 


ANFYVTLVQ\QGFTMLRSWRPA* PCDPP 
TWASQSAGITGVSHHAWPKMSTITLGVY 
SFGESEVFSIFKFYFLETGSSFLPQAGA 
* *CKHSSLYP*TPGL.KQSFHLSIiLSTSV 
YRYLPPCPLCPVLV 


287 


14188 


A 


292 . 


3 


309 


HEVPYFTLNQQLEMIKLSKGGLVKAKMG 
\ISQIVNAKEKFLKEIKSVIPVNTRMIK 
QNSLIA\ETQEVSLVWIEDQTSHNIPLS 
* YLI QSKFLTLFNS PKADM 


288 


14189 


A 


293 


2 


415 


ARDQYKKS TKK* AKDL YRHFKNE AI / HM 
ASN Y K/ KRCS I S * VI RKIE I KS S MTS CY 
rHLQN/ALKLKSDWFKC*QKCTAMGI IL 
SWWECNLVQSLWTW\QNIiLMLNTHKPY* 
SAI/PTSGLYPTEMYRNIEQASLRMFWA 
WRGG 


289 


14190 


A 


294 


335 


108 


IPTPK* PLSSPPIHTPYTTIPKP/RPPP 
P*HPPHIHPPPPPLTP\PLPTHTLPTPT 
PTNLPHIPPLYSIPPSSPKIS 


290 


14191 


A 


295 


266 




GFPKKVFS FPIGPPKLiGFPNSVFS/LFT 
FFNPGLCLKFFQFFPFSAFLPLGFSFPF 
F FFFF FFF * DRAVLCHPGW S AWRS WLT 
ATLV 


291 


14192 


A 


296 


347 


1 


DSLrQP*TRGLN*FFHLSFLTRWEYRCAP 
QCPANFC/VFL*RWGFAMLPKLVSSDLP 
ALASQS AG I TGVSHCAWLVFLLFSLYF I 
HFS KASETPWAQNQSQTPSEY PATYNFK 
IYSC 


292 


14193 


A 


297 


284 




CRQGFVLCRLVSNS CDQVLCVRLCV* VL 
GL.QACAT* LGAEGVS IEFSCFHKGRGCL 
CRHRSS/HSMFSAWC* PHTHTHTHTHTH 
THTPSPMQIjV 


293 


14194 


A 


298 


1 


96 


GTRLCLGAITTLFAAVCALTQKDLTKIV 
DFST*SQLGLILVTIGINQPHLSFLHIC 
THALFKAILSMCYITI IHILNNDQDSQT 
I RRLLNTMHINS TS \ LTIDSLS * L /GKP 
FLN/GYY+QKDLTKIVDFST 


294 


14195 


A 


299 


3 


134 


HEGRD*PGNHGDTLSLLKIQKLAGSGGV 
CL \ KLRKLMCENHLNP 


295 


14196 


A 


300 


2 


333 


GVGREGGSREGGEEESGWNGGIRERRSG 
KREEEGKEYRDEVDRKERGEEGGRERER 
GRGWEEEQKGAS * C/EEGRVK 


296 


14197 


A 


301 


354 


1 


TENELLSRIYKNFYQIN/RYKQKNSIKK 
* R K* OA I / TTET I Q T VNKHLKKCS I S DI 
IREbQTKTR* ILYT*LVEKKLKTNGIYC 
WECCRELGSC 


297 


14198 


A 


302 


1 


282 


GTRGMVAGAYNLGLPSSWDYR/ * CLP PC 
IANFF VSLVERGFTRLTTLVLLS * LYDS 
PASRSETAGIPLIHTSCIVYWSLRNNAD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Protine, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















298 


14199 


A 


303 


361 


3 


LPKWWDYRHEPLCLATFFFF*AFPPVPL 
CMYVPPN/RYVAFFFSLSSF* ICCTFTL 
YIFPQLMDG*AVSSFLFLQPVGVKFKKK 
FFLSFFK\ KRRSLGMLPRLILNSWAQVI 
HPPWPRPRA ' 


299 


14200 


A 


304 


2 


352 


ARMVS VS * PCDP PA* ASQS AE I TGMSHH 
AQPI I IS * TIAYVS FSLSSLGT* ISSVR 
S FTKS MSQLL YF YYF VS L/PSFW 










334 


1 


KN S F F F F F KR S LGIj IjAQI * / VQWGDF KT 
LQPLP PGVKQ I SRLNLLKKWDY* RGPSG 
LGKFW / 1 FL* KQGFPQLFQWLNFFFF/ 
C FFETES RS VAQ AGVQWHDLCS LQAPP P 


301 


14202 


A 


306 


270 


2 


DWCAGGGGDGVPRRQVIFVILVETGFWR 
VGQAGFQLLASS YLPALA/ FPKCLDFRH 
* PPHSALKALFFF*DGVSLLLPRLECNG 
ASSPRA 


302 


14203 


A 


307 


306 


2 


H I FQCVCVCVCVCC WCVCWWCVCVCV 
KIiVISQT*LSPLCSGP*A/CTFFS VCVC 
VCVCCVVCV\CVCVCVCVCKVSDLTNLS 
TLCSGP*A*HFFSVCVCSC 










1 17 


332 


S P PVLLRC S LSLS VQLNGKT I RFLK* LK 
MEL*FHSAIPLLGIYPKGKKFLYQKDT/ 
CHS I F I TALFI I AK 


304 


14205 


A 


309 


1 


351 


GTRKTNN*KWAKDLSRHFSKEDIRNGQ* 
V* /HMKRCSASLI IRIM* IS ITV\RYNL 


305 


14206 


A 


310 


1 


352 


PSPPNSPPSPLPSRPPQPPPYPPPPPRP 
PRSPL*YTPPAPPYW/PSPILTPSP/PS 
DPPEPRSSRP 


306 


14207 


A 


311 


1 


405 


FKPSP* PLTGALLGLLMTSGLAM* FHLH 
S I T\ LL ILC \ LLTNT \ LTI YQ * WRDVTR 
ERTYQGHHTPPDQKGIjRYGIILiFITSEV 
FFFAGYF'AFYHSSLAPTPQLGGHWPPT 
GMTPLN P LEVLLLNTS V I LAS * VS I 


307 


14208 


A 


312 


3 


176 


HE I LGSNF/ CGETQVS IHCPG * S * TREL 
KQS SHLNLP KCWHLRA * TTVPGLTT I FH 
LGKPKHKRVR 


308 


14209 


A 


313 


298 


327 


SLTLSPRL * KIKK* RKK* NKCWQGCKE I 
GAFTHCWWGCKMVQPLWKS\W*LPKKVK 
*KLSYDPAIPLV 








314 


344 


3 


HSTSLVIREM*IKTRMTYPFTTTKMAII 
* KNRS * \ WLG CGKTGS LI HSW * KC * MVQ 
SLWNTFGQFFIKLI*ELTIPLLDLYLRE 
MKTYDHTKTCI*MFIVALiFI IAQNVKMS 
LV 


310 


14211 




315 


18 


321 


WAIiFVYCLEKKEFGQPPRLACFFFWDRV 
SLCLPGWRAVMGS WVT / VRPKLLAFN* S 
SHLSLPSSWDYRLIPPCSAKDSICEKNR 
VSTVDGADKKVRRGKEGD 


311 


14212 


A 


316 


289 


52 


ETS CDVPS KGI LHFGKRNTHIMPEKWAK 
GMR * YFH\ KEDK*MAH I HVKRC* TSLVI 
MKMQI KSS TP YHF I PPRLTKPLKS ANTK 
CW * TCI WAI YLSSSMKILSHALCPFFWH 
DVC I SLTEVKNAFTGHITRRFS 


312 


14213 


A 


317 


57 I 324 


YCVTFFFFGKQSFVLSPRLNCMGPFWVN 
CSLSLLSTWNYGLVPGTQPFF\EFLVQR 



532 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

0(1 


SEQ ID 
NO: in 
USSN 
09/51 5, 1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiortine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GFHYVAQVGLKLLTSGDLLP* GSKGAGF 
TGLTHC 


313 


14214 


A 


318 


65 


341 


QWLLKSRSCYFFFL*DRSWAVAQA*VQW 
R/NPPPPGFKQF/SCVSLLSSWDYRCLP 
PHLADFWI FSRDQPDQHSGTSSLLiKKKK 
KREIPNLPGLG 


314 


14215 


A 


319 


2 


182 


ARGDYRHM PPYT \H * LETGS CF VI QAGL 
ELLDSSHPPALTSQCWDCTCEPQSWSAP 
CVS I 


315 


14216 


A 


320 


345 


3 


MPPRQAHF\VFFIEMGFHHVARAGLELL 
GSINPPTSASQSSWDHR*SQSAWITGVS 
HHAQLGTITS YH I LLFLKKGRAHAC*SQ 
HDPTTCLHVNLCCSLLLAISRPPTTTNI, 
PRA 


316 


14217 


A 


321 


122 


466 


QSFSTYC*KNWVTI/CFLKSIDSYLVPY 
TKIDSKWI I DLNVKP KTI KLIiGGNMGEN 
LCHLMLS KDS S DVTQ KA* SIEKQI NKFN 
SHGGTRLRSQPLKRAEVGRVRLSPGGRS 
YSDL 


317 


14218 


A 


322 


3 


345 


HEDAVSAPCNLHLPGSSNCPASASRVAE 
MTVPA\PCPANFFFFFFLEREFGFCCPG 
/ CK * K * VFQKKPKGKKKGR I EGKKGVWG 
PKLKERERKKRKKERRKRGKERKEVRTE 
MNGG 


318 


14219 


A 


323 


206 


446 


GNLHQCPYKWTHLAKYMCEKQDS IC* IL 
KECYQKPLL.EKSHL IAQVENDEKPADW / 
PLLRGEYVEWND 


319 


14220 


A 


324 


62 


327 


VERLLRDLRENFCRNPDGSEAPWCFTUR 
PGMRAAFCYQIRRCTDDVRPQGEAQAWG 
LQSRAGSLEPEGRGEVSA*W/SARTRHR 


320 


14221 


A 


325 


441 


1 


RKKK* ILQGL/EFRLEHWASPHRDHRV 
WPRQEGKLFSEGKNKATRMTVLYPEE/S 
SKKLGSRS *GSE/CTSVFRITLS VGYVA 
YTGAPVS I SERPSAAI YHRMNKW*HQHR 
RHLLGFSEQEPVQGVEPLQGSSQGPKEP 
QPSPSNPAPRA 


321 


14222 


A 


326 


2 


358 


ARARTLRIMVNLNYAKSTLPVLCKGKNK 
VWVTAHL F TAS FAE YFKPAVET YCLE K I 
FFKILQLIGDAPCHPRPLMDMHKEVNVF 
VPAICTS IQQPMD \ I S P FKS Y Y* E I TFQA 
GCCGSCL 


322 


14223 


A 


327 


1 


430 


ARENMPGHLiHRCI IEQDWRIHMLI PKLN 
TQMIKIV*YWYKDRYVDTE/YNNIESTE 
VNL Y ICGELIFSRSAKII QWWG KRIVLN 
KWCWDD/WNL/SCKSM*MDSYI.TLCr\K 
ITPWIINPTGRAKTMRLLP/EKTGVSFC 
DFG 


323 


14224 


A 


328 


146 


374 


KCLMLTKPYNIMRLT* PYEIR* / CGDTF 
P * S YYA * TGTAVRTWGLT P VI P AL WE AE 
AGGSRGQEMETILANTEKPRL 


324 


14225 


A 


329 


2 


322 


ARASRTFIVRKTQCLASKDKLTLVRG* C 
SCDFQLKSMLIDYSENPRALK/NMIiYKR 
NNKAWMTLGLFTAWSTE\KPTVETYCSQ 
*\KITLKIWL 


325 


14226 


A 


330 


350 


47 


EMGFLHAGQAGLELLTSGDPPASAFQSA ' 
GTTGVSHRARPAN* KKKF/CLKT/RVFL 
CCPGWS*TPGLK*SSCFGIjSSHWNYRHE 
PPRPAQFLLIDFYLRHYRES 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQ ID 
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USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 

acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T= Threonine, V=Va/ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


326 


14227 


A 


331 


753 


458 


FFFVE*GVL\HVAQAGLKLLTSRDLPTS 
AS*VAETTSTHHHA*LHLFNFEKYFCKE 
QILLCCPGWSRTPALKRYSHIiCLPNSWD 
YRHEHCNEPEEISLI 














TLV1.MQSCLSLLSSWDYRHKPP/*LSNI 
FNLPMGLSGHNPI INGGISVSLDRVWL 
FLFC/ L.CFETG S HS VAQAGVQWHDHGS L 
HSC 














FI*QKLFFISDGQYANVI J TWFEGGTFHT 
*A*RPNHHSYKLRKDQTIIFIYLFF/LR 
WSL/NSVAQAGVQ/WHNLGSLQSLPPEF 


329 


14230 


A 


334 


3 


329 


HEDWSPFWPGWSQTPDLR* SACLGLPK 
CWDYRREPPHLALNAFLT* NI FNL* W/C 
LSG£NPIIRSICIRFLGKGG*LLFIFIF 
WFLTQSLTHSVTQAAMQWHDLGSLQ 










26 




VCFWLGHLLPVW FRC / 1 LYLF / CWSLG 
FNFCFSLVL/CWFLLCFFCAFMFCIjFY* 
W FVI CHFLFFFFLFYFFVMFLFF 


331 


•14232 


A 


336 


3 


348 


DEVFKS/YH*HPNLKC*GDYSLiLFHGES 
DLDTTQILTHPSTTAMYFVHYCQSP* il 
YGTIDT*PPVIHRNPIHTRTPYPCL,QAS 
T AINLQ L S HMN CN S KATPH S LG YHQT Y P 
PLTV 














MALiLFLR*GL/NSVTLARVHWYNHGSM* 
A*PPGLRCMSXiCPA/NFLYFLVE/IGFA 
TLPRLQAICLGSSDLPTSTSGSAEITGV 


333 J 


14234 


A 


338 


60 


354 


NGKECKKL I YLS I YLS I YLS I YLS I YHL 
C*YL/SCLSI*LSN*LAVYYLSVCLSVS 
VCLi\VASCL*ISIYLSIHLSFYSIDRIL 
AHHCVPPTCSLPKATH 


334 


14235 


A 


339 


1 


351 


RDADVMLLQI LVI FQALALGVQTEGMAG 
VTHHDRVLVNWVRFPQCIj/HRV* SNGT\ 
P*VGTSASERPGLKQSSHPSLPSSWDYR 
CEPPCPVNF/CFFV 


335 


14236 


A 


340 


190 


358 


FGSWLVFFF/CFLETDSRFAP*AEMQGP 
NFG*LNPPPPGLRGFFGLSLPGTGDYGP 


336 


14237 


A 


341 


306 


119 


LRWENHLSLGN*GCSEP*FRHCTPTWTT . 
E*DSVSKQKQKQNNLELKKPGSS\CWF 
LPETLFT 


337 


14238 


A 


342 


1 


122 


GTRGCGEPRSRHCT1A\W*QSKTPSKKK 
KKKKKKKTGP FFF 














ICSSFSF*KGCM/WLGAVAHACNPSTLG 
GQSGWIT * GHE/ FKTSLA 


339 


14240 


A 


344 


83 


329 


ATAPAPCYFLILCI CMCVCVCMHTSVCT 
/ CVIDCMCMGLCHIL * TCVA/ CTCiiGMR 
FLLPAHI I ICVKRFCLMCGSGNSGSLCL 


340 


14241 


A 


345 


127 


357 


YGVLSRGRVLPCWPG*SRTSGF/ I SSPEA 
NSWP PCLANI LYLVETGFCHVAQAGLE L 
LDSSHPPTS ASQS AE I TSMN/HRA 


341 


14242 


A 


346 


312 


1 


LMNLNRS VGALLTRGVKCHKAFRST I S L 
LIGALK* *QS KSDLGISLSKWKDLNND 



534 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Ataninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I YRSLLRQSEEDTS \KWKDI PCS * TGTP 
NI VKMS I LiTKAI YRFNAS C 


342 


14243 


A 


347 


350 


243 


FHHVGQDGLNLLTL* ST/PCLGLPKCWD 
YRCEPPRLAY 


343 


14244 


A 


348 


165 


338 


HTLDPASHEGPTFLFSL/PVEQCLK*LT 
LKVATIVXFCFCF*DRVTLCHPGWSAVM 
QSQ 


344 


14245 


A 


349 


328 


2 


SVCPHGSINLSPAETTGACH/RRLA*NI 
DSRV* AKTFKGK* TNCSYNNNNKNNNNN 
NNNKPFLLLRLAGDRHPDQLCPPPRRCH 
FNGATPAGPPSCLRRSHHSSAVCSC 


345 


14246 


A 


350 


3 


338 


HEIEELITFHDRALIGNILICSLVLYAL 
FLTLTTKLTNTN I LHAKE I DTVRT ILPA 
1 1 LI L I A\LPSLR I LY I TDEGNDPSLT I 
KSIGHQWYTYEYTDYGGLIFNSYILPP 


346 


14247 


A 


351 


124 


382 


NTFKS *NSNKTKNAI RT* AKDTKQHFTG 
ED IQI AOTOIMKRCL I P LGKR \ SLKTQE I S 
PRVRINKIRPGMVAHTCNPNTLGGWGRQ 
IT 


347 


14248 


A 


352 


3 


368 


RDRASLFCPGWS*HPELK*SSCLGLiPKC 
WDYRR/AATAPGLLCL 


348 


14249 


A 


353 


390 


161 


FSRDRI S PRWSGWS P / DLRLS P / CLSLP 
KCWDYR / R * ATMPGT F FHFNLKNQDARN 
QIITCPNLTNKYIYMTSSFILVHS 


349 


14250 


A 


354 


123 


345 


LAPQHFASLRLARP/ PAGPPCDRPLAPP 
RPS*FSPAPL.PPGADRSVPLSPSVPP*S 
LPPPAGAPPR/PPPPRDSPRR 


350 


14251 


A 


355 


37 


393 


T KHFVS TCYVPEFI AG TNQR KRN R I S VL 
ANVKDKQI ILLRVRTAITEGYRNYHVLF 
LVFVCFFS PPKQGL / DS AFQAGVW WHNH 
NSL * P * TPGLKLSSHLDLQSS WDYRRVP 
/RMSSYF 


351 


14252 


A 


356 


2 


335 


ARETS KDRLTLLLG ANAGGDF \ KL.KP VL 
LYHSENPRPLKNYAWST/LPVL*KWNKA 
*MT\HVFTEWLAEYFKPTVQTYCLEN\K 
FSILLINNPLSHPRALMDRFKETIGVLM 
PA 


352 


14253 


A 


357 


199 


377 


VGSSCFLKVCQ/C*AVCVCVCVCVCVCV 
CVCLRCV/CVCL/CV\CVGFFFFLVLCG 
CGCVCVC 


353 


14254 


A 


358 


1 


335 


GTRTTCMYHVPPRSANF*I,VLERWGLPM 
LPRLVANSPPQTILLLQPPTVLPNPSRT 
S ALY/ YRPLLDLS PTS LS PRI PSS PLTS 
SLSIPAPPRSHLAPPPPRPPPHPPIRPS 
A 


354 


14255 


A 


359 


3 


369 


KPS P * PLTGALSALLMTSGLA N M * FG FH 
S I TLLI LGLLTNTLT I YQ * WRDVTRE S T 
YQGHHTP PVQ KGLRYG 1 1 LF I TSEVFFF 
AGFF*AFYHSSLAPTPQL*GHLPLIVII 
LLNSL* FPTLY 


355 


14256 


A 


361 


337 


121 


L WSQLLGRLRQEDCLS PRSQGQNKP * CH 
CTPAWVTE *DS VSKKGKWYIHNKKS SL 
KKK\NSCNMQQFGGT 


356 


14257 


A 


362 


256 


375 


EKKTIVQYPHTYGINV* KNLPAKKTPGP 
DGFIDKLYNTFRGEITTSPHILFH*FKE 
VAVLLNSFSK\AA£ITLTPKPNKDIMRK 
ENYS P I S S Y I WNQCS FFLEQIFPLS FKL 
NCPGLIIPH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 
location 
'correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine T I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Unkno'wn, M — Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


357 


14258 


A 


363 


170 


1 


SQLLRRLRSQGACHHAWLIFVFSVDTGS 
HHIGQAGLKLLS \ *AIHPPCPPKILARA 


358 


14259 


A 


364 


330 


3 


RIMSASASQSAVISDMSHDVRLIDI+HE 
P * F YQAL YS FR Q * ITSSPSYR*SYLLSY 
KWQDEARC/AFVTAWQAEAGGMLEPSRL 
VLDS * PRDLPTSASQSAGITDMSHHSC 


359 


14260 


A 


365 


1 


341 


SQYGLDFHDLSDLPTMASQSAGISGVSH 
RAWPASG+FYVSREVICSYRL*GYTPSL 
LVLE S GS S S VTQAGAQWHNHS I L YPHTH 
GLK*SSCLSITTR/WDYRHESP 














ARMVS IS* PRDLPTMASQSAG I TGVSHR 
AWPATGKFYVSRGVICFII *KDTHLFF/ 
CFLETGSSSVTQAGAQWHNHSLL*PQTP 
GLK*SSCLSIPKC/WDYRHESP 


361 


14262 


A 


367 


2 


337 


ARAG ILKH* QRKCKS I R \ PLWNNVGY Y I 
LKLNIGITHDPAIPFLVLYPSEMHTHAY 
*KTSMRMCITTLFIMAIN*KFKCS*SEE 
E / IGKFWYFT* * LLYSNENEHLQLHTTW 
IN 


362 


14263 


A 


368 


59 


348 


SAFGIQENEMPAGRGGSHL*SQHFGRPR 
RVDHLRSGVPD * LGDR/GET PS KKKKKK 


363 


14264 


A 


369 


350 


3 


GWSQ I PDLRWSTHLS LPKC* D YRRGPHA 
QL* TLMYKYLFEHLLSVLLGI CPEVEWC 
/DHMAILCLISLGTVILSSKALHHYTFS 
PVTCKCSNFSISLSTLVTFHCFLNACYF 
PLFSRA 


364 


14265 


A 


370 


908 


30 


ECSGTILAHCKLPLPGSCHSPASAS*VA 
GTTGVCHRARIjIFLYFLVETGFHRGLNL 
lts * st / s asasqsae i ts vghhaql / c 
LELELK*STCLNLPKCWDYPA*ATVPSQ 
FGSVFLRIF*PMFISNIGQLFSCSAFVW 
P*HQGNAGIiIE*VRKY FLLFKFLKEFA\ 
RTGINFR*MSGRTDW*NHPVRGISLLKG 
F * L VI * CS Y * L* VYP YFLLF * E VS LLS P 
RLECNGVHS E SAG I TGGS HHAQLVY P Y F 
FFVCDSVLVGFVFLEICHFIWLFNLLVY 
LCLWYYL I I TLIAVKSQ 


365 


14266 


A 


371 


164 




SCRQVESYGIKRITCISLASSWNYRHAP 
PRPANF\ * FPAEMG \ FLHVDQASLDSC 


366 


14267 


A 


372 


3 


383 


MVS TPAEDAVNI VDMTTN/ D * EYSLNLV 
DKAVAGLERIDSNFEISSTVGKMLPNSI 
I CYKE I FHERKNQLMQQKSLLPYFNKLP 
QSIQHSAITTLISRQLSTIWQS /PPPTK 
/RFQLTEGSDD 


367 


14268 


A 


373 


1 


349 


TLLGNDH I YNVI VTAHAFWI FWIVIPI 
I IGGSGN*LIPLI IGAPVMAFARINKXiR 
LRLLPTS \ I LLL\LAYAI LEAG ARTG * T 
V*PALTRNY*NPEAYGHLSMFSLHLTGV 
SSISR 


368 


14269 






443 


2 


SSGSRS ECRFLPLLFNWLE I LAG A IRQ 
/EKE/IKGIHIGKVKVSIKLTEVINKFI 
KVA* YKIM\QKS WCLYTNNEQLEKKFK 
IPFKIASKRI KYLE INH * GKDL YNENLK 
TLLKEMKENKW IN I P CS W I RRLNNVN I S 
ILSKVTYKF 


369 


14270 


A 


375 


3 


201 


LWIKKLDIIPIIPS*LFFNTELDKLILK 
LIWKFRGPR/ITQNS FEKPIVGGLILSD 
FKTYYKATVLK 



536 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 


Predict- 
ed end 

otide 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGlycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 

peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


370 


14271 


A 


376 


11 


324 


DKFLEHPPLLVLQAVAGKGS PLHCLSTL 
PTSLFI + DRVLLCS PAWS AVAQT W FTAT 
AS WAQ * SSNHSPSD * R/ PHTTTGHFFCR 
DKVL/ TMLPRLVSNS WTQAI 


371 


14272 


A 


377 


85 


364 


YQHNYS FCFISQY VYRLiFFFF* TFFKKK 
GFWF*RLNFQGGNMG/SMEP*PP+IRQF 
FCLNPLN/ SPDYRGPPRGANF*IFLKKK 
GFQVFPSLSKIP 


372 


14273 


A 


378 


1 


382 


GTSGTSYSTIFAGTLITALSSH*FFT*V 
GLE INMLAFI PVLHQKYT\ P * YTHAAIM 
HFLTDSGA/SVLLIEIbI ) LYSNSLGE*S 
S I I ELQLI S I FLSMML VI PN KFGAAYYS 
I HALCLDRPQLLH I LL I 


373 


14274 


A 


379 


24 


323 


IPGLKRSSRLGLLKCWDYR*DNFQHMCL 
KNFNNWAAY YFVCKI CCAVD PHGCS IKV 
S FFLS F / CFFFTKSRTVPQAEVQGGDLG 
* LEPLPPGLMPFSGLS L 


374 


14275 


A 


380 


2 


317 


AGWFQTPDL,R*SPCLSLPKC + DFRG+PP 
CQKLFLCPIKIF*TRLK/CYLNTL*QSL 
PLMHFKKNVIYFILYKAALFFFFLRRSL 
/ HS VAQAGVQWHDLG / S / LQT PSPGFK 


375 


14276 


A 


381 


1 


323 


VKRQPTE WEKI FA/ TI» YPSGKGL I TRI Y 
KGLKQLEGKN/KSNNLILKWAKEDIQTA 
NRYMKRCSTSLIIREIVMQIKTTTRYHL 
TPVKMAFIKR* /GNNEC**GYSEKRTLI 


376 


14277 


A 


382 


2 


248 


TQPLI LRCLPPRS I YRFNS I P IKIQVNF 
F\ *EKEKSLL»KFIQNL>KGP* ITKTILRK 
KKVDGKTFPDFTMYYKATVI KTVS WYQ 


377 


14278 


A 


383 


178 


1 


HNPPIiAISFCFFPFFFFIiRQGL/NSFAQ 
AIVKWYDHDLLQP* PLGL.KRFSHLSLLS 
SWDU 


378 


14279 


A 


384 


312 


26 


FLRGVFFFFFLRRSL/DSVAQAGVQWPS 
FGSLQAPPPGFMPFSCFSLL/SFFFFFL 
VE / MGFTMLARKVS I S * P FG P PAS ASMP 
VEITGMSHCLANMFF 


379 


14280 


A 


385 


1 


849 


FFFFKQTKFI KLSKYKNI IKKS /SAFLY 
ISNYLKMKFKKIPST*L*FEVNLTKKLK 
HLTFYSKEHYTN * VTHKWNNI THS *TG I 
FNS * I FVLHKMI CRYNATS IKI PVTYFI 
D I F/EKA YLKFI WYHKTP * IAKAZ KTKE 
G I / LPDFE I HYKT WTKTVWHLNKNRDI 
GQWSRRKREQKYI S VFTAN* F * IQVTFF 
FKGNNS I FNK* CLENFMSTCR\KKK* DP 
HLTPYVKINSK*ISHLNVRPKTLKLL\H 
QKI E* KPHNI GLGS KFFDL.T + 1 S QDTKG 
RTSQSDHF 


380 


14281 


A 


386 


3 


318 


LREMQNALESLSNRTEQVEETTSELKDK 
AFE*TQLNKDK\KKRI *KNE*RLQEVWD 
CVK*PDLRirGVPEEEGKSKYLENMFEE 
I I EQNFPGLARDLD I Q I QVAQR 


381 


14282 


A 


387 


1 


382 


FTPTRTAWKK/ SNDWCWRGCGS IGTLR 
HCSWECKMAQLLW * TV/WTFPRKIRQPS 
DVCCDMVIGLPYDPAVLLLGICPREMKT 
YVHTADMS VITSVSLVI X \ ADSENKLNV 
PSADEWINEMWYIHTVDY 


382 


14283 


A 


388 


1 


341 


HKLENLENIVKFL*TH/TLPRLNQEKIQ 
TLNRSITSSKIKSLIKNTPTRKEKKK\N 
PGP KG FPREFFPRA \ KKQG VPTLGNP FQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KFRERGSLLNYFYKPGLFRIPKLGKNPR 
GKKK 


383 


14284 




389 


168 


1 


XKFFFFSFFF * GDRVLLCHPGWNAVMQT 
RLTAAS \TPGLKQSSHFTLPSTAGYTG 


384 


14285 


A 


390 


337 




TGITPLNPLEVXPLINTSVLLASGVSIT 
*-AHHSVIESIGNLI IQALLITILLGLYF 
TLVQAS E * FE S P FT I S DG I YG S T * LVAT 
GFHGLRVIIGSTFLTICSIRQLIFHFT 


385 


14286 


A 


391 


1 


235 


IjNFSYESSMYFALFTIVF\WVFbNF*KF 
FMN * I YHLCYVL * YFLLV\ FVCLLTWFM 
VFCFFFFFFFFFFFFFFFCFFFS 


386 


14287 


A 


392 


1 


258 


SCDRLFANHLSNKELVSRKYIYIF*KSQ 
DST I RKQTDKK* AQELNRRFSTKDLQMG 
NKHMKRCSTPIiAI /REMQIKTMLRYHCI 


387 


14288 


A 


393 


2 


317 


LAYCNLCL.PGSSDPPTSSSRVAGNYRG\ 
HHDS VF * RAED INMHE IQ F I S FL FLNR D 
EVSLCCTGWS*TPGLKRASCLDLPKCWN 
YRHEPLCLAFFFNGSWFQCQI 


388 


14289 


A 


394 


91 


408 


LGAEFDVRAYLTSGRLTGPGVPFLTGFY 
SKDHII\ETANISYTNA*ALSITLIATS 
LrSAYSTRXZIiLTLTGQPRFPTLTNINE 
DNPTLLNPIKRLAAGSLFAGFL 


389 


14290 


A 


395 


3 


165 


RNKKLKNNRHWRGC+EKGTLIHCCWWCK 
LVQPLWKAVW*VLR* /LKTELPFDPAIP 


390 


14291 


A 


396 


661 


1 


LCPLSSFYRKCALLGFWFVCLFVFQIRV 
SLCCPRWLQTPGLKGSSHLSLPISWDYR 
HLPPRLAIGAVFLI FVLMRTCLR* FADP 
rSLiSFE*QQRN*LHIIjISLIj*MVDICLT 
QFTKWI FL* PKNS * * LKRQCKLPKITEL 
/PKKRIQIYPTD*MSSIPHSLSFYLVCL 
FVCLF * MESCFVT / TMAAVQ WHDLGS \ L 
QPLPPGFKRFSCLSLLRSWDYRRPSPC 


391 


14292 


A 


397 


320 


1 


PDS KQQ I FNVDET VFC/ WKKMPSRTFLA 
REEKSMPGLKAS KDRLI LLVQANPAGGF 
KLKPVLTYH/SENPRAIKNYAKSXTVL* 
KWNS KVWMTGHL FTALMVLS PRRAL 


392 


14293 


A 


398 


1 


158 


CIGPMWENRLILGGRGCSEL*SCHCTPA 
WAT \ S KTLS QKEKKK *NMRKYM I QC 


393 


14294 




399 


264 


1 


LINEFSQVAGYKINKQKSLVFLYTKYNY 
S KLSEK* I KKAI P FTI PAKKKE I KYLG I 
NI J TKDVKDL.YNENYKILKKIEDT\KKWA 
DTPC 


394 


14295 


A 


400 


3 


343 


HEQKRQSKVREVRELSQGLNMDRKRWSQ ' 
D LNPG CRT VALS P YHHTRLALNC PGRQW 
FVGVNFF*RKKIFFFFKFWDRVLLCHLG 
WSAWPSWLTAALI SW\VK* SFRLGLLS 
SW 


395 


14296 


A 


401 


1 


345 


GTRKNTDNTKCR * VCEETGYFIHCW WGY 
KMVYPLGKXiVWHFLKKVS IHLPYGTSAL 
LSIjMXEKLTFT/ CHTKTCTQM / FHWLiF 
P I VKKWKQPKCLPVGELLNKLWYIS IHT 
IYSAI 


396 


14297 


A 


402 


102 


355 


DRVIRLANFCIFGRDRVSPCCPSWARTP 
GLKRSTSLSLPKCWDHT*ATAPG/LRAI 
LLYFWDYRCLSPDLVNFACEHFSVELVA 
FF 


397 


14298 


A 


403 


156 


3 


NNKNGFP * SMLFQPSNGLVS I *NI \ PLK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

n u ele- 
ct id e 
location 
cor res pon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucrne, 
K=Lysine, L=L.eucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophau, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*QKWFSFFFEERVSL.FHPGWSAW*SWLi 
TACSLDLPAQVILPPLSLPNSSC 


398 


14299 


A 


404 


1 


364 


GTR S E VI YKJjLLQT PWVI* HMRTRLLLQ 
ACGAR * MWM I L \M 1 1 LGLVTCRLT I YQ * 
WRDVTRESTYQGHHTPPVQKGL.RYGIML 
/FITS*WFLSCVI \WVFF 


399 


14300 


A 


405 


3 


365 


heqtshniplsqspi*skilmllnsmka 
ergeeaaeekleasrgwfmrfkkrshvh 
s \ kvqge aanad veatas yp edlvrt i d 
edgdtnkqif\|nvnktafcwknmpsrts 
tareekqclp 


400 


14301 


A 


406 


3 


356 


HERSDQLYAN/KLDNL,DKRSKFL\EGYK 
LQKLTQKEIQNTNRPTTRI /E ISKKKKK 
KKKKKKKNFS PGGFTGE I P PTFKGQFKK 
I FKKFFKNLGGEKTLP 1*3 YGAG/ 1 KLL 
PKIKK/DPSKKK 


401 


14302 


A 


407 


116 


362 


YKYSLTPQKL YNHS YI KKNHNKQNNRNH 
STTTTTKQPPPPGFKRYSCLSFPLS*DY 
RCTPS CPVNLF / C VFLVETGFHHVG 


402 


14303 


A 


408 


3 


360 


HEVRLWDFAFERNEGGGENEEKVDWLNY 
RMWFHLIF*AECYSICRLHAYYSCILGP 
VLLLLFVLIILLFCCL/SFFDYF/VFMF 
FIFFYSFFFLLFSLFFSLFL/CFIFLYF 
FIF/CFFLYFFFF 


403 


14304 


A 


409 


388 


1 


ALLKFPFPKVLKKPP*GSFFP*GF*LLS 
/ LI FPPYFGRETLFFFF*NKVPLCPPDW 
SSMARSRVPAGSVFPVK\ESLSLSLLSV 
PPQVQVNGFLTFFFFFFSW*RQGLPLL>P 
RMVWNS WAQA I hPh WPHAS 


404 


14305 


A 


410 


1 


386 


VFNAEESAY/YWEIMPQRKFIR/EEKQA 
P*FRQERIS*YCANAWFTIRTTHICKP 
ANPQALKKKKKKKHQLTVFWLYTKKS * T 
MRTLFLDWFCQCFVPKVRKYLASKTLPL 
KVLL I LDNAPGHLKPH I FN 


405 


14306 


A 


411 


2 


417 


AHHI FTARIDVDALAYFTYVTI I IDI PT 
GDEPFS *LATLR* INMK*SGAVL*ALRL 
I FLFT VRGLTG I ALPDL*LD I VLHDTYY 
VGAHFHYGLSVGAVFAILGGFID* WPLF 
\SCYTLDRPYAKIHFTIIFIG/VDLAFF 
P 


406 


14307 


A 


412 


295 


397 


WQWPGTVAHACNPRTLiGS*GGRV/TLRS 
GVRDOP 


407 


14308 


A 


413 


437 


3 


PGFGSLIGNLIPASGNGRKSKSCVCVCV 
CVSVCFKS CT* S LCEHLFTCLCPQ I CVR 
LQLMV/CPQNCVCWCPESGLFLDESVCV 
RLCVCAWMAVWWGGSGSG * VCGCGC\KC 
ICVGPLLDSELCLCVC/LLGQSKCDCGL 
RCRLPAWCV 


408 


14309 


A 


414 


2 


392 


HLQFIFFWILKIFHYL/FLWFWFYLTSM 
ALFYLQ*QKRNCMYEVFNRGLFF/SCGE 
GVSGS PAS SSSSPS CS SSTRGGGAVGGG 
GLGFVCFLLLFWGFWFVFLFCWFL.VCVG 
WVWC VWWVFFLLGV/ CCWCFFF 


409 


14310 


A 


415 


376 


1 


GFQASKDRL/LLLG/ATAAGDLKLKPML 
I YNSKNPRVLKNFAKCTLPVLYRLYR * K 
NYAWMTAHLLKSWFTEDVKSTIQ* KISF 
EMLLIINNVPGHPRTPMEMYKELNFF\M 
PANTS IP* PMDQG I VL 
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SEQ LD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Ml 
eth 
od 


SEQ ID 
NO: in 

■USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to. 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartt'c Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H = Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=IY1ethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
YV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


410 


14311 


A 


416 


409 


2 


FFPIKKKKFAGALVFFFFFRKIFFFFPK 
GEGKRGVLVSLNLLPLC*KDFFSPPPPG 
EGVLWPPPPGPIFFFFFLKKGFCLFCQG 
GFYNPSLYLIPLSPFKIWG+TRGFT/LP 
PPALFFYFFFF*DRVSLCHPGWNA 


411 


14312 


A . 


417 


425 


3 


RELLAFWQNFKLRPQGPFFSP\GPGGEK 
KR*SLPKTPPKGGPPGP/SPNGRLPSGG 
QLPPPRGG /SPQGPPPLPKGPQGWGPPF 
P*RSPPCPPHRIGVPQVSSSPTPGL.VFP 
RGPLNPGNLGGTKKKKKISSKAARDLEL 
VRTRG 


412 


14313 


A 


418 


1 


346 


L.L PDRNLNTT FFD\ P AGGGDLI LYRHL F 
* IFGHPEGYMLMLPGFGI ILHIVTYYSG 
KKEPFGYIGMV*AMISIGFLGFIV*AHH 
IFTV* IDVYTRAYFTSATI I IAIPTGVK 
VFS 


433 


14314 


A 


419 


2 


382 


LiFS TNHTDI GTLYLLFGA * AGVLGAALS 
LiLI RAE LiGQPGNLLiGNDH 1 YNVIGTAHA 
FVINFFIVIPIII\GGSGN*LLPLIIGA 
PDMAIARIlWIISFRLLPPSLLIiLLASAI 
VEV*SRTG*IIYSSL 


414 


14315 


A 


420 


261 


378 


KKNFFFF*KLNFF*KFFLIFFPPKKKIF 
FKKKKKFFFYKIFF/I*KNIFFSPQKNI 
*PFLFFFI*YPFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFTLN 
SLYFSGRVGGRV 














KPAATHACAT I FMCLDQEAI I SN*H*AT 
TQTTEVSL\SFKLDYFSIIFIPVALFVT 
WSIIEFSL,*YINSDPNINQFFKYLLIFL 
I TI L I LVTANNL FQLF IG 


416 


14317 


A 


422 


8 


355 


PVFS YNHS TLLTFSLSLSFFFFFFAPGK 
KGEKNGKPGPFRGGFFPKKIKKPFPPP/ 
GGPKKPPGPGFPQKPHFPFKGGPWPSKT 
LFGRGGKPNFGGPQKPPLV* NP PSQG P / 
HGPSGGG 


417 


14318 


A 


423 


356 


3 


KPLGIDLTKKVKKLSKKNYKTLMKKIED 
DSN*KDNSCL*IR/MTIIiPKAIYRFNTI 
S VN I PMRFLPDAWADAW 


418 




A 








PSVRLGGHWPRSGITPLNPVE/VGLLNT 
IRLLASG V3I TGAHHRLIECNRHQ IIQA 
LL I T I LLGL Y \ LTLLQ AS * Y FQAP FT I S 
DGIYG*TFFGSTGFQGLHVIIGSTFLTI 
CF 


419 


14320 


A 


425 






GPFTPWSLC*GDLQR* P /RAVKFFLKKK 
KKKKKGNP IRN* RDIS S * FLKNLETAVR 
NFWTFFSYFKSKKLIQKSDHPLKNVKKI 
LQNDKRYLKLDWG 














DRRRFCTYHKDIGALYLiLLGA* AGWCT 
AVSLLIRAELGQPGWVLLGNDHVYNVIV 
SAHAFVIMFF\MEYPIINGGFGN*LVPL 
ISGAPDMECSGINNISFWLLPPYVLLLL 
ASAIVEAGSRT 


421 


14322 


A 


427 


359 


74 


I CADYTRKP P YMGAP I EFSCATCVL * LT 
STQRECVCVCVCVCVCVCVSM*VAI J IPD 
SN/CIPLSAYVCVCVCVCVCVCVCVSI * 
VALLLDSNSYH 


422 


14323 


A 


428 


52 


361 


NYPQLSEI* **VS*LHY/LFSFLFFYFF 
*LVSR F VARLECSGAWH \ I IAHCSLDLL 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H— Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Volinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GSS /DSSPASASLVAGTTGMCHQNQI IF 
LFKKVFVGRVQWLTSVTPATWE 


423 


14324 


A 


429 


105 


361 


SRLFFFFFFFKTNFGFVPKVGVK/WAFF 
GLME P FAFRVK * FSGUHPRTWNYRNVP 
HCPVHLEF* VKTGFNLVDKAGPKLLT* K 
DFP 


424 


14325 


A 


430 


1 


379 


HAYH I VKPSP * PLTGALSALLMTCGLAM 
* * RLHS I TLL I LCLLTNTL \ S I YQ* WRD 
VSRESTYQGHHTPPVQKGL* YWIVLFIT 
SEGVFFAGIF* AFYHSSIAPTPQLiGGHW 
PTTG I TPLNALEG PL 


425 


14326 


A 


431 


1 


390 


KXVKKWKNLIFMIGRLNNVNMSVLHKVI 
YKFRAI CLR/TFHFHRKKKNLTWA* KHK 
KAL IPKTI LKKNG KSGG I TLS DYKM* YK 
TTITTTIRFWWKF/DINKYNKITrHEIY 
FHIYGQMYFFKIAKTFQGIK 


426 


14327 


A 


432 


1 


392 


TRTRGRTQWD* T* LRPI / WTKKKKKKKK 
GKKRKKKKGVFFFYFFLGF* FFLWVFFF 
FFFFLGFFFFFWFFFFFFFFFFFWGIFF 
FFFLLIIYNFFCKF*FFFFFFGFPNFFF 
FFFWFFFFLDFFFYFFFPRV 


427 


14328 


A 


433 


382 


1 


RGGKFQNQKQKRKVSPPPPKKGFFPPAA 
PKNIKGGGKK*TPPKKGGVS\PPPPKKR 
KSPPHKK/IGNFFPPPRGKGGPPKNPKK 
PGPPFFFFKNPPPFFFFPFFFFLFFFFA 
HKDGLLAREQTQAEVKT 


428 


14329 


A 


434 


375 


1 


KYSQLI FDKTAKAI K* SKDCLLKEWFWN 
KWTSTHKSLDL/SPFTKINSK+ ITGLNL 
KCRIITLLENNIEENLDRLGFGNDFLTT 
LPEAQSKTELISY/DFIQTNFCSANATV 
K*MKRQATEWEKIF 


429 


14330 


A 


435 


1 


459 


PTRPPTRPSTRTLGFTMLAKLGSNARPY 
/DYPSYSASHSAGITCVKHCARAVIHDF 
NGVHRPLLI W * EFL VE IS FI FDFFLETN 
PGFITKGELQGHNLGSLQVSPPGLTLFS 
CLSLQKSGFYG/HLAKYLVKF/ CFGIFS 
KRGF*GGYPGGAQSPPP 


430 


14331 


A 


436 


14 


382 


MARNTSQKDIR I DPNNKCLWLVKKKXKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRKNYFHVI \LKNKAVNNGSTITS 
TNVMS PGLHIRL 1 1 1 LSLM I YKKSTTE * 
VEKQTCYYILM 


431 


14332 


A 


437 


3 


192 


TPGLK * SPCLGLPKCWDYRR / AAAVPGL 
♦DILAIFFHHIIiSESCLFLLVFIHCFNA 
NLFLRWSS 


432 


14333 


A 


438 


1 


364 


DRI ALGTVDDLPGRPTRP \AHP I I LKVS 
LADRDAI *NLWQIPIVAS*YI PLGF/WS 
KAMPCS VD I YSS FEKKKKTVFFDRGQGP 
TPE IQVLGKAEGVKPFEPRGS KPS WRIM 
AKPKFYKRFKL 


433 


14334 


A 


439 


336 


2 


GMIPEMKGNAGP/ SG/PRKPGFFLG*KG 
KVKAWLNFKPPFRAFWPKKKILLPLGFS 
GILGPGPKGNLSRPGPPPQAFPLGFCQV 
LGP/SGPGVGFPKPGQIGWSNNPCLTLQ 
KNK 


434 


14335 


A 


440 


1 


333 


HKLENLENI VKFL * TH/ TUPRLNQEKIQ 
TLNRSITSSKIKSLVKNTPTRKEK/SKK 
PL.VPKGFTIEFSQLCKKLRVPVFMKPFQ 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
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peptide 


M 
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od 


SEQID 
NO: in 
USSN 
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Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














KYGERGLFLNGFYEPGLTLIPKFGKNPG 
G 














WGINNWLS P * EKKGRPFLIj \ SHPKINS K 
WIKDLNVKGKI *KP3jEGQIG* YLMIFGV 
EKGLFKGGPKYTLYKRKY*NRELLFIKR 
YFS LTEKS PAE I CHI KKLAKKK 


436 


14337 


A 


442 


8 


391 


ERILRHDQVGIIPGM/QGWVRI*ESVDE 
I/QHV /NQLPKITHTITSVSVGKARDKF 
NIYS*FKLGKTEKERNFLHWIKSICTSP 
AASILLEGKESPF\PFRSGTRKD/CLL* 
P/LGSGMVLEALGTAERRTRKEK 


437 


14338 


A 


443 


363 


2 


HHVGQACL.ELLTSGDPSNLASQSAEITG 
MSYRAQLSIVTFSAYFLW*KLSHARGL 
ML * QS 1 1 1 /HVLHFHQVKKAFHVS /PEN 
SQPLHNVEPEDWIF*EHQRKTVPPIHTA 
RKLQDliEPCLV 


438 


14339 


A 


444 


1 


900 


DSSAGIT/GICHHAQII/IiFVFWETGF 
HHVGQAGLELLTSGDPPASASHTGGDYR 

SLPKLDPNKRKFSFPP*LF/IGS*TLIW 
SLFCFRNVCTQLADPTKSIAYQSSLMKP 
FGQKS ICFGSEKFPDKVYLFSSDRKAKE 
QVL WWWEY * I * DMSACFQ *ALGFC Y 
CLjSIWN/YELPEKFEVIjACSLPSRNNDL 
ILSLKKKSQNSFFVCVFFFFKTKSNFVP 
OAEGQGPLFG*LKLPLPGFRNISCLNLP 
GSC\QTGAVPPPPVNFGFL>KKNRVSP 


439 


14340 


A 


445 


3 


346 


QIGKVKKLNK*VPHEPSKNKL\FLEASS 
LILCNNNEPFLSRIVTWDENWILYDNH* 
QPAQLLD * EAPKPNLHQKKKKKKKGLAP 
FWGAFSRGNPFYFFNPQQKLHF*KFSSQ 


440 


14341 


A 


446 


2 


246 


FKCGKTRALMHCWWEGKMLRQL \ RKVWQ 
LLAMLNMELPRE PAAPLLS isprerk/s 
TTAKTCA*MCIATVCITVKKWKHRLLG 


441 


14342 


A 


447 


38 


393 


VILHRQGLSL/ VTQAEVQWYDYNSL* P * 
TPGLKQS SCLS FLKSYCHG*1>FVWWF 
KWM\GFTMLPGIjILNSWPQ/CNPPAVDA 
Q I AG I RG FHS / VGQAGVQWHDliS S ZjQPL 
PPGFKQFSHLS 


442 


14343 


A 


448 


54 


540 


RIPFEHASGFLQSSHQKPHCLLHPLSGQ 
VSSDGQ\FRKFGISRLGNSGIYKAAFPL 
HDCKFRRQSEDPSCPNERYLLYREWAHP 
RSIY*KQPLDLIMKYYGEKIGIYFAWLG 
YYTQMLLLAAWGVACFLYGYLWQDNCT 
WSIEVCHPDIGGMI IMSAQRDRL 


443 


14344 


A 


449 


2 


310 


FFFLRQSL/ DSVAWAGVQWRDLGSLQPP 
PPGFK* FSCLSLPSSWNYRHAPARPANF 
FLYF * * RQGVTMLTRMVL I S * PRDLPS S 
AS QS AG I TG VSHHAW AKI S L 


444 


14345 


A 


450 


2 


466 


KQKIFSVDETAFYWKKMPSKTFHS*REG " 
TASKLHTAG* ILLGPNAAGDLKLKPVFG 
DDSGNLRALKNYAE S PLPVL YKWNNKAW 
MTAHLFTAWFTEYFKPSLRS\FRKISWK 
I * LFMDNGPHHPRALMEMCKE/NAVFMP 
ANTTSILQPMDKGVILI 


445 


14346 


A 


451 


670 


212 


SSSSP*GS* YQNRTNTIPSPKKKQTKK\ 
KNNHYKS I FFTDAEILSNrLANAIQQS I 
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peptide 
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ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibte 
nucleotide deletion, \=possible 
nucleotide insertion 














KRFIRHEQLGFILVVLGWVNIQTPTNAT 
CYTNRLKKRNHMTILINAEVLYKIQPSV 
I*KLFPLRGAF 


446 


14347 


A 


452 


407 


219 


PL I SLRWENHLGPGVQGCSEP * LCLCTP 
AWMTE * YPIS PSQNKTKKQ \HTRKQNQH 
KNKCVXN 


447 


14348 


A 


453 




395 


WFLW FRERSHLH / R VQNEAAS ADVEAAA 
SYPEDLAKI IDAG \ AKQI VS VEETAFHW 
KKVPSRTFITREKTTSALKG/RC*LSLV 
DNAAGHF*VEA\MLIYHSDNPRTL/KN/ 
YAESTLPVLYKWNNKAWMTACLFTS 


448 


]4349 


A 


454 


424 


38 


EETEPL/HRP I SSAEVELVI KNLPT/ KH 
KSPRPDGFIAEFYKMQKE*LVPILLKLC 
QKI KEEGLLPNS FYEAS ISL IQ \NSGRD 
TV/RKENFRPIFPMNIFAKILNKILANC 
IQQH INKLI HHYQVLFS SLHSR 


449 


14350 


A 


455 


2 


309 


PRVRSQTPGHKRSTCLGLPKCWDYRHEP 
LHPATAS FLVAAV PGM/ FAD P P / CNMHL 
NE * MNE* MNGDDASE ILS FEMRSHS VTQ 
TGGQWCSHSSPQPQPPGPPK 


450 


14351 


A 


456 


3 


441 


DAWGLVLDRERP FF I FFFFFFLGNGGYF 
RGPGGRPGGEGPPMETSVSPVKKILRGP 
GQGGGKARKPHPLGGPRG PNHKTGG * KR 
AYPTGENPVLTEKPKFTGPGGNGPETPV 
I GKAGAGKPLN PG \ MGGSRN PKWAHCP * 
TGGKKGK 


451 


14352 


A 


457 


1 


234 


PTRPNHLGLGDRG CRE PRS SHCT PAWAT 
ERDWVS KKKTKG / EKF * KGRTK* MS * VE 
QGGSGGKGIFTLAISNSPIjPFF 


452 


14353 


A 


458 


42 


470 


KRIPQLKKSPLPLKNPPGEWVGKINFPP 
PRER PKNFF* KKKTLKTPPKQKFLRKKS 
P FKKPH FLL Y S KATKKQKRG KKKRAP P P 
KKKKKKENPKI VI FRTE ITVAS PVLSWA 
VKPIIHIFFPREKKPHA\KPPPP 


453 


14354 


A 


459 


2 


393 


DRPI EQWNKRERPR INLY I DGQM I FSKG 
TTAFE / WK* NLCNKYVWNAWI C\NRRKN 
P YLTSSRKINLR * I IDIKVK/PKAIKLP 
EQNIRCSL*VLGVGRDFLENMNYTRKK\ 
IGKLDFIKISQLGMVADTSNPI 


454 


14355 


A 


460 


338 


33 


GDEKIEKSFFFPISSLCFVP/AYL*QQI 
RLLI FTRNSRLLF I Y/ CLFLEMGSHFVT 
QTGVQHCHHSTLQPRPLGLKQSSHLSLL 
SSWDYRHL P WL KWT E ACAV 


455 


14356 


A 


461 


77 


435 


AKEVEASLSTLARPISIQQQQQQQQNEK 
KNKN* PRIAKAI \ LS * KGEITLP /ELQL 
CYRAMITKTAWY*HKNRHIDQWNRRENP 
ETNPHTYSELIFDKGPKSI\SLFNK*CW 
EYWIFICTR 


456 


14357 


A 


462 


157 


3 


NGRVDLK I QKLARCGGACLQ S QL/RQEN 
HLNPGEKG CS ES * LHHCTPD * VTKQ 


457 


14358 


A 


463 


363 


3 


PS VAQAG VQWCHHS S LQ POP PRLR \ HP P 
ASA/SQ*LGLQGQATALYIHKYILFPCN 
VLISF I FP FFFLEMRSHS V/SQARVQWH 
DGSPQPPPPELKRFSWEAELAVSRDRAT 
ALQPRRQ 


458 


14359 


A 


464 


3 


396 


LKEI AEKVKKI IKFKKQLRL*MKKI INR 
YLKEN/LNQLLEIKGTLRELQNAVESFN 
NRL*QIEEGISKLEDKAF*WTESGKIRK 
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T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














INK*TK*TDYIKEPNLRIIDVPGGEEKA 
KSLKNLFEVIIEENFPGLAR 


459 


14360 


A 


465 


399 


1 


PPGVLLNGPPFFFFFLiILGPGWVFGQPIj 
KTPLVFFPNFQYH I FPFKISRI F/LPTR 
FPFQVPPLCP*PFSSPWGLKWFILLSF 
FFFL* DRVSLCHPGWNAVM * SQLTAASN 
TW\ VK* S SHLSLLS S * DYRCVLS 


460 


14361 


A 


'466 


279 


1 


TTNIFNPRRVGSTDAGPLNMKEKEKENV 
KRRQATGWEKIVAKDTSDKGLLSKIYEE 
LLPCLKNEKTNK* / ILdjKWSKGFNRYFA 
KEYIQMANK - 


461 


14362 


A 


467 


2 


436 


RGELPQLDKYLKKTTAIILNGEEL/E/A 
C PLRSRTRQGCS LS PLL* KKFFP P KSH I 
ESLLFFNI I LEVLYSAVRQENE IKIVEI 
GKEEMKMYLFTDGNHLCKNSERIRN/YS 
KVAGYKVNV*T* ITFLYTSREQVEFEIK 
KTLSMNRFF 


462 


14363 


A 


468 


2 


420 


RTTALF * AVRQGRLS LQRLLLS FC \ CLC 
PAARGEAYIG\RQASMSRCGLQAVQASC 
L.LCL P KQAWAMAG AP P S AS LLPCSLISD 
R CAS S QPDS VGVG PSEAGVG YNLWRGL 
L.SRSEKR\NIRLGVTRFSRCV/LSPLSL 
TRR 


463 


14364 


A 


469 


1 


416 


PSP* PLTGALSALLKTCGLAM * FHFHS I 
TL.GILCLLTNTLTIYQ*WRDVTRESTYQ 
GHHT\QAVQKG\LRYGIILFITSEGSFF 
AGLV*AFYDSRLPPTLQLRGHWPSTGIT 
P LN \HLQVAVLGE P VLLAS * VS I T * ADH 


464 


14365 


A 


470 


211 


398 


I FFFPKMGGGLS 1 1 LCWRRKRHPQKX.K* 
STCLGliPKWWEYRCEPP/VPGQKKNFRP 
*KTKESPPLVSGGASSASNIKL*IALPP 
FLEKKKFKKQGFKDPPFPLFFTQNPGQG 
NFFFGKI* YCPPIFFFFLRRSFTLVAQA 
GVQWHDLGSLQPPPTGFKRF 


465 


14366 


A 


471 


62 


424 


TLMHCW W KWKMVHLLWKTVLNLL I KIN/ 
RTIiNHT* PCNPAIPLLGVSLREMITYVH 
KKPCP*MFIVALFMVTKNSKKLKCSLTC 
KW I NKLWSLYTMKYYLATKRKELLNHNR 
SCRNYSSDMS 


466 


14367 


A 


472 


427 


20 


LGLRLPFVEQEENKPYYPLAPFSPPEPP 
MSLYKNSP I YPGKGFLFPSKNNKI PPLN 
FLKMSNLF* / FALLLGNKDFKLLLLGGP 
IGIMGDNMATTSQMFYSRVPIFF/CFFE 
TGSHSVAQAGVQWCNLSIiLQSLPPRLK 


467 


14368 


A 


473 


424 


98 


NWYHjYYYYYYHLLF* RQGL/DSAAQAGV 
QWHHHCSLQPRRHRLKQILPPQLL\TSW 
NYRHKPPRLS P YVAQAGL * LLGSSDLPD 
SVSQSAGIIGNEPLHLASIj*FLFAFP 


468 


14369 


A 


474 


415 


75 


NHFLiKFFPQTQANGPGGKNKFFFLiKTRF 
CFFPPGKRPWAYYKSLQPPNSGGQQI\S 
APTP*IKGAPKGGPPTRVKFFFFFLF/L 
RWSLRSVAQAGVQWHDLGSLQAPPPGFM 
PFS 


469 


14370 


A 


475 


3 


413 


PVQKGLRYGI \ILCITSEV\LFFAGFF* 
AFYHSSLAPTPQLGGHWPPTSITPLDPL 
EGPLLITS VLLASGVS IT* AHHSLIEN\ 
NRDQIIQALLITILLGLYLTLLQA*EYF 
ESPFTISDGMYGSSFFVATGFHGLHVI 



544 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


470 


14371 


A 


476 


1 


440 


ITLTDRELSYY*ANRLI.*LAYTITFIV* 
I PLYGLHL* L\PKAHVEAP IAGS IVLAA 
VIXKLGGFGVIRLTLILNPLTKHIAYPL 
LVLSL* GI IITSS I CLRHKDLKSLIAYS 
S I SH I AL WTAI L I Q/TP * SFTGAVILI I 
AHGLTSS 


471 


14372 


A 


477 


2 


397 


L FVYNH I DVGALYLL FGA* TGVLGTGLS 
LiLIRAELGHPGNLLGNDHIYNVIVTAHA 
FVIILFIEIPIIIGGFG\N*LVPLIIGA 
TDMAWPRINNISIj*IjLPPYLIj1iIj t jASAI 
AEDGAGTGGTDYPPLTGNYS 


472 


14373 


A 


478 


442 


44 


SSSSPFTPPRGKFFFKKTPRKKFFS/SP 
GNKGFFS PLSP * KFFFFFNPFFFFGGFF 
PNFPPPKKNFFFKNSPGFFFFPPLKKKI 
FFFPPPLNFAPPKVFFKSPPPFFFFFFF 
FFFFFFFFFIiRGHPWGREGI 


473 


14374 


A 


479 


1 


439 


PTRSPTRPLVLDRERPPFFFFFFKKGPP 
SVPPAGGGGANLG*GNPPPL/GVKKFFG 
PKPPKIGE*RPWPPPPGQPECSFFLKKN 
GLPHGGQGR*N\PPPRP 


474 


14375 


A 


480 


420 


0 


YSPPSPPK/YRAPGKKFF*KKPRKEKF* 
KKKILGFFFPLSPLKFFFFPKAFKFFGG 
VGPNCPPPKKRFFSKNSPGGFIKPPLKG 
KNFFFPAPVKFGPPRGFFKGPPSFFFFF 


475 


14376 


A 


481 


371 


2 


NKI'NKFSKLNSKKSSSSSSSSSSSSSS 
SSSSSSSSSPSVNK*IKKCSTSL,TISSS 
SSSSSSSSSPT*IAKIKNIDSPKCWQGC 
RETGLTV\LVQPLWWQFLLKLSMQI I *D 
SANMILSMHARG 


476 


14377 


A 


482 


3 


335 


HASGKDRHTDQRNRIKNPETDT* I YSTF 
♦QKC/RLI *WRKDSLVNKWC*SNWA\SP 
MKKIKLDLSSSSSSSSSSSSSSSSV*NV 
KLU3NNVGGNLQYRGLRVHTVDVKAQH I 


477 


14378 


A 


483 


1 


418 


GVR*FSPi.NPPSRWGPKHGPPNL\IJSIFF 
FFFFVFLVETRFHPVGQAGLELLGSRAP 
PASA/FPK 


478 


14379 


A 


484 


1 


356 


FCANAVRSMI KTVLIDKATKP /RAWKGN 
YKYHLPVFNCKTGRT\DSGNPLNWFYQC 
FVPEIRKYIiARVGLPFNVFLILDNAPGH 
PEPHEFNTEGFRWFLTPDMPLIQPLEQ 
GVLRTLTA* YQCFVPE IRKYLARVGLPF 
NVFLILDNAPGHPEPHEFNTEGFRWFL 
TPDMPLIQPLEQGVLRTLTA 


479 


14380 


A 


485 


166 


406 


FIFWGSVSSSAEGGGWSWSSLPRVTVR 
PDETVDVTIHIiI J KE*CRPGAVAHACNPS- 
T LGGRGGR I / TLR S GVQDQPSQHG 


480 


14381 


A 


486 


107 


429 


FWVTQTFGFFWLNPPGGIiELWPPPPCPG 
NFGGNFKKKGGFPLWPGGVQTPGPRGIT 
PPGPPRGGK*RGGPP/ SPGPGFLGKPKG 
GGP 


481 


14382 


A 


487 


399 


2 


GSPAPGLPKGWGFRGGPPGPGKFVFFKL 
PKGGFPGAEI FLGEFFQRGKKGLGPI FP 
PVFLKTEEGGTLS/NSFFKGRVFLVFKP 
KAGPKKKKKKTNYKPIYLNHTDTKFFNN 
MLANRI *QCI KEVMQYDRTRG 


482 


14383 


A 


488 


419 


2 


PPPPPPPREKKGGLYFNPKIFLGKSFPP 
RNP KDPS * KPRVG I FFEG P KP P KKKKF* 
FGPWGKPPGDG*RKNPFK\PFFFFFFFF 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H = Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 














* DGVLLCRPGWI AMAQSRPGLQE WNS I S 
KKRSGPCVLGISSKEVPDAWADAWADAW 


483 


14384 


A 


489 


629 


0 


SSSSR*SLLCKCFKN*GEKIRNPIEKWA 
KVMHRLFTKRCINI *KDVRSTS1jMVRET 
* ITTTLRYHFSPIRLSKT+KLNRTLYG+ 
DCGETGIFIHCWWECKMF*VL*RE/VWQ 
YLIKLL/LHIPVDTA 














K^JK PfciMDPuM *GY* VNDEGDTT YQQGNE 
ELRS / WCWDDNRLAKVKLDLYFPPHITI 
EINSKWI *DLNIKNPPIHVLEEWMN/2* 
F*HVGIGKHYLTI*LKI*NP*K**IWLH 
ENKMYCL 












507 


NLAK* IQ* * IQTIMHHDQVSFIPGIQGW 
LN I LKS I N I I Y YTNRL * D KTH I T I S IDA 
EKAFDKI*YLFMKGK\KPLSQIjGIEGN/ 
YLKLIKGIH 












417 


REGKKS RVHHFN I KQGRVMSTKH* KTQL 
/SHSNAHSNKAQISKSQPHGLHDFFKKK 
KKKKKKKKKKKKRGEKKKKKKRGKKKKK 
KKKRGGGPP*KGVREALFCFFWRKKFFF 
VGGGGGKTPLGCLQADTPLWGA 














FFSSPPPPIjPFFFPHLFPPPPNFFFFFS 
PPPFFFF/PPP*KTKFPPPPPLFFPPPP 
LFFFSPPPPPFFFFFSPPPPFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFWL 














KKKKKKKKKKKKKKKKKKGGGPLKKKPG 
GAKKKG / EKKKKKF F * KKGE KKKP PGK F 
GKKKKI WGGEKRAKTPQKKKTP * GKKKI 


489 


14390 


A 


495 


274 


1 


IYRIDCAYMKKVERSKISILS FHIRKLE 
N/QLNPK*/RRRREIKIGAEINEIENRK 
* IEKINETKSYF\WKI SKPLAKLIKEKT 
Q I TNTRNRAY 


490 


14391 


A 


496 


336 


1 


VFQYTYNKLVSMFYYCFFL*RWGLiTVtiL 
RLVSNSG\PSD\LPSSAS*VAGTTDAHR 
HTQLCFTTLSDLLEPYFHLGSFIFSLL.* 
FFFLNTESHSWQVGV+WHNLGSLQPLP 


491 


14392 


A 


497 


445 


382 


PR VTLI FFFLTPRGPPPPP PPNKVFFPPP 
PPQNFFFPPPPPPSWGGFAPK/PPPPPK 
SFFPPKPPPVFFSPPPKEKKFSFPPPPH 
FAPPPVFF*PPPPQ 


492 


14393 


A 


498 


430 


2 


SPPPKPTRM* \KVKFGR* KKGGGGGGGS 
PPL.SPPLWGSKPWP*VPKGR/PLPG*A 

GQKNGFTPEWLLL+PKFSPFPSSLVDR 
RIYCLLKMLEYKSVMMILLFASCLNLYT 


493 


14394 


A 


499 


363 


3 


KKLiVTPARVLGDI I PRNRFLQMPQQREN 
FLCQVWMTKP PTT I FV KTKTG KWYL I S L 
KRFCPAKKKT / 1 KI LVNRVNRQPPEWEK 
I FANYASDKGLISSLYKDLKQVYKRKNN 
PITK*VKGMY 


494 


14395 


A 


500 


145 


435 


VFMCINSSFLLFIPWYEYTTVWPVTCRR 
T*MFIAALFVIARNWN* SRCPSTGNW/L 
KLWYIHTME 


495 


14396 


A 


501 


162 


1 


FYNYTIIFIYLFIFIFLRRNL/DSWHT 
GV*WRHLGSLQPLPPKFKPFPRLSliP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ [D 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 

correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanir> i 

C=Cysteine, D=Aspartic Acid, 

E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Hisridine, l=Isoleuctne, 
K=Lysine, L=Leucine, M=!Vlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


496 


14397 


A 


502 


343 


2 


IKKFRWGPKPPLKEPPGV/SPDFPII/N 
FGNPGPVPGKDFKFPNFPKKKKWGGRAP 
PWPTTPGG*MGRFP*PPAFGVPRGGDG 
SPPPGAPKEGPLSKKKKKENEKEKERKE 
KSARL 


497 


14398 


A 


503 


23 


405 


KGRNYLWEKKKKNMLVRGKIGGGPPPRI 
PKF*KVIFKTPPGPPFFFFGPPLNFFFF 
KTDAPLiFF*NHPPKSKIWAliAPPKKKIF 
LNPKKPPPFFFPPHFFKKNFPRAFFKNF 
\AFSPRGGASPSPPP 


498 


14399 


A 


504 


418 


247 


PPKWGF\FPKPPRGFFFPPPKGKKFFFP 
PPG*FGPPQGFF*RPPPLFFFFFFFFFF 


499 


14400 


A. 


505 


26 


426 


GCTGXil>HS * M YAKWCCTDH PIT* VLS P 
AFISRISYSPTHPQLILFF* IFFLFLVE 
TESCHIAQAGLK/LPASSSLPVWASQSA 


500 


14401 


A 


506 


337 


2 


IPNLKESA/CPNLPKGWGFKD*PPPPSQ 
I SL I FKYPKFLKFE I * KKKPPLXKGLFW 
WV*RKKAPVGF*G\EMALGEKFFFFFFF 
FFF F FF FFFPRRS FAFVAQAGVQ WRD LG 
S 


501 


14402 


A 


507 


454 


2 


TS KTGQ PGRRGS PFPI* WAAGQKRSS P P 
RRGSRAEALLTSQTGWPGRDTPHFPDDG 
RPGRGAPLFPDGAAGQRRSSLPRRG/VP 
GRGAP \QSQMGCQLGRGAPHFPDGVAGQ 
RCFPFPRWGSQVEALPTSQMGQPDRGTP 
HIP DG VAAGQRR 


502 


14403 


A 


508 


390 


13 


RIPPPKSRWKGKGPF*VS*NPRPQI*KN 
FLPPPP*K/YGDPRGPPPPPIKFL.PLKK 
KGAPP I CPGCFE I PAPRES PPLAPPKS * 
NSRGNPPPPPFFKKKNPLFWGGKTKKLK 
IFFFFFEAGSCYVA 


503 


14404 


A 


509 


2 


282 


WQFLTKLNILPYEPVIMLCIYPNESKT* 
IYTKTYA* ILLAALFIMAKTWKQGVLKK 
V\TDKVWHIHMTEY 


504 


14405 


A 


510 


400 


250 


LLSVTQAGVQWHDHSLLQPQTPRLiK\HP 
PTSAS * VAGTIGAYHHAWI IFFF\ IFIY 
CRDAVLLYWLGWFPKGLLKCWDYWCEP/ 
RMSGLFVFWSDLGFRSEVS*ILSSWDYR 
CVPPCLDNFFF 


505 


14406 


A 


511 


99 


242 


VTKEKEGHFIRIK*LIHQEDTTTINIGV 
PNNRTLKCMKQKLTELKKEI 


506 


14407 


A 


512 


139 


374 


SLWGKRIFFFFGADPHFVPQAGGA\WGN 
HG*LQPTPVGLKKSSLLTFPISWYYRLG 
TPPPANFKIFCKNGVKPCCPACF 


507 


14408 


A 


513 


23 


401 


STCLRLPKCWDYRRESPI,PAI.CMI J FI/*M 
KGKYVYMGFFI\LFFFLFLHKI * LLQEN 
FVYVCLKKNKTKLKA*TKKQKKNETG*N 
I LVDLNVLS * KK WKCLLWG F I L F YF/NF 
LRQSL/NSVAQAGVQW 


508 


14409 


A 


514 


7 


386 


FFVFSTHITLLFLFLFFFFLFPRPPGFF 
IAGKKNTPPAPPEKPPPP\PPKKKRGPP 
FFFFGPPQKKPPNS PRGGRGPPPPRPPR 
KMEKGGPPPPREIPPPPFFF*PPKTPPP 
PPGGGGGGKKTPPfCR 


509 


14410 


A 


515 


370 


2 


FVFFQ WLPE *NS FLFSPF* KGLC * RDL 
AFKRFFPWPPNKKLWPSLF*GFLKNS*K 
FNFSLKSLEVPFFF\CFPKFFPWI*AFP 
RFFPFFPPPP*RG* IFFF\*DGVTjIjCHP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
. nucleotide 
location 

ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last 8 amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Gtutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine,V=Valhie, 
YV=Tryptophnn, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














GWSAWQFWLT 


510 


1441 1 


A 


516 


1 


163 


NTREKLSGGGGTQIjS * /LLGRPRQKNHIj 
NLGGGGCSEPRS\HCIPAWMTE*DPVSK 


511 


14412 


A 


517 


1 


390 


RIVRVYYKHLC/Y*GSVNPREKIDNFLD 
TFNLSRLNQEEIETLSTPISSSIIETVI 
KSPPT\KQKSPGPE 


512 


14413 


A 


518 


47 


312 


E YTGL I LRLY I E / LFTTQH * KQKALNQA 
K*LAKGLNKHFIKRGCMNDPKHMKKSSA 
SLAI RKM* I KTMMRCHY I LT * MAKI ENK 
TESTRP 


513 




A 








QNP INKQ* KKNNKKYRLV* HKKPPPFTT 
. QPHQSNQHQ\PN++PQKHT*TKNHHTPP 
PPP*SPPPPSLSPSPPFPPSPPSSSHPP 
L.LSPSPPPPSSPPSSPPPRTSP 














RNVKHTKRCPT S \ RELQ I KTRY Y YI S I R 

* RVKYKKRDNTKC* *GRETTGTLVIHYS 

* EYKMV* PLWETV* QFLTKL/E * S Y/P * 
DPAITLLGIYSKELK 














GKKIPMLYFAEIEKCILKFV\KRS*KAK 
* ILPKNNKAC-GLKFPDFKTYYKAAASKH 
QVGGLWKEL.RSTLQAQQTWVPREELRST 
SPS 














KAKT*NNDNIKCWQ*CRETGPLIYCC*E 
CT/LVQPL * KTV* QFLIKVNM\DPAFVL 
I FIPKK* KHMFTHKKNCKHTFRAALFVM 
ANTR I I PNI FQFVE WLNKLSYKHI VDY 














TYIYIKTCPRMFT / AVMLTITKKQKQPK 
YPPTDE*NV\YIHSNAYYSEMKRNEVLI 
HATTWMWPKNIMLNKASSRKRPHTX*LH 
IYEMSRII QYMVLCNCLLLPSMF 


518 


14419 


A 


524 


426 


9 


aelpasptpgtctpqplgsgrdqvp\ga 

VGGTHPGGSGLAGSPP*GGLGMAGCKS * 
ALPRGEVTEAWRQFKCGERRQASSTGGP 
GAPSAAAGPGAKPLTAWGQRHQPAAPSA 
GPAEPLPTQNWCWPASNPGSRTRLSLKT 


519 












I C I W R KMN I L L P Y T KLj N S R W I AD L YV KG 
TTIMFLEVSVREHLHDSQKKIF* *DAKL 
MNQKE+LDILDSINIKSFVH*KAPLREN 
KGKLQSAIHIYEKGLVSRIYEELLQ\TY 
KETLKANKKKIDNPIEKWAKNLNRHFTR 
GCPSIYKHVK 


520 












ATG S L C C P G WS I V A I HG FNH S AL * L L T P 
GLiKH P P AS ASQ VAGTAGMS DCTQLCKIN 
FFVFVFli/RRQSLAVTQAEVQ 


521 


14422 


A 


527 


3 


438 


AVSHDCTTGLQPG*QREILYQRKKKGTF 
S PbP P PS PGIGGKPAIiPFWGPKTWPPPG 
IPPFLFSPPPPSQSGGGPRPLETLAGSF 
SQNPWNLRDGAPPPPGGGFFG/CPPFGP 
PGGLG/PPGAPPPPLKNFPKKKGGGGGF 
LFPPQAKG 


522 


14423 


A 


528 


2 


616 


FFFDTEWRSVAQSGVQWRDLGSLQAPPP 
GFTPFSCLSLQSTWDYRRPPPRPANFFL 
YF+ * RRGFTVZiARMVS IS* PRHLPALAS 
QSAGITDVSHRAI,QVCFITTLL*LSK/H 
QFKKAGVTL/PHLQCLHEIGLDCVLHKH 
/WSVHHSPP/HKTNVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Ml 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


523 


14424 


A • 


529 


57 


485 


RHSSLGNKSETQSQMKKKKREKKKGKEI 
SFDPAISLS I * PKENKSLYQKD/TCI CM 
FITALFIIAKTQNQPKCPSTDE* INKRV 
\IYIYIYIYIYIMKHYSPIKKNEIM/SF 
AATWMELEAIIIS3TMQKQKAKYCLFSC 
I ■ ' 


524 


14425 


A 


530 


399 


3 


FFFFFFFFWENPPPPPK/WRGGGKKKAP 
FFKFFKYKK*LFFF*GGGGGHQQKKKKK 
KSSSFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F* FKGFLiIiGTTHASAHASAHAS 


525 


14426 


A 


531 


491 


4 


SKWI KDLNliRTK/ TTKFVEET* VKI FMN 
liGLGNDFLSMTPKA/LKIDKSDFMKIKN 
/FCSSKD/TIKEVKRQHTEWVKIFTSYI 
FDKGSTSRTCKEL/RKQQQNKPIQKCSK 
VLNRFFFREDIQLSNKLMKGCSTSLNH 


526 


14427 


A 


532 


184 


509 


PQWPAHSFLPALGSSGTGPY* WRQIFD 
SKDKESSQ/WSHETSDRPKPADHRRRSR 
PSLATSP\PRLEPHPSLPNHSGLPILSS 
LPWGAVALAPTHFSALAWP *RPLPCNSQ 
GEKF FFW FGF F\ E RESHS ATQAG VLGHD 
LGSLKPPPWGSKGFFCPSYSGG/WNQKN 
HLSP 


527 


14428 


A 


533 


401 


3 


VIREMQIKPLHIQ/PGWL* SKSVKKY*Q 
GCGQS/NVLIHY/WLKT* IMGPFWKIVW 
QFLIKLPYGTVILLSGIYIIiWRMEKKPP 
KFCTQIFIAAFFIIQ*PKSENNPNIHQP 
KNQ * DVHMYIHTME YYLAI RMNIH 


528 


14429 


A 


534 


419 


93 


SLAIRQM*IKIPPRYTRVAK\IEKSHNT 
KYWKRHGATGTL I HCWWECKMA* S * KI W 
*FLIKSNINLPFNPVILL*GIYPREMKT 
CLYKD\ CM*MFMAVSFIEQNGKQPKCL 


529 


14430 


A 


535 


414 


2 


NFLARGYINCGPQFFFFPPRKNLKWGVP 
LSNFPPKL*KGGIFGEGPKKVKKNFFFF 
FLFFFFF* KKPPCS PRLKAK/ WKIFGPL 
KPPPPGLKKFSFLTPKKNGDKRGGPPRR 
GNFFFFFFLVFLVKTEFHYVGQAGLK 


530 


14431 


A 


536 


32 


435 


DRATALQLG * KSE TS S YKKKKKKKKKKK 
DGGAPLKKNPGGAKKKPGEKKKNFSPKR 
GGKKNPPGNFEKKTNFGGGKNGAKPPQK 
KKT / AWG KKKNLKG KRG KKN P KTLGAKK 
FPPRG*KKKKKPPAARPGKASS 


531 


14432 


A 


537 


348 


1 


EYIKSTHQMGKNYPLKNTVSSGQVQWLS 
PVISAQFAIAKSWSQPKCPSLNEWIKKL 
WCV/WCVCVCVCVCVCMMEYHSAIKRNE 
LMAFAVT+ MRLET I I I SEVTQE WKTKHR 
MFSL 


532 


14433 


A 


538 


523 


66 


TDQTSHNIPLSQSLIQSEVPTLFNSVKA 
E*CEEA/AEHKFEASRGWPTRLKERSRL 
RNMSV\GEAAGSPEDPATV/ INGGGRTQ 
PQIFSVAEATLNWKKTPCRPFIGREKSM 
PGFRASKDSLTLCRDFKTHLCLVFHYWN ' 
AKMVQVYILLKVTAKI 


533 


14434 


A 


539 


413 


2 


RDKV*PCCRSWSQTPELK*STRLCLPKC 
*DYR/R*APAPGPFI>L*VHRHVSIFKSG 
PIiSCRCSNFQD/HDS I KPS FPTIQYAHG 
NKFQL* TPDTLI FFLKWSFTFVAQTGVQ 
WHNLS S LQLLPPG FKRSDAWADAWADAW 



.549 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1 M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=1soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


534 


14435 


A 


540 


3S5 


2 


MASKHIKT/CARQLAIREVQIKTTQYHV 
IPTRMAKVKKT /DNECWHGC*NTLiINCW 
WDCKMMQLLWKK/ SVWHSSRGKMYVQLP 
CTPAIPLLGIYTELKTCSHKN/T/CT*T 
FIAALLVIAEKWKA KCPSAEEW 












398 


RPPFFFFFKKRPIiWKKKQGFPPPA*RGE 
IPSAKK\PPLGPVSGPGNPPGP*KP/PQ 
NPPPLGGSPFGKKPPLDIPFPGGPIKGK 
KWGGPPLiAGGATTKN/PPPGGNFWKGGK 
APPFPSQKF 


536 


14437 


A 


542 


370 


10 


FFLRTKVSL.FPRVEG\KVQSQLLAAPPS 
WGQV I P PI.QS PE * LRQKD P F S PGG* GCS 
EPCSCPCLPAWMTEPNSVSKS*KKKRKK 
E KN * * KKKKS PI NTTGKIR FR I I H 


537 


14438 


A 


543 




370 


FLLiRHILiLCHPGFCSVATAYCSRDPGSS 

d/plppqaplpdq * prlqa/ whrlapph 
sanff* * rqgftvlarmvs i sqpcdpph 
wasqgatatkaddyqk: 


538 


14439 


A 


544 


50 


395 


IPGLTRQWLLDPCASPSTPPYT/P*VQP 
STPQNSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAILEHTSSSSSSSSSSSTCRNR 
NGYTYTVPEHRPARGHTASQTRKQVIAA 
THKP 


539 


14440 


A 


545 


1 


370 


LCSVTQAGVQRRKL.NSVQPPPPRFKQVS 
CLSLLFFFFF* KRVFTLFPRLEGRGP II 
FNGSPTLRG*GDPPV*ASKELRTKGGPH 
QGQL INLF / CLGATGPTYGAQGG FKSPG 
LRRWAPLGPPRA 


540 


14441 


A 


546 


178 


365 


YKKTDATKTKMDK*DSIKLKSFSTVKET 
INKGNR \QPTEWEKI FASHTS DMGIISQ 
ICKELKQ 


541 










236 


KGTTKXiWKLSDFKSYFKAIVIKTVRN / 
WHKYK\HIDQWNRIQNPEINPHIYGQMI 
FKKRAKNKQWKKDGLHKKR*WSNDFQET 
CQEQTMEKGRSSQEKMM 


542 


14443 


A 


548 


355 


3 


I KKKNLGRKRFCFFQEEKTWP * TPKKKP 
LENPGGVFPNPGFPPFFKAKIPEGPFPG 
V/SFF*RGGFKGYPPPLFFFFFFL*DGV 
SLCHPGCSAVAQSRLTASSASQVHAILL 
RHVDEGR 


543 


14444 


A 


549 


1 


373 


CFLI CRHTSHMWR I WFQTTAINQI WQ* K 
ESQKWSQ \LYI * KLHLLYE VQ * HFV* K 
YTQ *LLLENADTK * AHAVGKMVL I DLL K 
AALSL I FNL * KNK * NLKNWPGMVAHACN 
PSTLGGQGGWLTS 


544 


14445 


A 


550 


446 


~2 


NIDKAPTVLGKMVSIFPTSRSPPLGPPK 
CWGHRRE P P R PAY FG I * I L FLT P P * NPL 
NNSPQHLSGKISGIFSDPSLSVFFSSFF 
FAPPPKKFMLCF*PFFFFFF*DRVSL>CH 
PGWGAVERSWLTAAPISQ\IR*SFHLSL 
LSSWDHRHA 


545 


14446 


A 


551 


342 


26 


WAPPIFFFPPYKRSPQKPFPPPGENNSR 
GF* IFG/RGGFFKKEGLSQFFFFFIjKKV 
FFFSPGGEPQGYFPPPKASFLKRIFFPP 
PPIKKGDPGRGSPPRGNNNPFF 


546 


14447 


A 


552 


3 


167 


QPHLQDCL*QQNPVAVG*AFSSHPRDSL 
KNPW\WPGAVAHACNPSTLGGRGGRI I 


547 


14448 


A 


553 


3 


413 


TL I TALS SH* FFT* VGLE I NMLAF I PVL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
LSSN 
09/515,1 
26 


I Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Lnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TKKINPRSTEAAIKYFLTQATASIILLI 
AILFNN I LSGQ* T I TNTTNQYS SL 1 1 IM 
AIAIKLG IAPFHF* VPEVTQGTPLTSGL 
LLLT*QKL\APISFIYQISPSIiKKK 


548 


14449 


A 


554 


441 


1 


SSPPPQARGEIFF*KNPGEEISQHNKKN 
EVFPPPPP*KFFFSPKAFIF/RGGGGPK 
RPPPKKKFFPKKTPRVFYKPPQKXKKFF 
FPPPVKFG PPRIFFKS PPPL FFFFFFFF 
FFFFFFFFFFFFFFRMGCDRWHGKFWIL 
RDGF 


549 


14450 


A 


555 


3 


455 


NTNSSMYVCIYVFILRQSL/DSVTQAGV 
QWRDLSSLQPLiPP\GSWDHRHLAQQ*YIj 
FMVLLLEV \ FVYERGT IMVP I L * DCNVN 
CRVFGTRSLS VYCVMG I FYRC/H IESML. 
CDRCFIFFDIGSHSVTQAGVHWCNLGlA 
MKPLPPG*RNFSGLN 


550 


14451 


A 


556 


1 


299 


RRMRQENRWNPGGTGCSEPRSHHCTLAW 
ATEHDSVSKKfCKKIRSSLGKKT* FYHNI 
LKAMG/CITGIHKGPEGAWQREGHFTPP 
QGPKKAPQRGQTYPDP 


551 


14452 


A 


557 


3 


391 


PVWWNS FEAS SGWL I GFKKRS C I CNI KV 
QGETASASVEAGVSYPEDLAKITDEDGY 
IKQQIFNVBKTT/YY* KKISSRTFI VRE 
KSMSGFKASEDRLM* FLGANAASNF*LiK 
PILICHSKI PMTFKS RAKS 


552 


14453 


A 


558 


337 




TRFPPFLGGFSTKFFYRPLVRFPLL/RE 
RKIFPLPPVFVGGFPPALGPFWGFLFFN 
FSKKRPFGFFWGFFF/ CFPKFliPFKPFL 
GFFPSFRRGFFFFFETVSLCQPS*SAW 
Q 


553 


14454 


A 


559 


117 


419 


I P P LLLGVGL FFFFFIRK KRGV FS PRWR 
GGGSNFGLLETPPLG I DPFSGLTP PKSW 
ELRAP P P P P I KF * KFFLKKNGF * WVS PG 
GLEISALLVFPA/SASQ 


554 


14455 


A 


560 


347 


1 


X YKELKQLCRKKSNYL I KKWAKNPNRHF 
SEEIQMANRQMKRCS\TTPVIIREMLIK 
TTLR Y PAS P VKMAF I * DR* * *MF/ WQRC 
GEKGT/H/CWWECKSVQPL*RAVWRFLK 
KLKILLP 


555 


14456 


A 


561 


2 


375 


IPPPLQNGEKIFMII *TGAEKAFYKrPF 
PFPIK/TIiNKVGIKGNFPHIIRAL*EKP 
PAY I FHGEAE S FPVRSGTRbCP LLLLLF 
HP VLEVLVRAI CPLK/E / IKGTQI GKEE 


556 


14457 


A 


562 


391 


2 


ASGSKGKKKYPPFLRGSPPLPGNPLFFL 
EGGEGKFP* P/RNWGPPPKIFPQKGQGN 
PFFFFFLKGPKGGGFFPPQGEKGVGFPL 
PGEKSQRPF*GEKGGVPPP*RKKPPPFF 
FFFFFSETVLLLPKLECS 


557 


14458 


A 


563 


384 


3 


I SDSGVHPLGLPKR I LLCQLSYHVWP * P 
KCFC/PHLCLFMTLFQPP * GLPS VCAPS 
KQEEHGCFP I SVARQEC* PLFPKNLDT* 
HFVGNFLKFFFFFF*DGVSLGHPGWSAI 
LTHCNLCLPGSSNSLPQ 


558 


14459 


A 


564 


522 


120 


SLFLPTLE CNGT I S AHYNLHLLGS SDCP 
AS AS IiVAG I TGM CNL CL PS S I DPS TS AS 
♦VAIDLRK* * GARGNC PVS / VLE KGWPE 
AEMGRGESASWWDPRLGRPGVAKRI CRR 
HCCPVQQPQVSMEPSGPASPN 



551 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
cth 
od 


SEQ ID 

NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H = Histidine, I = Isoleucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KDRHIDQWNRlESPQIKSMTIHLiWSPDF 
FLOG Y * * KDRH I DQWNRI ES P Q I \NL * L 
YIYGHLIFFYKGTE/ IQWRKKTLiSSKWF 
WDNWIFKCKQ 


560 


14461 


A 


566 


413 


1 


EKKKKVPSQVLTPKPQVLGNFPRFSEEQ 

LCFGKSPGSQQIWF/RLP/EDIVTTVOA 
S YS KKRLFLSLLDFQYVLQRREGA VNS A 
NLSLAPW 


561 


14462 


A 


567 


397 


1 


FLGQDLTVAWARVQWCSHGSLKPRLPGL, 
KGSSRSATG/SASPYLVPMFLSNLFWHV 
YIL*LYKTMTGI*MEM*FTPSCPMYSW 
RDV * I FFS FETEACPVAQAGLQWRSLGL 
PQPPPPGPKRFSCL.SLL.TSWD 










437 


1 


KFSEDPAKLIGEDGYSKQQVTjN AN/ETA 
LYWKNMLSSTFLARKETSMPSKliQGTG* 
LLLiGANAAGDFQLKSMIiTCHFKNPRALK 
NYATS PLP VUHKWNNKAWMTVHL.MTALL 
TEYFKPTI KT YT YHNTTGS LTT PHAS AH 


563 


14464 


A 


569 


234 


1 


FFPPFPFKASSPPQGTSSSRGVF/PPFF 
PPPKKGFFPKIPPGSSSPPPF*EKTYFR 
FPPFFLAPPGVFFR 


564 


14465 


A 


570 


2 


396 


FS * AFYHFGLALTPQIjGGDWS PAGITQ V 
KS LQVPWNT\SGLXiASGEE I t* adhnv 
IQNNRKRRIQALLITVLLGLYFTLLQAS 
ENFKI P FT! SDG I YGS TFFECAGLHGLH 
VIIGSALLTTCFIPQLTFDC 














HASGLPSSWDYRRPPPRPANFFVFLiVEM 
GFHHLWKAIIKSFACNEIQPI.SAVSVAG 
LVGCV* VCKCLFPVL*Q*LFQF* /S/HS 
I ANWMRE W P LRLS LF * L I C * GE RMSG FA 
TQSRRDP CS £i PG FL YQ VLS LAiCFR 


566 


14467 


A 


572 


100 


371 


YKSNDFYVYGILHFL»TTI\FFFFFFKKK 
KFPPVFQVGGQGGNLS*LNPLFLGLKGF 
SCLKLPRSWNNRGAPPPPPNFGFFSKNG 
VSPCNPG 


567 


14468 


A 


573 


371 




REGARES/THSSSHTPVQKGLRYGIILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEVPLLNTSVLLASGV 
SIT*AHHSLIEN 


568 


14469 


A 


574 


353 


103 


SLQLQPPGI-N*SSHIjSLLSSWIYRHKFP 
CPANVRFFVFFCRDG VLLVAQAGLRA/ S 

D 














FCIKKIIGFIPGK+GWFNISQPV*SLTN 
RIKEKIHISMTAEPIISIITEKVFDKIQ 
HPLILQKP FGQK* E YKFLNL I KG I CEKN 
/PIPN3 IYNGEILKQGCLLSPFLFNIVL 
EDIVIQSVKIKEGINIETQELR*SLTNR 
IKEKIHISMTAEPIISIITEKVFDKIQH 
PLILQKPFGQK 


570 


14471 


A 


576 


157 


255 


YISPTPPFPPDLPF*NYLPNPKLTSPAP 
NFQIY 


571 


14472 


A 


577 


365 


3 


PPHPSVFFF*KSPPPPPPLFFFPPPPFL 
FRGGPLYYSPP/SPPLFFSRRGRDTNPP 
PYSGLSVAGQTPPPPPRRTPPLVSPPPP 
PFFFF*RGGPKNKKAKKQTITFKSPPGT 



552 
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SEQ ID 
NO: of 
nucleotide 
sequence 


S£Q ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginrne, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NKPKHKNDRDI 


572' 


14473 


A 


578 


2 


323 


VGSWEPDEKVSHRELLLSFLFYPFFSHT 
S S * E TL I DFALTS TD I WAL WHDAENQTV 
VKYINFE/HVWVT* FCHPTSPQAISOE 
DERPGAVSHACNPNTLRGRGRQ I T 


573 


14474 


A 


579 


369 


1 


AHLKGTLGGFPDFPFINLE FRGGLTSCP 
NPLNNWE *R/RPPPPPGKI FGAFFFFFF 
FKVETGFHRVDLLVLIS*PCDLPASASQ 
SAGITGVSHRARPTPG*FKKKTMTF/CL 
♦KWWSHYVAQIG 


574 


14475 


A 


580 


394 


125 


AAEGEQGREGWRNRPRGRERASDRERER 
VR/ERGEERERGEERAPQ*SERWR/E/R 
SRERERVRELWSDSDRALEREALLR 


575 


14476 


A 


581 


416 


1 


KGQDLYGEV* KVLLKVIKQVLNKWKDMS 
CS\KINILISFPIRIALKFFF/ELDEl/ 
CSFV* * NKC* R I AKE VLKKKQ * CLGGHL 
/TSPHIRRSYNTTFSQSIWCCYRTKQID 
* WTRTQNPE KDLRI * GDL I YDLRCCSS S 
RA 


576 


14477 


A 


582 


3 


403 


NCFSQFNVIIMEIPAKFFIDINKLILKC 
FCKGRS I LKKKR * EDS / QRRNFLVIKTV 
WYWQRE CH I DQWDR I NNPE I NXiHKYS KL 
ILDKGAKAIR*TKES/l/YSK*C*NWKF 
CM*EK/ IDPKLNPLYK 


577 


14478 


A 


583 


402 


121 


QSLIHS KALTLFNS M KAEEAAEGKVEAS 
RGWFMRF * ERNHLHN I KVQVEATS AGVE 
AAASS POD *GKTTDE /GGYTTQQ I FNVD 
ETAFYWKTMPS 


578 


14479 


A 


584 


255 


517 


IYIFGFI FRGRVLL / CTHPS * SWLQS * 
LTAALNFWAQVS/LPSSWDYRCTPPCLA 
FFFFFCRKGGSTLCPRLVGNSNPLKFPP 
LGTPCF 


579 


14480 


A 


585 


1 


414 


SSHSCCSKA*SSMGPSPAFYKLVPLFLC 
LFIWDGVWLCHPGWSAWQSLLTAASTS 
Q\VHPSS\CLSLRNSW/EYRYVP 


580 


14481 


A 


586 


392 


43 


CWDYRCEPPCLA\TETGSSYISQAGLEL 
LASSNPPMVCHPKCWDYSHETMHSAQNF 
FLKGIS*F*LCCSHFIHNHLLWLGKVIH 
TYTHAHTGLEKYKTQCLDVKCI YSDLLD 
GAIK 


581 


14482 


A 


587 


1 


394 


GTR/YGINLFITSEVFFFAGIF*AFYHS 
SLAPTPQLGGHWPPTGITPIEPLEGPLL 
NTSPLLASGVX.IT* AHPSLIENNRNQGI 
QALLITI VLGLYFTLLQASEDFEDPFTI 
CDGI YG* TFFWTGLHGLHV 


582 


14483 


A 


588 


190 


3 


DGINLKIPGGIFFFLKARSHSVTEAGMR 
W/P/NYGSLQPRPPGLK*SSHLSLLSSW 
DHRHAPPSC 


583 


14484 


A 


589 


157 


387 


TKKRGRVGGWWIKILGINLTKEIKDLY 
SENYKTSLKEIKER/DTNKWKNIPCF+T 
ERVNTVKMS I LPKAI YRFNVI P 


584 


14485 


A 


590 


891 


0 


PPPPFFFFLPALIFFFPPPPQ/PKTPKK 
KKNPPPPPPP\PPPFSHIP*QFLSPPPQ 
HHHFIHHPPPQPPFHPFYYNPFKFFFF* 
NINVPPPP*K*IYFF*TKFFFFIPNIFF 
F\TKKKIFLSPPPKKFPPLLTNLSLPIL 
IKNLLSKPPPPQI 


585 


14486 


A 


591 


1 


496 


GTRAYQIVKPTP*PLTGALSALLMTSGL 



553 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PMRVHFHS ITLL I LRLLTNTL.TI YQRWR 
DVSRESTYQGHHTPPGQKGLPYGIMLFI 
TSKGFFFARFL*AFYH\SSLTPTPQLGG 
HWP PTGMTPLNTLKDPLLNT CVL1ASG / 
VSIT*AHHSLIENNRNRI IEALLMTIV 


586 


14487 


A 


592 


175 


2 


PFFFFFLLRHSLTLAQAGVQWCDLCSIiQ 
PPSPGFKRFSRLSLP/H*PGMVANFCIF 


587 


14488 


A 


593 


107 


733 


AAAAAS KVLM * REGQLPGAT \ GTGG VQA 
*APGSVA/AEGASVEGPGFGDTAPAHQG 
LSPTRSHGQGGAGRAS/SSQQGSPGGRG 
DGASEVWSGAL/SPGGGKDGASASVPRG 
PYAEAEKGGWALRGLGGVAAPGPPSRAG 
QAPSGS / YTGPNARPAPWP I PGQGGGLR 
RDQAG * VSS WTG STEPGAHTAHRAPGHG 
GKGGS PQQPHPQGPGQ I PT 


588 


14489 


A 


594 


10 


435 


FKWLLKSHAICFWTRS*SYCDNVCVPSL 
WAHHLGIRTEIPEFFLSKFLCTSIIPHF 
TYRRQLRLIQGSTE*EA*EDKLEQK*AL 
GAAQFTIjPGMDVFVCFVFCF/ CLFEMES 
HS VT * AR VQ W CDLGSLQPLP LG FKQ FS C 
LGL 


589 


14490 


A 


595 


437 


3 


DEPKKWKTIPCSWIERTNI/VL,KMAILP 
KAIYRINA/VPIKLPTSFFTEVGTFSQN 
*KTTILKFRWNQ\KRA*IAKATQSRKNK 
ASS I T/PDVASNYKTTVTKTARHWYNWR 
HVDQWNTIENTEIKLHTYSQ* ILSKAGT 
SKQWGKEHV 


590 


14491 


A 


596 


2 


498 


FFPFLGKTKPPTHLFFFFPPPFPIFFFK 
IFFGPLFKKKNPPK/PFFKRPPSFPKNF 
FFFSPPPPFNPLFSKAPPPIFFFFFKKI 
FFFPFPPLFKKSPPKTPL* ILGFFPSFP 
FFPPGFPFFSLF*GGVSLCHPSWNPWL 
SRPSSKLASAFRMPPVEG/SPFPPSP 


591 


14492 


A 


597 


1 


311 


RRVSSESRWRSLESRGRSLEIQGRMERP 

* VPVAQLGEPGAQLGDPGAHGEATVPEV 
QGE / P AALLPGTAK\ PGGEGS PGFIPPR 
HCRGTGEEGSPGFTQGRETSKAQQRTGC 
LLSCRGCLCLSVSGDS 


592 












FLRFFHHKKFTKALNRHFSKDILILLSS 
NHMKGCSMSLVIRD/MLLKIT/MKYQFI 
PTRMA/ 1 IKKTDRYWQKN/CEKI * TF IH 
CW * \ NE KMVQP S W KAVWQFLRK 














HTWPPPPRSPTAPPARTQPPSLQSAXjPA 
PQPGKKKKALRNEKG*NGSKKG/RGQPR 
PPPLRG PNGRTRS PAG I CKGGGGF CPG V 
SKAQGAPRPGGRETWPGFFGRRPPLPR 
GPGPPGVLWTSRHCPLR 














EGFFFFFPRWVGGQRALFWSPHFPPPGV 
KIFFPPPPPGWGGLRGLPPPPFNFFFFL 
KXKGFSPFCPVFSQLPPAGVPPPPPPP/ 
TGL£)LRG*PPGPPPFFFFFFFFF 


595 


14496 


A 


601 


393 


1 


VSPPKGCVSENKIPPPHNYYFAKTFVHV 
KL FG VAP L FLQ I L.F * G P FLTS I RGL I L \ 
WIPYS PGY\TLTLL IAPRHFLRTI IPVF 
RSVLPKTYLGLSGMPRRYSDYPDAYTT* 
NI GAFR IARES WGRKLDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


596 


14497 


A 


602 


1 


155 ■ 


FCF VT * S G VQ WRNHDS LQPLS PG FK * FS 
/CSSWDCRHAPPHLASFCIFRSTRP 


597 


14498 


A 


603 


170 


455 


KPNASLKRVLILCIQLQSSRQQKNNPT/ 
VKKWAKDLNILFSTEDL*TVNRYIKKYL 
TSLVIKKYHFIPTRLSKMKKTEHSKCWE 
SCGEIETLTCSW 


598 


14499 


A 


604 


3 


449 


TLLS NLEEAKKKKEDALNETRES * TKL K 
ELPGVCNETMMALWEECKPCLKQTCMKT 
YARVCISGSGLVGRQLEEFLNQSSPFYF 
WMNGDRIDSLLENDRQQTHMLDVMQDHF 
SRAFS I IDELF\QDRFFTREPQDTYHYL 
PFSLPHRRP 


599 


14500 


A 


605 


169 


435 


KYENTKINRR/VAPN*ISLSPEKKKKKK 
KKKKKKKKKKKKKKKSASS S KAS PSSSR 
GG 


600 


14501 


A 


606 


453 


29 


TSPPPPPGENFF*KKPP*K/HFFPPPQF 
RFFFPPFPLKIFFFPPSLFFFWGVFPHF 
PPPPKKVFFPKSPRGFFFPPPLKKKFFF 
SPPLFFFPPPGFFLSPPPFFFFFFFFFF 
FFFSPFFFFFFFFTVKNFYLLCYFYKN 


601 


14502 


A 


607 


170 


408 


NEYDHFSIIKERRDFIVCVFFFIFSR*S 
FTWTQAGLSSL.QPPIjPGYKRFSCL.SFL 
CSWDY/RWHPQGSANFFVFLV5MG 


602 


14503 


A 


60S 


2 


448 


SLHPVIYSEGIKSRQSPCWR**KTVKKK 
KKKNPNQKPKKKKN/RPKNPLGGGKKFF 
*PEKKPGPGKKILKEKGKKTCPFPPQKK 
NFKILKKKKKGAPLKKNP*GAQNFPGMK 
KINFFP*REVKKNPIGIFKKKPLFWGGP 
IC3ANPPPENL 


603 


14504 


A 


609 


441 


28 


GGTFLKGIKSAPKI *KGFPGWGGARFPA 
RDPPQFGG/PKGGGSLSPRGLNPPGPPG 
♦TLFFFKKPKLTGGGGPPPLIPALRGVR 
P*NSLYPQRGGGAQIPPGPPPPGGKRAP 
CFQKKKKPRRKEKKK 


604 


14505 


A 


610 


445 


1 


LWLKKNTGGGGGSPPLFPPLGEPKRLVg 
FPRGRGFG PPPPP I KNPPFFL KNQNNPG 
GGAQPG I PGPWGG* GG EFPLPP\SPGFH 
* PRFGPFPPPRGTKREPPFQKKKKI ANC 
FLLSDKSLLLEEAWGQWGPS PLEPTVA 
PKPNPRGKAQ 


605 


14506 


A 


611 


253 


437 


KKKKVQDMFSENFKML/NEIKATLNKWK 
DISYS*VRRLCWKMAILPT/ IQSYRLN 
IIPIKI 


606 


14507 


A 


612 


186 


2 


KKKICVQDMFSENFKML /NEI KEWLNKWK 
DISYS *VRRLCWKMAIL\ P\ KQSYRLN 
IIPIKIQA 


607 


14508 


A 


613 


37 


453 


KTPPPGENFFLKKPQKKNFFPPGNWGVF 
FPPSPLKFFFFP * TFI FFGGVWPNFPPP 
KKKFFFKNSPGVFF/SPPP*KKKIFFSP 
PRYFWPP/HGFFFKGPPP 


608 


14509 


A 


614 


1234 


0 


KKPPNTPN * * WAKDLNRHFIKDSS I KDD 
QYY*SLGQCKLISQYNFTCTRIAIIKKT 
NNNKSW*GCGETGNLIH/WIECKMVQ 


609 


14510 


A 


615 


327 


188 


FFSSSLLIISIPKFNHVTSLLRNLK*AM 
LGSSHL*LQLLRRLRREDHLSPGVSGCS 
KL * SH \ THCTP VWVTE * DGRKGLAGDGS 
HSVTQTGVQ 


610 


14511 


A 


616 


103 


446 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anirie, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threon»ne, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DGGGALKKNPGGAQKNRGEKKKFFFF+G 
G\KKKTPGGFWKKNPFLGGGNFAPPPPK 
KKKPLEKKKNF*GGRGEKPSPLPCGKEK 
FSHKKK 


611 


14512 


A 


617 


3 


429 


GLIiSIIYKELLQINKISNLVGKWTKDK/ 
MKQFLKKEIHRLAK\YMKRYSTSLVTIE 
M * LKTRY YFHPLNKI KHDNN I HC * *GYK 
EVGILHILLEA* IGKPFWKAITVTVLNA 
PVFCL * FCFEMEFHSCCPSLKCRAPTS F 
I GAS 


612 


14513 


A 


618 


427 


6 


WGGPPPPPPIFFFFPPPPXPYFPFFFLQ 
ALFS*RVFFFLTPPPPKNFPPP*GPPPP 
PPPPPPFCV\FFFPPPPPPPPPPFFFFP 
PPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFLVENCVF INS I TSDIEQK I S I 
SC 


613 


14514 


A 


619 


68 


434 


YSKD* PINTKKXKKKK^KKKKRGGAFKK- 
KPGGGKNKGGKKKKNFF^GGGKKNPRG 
NFGKKTLFGGGKKGEKPPQKKKSLEGKK 
KI*RGKGGKKT*NP/ CGGKKFSFGGFF* 
KKFPPGGGGKY 


614 


14515 


A 


620 


454 


90 


NFPTPEKFGPPQGNL,*KAPPFFFF*KGI 
PFFFPG*KQRGGFKSPQKPPPQGKTIFR 
PHPPKKGGPQGPPPPGGKIFFFLEF\KK 
KGGFPG* PRRFFFPGPGKPPPRPPKKAG 
IQGETPRPGP 


615 


14516 


A 


621 


432 




PEAFLSSL.LHPAPGEKFFLKKTPEEKFL 
TAKKYRVFLPPFPLKIFFFP/liRALIFL 
GRFAQ I FPPQKKGFFPKI PRGVFFCPP * 
KKKNFFFLSG* IFAPPGIFFKGAPPFFF 
FFFFFLDRWJLCYPGWSAVARSRTS 


616 


14517 


A 


622 


422 


2 


GGGPPLPPPRGGLPPPK**KA/VGGGKK 
P PPPAPL * NP P P PKKKIGGGGGGKNPP P 
PPLACFFFGFFLPPPPPSFWGGEKFFFF 
FLSGGPPP PQKKKKKKKKKKKKKKKKKK 
KNKKKNQTKKKKKKKRAAARDPRVRPRV 


617 


14518 


A 


623 


176 


401 


KFSMFI IIjWKAYYTL.CVCVCVCVCVCVC 
VCVIMYLFISSQ/RRLCFLGEGKICSIS 
LTVLRRGSRVCV* SQDLPVC 


618 


14519 


A 


624 


1 


355 


HS SGLDNI.TALAHSHLCCGV/ CLCFLVL 
VCVCV C VCVCV CL.CVC VCL / CLYPGPQV 
PK*SKNSSTSCLLFHFTSARTICP/CC/ 
CVYFCFCFFA/CYQTCL*ILTSCb/AIC 
TASGVCL * ANKYSM 


619 


14520 




625 


2 


400 


HTRLI FCRDGVLLCCPGWLQTPELKSLL 
FSFGFLSLFFFFLERDPGSHGP\VGGGG 
PNLG* WKFWPLG * KQFS FLTLWRRGNT \ 
RG P P PPP PGLVFWF * KKTG FCFVAQ AGL 
EQRPFGDQPVWASQGAGITGVP 














VFFRLPXjTQGPGIjFACFCPPPPCRFFFS 
FFPGSNSPFFP/L*KLNFWAFEPPPFFF 
FFFFFS ILFKDRVLLCHPGWNAW* S * * 
P * TPGLN * S SHLTL.P SGRV 


621 


14522 


A 


627 


1 


391 


NPPPLGGAKEGGPPRARGSNP PYPYWET 
PFFF*NPKIFPGGGGHPVIPSFPGG*GR 
KFPLPREGGVPL/RPNFPP 


622 


14523 


A 


628 


191 


375 


LILIFKTI FNFLILNFNF*SGGATQAGG 
QGRNLG*LQPPPPWLKRFSCLNLLSSW\ 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=!VIethionine, 
N=Asparagine, P=Proline, 
Q=Gluramine, R=Argin»ne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrpsine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DYRGWP 


623 


14524 


A 


629 


3 


382 


HHCQ I RL F I LL I TASE VLFC PVLFFFET 
GPCSVTEAGVQ*CN/HLQPLSPG/PPTS 
AA* VAWTADVHHHAWLFVCL.LI *DGSWS 
VTQAGVQ*RDPSSVQPPTPGLK*ASC\P 
AFQKCLDYR 


624 


14525 


A 


630 


396 


2 


CRIENVLKKKKLEMSNLSEEGMLKANIG 
QKLSLLAPQQWNAKESLLKEIRSALPL 
NTQMIRMQNSLSADMEDIL/WWIR/DP 
TNFNI PL>SQSIIQSNA/L/TLFNFMKP* 
RGEEAA*EKFDAISG*FTTFKERS 


625 


14526 


A 


631 


47 


392 


LHSFFFLFLGKKIFTPGGGGGGQKPFF* 
NPPPPG*RNFFFLPP*KTWFMGPAPPPR 
KIFGFIiKKKGFPLGGNL/ S IGTPPPRGT 
PPPLPPRGGCYGGGPPPPPFIFFFRTFK 
NFSN 


626 


14527 


A 


632 


39 


382 


LFFSPFFFKNFFFFFGRFFFWGGVAPIF 
PPPKKI FFSQFP/LQVFFFSPP*KKKFF 
FFPP* IFAPPKTFFSIPPPPFFFFFFFF 
FFFFFF 


627 


14528 


A 


633 


2 


213 


LDKPGKHSKISSLQKI/HNGVCLQS*LL. 
GWLRQEDCLiSEGS*GCSEL* FYHCTPAW 
ATK*DPVSKKKKGFLK 


628 


14529 


A 


634 


2 


206 


QENGMNPGGRACS* PRLRYCTSSWATER 
DSVSKKNE/TNKTTL.LREI *HFVGGPNG 
KKGLLKTVKGGLT 


629 


14530 


A 


635 


205 


2 


KRSLGLIAQI * /VQWGDFKTLQPLPPGV 
KQISRLNLIjKKWDY*RGPSGLGKFW/IF 
L* KQGFPQFFRWFN 


630 


14531 


A 


636 


3 


399 


QVQQTPASCPLDSDLSEDEDLQLAMASM 
TFPKKKKKKKKKKKKKRGGAP * KKTRGG 
PKK/IRGKKK 


631 


14532 


A 


637 


122 


373 


VSNILWTQSLLFFFFFLKGSSLFVPQPG 
GQGLDLG*TKFPPRGLKEFSCLTLRISG 
NYGLAPP / HPG* FCFFI KKGVFPCCPGW 
F 


632 


14533 


A 


638 


390 


3 


LLVLFLPQDGWS PFAFPHEQKLPEVS PK 
ANAAMI.PVQPAKP* AHPTFFFY* FPSFR 
FFFARMG * S S \YFVPNLTKLSE I LKGPI 
NDMREVFFP I FRLL.ILFFFGDRVL-LCGP 
GWSSWQ*L*LIAASA 


633 


14534 


A 


639 


392 


42 


PSGPGKPGGKTFLRKPPFQEACKRQGFY 
PLFPPKPLKNPKAPQNRKNLGPICPPPK 
P / QGPLRI S PQNSQSGS L*GPNKGNCFP 
A*TWGPPGAP*RGPPKGPPFFFFEKVME 
AMGLA 


634 


14535 


A 


640 


390 


1 


LSFSEEGMSTAEAGQKLGPLCQ/TSQVI 
NAKETFLKEIKSVIPVNIQMIRK*NSFI 
ADKEKI LW* I *DQTSPN/ IPL/SQCLI 
QSNVLTLFSSKKAERGEEAAEEKFINFS 
♦RLRKEAASADREAAESYPEDL 


635 


14536 


A 


641 


368 


3 


KKGGYGVPPPFSPPPGFLGEGSFLPPKI 
KVQKTL/SWPPPPPPGGKKKTPFFKKKK 
RKNFLTGFFFFP * KKGPFFFFF * CP W* 
KFWGGGFKPIFFFFFF/RDRLSLCCPGW 
SAW*SQFTPRV 


636 


14537 


A 


642 I 134 


413 


QKDQRNRIESLE/IR/PHTHNTLIFDKG 
VTG FQYGKDSL.FNKWC WDN * RE / 1 TQNL 
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M 
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peptide 
sequence 


Predict- 
ed end 

location 

ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L?=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














SLDPAHTTTNVNL.KWTRCLKARP/VKTL 
VENMGENL.RD 


637 


14538 


A 


643 


2 


436 


GRVESINLPLGCRLFKKRDNIKCGOAG* 
RAGGSL I HCW * EC/ E WQ PLWKR I QQ FL 
I KIjNI HLPRDSTI IiIjLjD IDLDGHPKTS A 
LFIITHNCCFIHNHP*LEQPKCPSTDVW 
MHKP WP IHSMEHYSAI KSAAVWDNFKG I 
LL 














LDLLTS *SACLGLPKCWDYRSEPPRPAF 
NF*/STTK*SLWSSHKQDYICRFLLSYT 
ATQKFSTLTTY*SHLKTF*NAQMPRLYL 
QSLILGSIF+KSNLPR+FQCAAKAENH* 
*SDLYCQIMLFSFM 














KGVPRKCKNSYVGPPSGPPIK* ILFF* I 
F*DRISLCSPGWNARVFS*LPV\T*TTG 
VKKWWPLSLLNI*GYKGVPPRPGIFFFF 
FFFRDR / S LTML P RLVTWVQ V I FQ 


640 


14541 


A 


646 


3 


348 


QFSSFFVLIYFTLPFMVSFCF/WCFSNL 
VLFCEI IFFLIFY\YYSLVLSFFYCLLV 
LFLSKLSFF/C*YLNITVFICSMTSFF+ 
RIL/CCLNFIKTSFNFLL** FGMVLiESC 
FFFFFFWP 


641 


14542 


A 


647 


2 


553 


AI LI PDKI DLKK \ VTGDKE * NVTM I KGS 
IHQEDVTTINIYASNSR/APKYMKQKLK 
GKEKQ ISP* K/WG I FNTP FS I / DRAPR 
QKINQEIEDLNNI IRQIDLKDI YRTFHP 
TSEYIFFSSAYET/FSKIGHKLKHKTSL 
NKYK* TE I MQSMPSD/HHGMKLENNE /N 
VGKLTNMWKSDTLVSDWLGAVAHA 


642 


14543 


A 


648 


388 


1 


GPPR*SPPFGKPRGAVPQ/GGGGLKPPG 
PQGENPFFFKKPKITIjGPGWGP/RNPPP 
LGG* KGKI P * /PPGGGGSNKPNFS PSGP 
PGKQKQNFFSQKKKKKKEKKKEKPES * C 
PGTS PQPVTDGSQAINAS IS S 


643 


14544 


A 


649 


387 


25 


PGMRGGFPPFPLKNFFFPPGP* /MSGGG 
GGPNGPPPKKGFFPKNPPGVFFSPPKKK 
KI FFS PPPENLGP PRDFLKGP PLFFFFL 
QKNPGVFFRAGQGNPPQKIAKIFPLPPG 
GGGGAVRHI 


644 


14545 


A 


650 


91S 


2 


LGLKGLTIYKILHSTIADHTFFSSSHGT 
FAMTDHILGHKIHIjSTFSKE*EIIPSTS 
FQ/HHSRNLN* K* INNKVNWKI PK\ FWR 
LiNKTLLNNT* TKE /GLKRHKNILS * TKN 
I T YQ I LWD AG KAVLRGK F I VLT T * IRKE 
ERSK\TTSFNTRKRNKKLIINIRTEINE 
IENRKSEKILN\TKSNFFEKKKSIKSIK 
TLARLNKRKRKETQ I PKHQK* RRGITTG 
NMANKKIKRNIHEQPYTHKLGNLDNIGQ 
FLEKRYLPKLKQGEI*SGWAYIN/SKEW 
ESI INTLPKRKAQDL KMFSQS E * YQTFK 
EKK 


645 


14546 


A 


651 


282 


265 


GVFFLKKRPTVF* KKKI * LGPHPQLKGP 
PGVFQ/HFPI *NFGISGRGDLFFFFFFF 
FFFFFFLFLRQGQVSVAQTGAQWHNQGS 
LQPPGLK*FS 


646 


14547 


A 


652 


6 


223 


LYAHKFNNLiDEMDQFLQRHNTPKLIQEE 
/HRI/TVIK*IKSIVINLPKQKA*GPDGF 
SGEMYQLLKEEIIYNVF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
C=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


647 


14548 


A 


653 


128 


337 


FXFFFXXFFFXFFXXXFFFXXFFXFXFF 
FXXXFFFFFFFXXFFFFFFFFFXFFXLI 
FFXFFLFFFFFLFF 


648 


14549 


A 


654 


238 


2 


MAAASTTLLALVI ISTTF*LPQVNGYIE 
KSTPEECGFDP\SPARVPFSIKFLLVGS 
P I QTHALD IALFLSHTRAL PHTHT 


649 


14550 


A 


655 


325 


3 


CSEHKKNVF*LRAKKTNNPNLKWAKDL,N 
RHFSRENIQVAEKHI*ICSASLGLREMQ 
IQTTKC/WHPVWKTV/WTILQN/LHTRLi 
P YD /PAI IiLLG I YQRELKTCSHKNMYTS 
V 


650 


14551 


A 


656 


3 


1655 


FFLGMESPSVAQAGVQRCDLCSLQALPP 
EFK * FS WLTLL S WD YRRL P P CLANF VFL 
VETGF\TMLSRMVLIS*PCDPPTSASQS 
GGITGVSHRVQPAL.LFFLSCLCVKMIVH 
APLVKTCRPSTVGSCL/SVPAFWE 


651 


14552 


A 


657 


519 


84 


SRVRCQSA\QMGGASHLGYSGVRDPLEE 
AVFPFSDLKLHAGITTTLFKAVRQGHLS 
GFAKDF* RPAEQ /GT* YTS KLA/STKQN 
ILHEE* ITFVLLLPKHEGIWMPSVPPKP 
FLFFS*GMTNSGSAGILDFVPNHPSKDG 
*HYVL.ITSH 


652 


14553 


A 


658 


38 


383 


EKC I FFFFFLERGFTFGGPPGGRAQFWF 
NKTPPP/HKKREFPALPPPGGGNNGLGP 
PARENLVF* KKRGFPLGGKRGLNPRPQG 
NPPP * PPKGG/GNNGGGPPPRQKK 


653 


14554 


A 


659 


98 


394 


KPPGKKRETPSQ/NKKKKKKKKKKKPFF 
PPRGGKKNPFKKKRGG*KNPPPRGGGKG 
NFPPVMG*RGKNLPPPRKP1.FPPPQGGG 
KN * KK / RA* KNS PQNFF 


654 


14555 


A 


660 


214 


567 


GFSVINIGLKMMMMIIIILRDSL.TVTQI 
GVQWHNLSSLQPPPPGFK* FSCLHLLSS 
RDY*HAPPCPVNFCILS/ERQFHHVGQA 
GL KLS AS SD P PTLAPQRAG I T VP SHQ 


655 


14556 


A 


661 


463 


392 


S*HFHPSSSSSSSSSSVINYY*/FGL,LL 
C/QVAHFYNS I DQQMIQS QR PMMLQS AL 
AFEQ I I KVNGLL I LL 


656 


14557 


A 


662 


205 


242 


GTGWQGGGLDG * QMTLGW PGS GVGGS AA 
P*G I /GPGLPGLPAPPAPS P 


657 


14558 


A 


663 


63 


351 


QSETPSQFKKKTQ I ELLYE PAVSLLG I Y 
QKERKS VLKKHYTPMFTAALFTI \ VWDQ 
PR/CPKYMKR*NVMY/HVYTHTHTMEYY 
SATE KNE I LF\ FATTW 


658 


14559 


A 


664 


1 


358 


FFLRQGL / DSAAQAGVQWCNHSSQRAP P 
LGLGRS SH \ PAS LAG WS AVA* S RLAATS 
TFRVQVILLSQPPE *LGLQSCS ITQVGV 
QQWRDLGSLQPPLPGFKRLSCLSLPSN* 
DYKRLAPPP 


659 


14560 


A 


665 


10 


257 


GMNERGNI TKE I KKI I WEYYE * LGTSQL 
DTL/ELNKFPERYKL/ PRTNSQS IENMN 
RTITSDYISNFKRLPKKSPGLDGFTGEF 
I 


660 


14561 


A 


666 


360 


2 


ENKKIFNSAPPPGSPFLFPPLWEIRPEG 
FLKPRVLNPPWPPN*APPFKPPMGPPFF 
PLFRGVRVGDPLSPPGLRLP* PGLPP\G 
PPG* PWGQNPTFFS KXKKKS WLDS SGFL 
EPNISVNF 


661 


14562 


A 


667 


328 


1 


NLFAENYECPEKKKKNFQKKKIHQ I WQN 



559 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
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09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methion»ne, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SEFMKVTEYKLNYQN/SQLYLYMPEKNT 
* K V/ 1 LEKHP L * PQI TKYFGE I LKKDVL 
NLYTDKYKISLREVKGEQNKWNNVSS 


662 


14563 


A 


668 


405 


566 


IHCQWECKTEQILWKTWfWFFTKLNILL 
PGDSA I MLLG I YPKELKMY I HTETCT 














T T MT 1TCT IHTOPPPPQglTliWZUfT * TV J r-a 

GAHANFLPSFFPSPPFFSLSFFFFYNRV 
FLCCPGWSAWRSQLT 














KASRRKE IIKI*AEI* K\VQRSNRENK* 
KKWFFEKINKIDKPIARLTKKWRT*ITI 
VRKETGTITKDPADTKRIMKEY/YKLLY 
MHSF*NLjSRKLKRYREVIEKINERSGSIj 
KRSIKLTNL 


665 


14566 


A 


671 


200 


3 


S LCHIj P J WAS KATLE T \g l vehm w * d f f 
CFFETESHCIAQSGVQWCNLGSLQPLPP 
RFKLFSCLSL 


666 


14567 


A 


672 


94 


374 


PKFRPQETTEQTSQFLQQ INCKGRKRQG 
KRT YRLGE I QKQS Q P MAT F E LYLD / H * F 
* LAKETL* LGAVAHTCNPSTLGGQAGW I 
TRSGVQDQPGQ 


667 












WKMGQLPGRPQLPQIjSQEKILSLiNSPNV 
FKKKKLK* * *NILPKKKRPDCFTHKFYQ 
TFKEEIXPF/L/HKLT*EFEKKEILLKS 
F\YEGSW*AAKLNADINKKTTH 














AKLGHNLVY\ KANLNKFKWLQVI SNMFL 
DRDRIALKINNKK/ PSSPLKYLQTHLLL 
NDPRI KEGSKREI I KDFALNDNATY*NL 
WCL*NGTLRKVYTTKCLFRKEGPQM 














PERFNGPAFFKIAHGLTSSLVFCLANSN 
YERTHSRI IILSQGLQTGFAIKTFRKLL 
ASLANLALPPTINLLGELS I 


670 


14571 


A 


676 


346 


1 


KMVKIiIj*NIVWKLLiK\GLTEVPHGSAIF 
pffipifp kemetnvh i kt ctwafp aaf 

FPMAKKKKQSGNNP ttde * i qkk \ w * th 
IIDYYSAIKRNELLTHATSCNITLSKRS 


671 


14572 


A 


677 


357 


6 


AAGDSELKPMFIYCFQKPMPL+IMLKST 
LLVLYKWNHKAWMTACLFTAWFTE/HCK 
PKLETYFSEKRFLSNIT*LMKASGHPRA 
LTEMYKEINWFKPSNTTCTLHPMDQEV 
ISTFNSY 


672 


14573 


A 


678 


401 


155 


YYS VAQAWQWRDLSS LQAP P PG FE * FS 
C\PASWDYRHKLLTSGDLPASASQSAGI 
TGVSHCAQPLSLFVFFLTNQRFVAALS 














Q*KISFEMLLIITNVPGHPRTPMEMYKE 
LNFF\MPANTSIP* PMDQGIVLTFKSYY 
A/RNVFCKAIAV /DSDSFDGFGQSKLET 
FWKGFTI*DVIKNIDDL/WRGVKIPILT 
GVWEKLIPTLINNFEVFKASLEEI 


674 


14575 


A 


680 


275 


3 


WPRQASIoAXiNT *S IHI YIHTDMRTHAHT 
YI FGMKS VTQAGVQWHNHGS 1>EP*S PGL 
RRSSHPSLPTSGEHRRTPPR/IADFLHC 
L*R*GFTML. 


675 


14576 A 


681 


330 


507 


GSV* PVNLIRNCQPFVQSGCA/ SLHSHQ 
E VWYMCSPHPRHYLVASVIFILAILFFF 



560 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methiomne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LRRSL 


676 


14577 


A 


682 


390 


43 


HHLPIRYICLSISSILCLSFIMYLS/lY 
LS I YLS I YLS I YLCQRCQI PLVFLCLLL 
S *LS * 1 1 LILNFS CFFVGRT I VLS 


677 


14578 


A 


683 


396 


63 


ADPTEIQRIISDYYDQRHTNKLiENL* KM 
DKFLKTYNLPRLN* EEVETLIRFIANHE 
I ALI I KS LP I KRS PVLGGFS VGF*HIt>K 
EELTPVLGLFQK\IKKEEILANSFYESR 


678 


14579 


A 


684 


1 


396 


EETLPLFADDMILYIIKKKQKTKKTTKE 
STKR Y * N * * M \NVFGKVAGYKVNTQKS I 
N*QYTI*KVKLAS\FSSSPQKNKIGINL 
TKEIQNVYSENYKTLKEIKDLNK+ESIP 
CSQIRR FNI VKMT VLLKLI YR 


679 


14580 


A 


685 


283 


3 


NCVESSNTHLWEPSQKPLSSVSLTDVCP 
PMLS AALFAI A\ RS YTLPTRSS IDE* IK 
KMWYIHTMEYYSAFKKKRIMSFLMPRME 
L.KIFRVNKIR 


680 


14581 


A 


686 


53 


324 


HCFCDR I RAAFC FYILDCPFFISITHKL 
1*1 FFFFFLERES I FVPG VGGGGPNFGS 
LNPLPPRLRRFSCL.TLPRGGDYGLGPPC 
PT/NFCVF 


681 


14582 


A 


687 


39 


208 


NIFFCREGFAMLARLVSNS *NWPGKVTH 
IHNP/STLGGLGGRITQTQEFETSLVDA 
GA 


682 


14583 


A 


688 


47 


341 


SSGRVFVL.FCFFETGSHS / VLFRLAYSG 
GIS AYCSCHLLS SGDSP I SATR/GTTGM 
CCSAQLGFCVCVCVCF/ I FVF* VKIGFC 
HVAQAGLELLDSSNPPTS 


683 


14584 


A 


689 


229 


1 


GRVDGRQ I ACQE FKARLAS QSAGI TGVS 
HRAQPFFCFC/LLFWFEMGSHSVTQAG 
VQWCAPGSLQPLPLRFK*FSC 


684 


14585 


A 


690 


339 


1 


KKGPPGGPTKPEGGRFPRGPPGGGKEVF 
PPPPPGGKGEKKRGGAPPPFFFLKKKRK 
SGKGG\N* FFPPGGGKKGGQKKKKRVFF 
PKKKKKKKKKALSIiSLSLSLSLSIjSIjIiG 


685 


14586 


A 


691 


372 


0 


YDYLYIKKL*N/LREKIDKFLDTYNLSR 
LNQEEIENLNTPITSNKIETVIKSPPT\ 
KQKSPGPE 


686 


14587 


A 


692 


117 


297 


APLLESSSASYLHFSN*DLQMAH/KHTK 
RCSTSLAFREM* IKTTRRHHFTPTKMAT 
SKRHT 


687 


14588 


A 


693 


122 


322 


EQTNSWSIDFFFQRCHSNAMGEKNCLFN 
KW * WDNYVAI \AKKMKLSLFKP YI KTNS 
K/WIKDLNIRAKT 


688 


14589 


A 


694 


21 


342 


RSHFRHIjNEYYATQQGNSDVHPIjLiIHSN 
ISTAIIWQLQRQERRKEKRLKEVKNEKA 
L.KE I FFLRGWVLLCHLGWS PWGS * FI V 
TLN\ FGLRQSSHLSLPSSWDYRPV 


689 


14590 


A 


695 


2 


343 


KFVPLHSGLGNRSQKQTLSQRKKKKGGV 
FVAWAKVQWHNHG*QGP*TGLRQSSCLS 
Ii*GGWVKIj/PGGRHHAWGFHHVKKVGLE 
LLTSDDAPSSASQKAGITGMSHTAPGPK 
NLV 


690 


14591 


A 


696 


336 


3 


IPPVKSPKFPILLKKKKFSLFCFSGAPF " 
PCYS PPRAQKKG W * KI FS PPPK\ QKKPG 
ALFFS P KPAFFP PP FKKKKKKKKAKEGH 
YLMIKGLIQQEKIILNIYASNIGARRRG 


691 


14592 


A 


697 | 374 | 2 


AFY* KKMVSRTFI AREKSMPRFKASKDR 
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sequence 


SEQ ID 
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peptide 
sequence 


M 
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09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gluramine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stt>p codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LTFLjLjGAGNFKIjKPVIjVYHSENTRAIjKN 

yakstlsvlyqrttkawmtshlftewft 
effepXtvetycsekkipskillltdya 

PRHPRTliMEMYSC 


692 


14593 


A 


698 


361 




PQMCPPPPR*QGIIFL*L*THCFFCTVK " 
ARLFLFPPF/S*RKGGFFFFE*RKFFKY 
FGWELFQIRFKVFFFCDRVSLCCSGWSA 
GART * LTAAST S PAQAS LLPLTLP S SWD 
YSHVLPRLV 


693 


14594 


A 


699 


75 


362 


KDCAYG I CS KKQKG FCLTQTGAWQQYEE 
I*GLET/GFSRSVHSMGQRRYWDYRHAL 
THPANF VFliVETRFLHVRQAG IEL.PP/S 
SSQSGGITGVSQ 


694 


14595 


A 


700 


2 


277 


FFLYTSNEQSEKKI /RETNWFI ITSK* * 
NRNVNKEI *DLYNENSKTFLKEIKEDLN 
KWKGILSS* TEKLN I IK/ PTAI * * FNAI 
LIK/IPMVFDK 














GTRKHFSKEDRQYHKSLRKQ I KT/ TVSY 
CFTPTKLTTVGKTANSK/CWAGCKVIGT 
LIHCQ*ECEMVWLLWT/SVQQFL/RNVK 
LYETAI S LLGICTRE I KTPLHI 1 1\ 1 1 A 

A 


696 


14597 


A 


702 


209 


3 


SLNRRLSFFFY+RQGLALLI/EPGWGAV 
ARSWL.TPASN\ FELQRSSCLSLRSSWDY 
RRVPLHRVSLRFSC 


697 


14598 


A 


703 


308 


3 


TTGLKRFARVGP PKCWDYRCE P PGLATS 
SVLKAFQPIGSVPPRLSRILSFM*NQ\* 
YRMLI TSTKYFPRMVS I SRPCDLPALAS 


698 


14599 


A 


704 


603 


2 


PLPP\GLQVESP/CVSLPSSWDYRHAPP 
RPANFS VF* +RRGFTMIAKM/VLIS *PC 
DPLiASASQSAGISGVSHHAWPKQT*LiLD 
TDKTEGIFLTNHLIiRIPPPMFYQYILIK 

IS*AYIKFTTFFFF\ETDSRSATQAGVQ 
WHGLSSLQPPPPGFK*FSCLSLLSSWDY 
RSMPPRPA 


699 


14600 


A 


705 


3 


366 


ARVTYYSGKKEPFGYLGMA*AMISIGFL 
GVI V*AHH I FTVGIDADTRAYFTSATII 
MAIPTGVKIFS*LATLHGSNMK*SAAVL 
* ALRFI FLFTVGGLTG I CLSNS SLDIVL 
H/DTYYVGAHF 


700 




A 






3 


GFHHVGQAGLKLiLTS * S ARLRLPKC WDY 
KRE PPHLAPDRKYFS PLVFARDNFVPQG 
P* \HVWINICLSQVKAATGIYPGEAGAL 
RNRLHPH*RIFWPQRPRVPIKSPWTLRV 
ISTLCSC 


701 


14602 


A 


707 


358 


67 


QLSSTIWY* *KN/RIDQWNKIAQNTSVH 
RKLISDKVTKPIQWSKDSLFNKECWNN* 
TSICKK/LS LDSALTLFTXTNS K * I TEP 
GSVAHTCNPTTLGG 


702 


14603 


A 


708 


34 


363 


RTVFFFFFFFFFFFPKKTPFFFPGPRGG 
* IPPPEKKIFPPVFRGKFKKPPHLDPGG 
GNFPRKPPFFFLGP\SFFFLGKTLGGGG 
WIPFPPPWPNLFFKKIFWGAGGPSPF 


703 


14604 


A 


709 


1 


380 


STCKRMTLDSYFTPYTKINSKWNKDLTV 
RGRTIKLIEENIGVNLHDLGFGSGFSDM j 



562 



JSDOCID: <WO 0164835A2J_= 
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PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etli 
od 


SEQID 
NO: in 
USSN 
09/515,1 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TPKAQAKRKIN/ WDF I KYLKFCA* EDS I 
ERV/N/RRSTEWEKIFANKIQ*GN*NIQ 
RILKLSKIi*ARLKWAEDL 


704 


14605 


A 


710 


382 


30 


PGTTGVHHHTRLTFVF/CGRAKISICWL 
VWNS*AQSACLSLPKCWD*EPQHPARFV 
SFNTRSGIPIISTNPSLSLSAFPSQGFV 
FLFPHFPASHPETAXiAEFFLGTFSLSPP 
GPFTPAS 


705 


14606 


A 


711 


1 


377 


LLVICGMQMKITVRCN*TXiNKIAIIEKK 
TAKCC*HYGGLKTIiIHQWCKSNMVQPLC 
KTV*QF*KKLNIQLLYLLK*KKAYIHPS 
NCTQMFIA/ ALFSVLAPNWKQSAY 


706 


14607 


A 


712 


1 


390 


LKYKIMLGMAAWACNPNYSGGPGNITA+ 
TW*AYMALTRLEPGRGDHTAALQPGRQS 
TTPF* KKKAVPFRA/RPVKMRERSI * KP 
FSPE\RTYSCAQEGPGRTFGSAQDLEAA 
GGRGHHRMG A VWQ EPHRLLG 


707 


14608 


A 


713 


2 


393 


5CVRRQI I WVDTATDS IGI KW 1 1 WKHYV* 
LYANKFNYLAEMDTYVERYKLP \ KE I DV 
MNSLVPAKEIIWVKILLTKTPDPHIFT 
NEVYQTFKEYY/PVLYKX.FQKIEDERTF 
LSSFCEAG INL I S I CKRFFF 


708 


14609 


A 


714 


219 


1 


PRPGAVAHACNPSTLGGQGGRI * TQSS1 
IKACCLIHDLiAIQTQKQMT\WPGAVAHA 
CNPSTLGGQGGRITRSGARDQPSQHSKT 
PSLLKIQKKLAS WAGACNPGYFENY IQ 
KLLRGPGT 


709 


14610 


A 


715 


381 


126 


I IAGGGGVCT*SQLFKRLRQEDHL,SPGV 
RGYKEL* SYPCTIiAWVTE*DPVF* kkny 
/ TLNTSP KETNKK* GVRCI KTGKVLI FV 
AAG 


710 


14611 


A 


716 


417 


1 


CYCCWGATNKILLLTDNASGRQIVLMDM 
YKKNNWFVPVNTRS FVQLMDQG I ILNF 
KSYLRN/TF/HKCIAVINSNSS*ESGQS 
KLKALWKGFTVLYAIQNI CDSGQVKMPT 
LTEV/WKEVILPLMGDFVRNRTSEKQNY 
KLS 


711 


14612 


A 


717 


406 


122 


MFSRDAVSLCCLGWS*TPELKLSSSLSL 
PKCWDYRGEPPCSAAKDF*IIRKR/HSC 
RNIIRNSDAWQR*TTL.VSYDRNPP*FPS 
PPPNSPLCPAA 


712 


14613 


A 


718 


535 


1 


HNIjNKI V*NL.HVENYKMLVKDKKT* ISC 
LWVRR\LVKIPVLPKLFYSVIVFPIRSL 
VR YFIE VCTFTLKC I PKEKDLKQ PNRF * 
KKKKKVHLPFIKVYY\KLQ*YIATVIKT 
VWY/W/ YSKIDKAKWYKTEKAEV* PHEY 
DQLILAEVQINH\NLFNKCY*SN*TTIG 
KNMNLNLSFTSYT 


713 


14614 


A 


719 


255 


1 


GKLQPNKGliISLIY\NL*KLVRKSNPKE 
KWAKGMNR * LTEKG IQI AFRHMKRCS PR 
FIMKAMHIITTWSYHFSSKRKTRIQRLT 


714 


14615 


A 


720 


381 


0 


KFIAWEYI IKEIDLPINNLTLYLKElV 
EREELTLCKASRAEVI \KIRAKI IRELT 
PIRKTDNQSWFFLKIHKIGQ/PFKIjD*L 
Q KPCKR VRE KVP I TKLR YENRVT TTNFTE 
/IKRIIREYYDH 


715 


14616 


A 


721 


3 


381 


iyrewrdltrdgtyqghhappgekg/vc 
rygii:lfitsegfffagff*pfnhcsi j a 



563 



SDOCID: <WO 0164835A2_L5 
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PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
LISSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I~lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine,'v=Valfne , , neS=Ser " ,e ' 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PTPQLGGPWPPTGITPLNPLKVPLLDTS 
VLLASGES IT* AHHS \L I ENNRNQL I QA 
LLIT I LLGLYFTLLQA 


716 


14617 


A 


722 


56 


455 


KTI INIKPPAIIQYQTPLFV*SVLITAE 
\ LVLLSLPGLAVGITI LLTDRNLNTTFL 
DPAGGGDP ILYQHLF * PFGHPEVYI LI L 
PGFGIISHIVAYYSGKKEPFGYIGMV*A 
M I S IGFLGFIG * AHH I FTGGI A 












3 


HQLPVFWQYNKKAWTTRPLiFVDWIjH * CL 
VFEVSKYPASKGLPFKWliMLDNAPD /H 
PTQNSMSS/DSKGIEVIYLPPNTMSLI+ 
FLDQGVIRTYR/RHYPQYSMQRSIRPMQ 
EISNKENIIKVWKNSTTDDAIVA 


718 


14619 


A 


724 


318 


14 


TPPFFLNIPNLFFIiKFPSPPPPWFLTKS 
LFFPLKIiKPKVFGK*SLSKF/ CPFPKFN 
PKPQKKNFFFPPPPFFFFFFFFLRHNL/ 
VCHPAWSALAPSQLTATSAS 


719 


14620 


A 


725 


2 


371 


APRCKRFSCLSLPSGWDYRCMPPRPVNF 
F/SIFSRAGGLES*PPMIPPA*A/FPK/ 
CWDYRHE 








726 


430 


97 


PKKNPFWKRA* NGGAQKDVPLiSLiP / SGW 
AGGSPWGPGF*PPLGPKGGPPFSKKPPG 
GVGPFLMPPTPEGGGGKLA* PWPPGFNP 
PVSIFLPRPPRAGGKKGTPFPKKKKKEK 












1 


GGAPPPPFFFFFFIKKGVFPF*QEGFFF 
PGLKN/LPP*PPKKVGVKGGSPPPGGFF 
FFFKKKVFFFFPGLRTRAQI KVPSTLVG 
QG*KILiPPQVSQEGGVKREGPQAQLFFF 
FFFFEME SHFVTQAGLQWRD IGS P 


722 


14623 


A 


728 


98 


368 


KPHKNAKCPIRVAI VADMENNKYW * GYG 
EMGTL/LHY* WEGNIEQPWE /T/VWQIX 
EKLNTELPCDPATPLLGIYPKTLESRDS 
NRFFIFFFSF 


723 


14624 


A 


729 . 


19 


404 


VC VI S I CLATE ILFFLHTRPCVCI YCVC 
VC VCIiAVWGTLC VYVCI ES C VCVCFHL T 
GVI J C/V*VCVISICLATEILFFLHTRPC 
VC I YC VCVCVCLAVWGTLC VYVCIES CV 
CVCFHLTGV\CVCVCVCI ESS VCVCPPT 
CGTVCVCVRPPGVLCVCVCVCQAAWVAV 
SMCLTP/CPC/VCVCVCVCV 


724 


14625 


A 


730 


432 - 


12 


FFHKPNFPAARKGRLFFPIYPSKCLISP 
KAL*F\FGGVGPFFPPPKEGFFPKVPR* 
CFFRPL I RKKQ I L FLPPLNLAP PGVI * M 
/PPPPIVFFFFFFKRDRVSLYCPGWSAV 
A*S *L/TAVRTRGIjK*SSCLSLPKCWEY 
KCEP 


725 


14626 


A 


731 


332 


7 


NKTRKEKFLTRSISRF/CFPPFPLKIFF " 
FPLRA* FFWGGLPQ I FPPPKKVFF/ PKI 
PKVFFKWPPLRKKIFFFFSPLILGPPRV 
LiLKGPPLFFFW*VFFF*DRALIjCHPY 


726 


14627 


A 


732 


70 


294 


FLRFCVETAINKI AI I LANF *NFL*RWG 
\ FTMFPRL VSKL * DSSDPPTS VSQS VG I 
TWSHHARLNFFYYQNSRR 


727 


14628 


A 


733 


1 


355 


FLLRHILLCHPGFCSVATAYCSRDPGSS 
D/PLPPQAPLPDQ*PRLQA/WHRLAPPH 
SANFF * * RQGFTVLARMVS ISQPCDPPH 
SGPQGA 


728 . 


14629 


A 


734 


50. 


395 . 


iPGLTRQWIiLDPCASPSTPPYT/P*VQP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gfutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














STPQNSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAILEHTSSSSSSSSSSSTCRNR 
NGYTYTVPEHRPARGHTASQTRKQVIjAA 
THKP 


729 


14630 


A 


735 


451 


0 


GPSSSPTNIPTPVSWNSFK*SSQVAGAT 
GTCYHAWLIFVFSVEMDFY/HPVA*ASQ 
SAQLCQYFMI FLRQGVSLCPPGWSAVAW 
SKLTT/CLCLPGSWDHRCALAHPANFLY 
FWWR/HSC 


730 


14631 


A 


736 


2 


477 


PSHVSNKRLISKIYKELIHLNSKKVSNP 
I * K* GQS LNRD FS KAS FLP P * KC\QM VN 
RYIKRCSTSLI IKEMQIKTTMRCHLTPV 
RLAI MKKSKDNKPS LVWDC * IRVLKLF/ 
CK/WI CQYVQL / FWLLLSNFLVKQP F \ Q 
LPAPPNS PTLG 


731 


14632 


A 


737 


3 


2344 


AAGGPTAQS PAQJLAGRALRIiARWRAVAV 
GACRPG AGS PC S VQGGAAS E LS PRPQTW 
IGSLKP*TFGAAAG*AHRGCGGSALIiN* 
ATPRPAPGLPASPTSSQALPAPLGAWGH 
SDHQPRAFP*SPQASTAIRKEKKQRAQP 
GRASVCPASNPFISSRALPVLQHGPPAI 
SGAGSAVASQAPGSS /GSHAESGS PALA 
HTP*GS*EPHSLIVESTRKS\ELPSSSQ 
GRLLLPLLTPGVAS /PVGTKLPGATAAT 
AGALHQPGLRLSSLQGVGGAKNKQTGCC 
CLQLPTTGLPQAPGALRPLGRLGPAAAP 
GEHRQRTSPQGTVPYGGIRQGWEQPQRL, 
RAYGTALPPPHTPPGSSGP/RQAPGCRG 
SGAGEAAGIRDTGGGGPGRRAPCSQASP 
GRGGWQAQVGCETCRGCAQSS /GGGAVQ 
PGLP RKPYPHS / AR* ENLWP FPCSPRT 
RAQE PQTQGEE / GVEGPQG S PCAPGAVR 
GRGIQLS SEPG K1ARQG / PASGDGP * EG 
TGQEPSQAFSSARHPWPSEARMPDTAIC 
RNQAQ*LQLTSSQPSMGPQLRKSLPATP 
Q PNS YWDGGNS ATLGRTTNTRRHCGMSN 
FGARGDLL.GTVPT+ QPLMQRRKEKPRVG 
GEPVQSHTVCG* \ PAGVSRG WPLRPML P 
ERWRPLSASSGSQGRPGLHPPSLA\CGP 
SSSPHRTCSP/GLDPGLFDAGSIKPPSL 
VGAGQGAGST/GLD/GPGLSLIiSPGKSL 
LPPSSPATGLSGLGWAQSAF* SL.LTVA* 
WLNPVPNGPSDTADCTPAQAPTAPAMLE 
NQANKSDFFFH 


732 


14633 


A 


738 


37 


450 


NQKKWPPPRSSKPARPLFFFL*QKYIKD 
P*KTYQGCFFFSPPPKKKNPPPPKGGGF 
FFFF*KKKGGGPPP/QAKKRGGGGPPQK 
GGAKKNPPRGFTRVFKGPFKKRGPP 


733 


14634 


A 


739 


2 


426 


QEFVLGSAPGCGCWVGGEGRVSS PWGDR 
KVGGPSNSRHWNKGAGPSCPPGPSLAAG • 
LPGRRAGS WAPVLW \ PGPP * GLAQLPS P 
LW/PPSPPQEREPPLPD 


734 


14635 


A 


740 


21 


392 


AQEFKSSLGNMERPLPYQKKKKKKRPGG 
GA/RPLETQPFGGPGGGEKKGGKKKTSR 
P / IRGKPPFL+KKKN\ CLAWGGGPAL* P 
QFFGGVGEKKTFNPGGGGLGKPKPPPFT 
PTRGKNQTPQKKKKK 


735 


14636 


A 


741 | 1 


408 


PSSQGG*EAGG*LEPRSSRPAWAI*RDP ] 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 

K=hysine, L=Leucine, M=Mlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














/GSPEKGKKKKKKKKKKKKKKGGGGPKK 
TPGGAKIKGGGEKKNFFFKGGEKKNLGG 
ILEKKLFFGGGKNGEKPPKKKKGLREKK 
KF*RGKGGKKAENPGGKKI FQKKKK 


736 


14637 


A 


742 


394 


I 


RDPLEEAVCPFSDLQLRAGRTTALFKAV 
RQGHLiSLQRLLRLFVCLjCPAPRGGAYRS 
RQASLNRGGFHPVRAYLLLCLPKQASEM 
AGAPSPA*LPPCSLISDCCASNQ*DSVG 
/VGPSEPGEGYNLWRRFLS 


737 


14638 


A 


743 


384 


1 


FTGPYLKNNGYSFIjFLYPPPSVLGNWQP 
phpgrvfp p * dvnfnqw i fvs l / 1 s slk 
DEPAFSSTLYSF*QHFFFITYLPPP/CF 
FLIESCSVAQGGVQWCNFGSLQPPPPGY 
KRFS CVGL P SS WDYRCV 


738 


14639 


A 


744 


2 


410 


T PLiN P LEV P LLNT S VLLAS G VS MA* ADH 

ASEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFEAA V* YWHFVN / VRGLFL YGS I YW 














PPPPPRGRGSPPPPFSRKGGPKGKPPPP 
GNFFFFFKKRVFPF\CPGGFKS*V*GVP 
P PFP P KI FKFKGGTP FPGLWS PLKS KKS 
QFFWVPKPGERKKliFPPFFFFFFLRPCL 
SLSPRLECSGAISAHRNL 














KNIjCRENYKYE* KKLKTMKDLiCKEYYKV 
* MKK I KENM* KDKLMNFK\NNI KI STLI 
KVIYKFNAI PIKIPMTFFAE\LK* ILKC 
V*NYKRP* IVKAI 


741 


14642 


A 


747 


59 


364 


CRFFFFFGLGGGGGGGGGGPPPFF*KNP 
PNPPKKIPNWGFFFRGPGFAPKKKKGPF 
IiGEPPPWGAPLYNPPPPFGGGGAKFPPI 
FFPRAGGPQKRVG/RPPPPPPPPPPRPK 
KKKNLH*LECVEGTEEPPTRPPTRP 














SSCLDLPKCWDYRHELLHLAHWLVLGIF" " 

LLLLFCFVLRW/RSCSWQTGV*WSDLT 

SLQSLLP 














KVLTLFNSMKAERD * EG PEEKFGASTGW 
FMRFQE I S QLYN / 1 DV * GEAAS ADGE AA 
ATCAEDPAKIPDEGGSPK*YIFNVD*AT 
FIRDLSWI FCTSCCS FS ISTCCFTLiHVM 
L 


744 


14645 


A 


750 


249 


2 


KIFKPPAFPVSFPPFPL*DSSSPPEP*I 
SGGRVGPI\FPPPKKGFFPKNPPGVFFS 
PPLRKKILLVPPPLNLGPPKGPLKRPP 


745 


14646 


A 


751 


1 


288 


WNHMTDEGLVSRI Y * QPLQL \KRKTEN 

PASRAIRGTQSKIIMRCHFTHTRMARIK 
KEKEAGPGGSCL 






A 








ETGSWFGPQRL>EVQVVWIKSSLQP*PPW 
APVILPPQ/I J SSG*DHRHVPSRPG*LKK 
KFFL*RWGLTMLPRLLLNNLPTSASQNA 
AITGVSHCARSASVFLILRQGLWVWEE/ 
GAQW*SALSHLPV* /LSR*WCQP*PLG* 
DSGSGSPLWRRLAYLPIiCALWKEVTMCS 


747 


14648 


A 


753 


1 


533 


YLSPRLECSGGITAHCN/LRPPGLK*SS 
QLSLQ\SSWDCRPAPPRPAARLIFFCRF 
EEGRRSHYVAQACLKLPGS\SNPPTVA/ 



566 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

Inst amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylala.nine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vn»ine, 
W*=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














FPKCWGYRCEPPHTGLCS/ GLSEFGFCL 
WWGACSRSSHKNPGCGTA/CPAVFLHSV 
GYASHPSSTSRTAFTLGWVLVFVFRDKG 
LTVFS QAGSTVGVI LG 


748 


14649 


A 


754 


1 


346 


ELS KSTLPALCGGNDKAWMTAHRFIAWF 
TDYLKPTVENYW* EKKIRLKVLLLIDNA 
PGHPKALLEMYTEIHWFMPADTTSIL* 
PTHQGVI S TLKFC YL * NTFHKVQ \CYID 
SDSS 


749 


14650 


A 


755 


1 


323 


EDQTSHNVIiLSIiNLiIQSKALMIjFNSMEA 
E /R/SEEAVEEKFEASRE * FMRLiKERSC 
LQNTKVQGEEASADGEPPACDPEDLAEI 
TDEGDCIK*QIF I VD KTA FYS KKM P S 


750 


14651 


A 


756 


1 


344 


ARQQPGDPARGGGRGRRL \QPRQAAS FC 
TG TLARARHTCEGAG VLRRPAD KLASLN 
*HFSQKKKKKKKKKKKKKKKKKASSSSS 
KKKKDSSYSSSSSRGGSSSSSSSSPPPP 
PPP 


751 


14652 


A 


757 


119 


317 


■ NRYFPFIiRVKKFPWVEGPFFFLR* SLTL 
LTLVAQAGLQWRNLSSLKSPLPGFKPFS 
/PSA/LSRWDYRR 


752 


14653 


A 


758 


2 


363 


CILAIVKSAINRIAN*YI*KCSMSIITK 
RQA/ 1 KRKNKTRRYQti I PVRMTL IKKKK 
RW*RCEEKGRLAHCWF\ECK*RQPD*KT 
K*RFL,KKLKL,P\FITAIALLDIYPKQIK 
S E / CKKH / C AI L F I ALFT I AK 


753 


14654 


A 


759 


358 


1 


KADFCFPPTFSPRVKKIFPPPFPKKWG* 
KGP/LPPPGENCVFFFFKKKRGFSPLGG 
GFFYFFPPGFPPPWPFKKLGFKGLTPPP 
GPPQKILVFFFFFFFFFFE/DGVliIjCRP 
GWSAVAQS 


754 


14655 


A 


760 


316 


2 


KATRSQDIRRIQVKLiKEIEPQKPliQKTN 
KSRSYFF*KH* *NRTLTRVMENKREK/N 
EIDPIQNHKEDITTDPTEIQTIRKYYKH 
LYAHKLKKLEEVDKLLDTNTL 


755 


14656 


A 


761 


3 


321 


FPLGLHGSLGAKPEGQAPLYPPTLSKRV 
VPLiC/DKGRSAVTFNTVYSSGSSRWFP 
PPCM* IYRLCIKGRSSDPEQKKKKKKKK 
KKKKKKASKKKKAPKDSSSPKT 


756 


14657 


A 


762 


315 


2 


KTERW/VFEKINKMGQS * DGLTKKNREN 
/ V/QINKIQNERGGLSTGSTE I QRVIRG 
FCEPL*AQILDN1jEKMD* FLQPFHLPRQ 
NYKKKKI I *KKPISKVIELiVIKNL 


757 


14658 


A 


763 


226 


329 


NPTTLEGQGGRIS KG PE F / CSRDEVS PC 
WPGWFQTPDLWRSVRLGLPKWWDF/RR* 
APPPGLNLFFRMLSTHGSW/CQHPCFKF 
PTSAFQYIYIYIF/SFL.SRSFTFVAQAV 
VQWHDLGSPQPPPPGFKQ . 


758 


14659 


A 


764 


335 


I 


QSQERPDSFQVMNLTLPRAPSMFLVSPR 
TELSKSP*\PPAFLCPCVWPHSTLLSQT 
LKY*IKYP*SPTPFPSFVLF*RDRVLLF 
HPGWSTLAGS * LTAASNS WAQVTLL 


759 


14660 


A 


765 


100 


310 


HFGRPKRENCL>SP*V*DQTLHTHTHTHT 
HTHTHTHSRSENGGMRLTPHI PERVRW\ 
GIIC 


760 


14661 


A 


766 


315 


3 


IFPNPPRIFIQNPHPFLFGFFPKKKEGE 
KNWAMETPRFS PPPLGGKKAPTGFPPPP 
IGF*GPKPLIKGRGP* /PP*KNTKIFFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T= Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














FFFFF*DRVLLCHPG*SAVARW 


761 


14662 


A 


767 


1 


309 


KEPYKLTELHNEIRVACGHLSKKKKITW 
VSDS*ASCPSKTRGNFIGPSAGSRAPSE 
KLEAHFHGCGSVNRVHLT/CJOHKRRPL 
GVAHLEVSDQKSGRTSLALD 


762 


14663 


A 


769 


1 


316 


GRAPPPPPPPPPTPRVHTSSKSESEPER 
/DGRE*EPTRSLERL/LFFASYFCLEAR 
QSTSALPPLPLSPRSGPT\VLLCPSPPL 
PGERPPNPTKLSREKQTKQTHRAR 


763 


14664 


A 


770 


2 


366 


AREIIILTF*QMITFLNTKGRT*SLIIjA 
SLI IFIATTN\LLGLLPYSFTPTTQLFI 
NLTMAIPL*AGAWIGFRSKIKNALGHF 
LPQGTPTPLKPILPIIETIILLiIQPIAL 
AVRLTANITA 


764 


14665 


A 


771 


2 


364 


NAFPVISGAWTEYPLSHFIQRCRQKKE/ 
LNGLQIGKEEEIFLFKDAML*IESPKES 
VQKRLDVIYKFSRIAVCKINIQKSNIW 
YVWNTQFENE VKT I * DPQ /D I KH WG 1 1 L 
SQRKKE 


765 


14666 


A 


772 


3 


376 


HEPLGKLKLSLLFILATYSLTVYSIL*S 
G* ATNSNYAL IGALRAVAQT I S YEVTLA 
IILLSTLLRRGCFNLSTLMTTQEHL*LL 
LPS * PLAI I * F I CTLAETNRTPFDLAER 
ESELLSC/FNIEYA 


766 


14667 


A 


773 


3 


350 


HEFFFFIL.KMYLGQAQYFICFLFFFLGF 
HLK/HKSCSVTQAKVQRRHLGSMQPPPP 
GFMQFS VAE IKD VHHHAQL I F Y I FR I F I 
F+ F /NFLRHSLALVAQDGVQWRDLGTLQ 
PLPPPG 


767 


14668 


A 


774 


1 


359 


GTRYAAMLiSALG F I FLFTARGLTG I VLA 
NSSLDIVLHDTDYVGAHFHYVliSIGAVF 
ALIP\GFIH*FPLFSGYTLDQTYAKIH\ 
FTI IFIGANLTLLPQHFFGLSGMPRQYS 
DYPDAYTTW 


768 


14669 


A 


775 


2 


369 


ARGS ICLRQTELKTVIAYSS IRHIGL W 
TAI LI OTP * SLTGAVIL 1 1 AHGLTYS LL 
CCLAWSNYERTHS \RI I ILSQGLETLLP 
LI TF * * LLARLANLALP PT I NLLGELS V 
LVTTFS*ANIT 


769 


14670 


A 


776 


2 


353 


ARGTGA* VDS * LTTLHGSNMK* CAA\ LL 
\WTLRCKILFTVRGLTGMAITNSTLDIA 
LHDT YY WAH FHYVLS I GAG FA 1 1 RGF I 
H*FPLFSGYTLDQTYAKIHFTIIFIGVK 
ITFFPQ 


770 


14671 


A 


777 


3 


353 


HEGLHL*L.PKAHVETPMDG* IALDAGLL 
RLRGYGIIHVTLILNPLRKHILHPFLVL 
SI*GIIITSSICLRQTDLKSLIAYSSIS 
Hl\SLEVrAILIRTPL\SFTGADILIIS 
HGLTCS 


771 


14672 


A 


778 


367 


2 


FCP I/CPNQK FMG VGAVGP AL Y PNP FGG 
LGGLV/TPGAGVLNPAGPPG+TPPPPKN 
PNLGGGPALFFPPLKGLGWKIALTPEAK 
GSINPNSPPALPGGGPNQTFSKKKKRKK 
LMLVYSIELTSRA 


772 


14673 


A 


779 


3 


432 


HEPLHWLSSCVCPACVCVCVCQ/ CLWLQ 
VGVQ* CNYSSWQHEPLH* VHTRVYVCVC 
LCHSVAQAGVQ* CNYS / CTAACNSW 


773 


14674 


A 


780 


213 


466 


DDILPVWNYSISFLFFFENRAFFLPPGW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GAGTQL * LPAASN* GPKGPSRLTLL * W \ 
DNRHGPPC/LDNFIFLQKKNLTLLPRLA 
LN 


774 


14675 


A 


781 


304 


373 


R*VFFFFFLKKEFHFFPQAGGQNQEMG* 
LHPLPPGAG/RF*CLTPPHSWDNGEGPP 
RPPNFCFFRGNKVSPCGP 


775 


14676 


A 


782 


389 


1 


PFPKN*CRVNSRPNVKHKTI/QLLEGNL 
GASVDDLEFGDDFLDTIPKVQSMKERS/ 
WDFI KI KNVCFAKDNVKRRGPATNWKKI 
S VKDLSNKGLL PK I YEELLKLNNTET * R 
LT*KWSKVLNRQLTRETRA 


776 


14677 


A 


783 


1 


383 


GTSP* PLTGALSYLLMTCGLAM* *HFHS 
ITLLILGLLANTLTIYPGRGDVSRQSAY 
QGHHTPPVQKGLLYRIILFITSEAFFFA 
GFF*SF\YHSSLSPTPQLGGHWSPTGIA 
PLNSL*VPLLNTWRLL 


777 


14678 


A 


784 


1 


389 


GTS I VI PTGVKVFN * LATLHGSNMK* SA 
AEL * ALGFI FLFTGSGLTG I VLANSSLN 
IVLHDTYYGEAHFHYVLSIGAVFAIIGG 
FIH * FPLiFSGYTL>DQTYAKIHIjT\ 1 1 F I 
GVNLTFFPQHFLGLSGNA ■ 


778 


14679 


A 


785 


374 


1 


ELNAYWNVMNLQNLIWNAQPLSIMQI FQ 
I LI KSQ IQNTLWS ISDTGYLPGIDKWN 
CI*LKILCIVKVTINRARMPVIDWENTF 
S TYTNDKGLI PKYKELKHS\ KQTNNLI K 
KWAKGLHSHSRA 


779 


14680 


A 


786 


.1 


363 


GTRLYHANTN*KKL.RVAILISEK\TDFT 
VKKIRNKEGHYIMIKRSIL*EGITILYV 
GTPSNRWNYIRQKLIKLPGEIDESTII 
LRDFNTPLSVIDASTRKKISKNIVESNN 
IISQLDLID 


780 


14681 


A 


787 


1 


361 


GTLFSSERKNPTWVSIiNQKL.EMIKL.IEE 
GMLKAETGHKLRLLQQ/TSQWNAKEKF 
LKE I KS ATPVNTRKTRK*DSLLATJTGKV 
LVAC I ED* TSHNVLFS * SL IQSKALTLL 
NCMKPERGE 


781 


14682 


A 


788 


1 


352 


GTRNYAKSTKSKLYRWNYKAWMTAYLFT 
AWCTEYFKPTVETYCSEGL/ SLKILLLI 
DNASSH*RALMEMYKQINWFMLDNRIS 
LLQPVDQ*VILTFKSYLRNTFHKALAAR 
DNDSSD 


782 


14683 


A 


789 


365 


128 


PLDQHGETPLLLKIQKLARRNGAHLYSQ 
LLGRLRHENLLHPGGRGC/SHCTPAWVI 
E * DCLKNQTKTKAPRRVS I SHSA 


783 


14684 


A 


790 


217 


257 


W S GG VAH P V I P ATQE AEAE ELS LGGRD C 
SELRSCHS\CTPA*VTQAGVQ 


784 


14685 


A 


791 


2 


363 


LAILHQTVS * F VHAJCEKFWKDLL \ KS AT 
PVAI *MIRNLNSLYNLIMETV* W*KED 
QTSHLIPLS*SLTQSKALIFFKAMKTDR 
GKGAVE*KFEATRGWVMRLKEKFCLS/K 
HIKVQG 


785 


14686 


A 


792 


2 


362 


GQKLGLLHQTVS KFVNAKEKFWKELL \ K 
SATPVDI*MIRNRTS\LISDMETV*W* 
KEDQTS PL I P IR + SLTQSKAVNLFRAMK 
PVRGKGAVE*KYEASRGWFMQYKEKSC/ 
RMCNIKIQG 


786 | 


14687 


A 




2 


360 


ARAGSTMAFKNYAKSTLPVLYKWWKKA* 
MTAHLFAARVTKYVKS / YCLNKKI PFKI 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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od 


SEQID 
NO: in 
USSN' 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of" 

peptide 

sequence 


j Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gtutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l)nknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*VFIDIVPSHPRALIGIYKE/ INVFMPA 
NTTSTLHPTDQEVISTFKCYYFKNRFRK 
AIQLPYSNSY 




14688 




794 


2 


359 


areklylsiltplqrslviafaatel.tr 
yyilfettlipslaistr*gd\qperln 
agtyllfytlvgslpllialiythntlg 
slgi llltltgqe lsns * annl i * layt 
iafivkip 


788 


14689 


A 


795 


157 


365 


GIIEEKGYLPEQIFNAKCSGMISAHCSP 
RLPGSSYSTSA* TDF*CQVQWYDLSSLQ 
PPPPRFKLFYLSDPSTWDYILAPP/RPS 
**FVFFVEMGFRQTPE 


789 


14690 


A 


796 


224 


372 


IFFIF I FI YLFTEWRQSFALVA VQAGVQ 
WRNLGSL* PTHPPPRVKRLSCL 


790 


14691 


A 


797 


482 


1 


NSFFFFFFLFPEAKDHKDFPSPPFPVFK 
KLKGFC+APP*VQKPVFYLKSF/CLKDY 
MLSFPESKVNFPSPPFL/SPSPLFMVGD 
PPLPPETYFSFCF*DPHSINFFLPPGPF 
LFHFFFFLRWVLLCHPG*STAV*SLLTA 
TS\VK*FFHLSLPSSWVYRCTSC 


791 


14692 


A 


798 


] 


415 


NLGGGGCSELRSYHCSPAWATE*DSISK 
QTKTL/NKDHTRAGWERA 




14693 




799 


2 


401 


VQTGFHHVGQAGL* LLTSGN P PAS AS / Q 
SAGITGMSHRARPQ 


793 


14694 


A 


800 


73 


307 


PMALEHHGCGMCLD FLPTFGKSHCFVLR 
CAEMETRSFLPSWSAGA*Y\CLLQPPPP 
RFS*LRLPSRWDYRHLLPCPAN " 


794 


14695 




801 


87 


401 


SLIEIWTLKKSTCHNLYNWLS*HSLKA 
IVLARHS / VESLTHVLKMCLV*NFVFAF 
S*RSLCFFLKISRPVWAHVCNPSTLRG 
♦GGHITRSRDRDHPSQHGEIPS 


795 


14696 


A 


802 


3 


354 


LRKYTP\PG*QSETLPFKKKKKKKKKKK 
KLSFFPPPKFLKKKKGLFKNPFKKKGKI 
FFNPPPPKKKGFFLIVNPPPKKKNPPPL 
GGGGP PKK I YF* KT FFFAPPPKFENPFF 
FFSPRF 


796 


14697 


A 


803 


1 


830 


VETGFLHVRQAGLKLLTSGDLPTLAYQS 
AG I TGVS H CA WL FF F F * * CLAVTQTE VA 
PS * LT I ASN \ PGLKLSS FFTL PHRARL I 
FKI FSRNEVLLFSR \ P SQTPNLMQS SCL 
SLPKCWDYRCEPLYPAESLSF\FIKLSC 
MLKCLE VKCNDVCNLL * NTL IKWINEWI 
EGWIGRLTCVKQIEKNVICRI +VMLKAT 
QPVGTEPS * ESRSTNSGEAMW* S.KDNTP 
FPVFVSVF VLRQS /LCFVAQAGVQWSHL 
SSLQPLSPRLK*SSCLSLLSSWDLRARA 


797 


14698 


A 


804 


389 


3 


F I WP F KG AP ENS F F VFM W I KGH KC P KKG 
FGRKKNQLGPKIPF*KKKKKGAFPLWLK 
RFQGPFFFFFEMEFHS \ IAQAGVQWCDL 
SSLQPLPPRFKRFSCLSLPSSWDYRHAP 
PRPANF\CVFSRDEVSPCW 


798 


14699 


A 


805 


161 


21 


RGGGCLQSQFR/RRLRHENRLNPGGGGC " 
TE PRSYRCTPAWARE * DSVSK 


799 


14700 


A 


806 


405 


82 


FFLRWTL/ DTVTRGG IQWCNLGS PQPP / ' 
PPRFKRFSCLSLSSSWDYRRPPPCPANF 
LYF+ *RRGFTMLARLVLNP* PRDPPASA 
SQSAGITGVGHRAWPMPIFENRFDL 


800 


14701 


A 


.807 . . 


938 


2415 


KITFWETFWIT TVH P HL CKERE AI AG I L 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RRWS*NKW/MIHTHTKNLNPYLTPDTKA 
TFK*IIGLT*RAKTMQL/LESFCQKENT 
GENLSDLGVGKDFLRHKKHGS I KGEKIA 
KLDF I QVKNF * S L KDT FKKMKKYALG W E 
KI FAERVS DRGCVS RRYKEL * L / 1 ELKD 
NPIRKGGNNLNKVHQRI *MANKHMKRCP 
KS+VIREI*I*TIMRYHCILPRMAVMN/ 
SD*SHGDKNGGSSGTLIH\CR*ECEMDQ 
LLWKTIGQFLSK*M* RCHTDSS I PFLI I 
QEK*NCISTKKTCTQTFTAA/MYIiIjVIA 
KNWKQLPYPSSVWI/QQIWCIYT\MEYY 
SAIKRTVDTLNN\QMDL.KI IRFNE *S/H 
NLRRVHCMY 


SOI 


14702 


A 


808 


414 


1 


KPKAKLFVPVRQWMAKEKFLKEAKSAA 
PLiN\RMIRKQTSLTADTEQV*VIWIEY* 
TSHSIPLSQSLIQSRVLTL\SMKAETGE 
KAAEENLEDSRG *FTRLKEK3HLHNIKV 
QGEAVS ADGEAAAGYPEDLiAKI IDER 


802 


14703 


A 


809 


399 


1 


TGPPPRNQPFNKPAKGNFWLFLFFFL*F 
/CNFFFFFFFTGAFliSLFGI \lv* clml 
VWGNFSP*PFKYCFTFSFFLSFIFLIFL 
WLLLNTCLYICYTFCSYLTVL*CSFL,FS 
S I FFLLYF* FWRLLLIHPPPTRP 


803 


14704 


A 


810 


1 


392 


I LLAGA I ED AE PG S G * TG Y PT LTG S YSH 
PGAYVYLTILSLHIiAGDSSILRAINClV 
TTLINIKPPAVARYLTPLFV* PVLDTAM 
LLILSLPVLADGVTILLTDRNLYSTFFD 
PAGRGDPIL YQHLL* LLRH 


804 


14705 


A 


811 


66 


411 


RETRAGAQGLSRFSPPWGNPSQKIFLGP 
GFKKNPGPQKKGNFFFFF*KKFPLVSPG 
GRGGGNFRSLQGPPPKVPPFFCFNPPRK 
GG* /MGAPP 


805 


14706 


A 


812 


3 


392 


LSVPQFPPFFLKKTFRNCPPFCFSPPNI 
LGFPLKGSKK*GSFFPFFPPPNYSFFWF 
PS YSF*NQPCLTRGPF*NQKFPRL* KGV 
PIL>RL*SPLFFGPPPHSF/IFFFFDRVS 
LCHPG YS AVAQS / LLTAASS 


806 


14707 


A 


813 


690 


153 


IISIDAKKAFYEIPPPSVISSSSPTDSL 
YYNLG FLKNFKKGRE VKYLN 1 1 KYIYEK 
PTANI I FI SGKLKAFSLRSGTRQI CPLS 
PL.QFNKVL.EVLARATWQEKEIRNIHI EK 
EEVKLSLFVSNILCIENPI J KMP*KNSSN 
* * IQRVAGYKTNI *KIS\AFLYTKNKLF 
CCLSHPIYDIPL 


807 


14708 


A 


814 


32 


376 


IiFGIiARS Y I TEGGRIiPENPT I PHG * REF 
WELCNKCD\TMRPKPSLHCSRCGHCVTR 
MDHHCPWINNCVGEDKH*LFLQLCFYTE 
LLTCYALMISFCHYYYFLPLKKRTLVRN 
VYI 


808 


14709 


A 


815 


2 


423 


YPLYPFKIFIFPKGFNFCREVGPICPPP 
KIKVLSKNSQVGFYTAPYKEKNNTLPAR 
VNFGPPKDSLKRPPLFFFFFF*RDGIL,L 
CYPSWS*TPGLKQSRCLGLLKHWDYIC* 
PLHPAS F * KKHYS LRILP / SLLADAW 


809 


14710 


A 


816 


404 


54 


SLAHFFPPPKKGFFPKIPRGVL*PPP*K 
KKFLFS/HPPVNLGPPRDFLKGPPPSSS 
SSSSSSSSSSPGG*MIFY* /P* * FGPAL 
VQPMACYCKASCKVKAPFFSrHSGIiMVA | 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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nucleotide 
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correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possib!e 
nucleotide insertion 














PPTTAQS 


810 




A 








DAEVGGSLEPG\SGGCSEP+SCHCTPAW 
VTG * ETVSKKKKKKKKGFFP PFFFKRGP 
FFREIEKKFGKKFFSKKKKKKKKKKRGG 
PFKKNFWGGKNLRGGKKKFFFFLGGEKK 
KPRGFFEKKTFFWGGKNWATTTQK 


811 


14712 


A 


818 


1 


404 


I P INSLTSKINKLLKRHKL P j HLNQEE I 
DNLTRTIPI *NINF* I YKQNTGPDGS IS 
KFY* TIRKT*Q* FYTSFQKI KNEEVLPN 
S FYEASTPLAS P KKKKKKKKKK 


812 


14713 


A 


819 


421 


2 


LPPKRRCFSPKPPRGFFFAPS*GKKNI/ 
CFPPR*NLAPPGVFLKGPPLFFFFFFFF 
FRGRVG*SWVPAASPLQGQVFLPPQ/SS 
*VGGSPGPPEHARVNFFFFFFFFF*F* * 
RQSLTMLPSLVSNSWA*TILTPQPPKVL 


813 


14714 


A 


820 


84 


418 


I LTCHNARLGEKS Q I VTMQAS QNNP KLV 
NI CE VFLVFF F WK P I FFG TQ VGGQY FNL 
SSL*VRLPGYTHFFCL,TLPKSWDYRPIP 
PS \ P IFFCFFNKKGVSPGYPGGCNILVF 


814 












FSPPPENLGPPRNFLKGPPPFFFFFFFF 
FFFFFFFMFCVLIY\FIFFCTNLLFLDF 
CIT* *T*NWFISSKAVCLFLFLPLQPAS 
QKTKNRCRKSFF 


815 


14716 


A 


822 


416 


2 


ITKLTEEKKIWSGKRKNDYVKYAGASQM 
YTAWIjRPQSMZTLKDSSERKAPHWGEL 

nw\hvhhfiwker* pev* iyies*aid 
gglaswsgpeknkigklgakkv *rrdmk 
tdplectqnini fgfvclfgdrvslcw 


816 


14717 


A 


823 


409 


45 


P PP P FLFFFF * KKKKAPGGGS QG P KF * P 
PPPPKGKPPFFKKKKKKPGGGGGPPNPP 
PPKGGGQK/QFFPPQGGNFNKPKSPPPQ 
PPRGQKKKPPPQKKKKKKKKKKRLNVTG 
PQFLQLESGTY 


817 


14718 


A 


824 


1 


408 


IFSEHWGIKL* INTKRNFRNCTNIWELS 
NMLLKNH\ * VNKEN*NKKF I KTNE/NKN 
ITCQNL*DTTKWLREIYSKKCLLQKKK 
KKKKKKKKKKGGGALKNYLGGGQFYGGE 
GNFIFFFWGGAKKPCGWIFRRRPFFW 


818 


14719 


A 


825 


259 


274 


YL*YLLLYRRYSA* IPL/LGYYLKKRKL 
LYQRDTYTPMFIAALFTIAKMWSRP*SP 
W 


819 


14720 


A 


826 


422 




FFFSKIPWGNFFPPPPKRIFFPPFPPKI 
FFFPPPPFFFWGGFSPFSPPPKKVFFPK 
SPPVFFFPPP*KKKFFFP/HPPLIFPPP 
GFFFKPPPPFFFFFFFFFFFFFFW 


820 


14721 


A 


827 


15 


402 


IKSLGKNIGINLGDLVLNNGF*YTLPKA 
QATKEKIDN/WDLAXAKNI CA* KDTIKE 
VERQPKR*QQTFVNHLFDKGLVSKHV /N 
EL*LINKKTNH\HI J * *WAKD*NRACSKE 
DVHMTDKLMKDVPSLVIREI 


821 


14722 


A . 


828 


420 


3 


KDAHTCPPG CKGG FSGHRALKA I CRNQG 
FIjLTTTEFLTHKCPIjPGQEPWGQHQG*!? 
DS PRRVS P APGT WQ P P CHR ERACLAPS A 
VE/GPS*IQEQEKSLLFFFFSLLRWSL/ 
NSVAQVGVQWRSLGSLQPPPPRFKQFSC 
LS 


822 


14723 


A 


829 


1 


269 


IKIKNIARHGDSCL*SPLLLEMLRLEEN 
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nucleotide 
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ing to first 

amino acid 

residue of 
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Predict- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LS PGSRGCSEL/ CQPARQGETLSQTNKQ 
TNKKPHQVQLPCFEIINFKEFITCLVLR 
LPGNSW 


S23 


14724 


A 


830 


432 


1 


GYTTOQQIFDVDQTAFY* KKI PSRTS I AR 
E E KSMLGF KAS KDRL I LLLGAKVAGD F K 
LKPMLVDPSKTPRELNN*ARCTLPVI,/V 
KWN/KAWMTAYLFTV* FIQYFKPTVDTY 
C/FKILLLIDNAPGLPRAVMDR*EESNV 
FMPVNTTL 


824 


14725 


A 


831 


1 


233 


KKLQ I KYLG INLMKYT I HWS I CQ KLLM 
KE INEDLNKWRDVLCLW I * RLNNMS I LH 
KlilN/RINIiLQLRI 


825 


J 4726 


A 


832 


3 


477 


YQTCKEDLTQMLL KIFQKI KGEGI I PNS 
LYDAS IAMMPKSDKDRTKKFXNYRPISL 
MSIDAKILNKILPNQIQQHIQKFIFIPE 
MKG+FNIRK* INLIHLITKMKTKTPMII 
S IGTEKVFDKI KY/PFP ITTIiTECLG I DG 
RSLDAIMVARE 


826 


14727 


A 


833 


3 


402 


RYQTPLLV\*TVLITTALLGLSLPVLTG 
GITIPLTDRRLDTTFFDPAGGGDPILYH 
HLF*FFGRPEAHILILPGWGIISHIGTY 
YSGKKEPFGYTGMR* AI I S I AFLRVI V* 
AHHIYTV* IDRDTRAYLPSLHN 


827 


14728 


A 


834 


7 


395 


DPQRVSCMALSSNSFFFFFWERKSLFFP 
GREGRGQIWVNGTPPLQGKRNSPASPPG 
GGGITKKAPKLHPPKKGWGGGLEKPP*N 
QRGG FG I * S PGGKKGFAGG* KQGQKS KG 
GGKKKGRA\DP+AYIPLK 


828 


14729 


A 


835 


41 


444 


DPRVRKKERENLYRSVSIKEIESVI*NF 
PTKKI*GLDGITSEFIOILPKKKPK\IK 
KERTFSNPFDKTNITLI/SKPETTFTKK 
EN/ VRPVSLMNI /DCKILLKVLANCTIi\ 
HI*REIHHDS/INFIPAIQISFNIQKTN 


829 


14730 


A 


836 


3 


415 


HAY YI VKPS P / W I PKGALSALLiKTYGLT " 
M*MQFQSIKDLRLGLLTNTLTIYQ*WRD 
VTRKSTYQGRHTP PVQKGL* YGI I LF I T 
SEVFFFAGIF*AFYQSSLAPTPQLGGHW 
PPTGNTPLNPLEDPILNTSGLLGIGVS 


830 


14731 


A 


837 


378 


0 


tppkgpggkiflkk/ SPGRKIF*PPGNG • 
/PFFSPLSPLKFFFFPKAFNFWGGGGPQ 
GPPPKKGVFSQNPPPGKKRPPQKEKKNF 
FP/PPGKMGPPQGFF*RAPPP 


831 


14732 


A 


838 


383 


38 


GPGEKIFLKKPREKKLSPPGEKAKFFPP 
SPLKNFFFPQGVFFLGGGGPKRPPPKKK 
GFFQKTPRGF* IAPQKKKKKNFPPRGKF 
GPPQ/RIF*KGPPPFFFFFFFFFFFFFF 
WSL 


832 


14733 


A 


839 


59 


418 


TKISSKDQKQY\TTPGW*MAEISTTFED 
AE 1 1 DALIFPFNLiP I WLHKKPDCS WKVS 
VYCKLK*WSLISATGLDIINT\LLEEM 
KRVSGTWYIATDLAKTFFYI / PDQKEFA 
FSWYG*KDIF 


833 


14734 


A 


840 


5 


335 


IRHLF* INKFENLDKMDTT*KX>TYEGIE 
SLN/RII*NF/PWKSLGSGSFTSEFYS 
LFKEEITAIYKPIWRVEKGGI \LPTCFA 
TRITLIPKPETCIMRKKNCGPVLFMNGH 
F 


834 


14735 | A 


841 


1 


38 


RRLKLEDHLSLY*KINSKWIRELNrR/P 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine V—Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide. insertion 














ET I KLVKKDKGNV\QDTDPGKDFMVKTT 

PIE*GKIFKNYSSGKGLLSMIYNELKQQ 
HRNNNL I * KLEDHLSLY 


835 


14736 


A 


842 


2 


424 


GRVGSIPLSQSLIQSKALTLFSFMKAET 
GQEAAEKKVEANRGWFTRFKERSGLCNI 
KVQ/GAGGDTAASSPADPAKINDEGGHS 
/KQQIFSGNETAFY*KKMSSGTSIAREE 
KSMPIFKASKDRVTLLIGANAAGNF\KL 
KPMP 


836 " 












FFFFFFFFFFFFFKTFLFFGYWFSIFFP 
LLFSISLLLSFSGSFVTFAFQPFF*VSR 
YHYQHFNSTF*Fli\KXi*FIF*IFFLISI 
LFLFHGYNF 


837 


14738 


A 


844 


409 


1 


RCGGTCPWSPVLRRLR*KDHLSLRG*GC 
SEP * LHHCTPAWATE\DS VS KKNKKKG I 
SCRQHIVGSFFFHLKMCLLNGLLSFFTL 
NVIIYWEFKFTILLFLLYSICSLSHLF 
SFPAKSWINSKFESLFVCFNSFS 


838 


14739 


A 


845 


63 


451 


KNQEQESEETLPNL.FYKASITLIPKLDT 
QK* KKRKEKKRKKKK\EN* KP I THRN I Y 
AKILNKILAHQIQQYIGKI IHHDQVGCI 
P/GAFDKIQYRCMIR/TLQKMGIEGTHL 
NIIKAIYIRPTDSIIENREKP 


839 


14740 


A 


846 


60 


460 


RNNEPFLDQIVTYFEKWILYDNR* * / PT 
QW LDG E EG P KHL P KPNLH / Q KKVWTVW 
WSASLTHYSFLNP\ETITSENYSQEIDE 
MH*KLQSLQIiASVNRKGQIPLHDNPRLQ 
VAQPVLQKINELGFEVLPHPPYSPE 


"840 










2 


NFLKFHPGGETFLQKGYTGYFSPFTP*K 
FFFSLKPLNFLGRVGPFFPPPKKGFFSK 
IPQGGLISPPLRGKVFPPPPRLNLGPPR 
VFLKAP/HPFFFFGI\PHGLQPRPMI*K 
PTRP 














RRVAAPPPSKNIFFPPGSYNCGGVWPQK 
SPPPK*GFFPKP\SGVYKNPPQIEKVFF 
FPHARIVPPPGDPIKTPPPIFFFCFNSI 
NFSLRGAQLYVWDFLGSGC 


842 


14743 


A 


849 


94 


470 


LNHFLSFFLSFFLSFFLSSGSLSLSLSF 
FFFWKGSPPPP*GTK/QKPCLKKKKKKK 
GGGGGQNFFTPPGGKKNPPPQGGGGGGS 
PAQKKKP/ EGGG P PG S KKKNPGGG FKKK 
KPPKKPRGGGGKPFFP 


843 


14744 


A 


850 


2 


396 


FFKKENIQMANKHVKTYTTLLVIREMKI 

RTIRILMLC*LQNKMVLLL/WKNWQFLT 
KI KI FL F \ FDPT I S LLGI YPREMKTYVH 
SSFIHGNSKNNRTGNNLNVHN 


844 


14745 


A 


851 


369 


36 


SKGASDILiEKPPLF'EVGLPTPSPPLGE 
KMP/FNYKGGPGKGISLiIjlFPKKKFIPP 
GGHFPFPFP/PFFFMDRVLLCHPGWSAV 
AVSQLTTTSASRL\K*FSCLSYSEG*DG 
RIS 


845 


14746 


A 


852 




367 


PPPPKIKSPSGPGPPSKRGSPGAPPRGG 
KFWFLSQKGHGGP» FPPPPGVKPE /NIP 
* PPRGRVPL/HLN1APAPPPGPPK/PKP 
PSP 


846 


14747 


A 


853 


1 . | 378 


AMLATLI SNS * PQ/CDPPALAPQSAGIT 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, JVNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Srop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GMSPCARPTSCSFYP* ILLTVRTFIKIR 
NFFFFFDTGSNF VAQ PGGEGHDHG* * * P 
QPPGPWKSFNLSLPSRWDYGGAPPCPVI 
LCIFCKDEVSSCGSG 


847 


14748 


A 


854 


369 


1 


SACFGLPKCWDYR/R*ATTPSLEIIIYI 
ER*EKTTSPDNANRFDVTKLHHYDCSVM 
DF* KAL * NGDD FP VLMS KMHMQS LNI I \ 
FVFCFLRQGLALSQ/AGVQWRDHGLPQP 
QLPRLQPSSHLSL 


848 


14749 


A 


855 


380 


335 


NPGGRGCSELRLCHCTPAWAT\SETLSQ 
TKIQPKKQEV*LL 


849 


14750 


A 


856 


373 


1 


LRIKILNKKGLANLI P* *R* KI IHSQVE 
FIPEMQA*FIIRK/TPSIKWPIVRTKKK 
KKSQMLISIDTEKIFDKIQKLFLVKPVI 
KLGIEGTFINFIKSIY\KKSTTATIISG 
LNASPLRLKTRQDF 


850 


14751 


A 


857 


369 


44 


KVQNLYFEKKLLNKIKVDLNKW/NSLCT 
RFRKLNIANIIMFKLIHRYN/SKVKIPI 
KIPANCFGEIDKLII,KFI*KFKRPQIAK 
IILKKNKDGYILPDFKTYYIYMQYHLC 


851 


14752 


A 


858 


1 


348 


QWHAPAS/LARPPPPRFK*FSCLSLPNS 
WDYRHAPPRLANFVLFCFVG*GGIHL 


852 


14753 


A 


859 


1 


366 


CTS PTFNQKLEM I KL, / EEGMLKAETG * K 
LCLLCQKV/QWNAKSMFLKEI*SASLV 
NTTMI \RKQSFTADMEKV*W* IEYQTS 
HNIPLSQSLIQSKA1jTL.YSSMKAERGEE 
AAKEK/LEASR 


853 


14754 


A 


860 


12 


366 


PS TLGLRRAS CLSLLSR WD * RHM P PH PA 
N*KNFFCRDG/SLTMLLRLVL\NSWPQA 
IHSPWPPQ 


854 


14755 


A 


861 


424 


78 


NPRPLEGQTASFSLAPNF* TTLGKKVNP 
FF/SLKNPPPPPPPPKIWAPQGPFGPPS 
PGGLNGGFFLPPRVKAPIIHKGAPPPQF 
PPQKKKGPRVFPKKKKKKERKKEKGMTR 
AFCYP 


855 


14756 


A 


862 


327 


3 


SHWFFAAV\GREISM*AMAPDQTKKICP 
RSAEDAI KYFLTQATGS III>IRAILFNN 
RLSEQ* SITNTTNQYSSLI 1 1 MAI AIKV 
GMAPFHF*VPEVTQGSPIiTSGLLVL 


856 


14757 


A 


863 


1 


341 


YDRNKWKDIPGS* IERLNIVKLSMLPTV 
IYRFNVHjIKIPMTFFAETEKSIVKFSR 
D YE * P KQ S * KEQNWKTHT P * FQ / Q F FT A 
TV I KMVW Y * NKDR Y I Y / DQWNR I * S LG I 
NPCI 


857 


1475S 


A 


864 


32 


324 


LVWPFFFFFFFF* KKKFFFFPPF*RKGK 
NFFFFTPPPPGLNLFF/ CPHPLKKWEKR ■ 
APPPPPN* FFFFLKKTGFFFFTQKGLNF 
PPLETPPLTPPKGLN 


858 


14759 


A 


865 


350 


3 


RVKNPRPFWGF*MVLKPLSFFSKKTN*I 
LFPIiKIFSPPKTVPWGKIFLGAL*NPFF 
CFKNPLWFFGF*KL/SFFFPPLYFF*KP 
LAPLKRRFSFFFFFFlj*DGVSLCRPGWS 
AVARS R 


859 


14760 


A 


866 


342 


118 


GS VTQAEVQWCDHGSLQPETPG IK*S/H 
PPTSSYQWGTTGVCHPGLAMLSRLVLS 
S WPQAI LS PWPPTVQGLQV 


860 


14761 


A 


867 




354 


VKPSP * PLTGALSAIiLMTFGLTM * IHFH 
SITLLILGLLSNTLTIY\Q*W\RDVTPE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G = Glycine, H—Histidine, I==Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = Glutaroine, K = Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














STYHGHHTPPVQKGLRYGI I L FITS EGV 
FWAGSV*TFYRSSLSPTPQLGGHWPPTG 
ITPLN 


861 


14762 


A 


868 


3 


344 


QIKNPDNISC/WIRCGATGMFL.HCWWEC 
KLLQPLWKTT*HQ*SRR*AVPLiSGMFP / 
NRYSSTCTPASI*KTFLSALFMMTPL/C 
LSRVEWIK*SWCNQKL*Y*SVMKRSEEF 
SLKNA 


862 


14763 


A 


869 


345 


1 


KQAGLKNVKITG/ EGASVNQSR* VS / YL 
GTIKKI TEEKGYIiPERVFNAGASAVF * G 
KKLPQRTFISK/EEKQAP/GSEVGKDRL 
TLLFCANMSRFMISTALLYKAADLQSLK 
GKDKHRL 


863 


14764 


A 


870 


340 


45 


FCSFCRDRVSLCCIiGWSRAPGLK*SSCL 
S LPKC * DHRC /AA / VP AW FQRC I LEKKN 
QIY* CAERIL* SERPHTRHLDSSTVNIIj 
PCPLSKVIPWKKKNQIY 


864 


14765 


A 


871 


209 


72 


KLWDIHTMEYYYSAIKRNKPLKYTTWMD 
LKDIMLCEKANLKRSHTV 


865 


14766 


A 


872 


3 


353 


SFFLGPPPFFFFFFPPPQPLPIFFWRFF 
FCFSPPKKPPP/PFFL*GGSPPPPF*RG 
FFFF*KKKGGVFPPPFFFFFSPKVSPPP 
PKKGGGG/PPPPFFF 


866 


14767 


A 


873 


3 


267 


DFTMLARIjVSNS/WPGWAHTCDPSTLG 
G*GGW I TRSGVRDQPGQHGE I 


867 


14768 


A 


874 


1 


346 


PRRPFFFFFFFLIVFIYFKCVYLF*DKV 
LVCHPGWSAVPHHGSLQP*PLRIKQSSH 
LSLSSS/WDHRHAWLVFVSFYRDEVSPC 
LANFCIFL*RQDFSMLPRQ/VLKAICLP 
WPPSL 


868 


14769 


A 


875 


1 


172 


KLIiSSGSPPASASQSAGITGVSHRARPG 
LLNFFOCAFSVPGPCLGYPDTFTH/LC 
PPS FHQSVTAS LDFSCLS * I. * MS P 


869 


14770 


A 


876 


1 


348 


LLFCFNNYSFFHGVKPKIPAFFVFGPGC 
LiPHFFPPITPAPFFFLDRVLtiFPPGWRE 
MGPF*APPTFSPQG*GVFPPPPPQ*LGP 
RGPPPQPGVFFFFCIFGRDSA/LAILPR 
L 


870 


14771 


A 


877 


1 


203 


GFHYAGQASLEVLTS *STCLSLPKSWDY 
RRGP P / * PGLS YFL YPSLRS * F I CAMS I 
HIPFIKKKKASD 


871 


14772 


A 


878 


343 


102 


EWEDCPSPGGRGCSEPRSHHCTPAWVRE 
TLSPKKYljFLQP*Ki)KNTKIYLSISIP* 
ELET\HjKILjHFKMMNPLHSYNFFF 


872 


14773 


A 


879 


2 


359 


RDITKGDMQMETKHM*SFSTSXjTSGI \h 
NSPIRPSKMKNIDSTRYWRGCRAIRALI 
HCWWROMVQPFGE* ISSFSKKLNMYLS 
YDERPTFRYEK*K/P*VHPKICV*MFMA 
AFFLISPNW 


873 


14774 


A 


8S0 


205 


1 


FFRGVTEGL*EPPYVESVI /AGGTTARR 
PLFFFFFFFFFFLRWSFALVAQAGVQWR 
DLGSPRPPPPGFK 


874 


14775 


A 


881 


150 


2 


CRARVDGVPWRNPGSLKPPSP \GSSDPP 
TSAS QECG I TGAHHHTRLT * VF 


875 


14776 


A 


882 


345 


1 

■ 


KGNQPWKTEKRALFC/ TLKKKKI FKNSP 
PRGNF* KFFVKKKI PLKKEKGFS PTWGK 
KKEFFFKKKKKKKKK* * AKDLNRNFSRE 
DVQMAKKHMKRCSTSLIIREM* IKTTIR 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YHF 


876 


14777 


A 


883 


2 


355 


DRLLFSASHLDLGTLYLLFGA* SGVLGT 
SF\SLLIRAELGQ\SGCLL*NDHIYWGr 
ETAHAFVMDLFIVRPIIIGGFGN*LGFP 
NKKGADMAFPRINNISF*LiLPPSLLLLL 
ASAIVEA 


877 


14778 


A 


884 


262 


2 


PTCEQSEREIKKTIQFTTESK*IKFVGI 
N1AQEAKDLYTETI KYKTLLKE IKDTNK 
* KN 1 1 YV\ S WTRRFNNFKMLGWAQWLMP 
VIPA 


878 


14779 


A 


885 


16 


318 


ILRADCADLFFFFFLGGKKGFCFPRLKG 
RGETLLN*TLIFR\VKGNF*PKLPKIW/ 
DFKGWPYHL/AENFVFFLKKGGLTLLPG 
LVLNS * IKEFFHLWPPKGVA 


879 


14780 


A 


886 


298 


322 


KRRTP* YPPGEPHFPPPPPQG*KARGAP 
PPPPKKGFF \ PEKKKKPGVMVQGTTQKP 
PKKTKVRVD 


880 


14781 


A 


887 


353 


3 


FPFWGFLKTLFGKALWLFKGFFFFAHQK 
FFFLPSLPPPP\LGFFLRGLT*FKIFSP 
LF*KGAPQKGISHPLFFFLISPKPPPPF 
FPFFFFPPPPFFFFFFFFFFS\RDRVLL> 
YCPGWGA 


881 


147S2 


A 


888 


22 


341 


IPCTCLKLHGKVHDHDQSQLWWPKPQEE 
MRTLENNLAVLQS I LQWLDI ELPYDRAI 
ICARE I KTYV\QQNCT*TFMLALCITAK 
KW/ KQLKCPSTDEWVSRMWYTCTR 


882 


14783 


A 


889 


51 


338 


ERS QLQWJL.MP 1 1 PSLFFFFETQFLFCCP 
GGRAWAI FKI FEFIA\PK* KQFSCLTLQ 
TI * VYGLNPPPRKNFVFLKERGLFHVGQ 
SGRDLPPSGDPP 


883 


14784 


A 


890 


1 


225 


GRLRPENHLNPGGRGCGELRLCYCTPAW 
VTERDTIS/RHTHT*NYFY*GSI*QLAN 
SCCCNMliNKFNVICILiRWC 


884 


14785 


A 


891 


357 


164 


GKGCSDSRLCHCTPAWAT\TKTLSQKKL 
ILKKQNKFRS+LDD* INMRCTVWPCFHV 
FLFIRAAPLFSDWLYNK*MNRNT 


885 


14786 


A 


892 


207 


302 


EP FSG III NE S IHQEG 1 1 VLNVYTSSNR 
PSKYMKQTLIELKGEKVKSTI IVGD/FY 
THLLVIDRTSR+KR 


886 


14787 


A 


893 


86 


332 


VMRVSCCCLKD* ISLCHPTWSAWQS* L 
TVASNSW\VKQSSFLGLPALWEAELGGS 
LEVRSLRPVWAT*TP\FCKNKKL 


887 


14788 


A 


894 


2 


336 


FFFWPPPQTFLKNPGGPGPKREEGPGGS 
PRGQKQGPPGLHLTGFGGQPGG* KNPEP 
GPGGRAPKGETRGEGGGPTRPQI P/QLI 
NGKSGKPPKVTLNGAWGTIKIFLIKTPV 
G 


888 


14789 


A 


895 


1 


214 


ARESLASFL.PPSLPPFLLGLRFLPSFLP 
FYFS FLPS FLGFGS FLPST* VPS FLPS F 
IiLFFPSFLPWLWFLPS/FLPS FLPSFLP 


889 


14790 


A 


896 


131 


352 


TLHESDSESVPRDFKISDALAVEDDQRS 
PGTLNAAEL.S/SSVRERKKKEK/KPEPG 
L*DQSIKESDSYMVSGGRIQ 


890 


14791 


A 


897 

898 j 


3 


184 


CS VAPAGVQCHDLTSLQS PPP \ S S SNP P 
TSSNPPTSAF*VAGTTGMCHHAHLI FVF 
LiVDAA 


891 


14792 


A 




327 ! 


264 


NRAVSLMNLDAKVL \ Kh I S AS Q I YVYI K 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G = Glycine, H— Histidine, l=lsoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine S^Serine 
T=Threonine, V=Valine, 
W=Tryptophnn, Y= Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














KMIH/HNQIRFTMRM*GQFI I *TAINI Y 
HKLIK+GRKHMIPLIYAVKIFEKI*YHN 
IMIKTLHKXi*IKEPPGSFIYSL 














TKAASHSQ\RANLQFLVGRIH*HLKSRT 
IS PGRVGATAAVNSTAI LEYLTAEVLEL 
ARNAS KDFTVKHTTPHYLQLAI RGDKEL 
GSLFKSTI AGKGVIPHIKKS 


893 


14794 


A 


900 


340 


1 


HLCPHKEHLY/ LL1QTSCPI1AI I * FIST I " 
AEHTKAPFDMAKVESELVSGFNIEYAAG 
PFALFFIAEYPNI IIINTLTVTIFLGPT 

FV 


894 


14795 


A 


901 


1 


252 


LTFFPQHLLGLiSGMPRRYSDYPDAYAT* 
SKRKVLIVEDPSINLE*LYGCPPSFPSF 


895 


14796 


A 


902 


326 


1 


LGPPPPLKEPPGVFPIFPFVFWNFGPV 
GPF* KI LIGPL.I LDFPQFFFGGVSQIVF 

PLFFFFFFL+DGVLLCHLGWSAVA 


896 


14797 


A 


903 


1 


352 


KG I FP VWSCG * KGTQKKGGLSQGGQGNV 
PPCGI HFVKE FLG FPGAQLL IS*GGRLI 
F*KGPKKGFLP*TVFG/RRAQFEAPR 


897 


14798 


A 


904 


325 


2 


RLEEKLNDYWNEMKVKKNTEYPLNLPVE 
DIQKRPDQTWVQCDACLKCRK* PYGMDQ 
HLEK\WYWCNNHDSQFRYCMVPKDPEDM 
DLVHPNYGKPYKKPSKETYWFSQMP 


898 


14799 


A 


905 


1 


329 


I GLAI HG AER I VRGQTAKS LAVHN \ VCE 
QKKIKKKKGGPGREFQNLKWENPKNPGG 
SF*RGJjGPQSNFFY2jKQ/RLGFFF*KKP 
PKI*ILAMEPPNKKIF*KNKNKAHFFF 


899 


14800 


A 


906 


148 


2 


DVDKL.FLLRSLPTL*RPEYGSYMIEGT\ " 
TGQPYGGTMSE FNTLEANMRIR 














ESKRS IFGPPNPPPPGFKGFSRHAPLWLi 
RSREPPHKAGVKKGFPRFRPGS*MPIP* 
SFLQNCLPQRVGFPSLPPFGAPFLKISL 
FLiRQESRSVTQAGV* \YGLLQCRPPRLK 
SFSCLSLPNSWDYNR 


901 


14802 


A 


908 


3 


297 


TKIKSL* INHLIRAKTVKLLEENMGINL 
HDLSSGRQQPFRYDTFQ/SMINTAIKEK 
I MKLS F VKMGS F I S QRTPLRKERQLTK* 
EKIFADVYLIIGRRG 


902 


14803 


A 


909 


169 


373 


ASNILSATDISNTFGPPGSQGFSGREAY 
VEAGT YYTNFS CLGQVKVF * YWMQL I VS 
IVLL*LL*VTQKL*L/GPRQEKFV*YVP 
AST+ASLPLKPCDPGGPKVFEMSVADS/ 
DV*SLWLALIGESKFRCLVF*SKFLPSS 
AYTYSPLEKKLFILLGCFVMICFCFCFL 
RRVL 


903 


14804 


A 


910 


132 


368 


GRIFLFVGQEKGARVSFLFLF*DRVSLC"" 
HPGWSAVAQSQLTTS /TWTQNNSSHLTP 
P * VAENHR CA/ PHT PNVL FL C 


904 


14805 


A 


911 


2 


339 


NSWAQE*AGITGSCHHTQLTLEF*AHDR 
QMMFFLFFLKTNFTFCPPA*RPWPGLGL 
TE\PLLPGLKHFF/CL.TLPSNWDKGHL.P 
P/HPYQI / CGFLRKNGASL.ISG 


905 


14806 


A 


912 


3 


381 


LNFCDTHPLTPRPVSIQRQCLPLVEAGI 
R WRAL I S PQLHPPT / YSL.LSS WDY.KHAA 
PHPANFFFFFFKKGSLTMFPRPGGS*IF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G^GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I YLLI YLERES Y/ AVAQAGGQG PNFNSG 
QPL.SPRFKQFSYLSLLR 


906 


14807 


A 


913 


394 


1 


MSKGKIG* KLG * KFAQKVS QFVNAKEKL 
LKE I KHFTLMNT * MI * KRNS /LI ADLEK 
ILWWIENKTGYSIP*TQNRIQSKAIiIL 
FYSMMTERGEEA\KKFEARRDYFMRFKE 
RSRLQNMKV*DEAANADGEAA 


907 


14808 


A 


914 


326 


200 


HNWLS I WKKVKSNCFLMPHTKINSRWI K 
DIKI/R/LHRIKILQENVGNHIYKI*RQ 
KRS+PRHWFLHKNCKCLVWHKTNK*NH* 
KI YL * CI * KTFAVFM 


908 


14809 


A 


915 


3 


334 


LVVRVKGPKFKFWLiHHQLGLYC* VISFF 
INKMKEYVLIHRIK*DHKNIF*MGYKKL 
*NVN I W * L VLFG FS S KNHL / W SGAVAHT 
FNPSTLGDRGRR 1 TQGQE FQ I SLTNMVK 














YQKLPQKKSPEPNAFTDKFYKIFK*LTA 
ILQ* LFQKI /E / KETPLPK* SNESSNAL 
I*KPDKDIT/SKNYKPISVSNIHAKILN 
KI * GYQVQQKQLYS K 














SLQPQS CRLKCS S CLSLS S CWDYKHKPP 
CLA/NFF/VFF*EKGICTVKN*NGDYPL 
GR R VT KRDHERG F * NAG K VLL VG W / CG F 
ETQSRFVPQAGGQGRNLGSLQALFPGLM 
PFSC 


911 


14812 


A 


918 


2 


321 


GIISGI YKELSQPKMTDS S I KK \KDLNR 
NFTKE YVQMACKHMKRCSAS LVI RE I K \ 
IKSTMR*H*\TPTRMTKIKD\DKNTKCW 
*GYGAVGMLIHCQW\NTKMVQPLWKN 














FFVFCLFSLVFISLNCGLF /PTFFRIPS 
L/PYLQLI * I YLFFY* LKP I FLKFFRDR 
VSLCHPGWSTVA* S * L I EAS KYW\ VQ * S 
SHLSLLSS*DY 


913 


14814 


A 


920 


239 




DSLIIjSAVQ*HDLGSL.QPPHPGIjKQ/FL 
PSNWHYRCMSPCLAYFLiLFLVETPFCHV 
TQASLELLGLSNLPASASQSAGI 


914 


14815 


A 


921 


1 


242 


PRPRRRLLKYF ILFNFLRQ / RSRSATQA 
E VQ * HDH S S L * PQT PRI»K \ H P P AS ET 


915 


14816 


A 


922 


249 


1 


AbFCALKKTLCGTLMFI AALF I I AE \SR 
HNQTQPTCLSTDEEINKMWHRHAMGNYS 
AI KRNE VL I YAKI * M YLENIML.S E 1 1 


916 


14817 


A 


923 


404 


1 


SRPQKKLGELKCS *RPKGLNKKNQRKPG 
TFFFFETKFHS FCQAG VQ WVYGC * SHP Y 
L LH 1 1 F P * * VF * M FL I L, I T V/ P S RNKDL 
IGNSEKQIHN/WSSLYFYF*LFFI J RWSI J 
TLVPQSGVRWRNIiGSLQPPLiPRFK 






A 








FFCSLLGHGGETPNNSPLPHPWL>*ETKS 
PPKPPHPLKKKKHSLFFKIFILKKKKNG 
PPFFPPAGPNPRG*NPPPPQ/RPPKGPP 
PPKEKINPPPPPQGKNFPKGKRPPLGPF 
PKKGGGY 


918 


14819 


A 


925 


268 


2 


HMYAFVKI DL.TAHLRFS AGKKE I PAGKL 
YF/L*RKNPKNQI GNQKKKKCLPRAGRM 
VAHAWNPSNLGGQGAGTT* SQALETSMG 
NMGKP 


919 


14820 


A 


926 


74 


338 


IAGITGVSH* ESKNS Y * KQFL / WPGTVA 
HSCNS SNLGGQGE W I T * GQE FETSLANM 
VKP 


920 


14821 | A . 


927 


339 


1 


LRLFACPPLPKWWDYRGEPPRQIFFSWH 
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Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFKKQGFICPIEKGRELSNCCFQKEVLI 
* KKVI LKI * P FFFFFLMESCS VA* AG VQ 
WHDLQPLPP\NSWDYRCPPSRPAKFCIF 




14822 




928 


3 


416 


RCWWENKIN/PFWK/S/VWPCVTKVNIH 
SPYDLTISLQYLPYRDEYLCSYKNLMFL 
IVLFI ITKNYKQ PKCAS VG * WLR ILWYL 
/YMEBTYSAIKMNK 




14823 


A 


929 


344 


69 


YKRSPLIj\KSA*DGPANWKSLPRS*YKR 
SPLL I KAKPDQ I I SGWEAS PL 1 1 \ PRAL 
LQKLKPAASKF*W/NPKKKRALIS/KAI 
PSKKNKAEGIMFPNFKV*YRDTATKTA+ 
WWYKNSSSL 


923 


14824 


A 


930 


378 


3 


HGKKIDPRFIPYTKI/NLKWFIDLNVKP 

ktikfi.e*nige/tifvidhkk*np*kk 
klvn/wdfiki ks fcfvh* kpavgk* kr 
/qp*trrnyvqthtsdrglisrickelf 
*lnskvi ns i rkwaqsln 


924 


14825 • 


A 


931 


479 


83 


SRKGLGRGLKRGGF FPP P P P KGL.PGG VA 
PPGFEGFFGS1,* KG/DRG * PFLKGVKGE 
KGKRI*EGG/EKGGGFLPPPPPQKGFSK 
NFLGFYAGKKENP*GGSPPPPLFGPAQK 
ISLGKGENKVPPPPGNLFYLFIF 


925 


14826 


A 


932 


36 


440 


TTYASLDEAQLPRAKFNAFLTTF*HI IK 
NGP I LGKIGRKYMIADQ \GHRMKNHHCK 
IiTQVLNTHYVGPKRILLTGTPLQNKIiPE 
LWALLNFLLPTIFKSCSTFEQWFNAPFA 
MTGERVHLYEEET I LI IRRVHKVL 


926 


14827 


A 


933 


8 


398 


CLQGKEETLTTNWHLRYFLPFFFFFWPK 
KGARKNGGRPPGGSPKNQRAPCPKPFEN 
PGEKTPQKRGFFPPKPCPGPGPGPTGGQ 
GKPP/HPQRPQILPFVPQAGPQGRKGG* 
WNPPPPG*KGFPAPTPPRM 


927 


14828 


A 


934 


1 


441 


TRSHPRALKEVYTE INI VFMPANATS I L 
QPMD*GVILTLKAYYLRSTFCKAIAAVP 
SDS/SSDGSGQSQLKTFLKGFLILDAI* 
NIGDSWEEVKISTLTEMWKKLNPIFMND 
FEGFKTSVEEVTADWKITEEVQVESED 
GTEFLQSH 


928 


14829 


A 


935 


3 


383 


PGFKASKDRLTFSSGNNAAGNFKLKPVL 
I YHSENPRVLKNYAKS I LS VL YKWNNKA 
WMTAHLFTAWFTE \ P YVETYCS \ EKNI F 
FKILLLIDSVPSHPRALREI * KQMTIVY 
MP/STTTS ILQPMNQG 


929 


14830 


A 


936 


417 


1 


CFFSRDEVSPCGPGFSPSPD/HHDPPPP 
PSQCWDYRRDPLWVPPHICFLIHKKKRS 
SHMGSSMYDP*KPPHKWMKSPPPVSVLY 
GSIPVQVQIAPPPETNPVYFFSPPPPFW 
GGGRVCVCVCVSSEVCDFFGGDENVIV 


930 


14831 


A 


937 


412 


3 


KPTRVKKI NP P FFKNTKNKGGRVG GSLY 
SPFFGGLGQKNGFTPE AKGS INLKFPTA 
PKSWGTQQKSLFKKKKKKKSILKFIWNQ 
KINPKQI KQS *G I TLPE/FKLYYKEATV 
TKTAWYRYKNRPIDQWNGTDRNKATH 


931 


14832 


A 


938 


I 


416 


KNRHIDQCSGIERPEIDSHKYSQLIFDK 
GAKVI * *RNDNLFNKWYFNKWMSTC\RK 
INLDTDLTYLFQNGSQT*P*NIKLLENS 
MGGNLGDL/GMSQ*VSSSSSSRRIHDKK 
LVS/WDFIKIENICSEKGIIKRMKSQTT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^-Phenylalanine, 
G=G)ycine,-H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vflline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














D 


932 


14833 


A 


939 


4 


443 


DFDYNHDHDYAELGTRPGSVGQGSPDPQ 
FTPSRMGREGEGTHS IL* CSSLGMG VI A 
DLSTDPTELEKRALE VAGPDGQASAI S P 
ASPRRKAGDGGHRRALPGCT5LTGETTG 
KSGEAGQDGKPPGD/GPIGPYSLPGSGP 
GSGESMMG 


933 


14834 


A 


940 


3 


404 


LFM FLE FFF *KPRSC*VS QAGM QGC YLK 
ALQAPPPRFTP/SLLSS*DYWSLPPPLA 
NFLYF * ETRGFTGLTRMIS I PQPTEMPG 
LASQTAKLIFF* K/HRVLVES /HG*SAR 
AVHRGDLHILEP*TPGLK*YS\CISL 


934 


14835 


A 


941 


397 


1 


FVPNSQVANAKKKFLKEAESTTPVLSG I 
RKQNTFAADME*V*MVWIKDQPSYSIP:L 
SQSVIQ/SRALTLFDCTKAERNRRGKLQ 
WEPAV\EGSKGWLMRFKERSHLHNIKVQ 
DEAVSYPEDLDKMDAVNTKHQI 


935 


14836 


A 


942 


441 


29 


SVTLCKHTVHVPTFLRRGKRCPLFGQCQ 
PWPSPG/PHPLSMSPV*PRPQ\PIPtiHS 
APPLALGPFIPMLSMAGLPFPFQGSGLR 
NWKPPFPQPSLLQ* *HFPKPLKPIRGLR 
P FP PS TLS I PL PGTRPRAKLG VMRSR L 


936 


14837 


A 


943 


-5 


397 


ARDAAPEPGEHLLQGLSARHGLRPPRDS 
RPGPDXPPXSPPHPLPLPAVPTFVFSFG 
D S WLL * S P FFPAP WA3EGGACAPNPHAL 
VPSPSGQASGLRGGAPSWKAGLDTDGQQ 
AGRQS PGPAPPS P PLPPSQDC 


937 


14838 


A 


944 


3 


449 


LRQVWHEGEMPNKTTLIYHYTPI* ITNI 
KNTDNTKS LWGCAEARS FTHC / WMR I KM 
VKPLWETVQ* YLI * * KLQLPFNPAVALW 
\S I CPRKVKTY/CHRKNKTRS *MFI V/ A 
LFVTAQTKN 


938 


14839 


A 


945 


380 


175 


RAPAVPATQEAEVGGL / H * T WEDRLNPG 
GEV/CSEPKWCKCPPAWATKPNCVSKKK 
KKKEKRINHQHDEK 


939 


14840 


A 


946 


21 


331 


VAPLKYGPSKETI / IQSVRQSTECKKIF 
AYYPSDTGLITRIYEEI.*RLNRKEKLNS 
S VYKYAKDIjI * PFS KEDTQMANRR/H * K 
KCSTSLI IREMRFKTTMRVDAV 


940 


14841 


A 


947 


365 




GGGPFWFFFFFFLGGFLKIGKIWAKKKT 
GPSFPQKKKRGPKKKKPPGFFFFFFF*K 
GFVAPGGGQWEGFGSLQPLPPGVKQFFC 
PRFLRKW/DSRFFFFFLiI KSNAEQFPTL 
YAGVRMRYIK 


941 


14842 


A 


948 


357 


137 


ISAHCELRLPGSHHSPALLCS PGLSRTP 
NLKQSSLFSLPKCWDYR\LATVPGQLLL 
F*ARHCISIDPSLHLNNM 


942 


14843 


A 


949 


53 


352 


REDNHHKCI RNKM INRKWANEVNRCFSE 
E I QMVKKHE KLLAS PPIRSMQI KRYY\L 
NSLA* KKEKKSD\NTKLWQGFGETKSV* 
RYIIK/PYDPTIPLL.GIY 


943 


14844 


A 


950 


937 


3 


KVSPYKINIHK*VAFLCTNNIQAEKK/M 
NMVLFTIATNKKI/ sylvih* IKEVKYL 
YNKNYK \ LLKKNRDWANK*KGTPCS * I * 
RINI I KMS VLPKAT YRLSAISLKL * MQL 
FMELEKE \ T I LKVI WKNKTAHLAKATLT 
\ QSNPGGLTLPGL Y YKATVIHI TW /HCY 
KNTHVDQ * /NRRESPE ISLHFYTQLFSD 
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nucle- 
otide 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVNRSKQ/WKNCLSNKWYWESWLIICRR 
RKLKLYLS PYKKCNS K* I KNLTVKLKAI 
KIVEHQKSTVLVTGLWKGFMTNTIKVNA 
TKIKVKN*HLIKLKNFCTPKET/T*QIC 
RQQRVWNKLFSNF 


944 


14845 


A 


951 


246 


1 


AASTKTGVQTKTCT * ML I AALVI TVKRE 
KQSKCSSVE\E*TMWYIHTMEYYSVTKR 
NDVLIRTTWMNLENSMLSKKDSHKG 


945 


14846 


A 


952 




255 


QGCGEKGTFLHCWWEYQPLWKTVW*FLK 
DPETDIPFDQAIPLLGIYPKEYQSLYYK 
DT \ AKT WNQ PKCQ LMVDG I KKMW Y I VDA 
A 


946 


14847 


A 


953 


343 


3 


GS I YTKEME PKVNNLQK* KAS CTDGL * G 
EFY * TFENE I PI LYNLFQKTE AKGTLTN 
SRDEASIILIPKPEKCTRRKENDK/P/I 
SIMS I DAKLLNKI LAN* IQQS R I YSRHT 
RLIQ 


947 


14848 


A 


954 


1 


349 


AQPPFILFILHP\IYCIFSS*S\R*QKT 
LKTT*FQGFIFCFFFFFK/QAKSCSRIK 
GRGPIIAKGTPELPGLRDPPTS PT*VAG 
TKGGGPHTQLKFIFIAK*FYISF*HNGK 
FCSRRGR 


948 


14849 


A 


955 


2 


351 


GLKNYAKSTLPVLYKWNN\KA* MTTHLF 
TAMSTE* FKPTVETYCS /EKKI PFKILQ 
LMDNAPSHPRALMEMYKEINAVFMPVNT 
ISILQPMDQGVISTFKSYYL/RNTFCKA 
IGAIDSDS 


949 


14850 


A 


956 


336 


3 


PTKENFEPDGF\TG+ FYQTFKELIPPQT 
FLKTSREYFSSSSSSSSSSSSSSSSSSS 
SSPYRPI/SLMNTD/AKILNKILTNQVR 
LHI KRNTHHDLVGF I PK\ WFNI RKTDQS 
NN 


950 


14851 


A 


957 


181 


1 


RWDLTMLPWLVSNS\GLRRSSH/LPKCW 
DYRFEPPCTA*GWFLIGPHWNVSTWGS 
PRXSRQ 


951 


14852 


A 


958 


3 


293 


GGLTSPHVKTY YQATVI KTAWNG* RGVC 
MDQ/YNKTENPETDPCKYSQLMFSEVTK 
AIQ*RKDSLVNIENWNN*MSIHKKSSSR 
KHLNQYLTPYTKL 


952 


14853 


A 


959 


350 


3 


NKKKGQSLVFRSPTLFFFFFL* IEMG\F 
IMLARLV*KS*RRNMTTSGSQSVGITGV 
SNHARPKRKLFLFSIITTIGWAGV\WWL 
TLVISALWDYRREPPRPAVLLPFHIQRS 
RMPLSN 


953 


14S54 


A 


960 


349 


1 


GGFPPPLFFFPF/PPPWGKPPPPFFYQ\ 
SFPPKKKGGPKPPFFFGSPWGVHKKQFL 
VKSPLPFFWKKKKPPPPF*KKFFPP\VF 
QKQPGRVKKPPYERNSWLLGPPTKKSPK 
KSTGS 


954 


14855 


A 


961 


379 


160 


PGPQGEPPFFLKNPQKLVGQGGRLF*AP 
LLRGVRQKNSLNPGGGGALKPGS /HLWP 
PSWGKKGDFLF*KKKKKKK 


955 


14856 


A 


962 


184 


2 


TFFSPKKIFPFFFFSKKFFPKGTFFS PR 
ENFFFFFFFF*DGV*L\CHPGWSAWPS 
RLTATS 


956 


14857 


A 


963 


1 


328 


ARLVLKS * PQ/CDLPASASESSGITGVS 
HCASASQSAGITGMSHHIRPKWISLYLG 
FWS FNKNVLHFFCAS S LEGES MNNELLS 
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ing to first 

amino acid 

residue of 
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residue of 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, . 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKNWE FLDKKLHGS TKLGGNHVCVNF 


957 


14858 


A 


964 


1 


334 


HPLGGRVGGVPLGP/ WLNPPPPQ*GTP 
FFLKKPNNPGRGAKPVIPAPRGG*GGKF 
LLPPRGG / PSMNPNCP PAP P P 


958 


14859 


A 


965 


206 


360 


QKSMSLHQQ*QDIHSSQAHKNSSW\PGA 
VAHTCNPSTLGGQGMQITRSGDQD 


959 


14860 


A 


966 


10 


332 


NFFSPGAPPPPFPFGGGFFLKKGPPFF* 
KKKILGGPPPPKKNPPGVFPFSP/AKNL 
GFPPPPPPFFFF 


960 


14861 


A 


967 


1 


282 


RKCAKDLNRHSTKEDIWISTECI *HC*S 
LRKSTLKPRDMPHIY*ND*LYKKNDDIK 
CW*GYGVTGAVFHFHWVSRMVQPLLNNW 
TVPRNVKFTL 


961 


14862 


A 


968 


339 


195 


FFFCRDGVSPCCPGWSRTPRPKQSTCLS 
LPKCWDYR/RL*ATVPSLDYLF 


962 


14863 


A 


969 


3 


342 


IKKGPHPQLKKPPGVFPVFQFKKSYFPP 
PPPLFNNPFFFR\DKASLCHPGWSAVML 
SYFTAAYTSW\VRSSSHLSI»LSS*EWRH 
TQPCLFF*F*FL/CRSK 


963 


14864 


A 


970. 


384 


6 


GRKNVAGFPLSPLYTPPHGHGLGPPQTF 
GAGPPAHKSHQKVGRQKRGPGFVPPRPP 
AFLFFFFFKE*SVLEKKENNLYNSLFAY 
K I LNKVQGETQCEGRAH I C \ VCVCVC VC 
VS VCTC VHVCALA I C 


964 


14S65 


A 


971 


427 


109 


LNSQFELQEKRISNLEDNEKII*SEKES 
KNRMARN / E /QSL.RETWDTFG YTN IG IM 
ECPEGEEK/GKRKRKTI KKKNAANFPHI, 
MKNINLHIQKAL*TPTRITCTGPYGG 


965 


14866 


A 


972 


389 


145 


SLQP*TPRL,MLSSHLSLLSSWDYRCVPP 
HLANFLYF* *R*GFAMLGSSNSSA/SAS 
QSGGI TGVSHHAP WFTFKKKLVADT 


966 


14867 


A 


973 


170 


419 


VSFLFFFFFLERQFYFFSPGGGEGGEFG 
LIEPFP\PGFGNFSCLTLW/RRWD*GAP 
PPLPAYFGFLIK\TGFPLVGRGG1jDEjRP 
SR 


967 


14868 


A 


974 


212 


3 


QNSMIRYSRSMSRSCLCTTILILSLNRS 
LKSPR* /WPGWAHACNPSTLGGRGGRI 
TRSED* DHPGQHGETP 


968 


14869 


A 


975 


28 


356 


VFETSLFRNKKEIVKGTSLPDFRTYFKP 
L.I IESI * CLHGIGQNNAWNRVGNLQIDA 
/ECAENL>IYEKGGISIQW/SLFNK*CWS 
ITIHKEEKKIRPLHCIKIKGTKDFKKXS 


969 


14870 


A 


976 


350 


1 


LFCKEKVSLLCPGRFEPPGMKKSTLLSL 
PKMRGYR/RLAPPPQLEIFFHFFLTSKA 
TPL/CLGRSLKRLNSQMFSPFFSEGVSL 
CSQAGIQWRDFGSLQPAILWFK*FSCPS 
ILSSWK 


970 


14871 


A 


977 


4 


343 


PLHSSLGNIVRPHLLNNSNKNDTTKCW* 
GCEKPGFLIHCIWEYMMV*KLWKTDWQF 
LIKPNIHLPYDSAVAHLGIYPR*MKTFL 
QKIi/ CP*MFIAAVFLIAKNGKQLRCPSV 
SK 


971 


14872 


A 


978 


3 


424 


KRMRRQATDWEKI FAKG I SDKELLSKI H 
KELLKLNKKKS PKQVACFKMGKQT* TAH 
LH * TT YR / CRRVTHRKDHS TRS WLLREL 
QILNT 


972 


14873 


A 


979 


114 


325 


QQQQNN* IKKLAEDSNRDFSN/EDIQMA 
NK*RCSTSLVIMEMQVKVTVRYYYLTLI 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glvcine, H=Histidine, I=Isoleucine, 
K = L*ysine, L^Leucine, !VI=iVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptoplian, Y=Tyrosine, 
X=Unknown, * = Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S I A I I KQKKKQNP KK 


973 


14874 


A 


980 


339 




GPTGFKFNQGGPPPKGSFFL*KISPPS* 
NPPFFFGPQKGFKPFFPPPYLjNPGKNFF 
SKKPLFFFF*KKGFFKPPPFFFFFFFF* 
GRVS LCCPG WS AV \ A* SRLTVTSTSHVQ 
T 


974 


14875 


A 


981 


2 


377 


QMANKHMKSFSTSSAFMKM* IKNKMRSH 
DTCIRMANINTNDT\CW*GCRSNKIL,MH 
SW /WECKMVQLL*TMV*KFIiI KY /DTYH 
DPAI S LLGS I KKKS KFCT * / T / CKI L YL 
NVLFIMNPNWKQSNWYSMG 


975 


14876 


A 


982 


1 


457 


INE I EARNTTE KFNKTELTS SSSSSSSS 
PLVGLTKRKKSQN\NEGDITTDTIEI I / 
TIIKDYSE+IYA/NKIEQASSSSSSSKT 
YNLSGPHHEGTESLNRPIMSEEIESLSQ 
NLPTNRSPRADSFTGKFL+TSKDELPPT 
LiLK\ FQKTEQERTLPR 


976 




A 








T'PKAGQMQVTjFPQYGSKVARAPFS wal 
PPPPPFFHFFCFPQPHCSLPPQIPPPFS 
RFFLQVKSSQQGPPPSLLGWGEQ/NTLP 
FFFFF*SHSLALLPRLECSGAISTAA 


977 


14878 


A 


984 


365 


2 


KSSKL.FNYPPLFFFL.KNFGPKKK/SGPF 
FCPFKNFFNPRVPGPNFNFKGFFFLKKG 
PTVF*IKKFWWGPLPPLK*PSGVFRFFH 
FLIWDFRPRPPFFFFFFFLVERGFHHVG 
QASLELL.TS 


978 


14879 


A 


985 


1 


153 


GVQWCNYNSL*PRFPG\SGDFPTSAFQV 
AGITGVHHHTRLFFFFVGFFFNF 


979 


14880 


A 


986 


225 


1 


LYIYRERERERDRDRDRDVSHTHLYIHT 
RTHLDN I LFCH P GWNAVA* S W FT V ALU S 
WV\KSSSRPSLPSSSDSS 


980 












GTDEYILIALIWMVSWVCLFFFFEKKA 
PFCPPAGKTGANFGLRAPPPPG I KEI FW 
PP*PSEEG\EPRGPNPSRGNFWFFKKRG 
GSPL*PRLFG/HPRP*GNRPP* /PPQRG 
GNNKGDPPPPPGGIFF 


9S1 


14882 


A 


988 


55 


335 


HIYIDVFVSGSWLLTV/ISLLELTVFCY 
NVGAL * AS G\QAS ENRI S VSD FL.LPS FY 
LPKAGL* S FPAFMMMGHKI LI PEKVI PH 
MLEEE\TCTERP 


982 


14883 


A 


989 


295 




ILVTRNSEL*VLPEQRTRFLEKAMALRS 
I PLLKR/NGRPGTVAHACNPNTLGGQGG 
RIKRSGV*DQP. 


983 


14884 


A 


990 


2 


240 


KKISMNIiEHFMIiSERSQS * KTT YYMI I F 
T*KLQNRQIYK\TERIYIYIYIYIYIYL 
FFFFFLERKFLFLFPRGEGGGPF 


984 


14885 


A 


991 


463 


0 


GFRQLSCLS\LPSSWYYRHMSPHLANPF 
\TF*VERGFRHVGQAGIjELLTPNDSPSS 
ASQSAAIH/GMSHCTQP 


985 


14886 


A 


992 


82 


365 


E I CHQLYRS FLCSLFC/ DHPSNKSH* DT 
MNCVHFI IRLLNFSFFFFEKKFRFVPQV 
EGQGSNLG* LKVLPPRLKPFS CLTLRRS 
WGYRGPPPPPVN 


986 


14887 


A 


993 


3 


317 


GLKQSSCLGIjPKCWDYRHKPPCPTHIIF 
NTH* 1 1 KVLNVS FS FC I / PLCWS IALSD 
HVQ/PV+LYNMLVSSFLLLLLFFEIEW 
I LACLFCPVGSP ILFFF * YILTF 


987 


14888 


A 


994 


350 


139 


LiFRRLRWEQHLS \QGGQCCSEP * SHHCT 



584 



J>164835A2_I_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparric Acid, 
E=Glutamic Acid, F=Pheny)a)anine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Voline, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PAWATE *DPVSKI QQNKNQDTFSHTFI L 
ECRTSKGVSAFIAL. 


988 


14889. 


A 


995 


219 


1 


RPRRPLIQKTKAKNHHFTPTRMPVIKDN 
NKC*QGCGEV/WN/CRMVQPLWKT/IQF 
IiKMLNTELP*DRAI PLTGGKE 


989 


14890 


A 


996 


3 


498 


CLPWAAVARQPPSAHPFPQPGP*LRTLA 
PDTATAVNQALQRQES *TGC * S PSGWPA 
VPTPVAPG/PPSAG * GWDPRAWPRHNG 
PPGQHPKEKTYQPVPAP*VQLSPTRQTP 
APMACSSLLLPAIiPPPAPAARPPCPAPP 
FLPSRTSTLPPRRLSSTIKSKTPPGP 


990 


14891 


A 


997 


2 


419 


ADTEAAASYPDNL\ ANDESSYTKQQI FN 
VDKTAIiY WKKTPS RTS I AREGKS KPG FK 
TSKDKLTLLLGANAADDFKLKPMLTYHS 
KNSRAH KNYAKS T L C FRN * NKAWTEH L F 
I AWF / E YFKVTWTNCPEKK/ VLLLTHN 
APG 


991 


14892 


A 


998 


388 


405 


G* FFPL.SASKFFFCP * ALKLWMEGCH I R 
PPPQVRFPLQSSQA/GFISASLKGKGFP 
ASMPANIGPPKGYL*PAPPFFFFSFLRQ 
SHS VTCAG VQNRNS S LQPQP PG / LRRS S 
HLSLPGS WDH /MPP CPANFC I FCTGG 


992 


14893 


A 


999 


1 


423 


WILHD/NliRQPAQWLDQEAAPKHFPKPN 
SH/ QKKVMVTVCW SAAGLIHYS FLNPGE 
TSTIiEKYVQQIDEMH* KLQCLQPAL,INR 
RGP I LPQDNARPRVTQPMLQRLNKLGCD 
VLPHPPYSPDFLPTDYHFFKHFDNFLQG 
KHF 


993 


14894 


A 


1000 


2 


406 


SDLFRAI*ETTSVIVYDVSMDYLEIDSE 
M I RDLLNTFPV\HLE LQQDS SGVIQGAG 
I TEAS TINAKEIMHLLMKGWRQRTQEPT 
AANQTSSRSHAVLQVTVRQRSRVKNILQ 
EVRQGRLFMIDIiAGSERASQTQW 


994 


14895 


A 


1001 


290 


3 


VTQL P ETVPHFLRDR VS FCCSGS / DHS S 
LQPQTPGL.K\YPPTSAT*VGGTCHHAQL 
IFIFLRDGVS\SVAQAGVCWEDLSLL*P 
PPPGFKRliSCLRNR 


995 


14896 


A 


1002 


143 


408 


IKKKIKREI*KYLDISKNK/DTYQNHiWN 
ATKAVL+GKFIGI^A^^IFKNRKN*NK*F 
NSTSGKLKKLEKKEQTKLKISRKETIKI 
RAKINE 


996 


14897 


A 


1003 


84 


409 


DKLAALPSS WTLHPG S PLGVTRAP P P P P 
PPPPPPPPPRVLEPVPRSIiYPGIiAVPW 
PRALH* PPHPGSLPACPAPGLLGGTRQC 
S\QTILPKKKPPPLDADPANEPPPP 


997 


14898 


A 


1004 


392 


2 


EKNAVHFDQKNLVS I WEPLQVPTQVCIQ 
NKTNSWGIAGYPFHFLKRNQRFFNALKV 
PGPFS K * TIHLVNLKNLFFF * DR VS FCC 
PSWSAWKS * FTAASTSlA IKQSSCLNH 
PSS*DCRHTPL)L>FFVRMRS 


998 


14899 


A 


1005 


2 


421 


FVSLLLLITSAIVAAGAPTGRTVYPPLS 
GNYFHPGAFVHLTI FCLHLEGVSS I LRA 
INFIASIISIIPPAITQYQTPLFG*SGL 
ITAGLLLLSVPVLAAGITI \LLTEPSLF 
TTFFDPGGGGDPILYQHLF* FFGHPEVY 


999 


14900 


A 


1006 


1 


428 


D I H YG * 1 1 RYLHADGAS I FC ICLliLHTG 
RGLYYGSFLYSET* SMG 1 1 LLLATIATA 
IIGYVLP*GQISF*GATEITYLLSAIPY 



585 



JSDOCID. <WO 0164835A2_I_= 



WO 01/64835 
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SEQ JD 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
LSSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
locution 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Srop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGTDLVQ* I*GGYSVDSPTLTRCFTCYF 
ILPVI IARLPALHL\LFLLETGSNKPIG 
LAS 


1000 


14901 


A 


1007 


1 


419 


S PI VP I VMAMACLMLTERI I LGYI QLRP 
G PNVGG P YRLLQP FADAI KFLS KEP\ LK 
PATSTITLiYISGPTLTLTIGLVL*TPIjP 
I PNPLVNLNLGLLFI LATSSLAGYS 1 1, * 
SG*ASNSNYTLIGALRAVAQTISYEVTL 


1001 


14902 


A 


1008 


411 


27 


GQPGQHRKTPSPLSIFYF/ELAGGGDAS 
T*FQLLRRLR*EARLSLGI*GCSEL*L>P 
PCPPAWVTK*DPVFGGRKKKERLSQLRK 
LRPREVKSLVQHYVAHKGGNWDPNPGPM 
ATQPGRSCTTLCVPL.CHR 


1002 


14903 


A 


1009 


21 


377 


M S LGG QQDS AQTTR S PQFAG FFGORSIP 
DRTGLGFHLCVCKCVCACVQVSVCKCVH 
MCACACMCTRTPMCECVQVCECVQVCVR 
VSMCVQVCACVRVCACVCSGACG/CTCV 
T*MSLACG 


1003 


14904 


A 


1010 


1 


412 


HKANTSTYLLTI/VNTFSG*VKACPTTHK 
TAEWASTLIEQI IPRF/GPAFISKIVK 
QVTTTLiDVNW KLHT P YHPQ S S GKVE RAN 
SLVKQHLI KUALKTRQSWVTLL.PFALAW 
LWAAPQSPTGINPFEL.LY/SAPLPLSN 


1004 


14905 


A 


1011 


1 


410 


FRAVAGAS RQENG A \ T VI LRD I ARAREN 
IHKSLAGSSGPGASSGTSGDHGELWRI 
AS LE VEN QS LRG WQE LEQA I S KLE ARL 
NELENS S P VHRSTAPHTQQVS PI APVEP 
PAEKPATP T * DDVDDD I DLFGRDND 




14906 




1012 


269 


32 


NI S R I YKE LLQLNNKKT \ S I S KRAQ YLN 
RNFTKDI *MAKKPMKRCPTLLVIREMQI 
KTTMRYYFTPARMATIKKTDNNEY 


1006 


14907 


A 


1013 


21 


443 


RIRKKNSYPHYVKSIVAYTFI IRLFPTT 
IFMCLDQEVIISN*H*ATTETTQIjSLSF 
KLDYLSIILIPVALVATWAI IQFSLi*YM 
\NSDPNIKQCLKDLLMFLMTILVLSTAN 
NLCQLLIG*EGA*MISFLLIS**YARAD 


1007 


14908 


A 


1014 


1 


309 


S S RAAA IHGGACL WLQ I LHR LR WEG * LS 
VGGQGCSKP * LHH* TLAWATERDFVSQK 
KKKKPQKANP\HLETSCKKKKKKPMLGP 
P K S QR KEG P KAPQGRTLWP 


1008 


14909 




. 1015 


3 


256 


EKNQII.KWTMDLSRHFSKK\NMQVGNRH 
MXKC * TS Li 1 1 RE I Q I KTTMK YHFTP VKM 
AI I WPGAGITGMSHRTQPSPTLLliTHW 
A 


1009 


14910 


A 


1016 


397 


3 


LFFSPPPKWGFFFFPPPRFFAPPPFFFF 
PPPPFFFFFCEKPFFFFFFPPGGGAPPL 
SPPFFFFFFFFPPPPPPFFFFK/SPPPP 
PKKK* KKK* KKKKKKTPNTPKKKPKKPK 
KKKKRAAARDLELVGRVGGRV 


1010 


14911 


A 


1017 


2 


164 


EKERRE * KREE * KERK \RKEGEREGGRE 
RERERKKKRKRERKKDRCMFIVHFRE 


1011 


14912 


A 


1018 


1 


406 


NSSPPKNKN*NK*KQHT**EKNFANYIS 
DI ELVFRT YNELLQLNKTGNP I KNS F* K 
NQFKNKGK * PNFY ITQSF* RNKP I * KF/ 
AKDLNKRFSKEDM* IANKRIKTCITPLA 
IRKRQIKTTVKSSFKA/IYMVWKET 


1012 


14913 


A 


1019 


302 


2 


IVEPLFLGWLCLFLQNPRVKGPGPYFSK 
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SEQ JD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GKANPVVFPTV\SKQPFKKPAFPVYQKQ 
LPLF/PALVDKKKVL/RFFFFF*DRVLL 
\ CCPGWSAMVQSWLTTTSTS 


1013 


14914 


A 


1020 


47 


382 


AKIALLHSSLGDRVSHTHIHTHKRSNEI 
ELVIKSSPKKKSSGLNGFPIEFYCSFK* 
ELIPXFLKL.FQQ\VEAVGILPNL,F*EVS 
ITLIPK/PRQEQNSNKTIYRPISLMNIG 
GK 


1014 


14915 


A 


1021 


273 




QSKERRQRTQEEMGKTFHNKRENGTL.FG 
FSQSCLKDE I I I I I FETGFRS I SQAGVQ 
W/P/NHGSLQPRFPRLiKQSSCFSLPSSW 
DYMC*LPH 


1015 


14916 


A 


1022 


327 


1 


KFLKGW P IYLKKGFP F VPQGGGQ WANYR 
SLVPQIYGVK\YPSAFASKINSTMGMCP 
PAWVTFGMAQLINGD* IF/IFFLRQSFA 
LVAQAGVQWHDLRSPQPPPPGFKRFSC 


1016 


14917 


.A 


1023 


3 


407 


THLTQR * /HNIQTPP / WV* TGTLSTVLM 
NFYIiWM*FHFQSITLFILRLLTTKLTMY 
QGWLDVTQQRSYQGHHTPPVQKGLRYRI 
ILFITINTFFLEGLF*ALYHSSLFPTLH 
*RGDWT/PTGIIPLKPLEDPIiLNTFE 


1017 


14918 


A 


1024 


3 


425 


LPPQKKRGFPKIPPREF*KTPPKKKKII 
FPPPGKNWPPTK/RFIjKGPPPSRFIPFP 
FFPLPSSPSPP 


1018 


14919 


A 


1025 


430 


1 


GGFPPFPPKNFFFPLSP*IFG/RGVCPK 
FPPPKKRFFSKNPQRGFNNPPQKKKKIS 
SPPPVNFAPPRDLLKRPPPFFFFFFFFF 
FFFFFLFMHPSFPTSVPQWQITSFLCLL 
L I LQCC VCVL IGMWGDQSES S PSHHLQ 
PRWL 


1019 


14920 


A 


1026 


463 


494 


RKTGFPRLT*HLDIjDFFFFFFFWKGIFV 
APRAGGRGRNLG * LKP PPLGWAPFSGLT 
PPKSWN*GAPPPPPFNF/CVF*RKTGFP 
RLTR 


1020 


14921 


A 


1027 


606 


1 


AGGPGPPKWPGAFGWG/ SLNG* S * LKG 
KGS KGAP I PQSGPTVFPGGI P PKES PLR 
PPGGMGGRRGPRWPGASGC/VG*DSVLiR 
QP/GVTGEAPCPGASS PQASQAARRGSM 
GKGGAFHSVWGGCVLSAAWPGWQPPGSH 
G* / PGRRGGQSGCSCPKG* AWWRPCLAG 
SHSSAKHKGMSSEGPAPRGRAISPVSAG 
SIGPIAARGF 


1021 


14922 


A 


1028 


436 


1 


GGAPOGVXKRAAPFFFFWKQNFNFVARI 
IKGKGGVLSPRQFPPLGFKKFWGPTPL/ 
SRWGFKEGLKPPGS FWFFKKRWGFSMWP 
RWV*\IPGPKEL\PPPFSQRGGIQGGT/ 
HRPPIIPPSLF/CFFEMETHSVAQAGVR 
WRDLGSLQPL 


1022 


14923 


A 


1029 


1 


436 


PPKKIIFSPKP*IFWGGGGPNFPPPKKS 
FFSQNPPGVFFSPP*KKKKIFPPPPENW 
GPPKIF/YKRPPPPFFF 


1023 


14924 


A 


1030 


28 


366 


EDHLS PGGWGCNEL *LCHCTPA\ WRQOTC 
TL>S PKEKDHK 


1024 


14925 


A 


1031 


3 


142 


FDCSALQEFGTRLYCPSWSQIPELKRCT 
HLS LPKCWD YR /R* ATTPGLR I VLELiQK 
KLRR*CRELLYTPRSVTPNINDI*HWGG 
TFVTINE I I S IH * Y IA1AGLRFLS SSDV 
LTSASQNVGITGMSHHTWP 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
^V = X^yptopha^ ^ y = Xyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1025 


14926 


A 


1032 


7 


361 


PALRPSARGQQYYDETKNKTLCRNAQND 
SYLPDPNPFSRFSTL/DHSWHQLEAPDA 
* KAP FGLY WNCGARVCR\ QG I SAK* TEA 
CGLGTI KPS FFLI PLKQGELLGYPAYNE 
NLKKKKKK 


1026 


14927 


A 


1033 


472 


3 


TKQQMLNVDRTAFYKKMSSRTFIANAEK 
SMFGFRASKEG*LLGANTTGNF\LKAML 
IYHSKNPRTPKNEAKSTIiLVP*K*NNKA 
LVAAHLFTA * FTE YFK P P I E I QI I TADN 
APGHPR\MYKEMNVFMSAN\TSIL* PMY 
QGI ISCRIPAARIEVKGA 


1027 


14928 


A 


1034 


488 


0 


PLR* ILAQRSGIHSMKT*SGEAAAE*KS 
EARSSWFLRFKEKSHLHNIKAQDKAAST 
DGKAASSYLED\TD*DGNTK* ICNGDKT 
AFY * KEMPSRMFPAREELM PGFKAS KDM 
LTLLLRANAVGDVNLKPMMIYYTENPRA 
LKNYDKTQLC / PVLYKW 


1028 


14929 


A 


1035 


114 


491 


NYFKNFTFSNKLTL,/CLL*LFLL*KLFN 
F*TF*LLC/I*HIA*NTNTLYSCTKIFS 
FFIFLFYKHFLKFIYFFYFLDFII*GAY 
*SLCYY*WYFSCQFCQVLFHVFWGLCC 
HVLVRLLYFQTDEPFY 


1029 


14930 


A 


1036 


114 


496 


LLRAILTYLKYKISAMNLVSA*FIYLHL 
TYHCVF/DHPVQGR*LLNK* INELFCYR 
SFGF*WVFSYSHLSED*ALEEKYLRERS 
RWVK/DLNVTSETVKILEEILGKTLLDI 
GLGKEFMMKTSQANITKP 










381 




KNRGRKKNS * LGGF\SDKFYKTF* KLIQ 
IIYKlrFQKNEKEGTIi\*NSRPISLINIE 
AN I QN / RAVLAM I I QKH I KK I I RHNQ W 
F/IPGR*G*SNTCKSI\NVIHHIKQRRI 
KSIEF 


1031 


14932 


A 


1038 


134 


394 


SFCDKFTNCISESLCQ/SYSPTYIIKIF 
VLFKIRSGSITQPGVQWHDHSSPQP+TH 
SWAQT I FLLS SQWGTKGMCHHT WL /IF 
LFLFVF 


1032 


14933 


A 


1039 


483 


30 


YDGGSE I INYVLESRLIGTE KFHKVTND 
NLL.S RK YTVKGLKEGDTY EYRVS AVN I V 
GQGKPSFCTKPITCKDEL/AYVSTTIYT 
SETCTFVDL**DINKNDLPL*LQILAPP 



VSDOCID: <WO 0164835A2J_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine T 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLF I * VKNLMEFAKKLLEL INE YNKVER 
YKINIKNILLAKNTWTLKF 


1036 


14937 


A 


1043 


50 


457 


TKYQIANKNMKICSTLPIIRE/MQIRPH 
E I S LT P I MMALS E EQ KNM KY WGG CRE VE 
TLVHCWW*CKTVQSLRKTVHRFPK*LKT 
ELP*ELPPK/PGSQPJDICTPMFIAALFT 
IAKRFWKHPKCPSTDE* IKEMWYIQS 


1037 


14938 


A 


1044 


2 


469 


LKQSSCLGLP KCRDYR / R * AT VPG * F I L 
* NTT ED PNLT KN * I LNTNH / WCYSVQL I 
LGESWSTIKTSLSFSQASSHQNTTVQFli 
STSPSFKNCQWLAILPRK*GLTLSFCFA 
FERQGL / NS VAQAG AQWLHHGS LKAQP P 
RLKQ / FLPNC PTLM ECYRLQP 


3038 


14939 


A 


1045 


42 


200 


LICT*MLIVALLVIARNWKQIRC/SST/ 
DEWLNKLW* I PMEYY*AIEKEPTIGTC 


1039 


14940 


A 


1046 


475 


36 


LNVN* LNHP I KRQKLAG/ YVRKHDPTI C 
CLS \QQSRF I FKDTNRLNI KGWRK\TFH 
ANSNQKIA/GVAVYVSNKTN\FTRNKE* 
NYILIKGSIH**CIIIINICATNS*NLK 
I *NKLTEMKEE IHCSKI TVED / YNTLF* 
TMDRQKEFHRMQ 


1040 


14941 


A 


1047 


3 


326 


FFFFI III IFETERDSVSKI I I IIKNKK 
DLF * FLVTQVTPE ITNQYISTLPLQSKT 
KWDRPGTVAHSCNPSTLGGRGRRI/TLR 
SG VRDQ P DQHG 


1041 


14942 


A 


1048 


403 


15 


EVKNSSFSYRLMTEIMPFGYVSKLYNCD 
SGS * LS * S FNVCKMNRY * Y/LEERGNTS 
FKSLLLLMWKLLHSAIFTSA*TPRGL* I 
YFI YMYVtiETWSHS VAQAGMQWHNQGSL 
QFSPPGLKEFHRTDIVYAR 


1042 


14943 


A 


1049 


25 


457 


YMW WNSHHSCM FTLLFDS S FP P PSL CF I 
LRSLFLLYAELPLAFLLVKDSFILFV* I 
MSVFHLIIiKLQ*NIHNIYFTILTIFKCT 
VQWH*VYSHFSVAKTAT\ISRTCLIPSS 
WDYRHMPLCLANFFL*RQGLALLPRLVIj 
TSWLQ 


1043 


14944 


A 


1050 


14 


478 


FEPYCRH*TTRWW\CWWGYELVQSLWKA 
TEQYVLKPDICVF/LEPGVSLL.GMYSKE 
LCTLC\ YQKTRMFI ATLFALVKS * *LPK 
SS*MVE*ITKL*YFHTMEYYTAM/KKE* 
rTI*TTIWMNl/TIILNERSQTQK 


1044 


14945 


A 


1051 


2 


465 


GIDQPSKPIPL/TQSFTQTKA/L.TFFNS 
VKAERGEEAAE/ER/VEASSD*FMRFKQ 
RSHLHNI KVEGEAATADGETAGS \SYPE 
DPAITDEGSHTKQQIFSVDETACCWKIS 
SRTFAAKEKSMSVSKASKDRLTLLLRAS 
AAGDLKE FPRL V 


1045 


14946 


A 


1052 


392 


24 


DFAPRRKKRETRRSKINSLTS * * KEL.EN 
QVKTNQKGSRKQEITKIKVELREIETLK 
TFPKKKVNKSRSWFYEK\INKLDGTLAR 
QINKKRKENQ/DTIRNNKGDIITDPMEI 
♦TTIRGIPPLWR 


1046 


14947 


A 


1053 


204 


488 


CAVCYTQISLTEWVDL*SLDPGFETVSH"~ 
LFDRVNQP * DCRRVTDMLMVF* NLHFVI 
LFFLFWF/DFVFVLRQS FPWTQAGVQW 
RDLGSLQPLPPG 


1047 


14948 


A 


1054 


498 


2 


SWVCLFLS PGITSHTYVPM2FKIGAKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VHWWKSILFNKSCWRNLISTCIRMKVDP 
M/LTPGTKINTNWIKDLTPSAESIICLK 
ENIG/THFHDIRLGNAFWDMTPKA*ATK 
EN*IPWITSK*QT\FCAAKNTVN*VKR* 
PMD*E/RIFAKHISEKRLICVIYKEQLE 
LN 


1048 


14949 


A 


1055 


28 


209 


I DGRS RGLWW \ CWLASQLLWEAEAGGS 
LEPKSLRLQ*AMIM\DCTPPW 


1049 


14950 


A 


1056 


163 


479 


TSFLGSQSAGITSVSHCPSREVFFLKIil 
HWRQGGQVALLVATPHSPCCPQYRLAPI 
PRARHDFACASLI FVCILLVHVLLMPRS 
VAGRGVGSGLLG / CQAGR + L * TC 


1050 


14951 


A 


1057 


2 


1258 


SLFPWHFQPSRAASLWAKAIRDKVPGT 
RRWPVPTDGRPEGFALSEPHHSCLHGRH 
QGPERCRHMAASP*EAGNK*PKDKDGHP 
GERTETAAGVHTGCGGKGPAAAGGR*AQ 
GGSGEAGAAAGGAGAAVGGAGAAVGGSA 
GGRW PAGLLGQHGAG * GEGP CRQHGP P P 
GVSAGQAASPAKAAGQPGPGT*GAAGQP 
GRG * G PAG PRGG AAAE RAG AG AM P AQG P 
AGAAAEPAEGEARPGAGDYGPAANHPGA 
RTGAGGAEGAGAAAGGLPRPAQAHR\AQ 
I HG P VP LGTGGRSSS/GG I PDNMSHRLW 
QRHRSHGEASAGQRHPHPG P TGGER \GL 
QSMLSKIREVAQQGGLKVGLRGRALGDQ 
EEAPIQQQVFRIiCPGNLWR/RPPHHMRP 
*AVLL*NIFHISRRREDVMDPMPSSPIC 
PL 


1051 


14952 


A 


1058 


2 


437 


ERSVRTACCKQSRGLSSLLCPPPAPRLS 
♦TGSPVTGSSALEPRSPHPQPXPSIjRPR 
SPGPQPLHPLGPRSPSPSPEQTVFSHPA 
RLPSLSPERTQG\PKAQ*SOLPPPSLAL 
AQPAPAVGGREE VAS PACPS WKDKS RLR 
AVPGSA 


1052 


14953 


A 


1059 


47 


456 


TRCYGTLL.RLDAPGKLWTRKM* WASTY 
ARTDRKSSASFRPLLPLPRGFQLSLHFS 
LTSPSCLAFSANTHRGLRGENYHITK/C 
DMAPKV/HN* TQAWQWHDFGS PRP PPP 
GFGRSSCLGLPSGWDYSPIiWHHEQIL 


1053 


14954 


A 


1060 


484 


13 


FPTSASQNTGTTGVSQHAQP 1 1 F/LFVE " 
MGSPQV/GPASLKLPTSRDLPTPASRAA 
ESVSAHHH * FLNFL * RWASGFVAP S WS * 
TFLL\ KQSS CLSLPVC* DCRCYNH WAWP 
HSVFSSQFLSLFPFNFSFLFFLSFSFFK 
DRISLCRPGWEFHQTVQWSR 


1054 


14955 


A 


1061 


1 


494 


GSPGHPVCGRRRSALENPGQPCSLQPGV' ' 

VSGATGAIVRVQRTSSAWATAAAMGAYV 

E /TTRFRYT* KAGVGCDRLWGAWLKADG 

LGETKAEHTLHDG / PPEDAliYGLI EGDD 

TNFTIQGEWHCWYTGTLPDGTVFDTNI 

QTSAQKMM/NAKPLSFKV/GIGKVSH/G 

WOE 


1055 


14956 


A 


1062 


3 


395 


SDPSPCGGIRFDEMEKFLENYNLPKIiKE 
/DKINDPTFVK*IGSVMKIFHTVEP/PS 
HKCFTSE FY * TFKE * I AP I SHKL F * SRG 
IMGTATPPVMFHLQENVIPK\PTEKK*T 
YRPVSLMNIN\KILNKILTYK 


1056 


14957 


A | 1063 


402 


260 


WYCPPKV/LLRFSVYSSPPEVWAVGSIK 
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SEQ JD 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, Lr=Lcucine, M=IYlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AELYMliRPLiFPRTREVD*IF* ICPV/LK 
GS PKNKDF CP * GAKTFEY FY PATLGLNL 
JCI*IFRQRW 


1057 


14958 


A 


1064 


66 


379 


EFGRGCGEPKLRQCTPAWVTEEDPVSGG 
KKRKRKEREKT+REPSVSRPESH/PPSQ 
EAFWWQALSRPVL PNF I SGRS ASGLG K 
SEAIKSLFIiYLAVCSTTEEQA 


1058 


14959 


A 


1065 


223 


437 


KSCMSCDSNKIFFLRQGLSCSL.TQAGVQ 
WHYLGSLQPLPQGPK* * SHLS LLSNWDY 
RR\PPR 


1059 


14960 


A 


1066 


414 


65 


NKS PDRNELiAS * KCTQ* THTS IARDF FA 
KT IQWG KNS LFN I WC * DNW I STS KRI KL 
DPY I I P YTK\TNSKW I RDLHVKAKTI KL 
LE KNG P GAWC P CNP S YSGGLLE ARS LR 
PAWAT 


1060 


14961 


A 


1067 


418 


1 


L.WV FFF S RQ S L / D S VAQAGVQR CDLG S h 
QPPPPRFKRFSCLSLiLRSWDYRCRPPHV 

anfvflvetNgftmlarmvlns * PCDLP 

AAASQSAGITGVSHHARL*FCGF*YIHR 
VAQPSSQSI*DDHPIKKFCIIj*QPTPFR 
S 


1061 


14962 


A 


1068 


24 


440 


EV*NLYSENYKMLLKEMRGDLNKWKSIP 
+S/WI*RLNIVKMAVLPKLIYRFNLIPI 
RI PDE FFVKSTS * L / CKFI WNCKRLRI A 
KTIF*K*RTK/SGDLILTDFKTYYKTMV 
IRTI* Y*QKDRCID* WDRIES/PEMNPY 
IY 


1062 


14963 


A 


1069 


442 


26 


LRNQVSTPSSKTPR FFFFFFLKRQTGAQ 
RHNHSSL* PRTPGLKQPSTSASGVSRTI 
GASHHTPL I FFYFFN* GKEGWGGCSCFV 
AQAGLQL\RLQMILPPQPPLKCWN*RHK 
PPSLTYPIFLKRTP*SCFSKEKATRIPP 


1063 


14964 


A 


1070 


294 


46 


KNKLKRKERKEKNKKKAKIND I KNKSML 
EKIHKDKN* FFERNNKIDKLLATN/RIT 
KKTQ I V I FMNK I R D I TT YPTD I KNV KR 


1064 


14965 


A 


1071 


141 


469 


PKKQGVQLTQNAPLPFRVGGKRGLCPKK 
KKERERDRE I IS I I I I I IKLPTKKTPGP 
DGFTDELYQTYK* LTS I LLKVFLKKRRE 
CFL/PYSMDPAVPAIALNPPP 


1065 


14966 


A 


1072 


49 


472 


EFQ 1 1 K K* NS FFADTE KALWf I ENQNSH 
NMPLS QS L I QSKALTL FNSMNAER VGKD 
AEEKLEAS RGFWFMRFKKPS P +HKVQGE 
AANLDVEATASYPEDLPEIIDEGGHGKQ 
/QIFNMDEIVFYKKMPSRTFPAREERSM 
PG 


1066 


14967 


A 


1073 


82 


419 


LTVGFRETRSSSWVCSSSPKRALHAIjRF 
TPALKF* FFFWSFSRLFFFFLRDRVLLC 
CPGWSPTLGLK*SSCLGLPKCWN\YRYE 
PP 


1067 


14968 


A 


1074 


1 


407 


LEDGFFKITQSDKKEKKRI \KKCK*NLQ 
EVW\DYVKHSNLQVI E I PE * EVKCLENL 
FEEVIEANI / PHLAS YLDIQIQAVQRTQ 
RGYI ARQTS PKHI WRLSKVNMQEKI LN 
LPKEKHLITYKGNSIRLTAKPSPKPT 


1068 


14969 


A 


1075 


3 


396 


LHAYHI VKXSP* PLTGALSE \LLMTSGL 
TM * FHFHS ITLiLILS LIjTNT LT I YQ * WR 
DVTREST YQGHHTP P VQKGLRYG I ILFI 
TSEVFLFAGFF*AFYHSSLAPTPKLGGH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine; 
Q=Glutamine, R=Arginine, S=Serine, 
T = Threonine, V = Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion' 














WP PTG I TPLNPLEVPLLNTS 


1069 


14970 


A 


1076 


250 


47 


VAACQNAFCTFSCYWL*NLLCRK\MDLD 
ISLLP*TKVNSRWIAGFNVRALTIKILE 
ENIGSAFGNRHSQ 


1070 


14971 


A 


1077 


.2 


393 


PLTGALSNLLLNS GLAMRDDLHS I TLL I 
LGLLTSTLTIYQG\WREVTQESTYQGHH 
TPPGQEGL.RYGTMLFITSEGFLYTGFF* 
AFYHSSLiAPTTQLGGHWPPSGITPLYPV 
ES PLLNTSALLA* GVS I T 


1071 


14972 


A 


1078 


2 


500 


LHVIGVSEGEEN\GMKQNKI FEEIMGPN 
F/PNLVKYINA*/IQEMQ*TPNGIHLKK 
TVHRYI I FQLVRTNNKERIL I VAREKWH 
GIFGGTML>*MNDDFSSEFIKARKIEMMF 
LKYKRGISSSAKHC*SRILCVAQISFVS 
KG/EITTFSDKRKLRAFIISRPAHRNAK 
GK 


1072 


14973 


A 


1079 


419 


1 


LRWSFALAAQAIVRWRNLGSIiQPLPPSS 
SNSPGSWRLQ * ANTAPLRS SLGNKNE TL> 
SQKKRK*KDNSEWEKIFANHASDKGLVS 
K* I KNSYNS I TKN\ DI * KWTKYLNRHFS 
KKD I KMANKHMKR C S TS 


1073 


14974 


A 


1080 


468 


2 


KLPPPRGAGSSAP/PLFFPPTKKGFFSP 
PPP* KFFFSPNPLIFLGGFFPIFPPPKK 
NFFLKNPKGFFFSPP*KKKKIFFFPP*I 
FAPPKFFFKSPPPLFFFFFFFFFFFFFF 
FFFFFFFLRTRGRTRGRTRGRTRG 


1074 


14975 


A 


1081 


467 


1 


PTYKFNAIPIKIQ* *FFKKIENCILKFL 
/WNLKEPQIAKTVXjK*NKVEGFTL>PDFK 
\LTVTKTVWS*HKDRHRDE/WK*TESLK 
IKPYKYDQFFFFFF*FSFL/HFF*YLTY 
SQHHKLI FNRMPKPFNKERLFNK* CWEN 
WIS I CKI I KLDHYLTPYTK 




14976 




1082 


387 


3 


LKQPPPHKSFVKNKNGVSLCCPG*F*TP 
GLKRSS\HFDLPKCWDFRCEAPVLSLLN 
NIKL * L/ CGRGL I KPQFP / S VK* K* * AP 
SWSAHLAETFFFFRQGLTLLPMPGVQWH 
DPGTVQPQTPGHK*SSHFSL 


1076 


14977 


A 


1083 


2 


435 


FHPPTNWGGFSPPPP*KFFFSPKALNFG 
GGVGPNFPPPKKRFFPKNPRGGFIYPPIi 
KKKNFSFPPPLKLAPPRV/SFKRPPPFF 
FF 


1077 


14978 


A 


1084 


447 


47 


PSRKFFFLAPPKRGFFPPFPLKNFFFPL 
GGLFFG/RGVFQFFPPPKKGFFFKKPRG 
VF*PPPKKKKNFFFPPRENLGPPGVFLK 
RPPPFFFFFFFFFFFFFFGRRFVK 


1078 


14979 




1085 


438 


2 


AFCNHPKGFSPFKKFSPPFGGGGIFIRGQ 
Ii*KSFFFFFKKPKFLV* KPPFKI FFF/ 1 
FFFKQRVLAL/ CPPGGNKGVPQGPFPIF 
FSRVGKPLiFFNPPKNWGKKNPPPTPGKI 
FFFFCSFFL*RQGLTTSHRLVLNSWA*A 
ILPLQPPK 


1079 


14980 


A 


1086 


43 


265 


IKHKLEYIKIKNFCASNSNR/TERQPME " 
WEKNMFANLISDKGLIPRIHKQLL*LN/ 
TKQQNSP I * KWVKDQLQWPYL 


1080 


14981 


A 


1087 


11 


412 


IASGLHDFFNKKKKKKKKKGGGPLKKTP 
GGPKFNRGGKKKIFSFKGGEKKTPGGFL 
EKNPFLGGGKMGQNPPKKKKPFGEKKNF 
* GERGEKKPKFPGGKKMSPS PQE /NKAP 



592 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SKQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SKQ ID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno»vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RDDPRPPE 


1081 


14982 


A 


1088 


2 


292 


RS L PAS ADS S S L VAAS LAG VRDRVS T C W 
PGWS/AV/VQTL*SARLGLPKCWDYRCE 
PPCliAEAPSFMRSGKASCTLETVWEDKH 
KYEEAERRFYEHEAT 


1082 


14983 


A 


1089 


376 


3 


HIGLYIASCKTLLKEZKGS* IDILCSWI 
ERLNTVKKAIliLKLIYGFNTIAI/KIliS 
GLFIETERMILKITGKCNRSQTANTILK 
PNKVGRLALPNFKTYYKATVIQTVWSWH 
KDRWYGFFCVPTQI 


1083 


14984 


A 


1090 


277 


407 


Q I KAERSHHKQ I TS MRNVKHC S S ENYDS 
*RKPAPGYIIIK\F*K*LIKRKILNSGR 
LKNRPIKEE*RYAYQDISETIKVRR+EN 
DI FKVIAKEKKS CQPRLLYPLKI S * IE V 
KYFFDKSKLKEVI TNRS PV* EMLNI VLQ 
/INYDSPYEP 


1084 


14985 


A 


1091 


421 


2 


GHPGVIiFKG* *A*NEKSL*NCSFFNY*S 
SFSNMQTRIKNV/WPSTVAHACKFSTLG 
GRGGQ IT*GQEFET SLANTVKP 


1085 


14986 


A 


1092 


369 


3 


FLKEIRSVTPVNT* ENLIAD/ IGEKVLV 
VWIENQTSHNIPL.GQSLIQSKVL/TFNS 
MKTER * /EKLQKFEASRGWFMRFKERSH 
LH / N I KVQGEAE S AD VEAAG VY PDP AKT 
IVKGGYTQ* Q I FNVDV 


1086 


14987 


A 


1093 


71 


506 


FAEDNGL*LHPCSCKRHDLAVFYGCTSF 
VLTFGL*PWFLTQS/FLNPLEFS 


1087 


14988 


A 


1094 


118 


385 


SDLGKDFMAKTSKAQAAKTK\ I YTWDYI 
KLRSFFSAQ*TKQSTE*RRQSTE/WKEK 
IFADYSSNRGLISSRQETQTTQWLKKIN 
KNALCTL 


1088 


14989 


A 


1095 


419 


1 


DAKIPGQMVAR/RIPRPIKKIFCPDQVW 
F I PG MQGGFK I G KS INVAPH I QMG * KS F 
NNI QHPFMI * KKKNLSTGTTHEGDITQH 
/R*MLHEHTTNAILNREKLKAFPLRTGI 
RQGCLL*PLLFSIAVKT*P*AMRQEKEI 
N 


1089 


14990 


A 


1096 


34 


464 


NSSKKEKRKI PHDL,GLRFLDMTPKT*ST 
NGMMD /KDFI KI K/ FCAS KDT INKVKRE 
♦EKLQIIYL/DKRLVPRIYKDSYYPIRQ 
LKKWVNNLNKHCFKEDTQ I ANKHKKECS 
T\SIRE*QTKTR*RSHFISTRVTKIKMS 
G/N/NKYWQGC 


1090 


14991 


A 


1097 


458 


28 


R I I KVDL KMCTHN FDSLE E MNQ F FKKS R 
WPR+NR*FE*SYNH*RN*IHNLKVSKSS 
GPDAFTGEFYQTCEEELAP I LQNLFQKM 
ESIRSNSFYKVRITLIP*GDNGS/TKKE 
NYSWSFMNLDAKVLIKILAN* I*VFIK 
REFH 


1091 


14992 


A 


1098 


41 


500 


FWIRHFLiSRHKQRKNR*MDFIKrKNLYA 
SKDTVKEVKR*ST*WENYFQI/RMFDTG 
LVPRVYKEFI * LS WK\ DNHI * QWGKKAV 
SRGFSKGNTQMAKKHMQRCSMFFVIRKM 
*I\KPKMRYHFTPSSMT*EKNKDNTCFK 
SDGEYGILIYYYWGI 


1092 


14993 


A 


1099 


1 


484 


RRAPFFFFFFKRS FPFWARAP / LQWAFF 
GDVQNPPPRQIPPPGLHPF/SCPNPPKK 
WGYKG P P P TPGKI LGF* * KGGGPRLNKK 
IFISRPCDPPAPTP\QKVGIQGGTPPPR 
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SEQ ID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence' 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GlutaniicAcid, F=Phenyla'lanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PFFFFEMVSCSVAQAGVQSCDLRSLQAL 
PPGFEQFSCRGMPLKWIGSHPSPHP 


1093 


14994 


A 


1100 


144 


260 


ILILDSALGHPEPHSFNTKGIEWYLPL " 
NTMYI IQPLDQ 


1094 


14995 


A 


1101 


265 


457 


DPAAPLLVTYTREI \ *MYITKRYTAALF 
IKTKNWK\RPKCPSKGE*INKLWYICTM 
EYCSAIKNE 


1095 












RLRADI IjRI L VYNNFLQRLKKQR YL FTG 
PQYRFLFLELWLCVCVCVCVCVCVRANF 
* I ISRDKVLLCCPV\CFCFLNPPCVCVC 
VCVCVCVRACAI INKLLVETRS CS VAQS 


1096 


14997 


A 


1103 


465 


208 


QNMTKTFQDYRPISLMKIYARKQQYRYR 
Y*RMLHNYQMGFTLGLRGWFNI*K/SII 
IHIN*+EKKIMIILIESEEAFDKC*QSL 
II 




14998 




1104 


57 


250 


MTCSRMQ/RNI KLLRYKSEKL * EENSTS 
RNE I T I LS EEDS FTNVKLEN * MKTVKQK 
KKTSKKTGVYKMAIKKQISEFKGKTSNF 
Y*EQINT*EKKNFKENWSL 












378 


YVD PRQ CGG I L KG V LKGE FT I S NE KAN P 
GRGSPSSVSKD/L*CQNIKTIESKKTCL 
IQKLIKVKAEIYERESRKTMEEINETK/ 
SQFFEKI /NKID* / P LARLT KKKK KT Q V 
LKSEK\GNITTNHTEIKRV 












1 


VSIYVSYNLDSCKGQIRIKSCDDQYIFK 
RLYFFEITFLKFLN*K*VLKKIRDNVLIj 
C/YPGWPQTPGLKQSSCLSLPSSWDYRH 
VPLHLAPINYFLW 


1100 


15001 


A 


1107 


24 


397 


DIDHWEFETVFYHVGQAGLKLLTSDDP "' 
TALASQSAGITGVSHRTRPSSGHFNNTP 
EPPEASSLSSHPKLHKSPVT*NGAGL/Y 
GSSKLLS CVLNGP I SLVHSTLRLRKGGP 
QGNISQISLMAPP 


1101 


15002 


A 


1108 


64 


414 


AQLTPTQPQGRAALHKGHV*RKTAPTCL 
FMAEKNQAASFFLFPFPSICINKE/HFK 
KKKKKKKKKKKKKKKTRQKKKKG PEKPR 
KQPGGGIjLISKIIjWAPPPGFFLTREGP 
PPIFF 


1102 


15003 


A 


1109 


62 


475 


FEPLFYLMCLLNLFPLQLPRHPFIjFLTV 
DIjVNTWGCPLPSS pq * s wllaaphrstp 
pplssgfparrqlepgagargp/hhtqa 
lhlsfffvflrrsl/dsvaqagvqwrgl 
gslqplppgfv\ilssplslpslty 


1103 


15004 


A 


1110 


76 . 


477 


EEWKKLYRWRENISNLISDKELIC* IN/ 
NQTFN I QPLiKTKNPTKDVNDLKTFTNED 
TQKYTYIHQ*R\SHMKR*SP*LFI\KMQ 
MSTALRFLYPPITMDRIQI PENI KFWRG 
CGILIHSW*TRKMFHSL*KAFWPF 














KEERSETLiAKGKLIAAGA/YN*KEERSQ 
I NNLT LHL KGGKKTND KPS KR /QE / I * q 
IRAEISRIRPRKKKEKNNEFVFLKINKN 
FKTHSYIKKKREISNTKirNKRGDSTTD 
ATEMKKITRD*CEQPSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSPPKKKIPGPDNFTGEFYQTFQEEL 
MPALCKIFQ\NVQKPEGDI/PNPFYQVN 
IYLVPEPDE/QPFVIKTLKELVIYGN/Y 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valjne, 
W— Tryptophan, Y = Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














L/K* IK1 IHEKLTANIILNGEKLKSFPL 
GSGIR*QHLFLPLLFITVLEFLLRIISQ 
EKVIKRMQIGKEKV\KILFTDNRM*WV/ 
ENP/VNSQNIGKIMKFNKVSGYSNTHKS 
VAFP * TNNEQS ANKI LK 


1105 


15006 


A 


1112 


37 


466 


DRAAEFPTEN PLEL I SKAS KVE EFNVS I 
QRPIVFLCISNKQLENEILKIQFH*/PI 
S VAS KNS K YGG I NLR KYL * DMH I DNK / P 
LNKNRGGLCS W S ATS VLPKL I DNFSAI S 
GNYSRFFFFGE I EKLNLKP I WKYK* VRV 
AKTVSY 






A 








PGLN* * CWKNRISTCKRMKLDPNLAPYT 
KITSKWIQDINIRPVTIKILKENRGKGL 
QH*MWQ*FF*I*PQKQE*TNRTKSDFIK 
TAQQRKQQNERQSVEWEKI /FANQISNK 
ELLS KI KFRLLPLDNNKQLT * KWA 


1107 


15008 


A 


1114 


2 


391 


PLTKSRMAI KKETK \NYKCWQ * SGGIGN 
LiGNCW*ECKMF* PLWKIVLQFIjEKKKRN" 
TELP *NPLIP ILGIH/YKRNVCIQMFTE 
ALFIITK/SWKQPR 














I RD P LE EAVCP FADLKLHAGRTTTLFRA 
VRQGPLSL* KLLLPFVQLSHVPRGGVYR 
GSQASLSCGGLHPVQAS*LLCLPTQASA 
MADAP \ P PAS LP P CS S I SD CCASSERGS 
VGMGPSEPGMGYNLLVCRLLRPLEKPSI 
RVGVSLFSRYHM 


1 109 




A 








PKMKTDSRWILGLFF*FLRQSL/SSVIQ 
AG VE WRN PGS LQ P P P P R FKQ FS LENQG I 
PPHGLWS 


1110 


15011 


A 


1117 


34 


454 


IHISWEFLTS KLYGTLTSQYNKKTNIL 
I K* MGKRFNAH FT * EG I * I VNKHMKRL P 
TS S V I REM QTKTTVG F RS VLTRE AG I KQ 
TDS/NQAWVRKWSNSTLLCCWWKHRVLQ 
PTWN\TAWQLLIYKMIWQLHFYAQKDIY 
VN 


mi 


15012 


A 


1118 


83 


451 


CFLKFFLYRELISPFLL*Q\FVRPSPAF 
RRKPPPWVAFFFSDPS/FSFLVFFPKGI 
V FFLGDAL KKVLT * KKNF F FFGRD * VLL 
CCPGWSRTPALKHSSCLGLSKCWNSTTR 
P 


1112 


15013 


A 


1119 


316 


441 


FIFFFFLKTNFNFFAQVGGHNRNLG*LK 
LPLPGLKQFSCLTL 


1 113 


15014 










RQIDQWTRI ES * ETDPREYSQLI FDKGA 
NPIQWRK\LFNKWFWHNWTSTCKQIKKN 
LDIVLASFTKTSTSHRLRENLYRIHI 


1114 


15015 


A 


1121 


27 


490 


GTQLHSREKKNSPFNK* YWEN* ILTCKR 
MKLDSYFIAYTKINSKWIKDLNVSLLGL 
LAKIK *SPKCKIQNY* TSRR K* KGKDLG 
Q * FXGYDTKCKATE * KNK/LNKWDYIKL 
KNFCIA\NKTINRMKRQPTVWEEIIANI 
ISDKVLISRICKKTNR 


1115 


15016 


A 


1122 


490 


48 


EHTNNKRDTLTVDIGKVLVI * TEDQTSC 
N/ IPSNQSLIQSKALTLFNS I * AERGEE 
ATE/EKLEASKGWFMR* KERSHLYNIKV 
QGEAASADEEATAS YQEDLAE I IDENG \ 
KQQIFNSDKIALHQKKM/PREKSMPHFK 
VSKDRLTLLREF 


1116 


15017 


A 


1123 


2 


259 


IYGQLIFNKGVKTIQ*RKNSVFNRCYWD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G~Glycine, H—Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=l\1ethionine, 
N=Asparagine, P=Proline, 
Q—Giutaniine, R=Arginine, S = Serinc, 
T=Threonine, V=Y'aline, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














N+ISIWKRM\GIiDPYLTPHTKINSKWIK 
DLNIRTKTLICSSWRKKKRENFLKKKRK 


1117 


15018 


A 


1124 


206 


24 


CSIFFCIYLFIYIiFIYLFIYLRQSL/DS 
VTQAGVQWHYFSSLiQPLPPGT\GLF*RT 
SYSWDF 


1118 


15019 


A 


1125 


3 


351 


RRGRGFTILVWAQTPDLM/HPPALASQS 
AGITGVSHCARPICLFFKRQGF/NSVAQ 
ARVQWWDHGSLQLQPLTLR * Y YHLS \S W 
DYRRM/PAMLGFFVFYAYPGITLL.PRWP 
WISGLNELP 


1119 


15020 


A 


1126 


2 


349 


STNHKDIGTLYLLFGA*AGVLGTDLSLL 
I RAE LGQ PGNLLGNDHT YNVI VTAHAF V 
IIFFIVIPIIIGGL\GIWILP 


1120 


15021 


A 


1127 


362 


2 


RVSLHCPGWSQTPELKLSSCLSLPKC*D 
YRCE P PHPAYLTS F I TEDS KNFEYVTTL 
F*NNIY*TMFIEIPP*VHYIGAHLF*DQ 
LSTFF*SF*N*FFVCLFFRQSHSVAQAG 
\VQ*CDLGS 


1121 


15022 


A 


1128 


1 


349 


IYFLTRSHSVSRAGVQWHSSGSLQP*PP 
RFKRSSPPLSLLSSWDHRTWG/HMPP/H 
S S S FS S S LFLRSG WGGKGVXi WS 


1122 


15023 


A 


1129 


367 


24 


INCVFYPCKGVTKKPWYT/HF*RKGFII 
INI FHG PFKLT FVC P EGGKNS ETLYP FC 
FLPLS1AIKKSRPLGVFLV*KGPLSPPL 
QGNY GVG WVG F F F FS AFV I F I KKL VLiS P 
YYKN 


1123 


15024 


A 


1130 


61 


299 


IPRVDNYCGYVREHLFSFLFFFKRSLV/ 
SVTQAGA* WRDLSIjIjQPLPPGFKQF\SC 
FNLLKNHC YRDVPLLLNTLCI I FKT 


1124 


15025 


A 


1131 


2 


261 


FFFFL VFS FFLLRLFLLPS PS S P PPS VF 
FFLPS S FSL/ CLVSRDLLTLSQTKVQWR 
DLGSLQPLPSRFKRFSCLSLPSS*DYGR 
TPLW 


1125 


15026 


A 


1132 


2 


381 


EFYFFF*SSQMESPSIAQAGVHNRDLG/ 
SIAPYASWVQVIILPQPPPPGFESFSCLi 
RFPSTWDNRHAPPRLAKKRNKI 


1126 


15027 


A 


1133 


2 


221 


PSLLKKLARHSGRCL* SQLLRNLRNENR 
/CVPAWVTK/GETPPSKKKKKNPPPKGP 
GAQTFETPGFGKPKGKIKV 


1 127 


15028 


A 








ERGKQSPPLGPPIRRGLSPKPPPPEKGL 
TVFFFFFFRCKDGVSFCCPG/WISRTPG 
LK*SSCLGLPKCWDYRHEPPHPAPAIj 


1128 


15029 


A 


1135 


3 


391 


AVNTQMMRK*KSLIADVEEV*WWIEDQ 
TSHNI PLRQSLVQSKAL/T*RAKPSMKA 
ERGKEAAKGKLEASRG* FMRFKENN\RL 
HNIKVQGDTASADVEAAASYS *DQAKIT 
GGCGYTKQQIFNTDETAFY 


1129 


15030 


A 


1136 


107 


350 


TYSIVAIVMRFYFFIIFYFLRWSLAWI 
QACCCHPGNWHDGSLQPLPPGFHKFSCL 
SLLSS*GYRCPPPCQA\IFIFLVEMG 


1130 


15031 


A 


1137 


35 


297 


TLMHYTNRVKEKNI I ISIEAEKAFDRIQ 
HPHM/ IKYLNTIKAVYNRPTAS I ILSG* 
NLKAFPLKSVTQQGCPLSPLLFNI I LEV 
LSTRP 


1131 


15032 


A 


1138 


378 


1 


PFTKINLKW I IDLNI KHKTI KliLED I IG 
ENI,DNLGHDYDILDPTPKA*AMKKI LIN 
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SEQ1D 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














VALC *NHES FCS ENDTV I RMRK* ATDRK 
KIFAGDMIKDCYSKC/NEKAEVNNMKKN 
NPIYKWAKDLNMLL 


1132 


15033 


A 


1139 


271 


361 


KW/YWNPIHIISQVCL*GPEIYQHVYGQ 
MTKQRCQGNSM/WAKDSLFNKWC*NKWI 
FF/C/RI* NLNS YLTPYRKI K * KW I RDQ 
NITVTTKLLEENMRFSFAFGIGKD 


1133 


15034 


A 


1140 


28 


340 


LLYF* YKRGFTML>SRLVSNSRPQ/CDPP 
TSASHTAGIADGSHHARLLFFLEK\EPA 
FGPPAGRKGANFG+REPSPPGFRGIPP/ 
LPPPGNWD\YGGPPPPKPNFGFF 


1134 


15035 


A 


1141 


94 


128 


GSQMPRHLVE * MTRHLATLRES \ CYSR \ 
VYPRFIEFLHFDIQSTGQKSHR 


1)35 


15036 


A 


1142 


85 


492 


VWVGLLSLEGS PS KFGNFI EFGVLLS SG 
GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 
CFETESHSIAQAGTQWCDLGSLQPLSPE 
FKRFSCLRNSRSLRRDIEPSEGNQC 


1136 


15037 


A 


1143 


372 


3 


KMNR*PISIKEIDFIVKNLPK\KFGPDG 
FTGILY\RHFKKEIIQTVCNLFQKTEKE 
GMLS \ I YKASIAQI PKPEKDKQTKATD / 
YKPSGIEAKIINRIIiANGIQQYI\HDQV 
R FTPGMQG WSN I Q I 


1137 


15038 


A 


1144 


56 


482 


TMKTL LY WQ KNG * VDQ WNG IESPEIDPY 
K\YIQLVFDRKAKTCNV/RKDSLVNKWC 
TYAKKKKTIiNIi FFTPFTKITSKT YLKNW 
N/SVNFLNTPLGKTLGDLGFPRDFKFLN 
TPKAKSLiEKH/MDT1jNFFKGKICS*KNM 
VKKNKKT 


1138 


15039 


A 


1145 


190 


482 


RQGLALSPRLECSGVIIAYRSLKLiVGSS 
DPPTLASQIAGITGIRHCPWPKTLFLAS 
VIMPAHNS*PSLSPVPSP\SLPSPPLIiA 
SQSRRSQ 


J139 


15040 


A 


1146 


138 


485 


IFFPSV* TI FLLLALFFS * *MY1>I FVKS 
SLSIYSVYQF/SFLCFWSPISDPKSQRF 
SLLSFIVCFFVFPRNRVLLCHPGWSAW 
* S + LTAVS NS \ LLKQS S 


1140 


15041 


A 


1147 


101 


426 


GDWKKFYI YKHS ESKS PLILFEKKKGVL 
EEYS S FD I * * AI KVI YHISRKRK\ KNQL, 
ANLTDAEKALNKI * HP FMRKTTQQFSNR 
S FLHLRKG I YKKPTANMI I KEQ X I F 


1141 


15042 


A 


1148 


458 


15 


VFVCLiKI CKHRKGTAKKWY YNFMGLPLY 
MRS WDQNGTM * KMT I YAPN I GAP KY I E 
Q I LMGANRE VNSNT I LLGDFS I PLS TL / 
DKELPELNFT*NKTDLTDIYRTFHPMAA 
KYTYFAGTHGT FS KMCP VGFD I RV I LVE 
FRRSTYGLDW 


1142 


15043 


A 


1149 


1 


507 


KGPPAPPPYYKKKFQGGPKKFPP*SHPF 
KRPRGEDP/YKPRNLKPPWAKKKNPPFP 
KXKQRGKGPKNPPPWKVKPEKSF*PSPK 
KTKIWPPPPPGGQKKKKKPPPPKKKKKK 
KSVSSSPKNI/YRSLFCTNLKKSFPT*F 
*NIiMP/TKLPV*EFPPPPYNSF*SAP 


1143 


15044 


A 


1150 


436 


32 


NLCMYVYMYLIYVRTYVCIYSI YVS I YL 
IDLSLYHHHYLSICMCVSIYPIYLSLIY 
LCrSLIYHLFNLCIYISIYLIYLS/YVC 
M/SS I * SMYVCIYL I YVCI FS I S LiYYLY 
I YLS I I T I I YLYLCMYCLFYLS S I E F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


1 M 
eth 
[ od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1144 


15045 


A 


1151 


3 


657 


QDCKIQNEHKIQNTLFLYTRNAHIKIKF 
HLHSKKIIjAVNiiTKHV*NFYAENYTTLM 
KEKI*IERNKDLHRKQ\NLNPCIWIVKL 
NIVKIPVI\PHR+RSLTIPIIIPAK\FF 
GDKEKIILKFIWKGKGIR*LRF+KIRIK 
GGLNLPN\LRLTWTVIKIV*YWWKERD 
RYLNQ\WNKIENS/EKLKPRKYVQLISF 
VFDTSG 


1145 


15046 


A 


1152 


430 


438 


TFWVKKFFLLNLPK* INPGFKKKKNKRG 
G*KRP/SGSQVLKKLRGKIALTPEGKGK 
IRDCFCPPPPPLRKKKKPPIRP*KKKKK 
KRKRDPPFGNHYSNS YRHTS CYFTLLHL 
TLQI FFTSRSPVAMS CRINL 








1153 


104 


478 


ELLLEGSPCLRAESKAGPAGRLVPTLLS 
WRVQ S PG VL C AWE EECEEPPSTGE DAPS 
LAIRETQI KFITRNHLS PIiSMAI FQTNN 
DN\RTKRK*CWHGCGEIG/T/LLHCWWE 
/CKLTFP 


1147 


15048 


A 


1154 


171 


476 


NSSDYYYYFLRQSFTLVPQAGVQWHDLG 
SLQVTSDDPDLR* FARLSLP KCWDYRRE 
P/RMSDSNYFLK*VPVN*KARA*L.LFIP 
S S CQAG / AS AGQTL.S VS 


1148 


15049 


A 


1155 


3 


562 


PDYQTARRRQAPADRCQPGPAATICPR* 
PKARSSSMNRSNPTVTRTAGP * PSWKTG 
AASKSQTGDGDRKELIPPNRTPNN*VST 
AAI K * QVRP YLTQKVPAG * TFCPHQPCLQ 
GVRPLCRGPP\RPFGS PAVTDNRQAKLK 
TPTQ/KPADPPEQKRPPARPPVGRHAEV 
KQPGPPFPAGPRPAPSTDGL 


1149 


15050 


A 


1156 


67 


417 


TLSCETQPGQHGEAPS/ PTKNA* IALPL 
SWARPLYFPAPARFTARGGPVFTSKRVP 
SPPNRRGWRPPP\QFCHAWNPSPKFLAP 
KIPPT/GPLPPNKEPSKENVNPG/PLCP 
L 


1150 


15051 


A 


1157 


446 


87 


PDCINVISCFLTKIYHVFFFL*RQGLiTK 
LPRLVSNS*AQAir.PP/SASQSAGITGV 
SLHTRPIMYFSSYYIIFGDTVLF\ETQS 
H PAAQ AG VQ * CNLG SLQPPPPGFKRFSC 
LGLPSRRMA 


1151 


15052 


A 


1158 


1 


938 


FFFFIFATYLFNKGK\LTGKRQSD*SIC 
KHLKGYYIKKDILTASKH/V/KKCPTSL 
airemqiltgiikcc*rcegngifthyw 
*ARALiAQLLWKNV*QHILICIjSMCITYNP 
TILFLGISNRKVYM/ CCPK/DPCSRMSR 
AAQFVI 


1152 


15053 


A 


1159 


370 


3 


afkaaaegaamsvtggqpvlnswwlgg 
vtttt/ cpttfimpdntv\rgdileltr 
\plgaqvamavhqwldxpek*nkhk/liv 
vteed velm y qkalmmm vqnnkkaag i m 

YTFNAHAAIDSFHR 


1153 


15054 


A 


1160 


55 


471 


S P PPGLPKKI FFWPLS FYVWP* PQRFCP 
L FAP FKQESNP LKVW P P FGALQNKRS KG 
CSPSM/SARQES*PLFPKNIjETPPFGGK 
FLKFFFFFF *DGVSLGHPGWSAI / IDSL 
QPLPPGFKQ/SLCLSLPAS 


1154 


15055 


A 


1161 


3 


467 


GYTNQQIFPVDKTAFY*KMPSRTFMARE 
KSMSGFKPSK/DRAGDFKLRSMLIYHSE 
NPRALKNYPKSTLPVL\KNKAWMKHFCIj 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cystcine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phen>lalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryprophan, V=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QPPVETYCSGKKIPFKILLVIDNVPCHP 
GALMEMYEEINVVNTATNTTS IL* P/ VT 
SGVIS 


1155 


15056 


A 


1162 


216 


464 


S SEGPGS PGELPYGAQEFVMKAPQG I LV 
IR*T* FFETESHSVAEAGVQWCDLSLLQ 
LPPLRFNQFCLSLP\SWDYRRPERHPAN 


1156 


15057 


A 


1163 


29 


433 


AVEFGGDSCSPQHPLNPPLGSPQHSPPP 
LIGICT/REE*GWGGGLPVPCPPALPYP 
STPSWGGICVCYGRGGVAPPP*HPPLT* 
LGGGSTPTSALPAPGAPRHPGHPLMVLQ 
TLPWGPHTPARKPYINKVLSCVDF 


1157 


15058 


A 


1164 


486 


11 


STCLGLPKCWDYRREPPRPALRCPFFLN 
ILLFLKFTLSEINIPTPAFF*LVFAWYI 
FFSLFSF* P/DLEHYI + SGICVDTM+VG 
SSCFFNPT/WSISSF*LVCLDHLYFFLF 
FFFLRQSL/DSVAQAGVOGQDLGSLQPP 
PPWFKRFSCLSRIPAHFEWSRA 


1158 


15059 


A 


1 165 


467 


131 


NRSLEWAKMYKTYDEIKLVTNLPT/RKS 
PGLCKITAKFYHIYfCDGLVLLLNILQEI 
QVGFHPNS * YQ \ 1 1 L I PKYSEGTTKKGN 
CRPIFLVNIETKILHEIRAI *VHKQIRT 
LE 


1159 


15060 


A 


1166 


455 


32 


EEGVLKAKRGCKSGLLHQTVNQVVN/AN 
LSKFLKEIKSDTPVWI *MIRK*NNLTAD 
KEKVA* IQDQTSNNIPLTQSLIQSRALT 
LFNSMKTERGEEAGEEKFEASRG * FMG F 
KEKSHPHKHKSASKAASADVEAVASYLN 
S 


1160 


15061 


A 


1167 


26 


418 


KIWDYVKQTNL*ITGIPERGGEKVNNLE 
NIFEAIVQ/ESVPSIFKEVDTQEQEIQR 
ASSSSSS 


1161 


15062 


A 


1168 


64 


313 


KWCKGNSYISKDLKELKYLGV/QLK*VQ 
/ DL* S ENYQ I LLKE I KEDLNKWKD IPCS . 
* IRRLNI KMVIFS KL I YRFNAI P I KI LA 
A 


1162 


15063 


A 


1169 


464 


36 


QQAEAAESL \ DPGGRGCSELRS CHCTPA 
WATEQS INK* SINQS I K* KSKLC/CWEN 
TLVKHIFHKRL.TSRIYKEL\QQLNKKTN 
NSL*K*EKDVNRYFIKKIYEDI*MAMEN 
IL\IKLVIREIQINLKEWLSWFCCCCCC 
FVLFVF 


1163 


15064 


A 


1170 


290 


487 


GSLPHHTPKR*PFLF/CVFETGSRSVT* 
AGVQWHNHGSL.QP* PPGLKRSSHLSLPV 
LI FLFSVEMGV 


1164 


15065 


A 


1171 


2 


413 


GKVFI * FSTLH/GCNMK* SRS \LL* ALR 
FI FLFTARGLTGI VLANSSLH I TLHDTY 
YMGTHFHYVLY I GAE FA I I GG F IH * LP L 
FSGYTLDQTYSKIHYTIILIDINLTFFP 
HHSLGLSGRPRRY*DYPDAYTTRYILS 


1165 


15066 


A 


1172 


381 


2 


PPKLQKNFFFSSTGKFFLGGGRAFSPPP ' 

kkgffsqiprrfffplpkkkkiyfcppp 
cfgppptfflrappfffffflllfcsfl 
vgvkfilfl/fylfi frdtv*lcypgws 
awqswltaaltprp 


1166 


15067 


A 


1173 


9 


405 


NLDKKGRNRTPQS WFQANPMASMTFSKK 
KKKKKKKKKGGGALKKKPWGAQKKPGKK 
KKNFFLKGG\ E KKTPRG ILEKKP FFGGG 
KKGPNPPKKKKPLREKKKF*GEKGEKKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


1 Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Pneny)alanine, 
G=Glycine, H=Histidine, l=lsoleucine 
K=Lysine, L=Leucine, M=IVlethionine 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*KFFFKKKFPPPSKKTP 


1167 


- 15068 


A 


1174 


410 


0 


FSYYPPPTRGGCPSSPPQIFLPPPLLGV 
FFPFSPLKIFFFPRGFKFFGGWPFFSP 
PKKKVFFKNPRSVYKNPPKKEKK*Sr J QP 
PG*VWAPPGIFKR/PPPPFFFYIFS 


1 168 


15069 


A 


1176 


319 


3 


KEARSVFRVEGRG*KNFSASGPPGGGNP 
GGGRPGRGNFVFLIKKGGPPPGPKG/SP 
FFDF/GGPPPPPPPSGGGSGVNPPPPPF 
FFFFEMESRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 


385 


1 


SAS FGLPKCWDYRHEPTRPAS VSVTVTS 
SRLSGA+ARSGKGTLVFWAQMVFK\RI P 
LTKYSDHSREPSSLQLCMQSTPSKAEFT 
VAKAD KRLCCTAKS S TAKS I AQ I KCQND 
AGRSPQESLHVSGRV 


1 170 


15071 


A 


1 178 






KLKE I KKLiLE ENAG I NL« YDLRLGSG FLD 
MTPKAKQQKKENLKWDVI RMKNSCAS KD 
TITFYN\ YRSDKGL V/S KKYKELNS I TE 
RQPNF*KDLNKDFSK/EQMAKST*KTTE 
RLFIREMKTKTT/ME/YHFLSTRMAKI * 
KDSNR / RLCRKTG TL I HCW * GTAPANLL 
KNCQPLFPF*GTVWQFLKRLN1,ELPDDP 
AIPPLVICPKEMKT/C/CYTEICTQMFT 
AA*III 


1171 


15072 


A 


1179 


1 


398 


SRSRHCTPAWVRVTIiVLKKKEKEKEKEE 
KKRCPCYIVSHQSL>*KPAWQFIj/RNVK/ 
LELP YDPAI / PLLGKCLKE I * KYAYTKT 
CM*MFIALFIIAKKYKQLKRLSTDEWIN 
KMWYIHAI. 


1172 


15073 


A 


1180 




389 


GYDRVI PNP * PLTGA\ LRALLLTCGLAM 
* FYLQS MWVL I LGLLTDTLT I YHR WS DV 
ARKSTYQGHHS PPAQKGLRYG IML YMTS 
E VLLLGGLF *AFYHS\ SLCPT PQLGGH W 
APTGMTPLDPVEVPLLNT 


1173 


15074 


A 


1181 


354 


2 


ANS SDAC I LDGHRAL PLQLP PCG YS I S P 
AAQSST VPMPLLL I P P PHCNRTPSLWHY 
S PASNPTNPSPY \ * TAPSPP P P I PRNNP 
FS*KFPYYVWVYTSLTTCSQPPVSSPNF 
TQPSVL 


1174 


15075 


A 


1182 


3 


384 


GANVSG/DL/KLKPVLIYYSSNPR/ALK 
NYARSILPMLCKQEKKEKKKK\AWMTAH 
LFTAWFT* YFKSTGET/CAKEKI PFKIL 
VL.IDSACSHPRALMEMCKEINWFMPVN 
*HS I /LFCMQPMDQGALSTYKS 


1 175 


15076 


A 


1 183 


1 




PPIjP*KFFSPGGVQS*KGGRFWPPP* VG 
VLPSS S P / 1 1 FFTPGNWGCFS P FS P* KF 
FFSPKGFIFCGGVGPFFPPPKKRFFSKI 
PPLVFFSPPFKKKIFFFPPPVNFGPPRV 
FFKRPPS I FFF 


1176 


15077 


A- 


11 84 


406 


62 


PPIINMLCSLAPPFFSPPPLRGVLPPFP 
LKNFFF P * GVYFWWGGGPHF / S P P P KKG 
FFPKFPP WVFNP P PLRKK/ YHNF P P PGR 
FGPPRVFFKAPP 


1177 


15078 


A - 


1185 


337 


383 


rsyisfqk*vkdlnkhfskddtqman/k 
hikrystlviremkikpmtryhftptrm 

AVPF 


1178 


15079 


A 


1186 


3 

■- ■ 


471 


LAPSDK* ENRLPGNRPL* EVRS PSARQP 
PHLRSEEPLRPAATPSGK*GASPPSSHA 
VREGGGGQPPPGQPPRPGGEGRLCPAAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


ML 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGK*GAPLP/EPPPGLGG 


1179 


15080 


A 


1187 


2 


406 


FLVETEFCYVGQAGLELLTSRDPP.ASAS 
KGAGMTGVSHQVQPQ* * S * LWT* / PSS V 
EAGTS FGLSFLSSS WALS AQEGCLAVPS 
/SGSRGLLVGALLLWTKPSPQLSPVPAS 
QRLSSLSLMPPLPQPQHLTHTS IET 


1 180 


15081 


A 


1 188 


160 


459 


NFMTIDMIjCSAWIHFCSSGLDFQL*KL. 
FKSQ* ENNLINKWAKDLNS FT I FSYG* K 
LAHEKMLNI IRE I YFTAIMS * YTLTRTV 
KVKKTDTKCW*GCTATGNLIHCCKNV* P 
LRNTAWQ\FHKILNIYLPYHPAIPLSDM 
NSRE * KNT /CHAKMCAWI FI 


1181 


15082 


A 


1189 


232 


2 


KAPPPFFFFFFFFFFFFC*SGSHSVSWA 
GI*W/PGV*WHHHGSLQPQSPRLK*SSC 
LSLSSSWDYRYVPPHLANLKK 


1182 


15083 


A 


1190 


2 


402 


PRVRHASGSPSPPPPPPGL\SHTSPSQ* 
VFSWPS Y* TPCLSALTLASVLSLL* QRS 
PR TL F ITNKC / D F / PASHS S CR I PAGL * 
ALGRQGLFSCFFCFFETESHSFAQAEVQ 
WYNLSSAQPSSPEFK*FS/CLSLPSS 














WCVPAVPATWEAEPRRSR \RS KPbTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKL LGG P KY KGG KKKK F FF F KGE KKKS 
LGGILKKKTFFWGGKKWPTPPKKNKALK 
GKKKFLGGRGGNP P 


1184 


15085 


A 


1192 


485 


112 


QRPDPR \SAEAAIKYFLTOATASI ILLI 
AI LFNNI LSGQ * T I TNTTNQ YS SL 1 1 1 M 
AIAIKLGIAPFHF*VPEVTQGTPLTSGL 
LLLT *QKIiAP I S 1 I YQIS PS LNV 


1185 


15086 


A 


1193 


55 


385 


THAFADAWADAWGLFKGILTENFPNLQK 
YINIQVQGG*RTSSRFNTKKTNSRDLII 
ILPKVKDKLS KKKKKN \ AENKK I LKYRG 
GPF**KPGGAQFYGGGRKSFFFFFGG 


1186 


15087 


A 


1194 


408 


3 


CQSAXLLGGASQLGXSRGSGVRDPLEEA 
VC P FSDLQLHAERTTALLKAVRQGHL s l 
PRLLLS F\VCLCPAPRGGACRGRQASLS 
CGGLHPVRASRLLCLRKRAWAMASVPPP 
ASLPPCSLISDCCVSNQ*DSTGRV 


1187 


15088 


A 


1195 


5 


371 


LFSTNHGDFGPLYL.FP\A*AGVLGTDLS 
LLIRAELGQPGNLLGNDHMYNGIVTAHA 
FVI IFF I VIP 1 1 IGGFGN* LVPL.I IGAP 
DVAIPRISNISF*LLPPSLLLLLACAIT 
EAGAGTG * TVY 


1188 


15089 


A 


1196 


76 


404 


PT PLRTHDQS S KVS RYKVNI Q KVGAFLY 
MLSTRTTGI * \ I KS TS FP FAS P KVKYIj T 
INLTKCVQ/DL WKEKKL/NEI KEDQNK * 
/NVPFSWIKGPNIVN 


1189 


15090 


A 


1197 


50 


334 


ILHMVSIPSISIY*HLLPAG*AGTHIG* 
LPPA* FFWVMG /RDRVLLCHPG WNAWQ 
S*LWASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 


A 


1198 


250 


1 


QYYIiVSSNT*SIIDFLQLPQKCIFTVGF 
SK* * CI CGFI FKMI * LFF\ FCLFESESC 
S VAQAGVQWQDLS S * QPLP P VFKQFS 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWLI FVFLLEMGFHHVGRAGLG 
LLTSSDPPRPAKLNFL*R*GHAVM\CPG 
WPHE \TCL,GLPKCWD YRCE /HTAPKPHH 
FLFFFFFSEKNFIPVPRRGGG/WKNLN* 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

rv==l_,ysine, L^Leucine, M=IVIethionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MEPPPPG* KGFFCPGPPKY WN* KAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQSS ISGPSLPLPPHNGAGDAPAG 
LG/YGQGPSGPPWGPS/SGPSPR*ALVC 
PTDPSGAARGGRGG \SRGS CCAPAGPAG 


1193 


15094 


A 


1201 


86 


313 


PPPPGGYPHFSFXXXXXXXXXXXXKSLL 
PPGKGPNPPKVGLPPFXXXXVSPKXXXX 
XXXXX*NPPHSPPRAPGGPP 


1 194 


15095 


A 


1202 


105 


485 


VQQTTMAR I YVCNKPARST PRFTKQ I LL 
dvlk/yidtqti ILRDFNTP/LNSVMSW 

RQKTNKDNLDLNLMliGQLDliIDIS R I LH 
VSTR/YIFFSSLHEIYSKSGHMLSHKAC 
HNNF*KIEIILIILLYHC 


1 195 












PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP*RRNPYPKFGSPPLiPGKIHQGSPGP 
SGQFR\PPRWGKKRAQPTFWAHPPPIFF 
FF*GVLLCHPDWGTVARSR\PPQPPPPG 


1196 


15097 


A 


1204 


1 


269 


WEFETYNLGIK/WS KDMKRNFTKEDLM 
MADEHPRRC\*HHXjTPPRTAQVKLELEP 
NE/KQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFLKK 


1197 


15098 


A 


1205 


459 


42 


KLKPMIilYHSENPRAFQDYAKSMLSVLY 
K* KNEAGLRAHLFTAGFTENFKP AVETY 
CSE*EISLQI *LLI /GHPR/SLMEMYKE 
MNWFMPANTTSILQSMDQGVALTYEYK 
Y I SNI LYEAIDNTENPL IDLGKANAWVG 
PG 


1198 


15099 


A 


1206 


408 


1 


EMKTLTEMSSPGMPTEKVSELED/DIN/ 
EMHRKLQCREKR / 1 KRN/E INVQE Li* YH 
YKRCNHHVMRMPEKEQRKEEIFKVIMAE 
NFPNLATNNKSKVKEAQRTPERIKTIYI 
YIYIS*XiQKAKEKESIL»KETCQKTSKPK 


1199 


15100 


A 


1207 


7 


383 


LDIGCFWWEDKYSSCYFSLATSGKLKYF 
SFSCLIHCFLVCVCVRAHK\CI*SRAS* 

CQCFL VCVCAQV/CI * S S AS * CVCVCTS 
G I * /CQCFLVCVCVCVC 


1200 


15101 


A 


1208 


405 


214 


AEAGRSLCPGGGGCNEL*SPPCPSAWVT 
\ S ETLS QKQNPHHHKKKTGQVRGQS S Yli 
PLWEVNR 


1201 


15102 


A 


1209 


384 


2 


GVTHH / ARARFFC / LLDTGFHS VFQAA V 
Q *A\NHGSLQPQPPGLKPSS CLS LPSSR 


1202 


15103 


A 


1210 


403 


2 


VPSIQTNGKSLTYFDFFFKQLFKTKIjKL 
D AS FWVCVNVE P FRT FALS KTQ S LLCL 
SLFLLIF*TINCNFCFLRQN/LRSVAQA 
GVQWRDVGLiLQPLPPE FKQFC 


1203 


15104 


A 


1211 


156 


405 


ESKQMLIQCKSQMLYC*RI*TV/CLCVC 
VCVCVCVCVCIiTCGTKRVCIVFLFIWL 
NCK/PLCE PCC* CNRQGWAPWLLPVI ST 
RW 


1204 


15105 


A 


1212 


70 


408 


KAFSI/LPPGVSPGPIiCNPRRPNF I GGGP 
KKKPPP PPQR WTQG FFNP PP FFFFL WG 
G *GQD/ PPPVAQG * MDPP PRPPK\GRGP 
RHEPPPPAF 


1205 


15106 


A | 1213 


410 


3 | KKSMPGFKASKDRMTLL,LEAQAAGDF\N 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LKPVFI \YHSENPRIL,KNYAKYT/PVLY 
KWSNKAWMIANLFIA*FIKYFKPTVETC 
YSDKKIPFKILLLIENAPDHLRALMEMY 
KEIHAVFMPANTTSILQPMDQGVILT 


1206 


15107 


A 


1214 


3 


340 


KI RVE VNKMYNRKLLEE IKKI KS LFFEN 
INKIDQFLPRLRKTRQKMQINKIRNEQG 
GISINIMEIKYSYKEML*AIICKKLNNL 
HEMNKFL / EHRVPKLTQVE I ENLjNYIYK 
K 


1207 


15108 


A 


1215 


1 


57 


RGLKIQNVNEIKS*CFEKIKIDKFLARQ 
N F KRKRF KFRD \ * KGD I MAD I TE I * G I I 
RAYY*QL*GNYLENLEEMGKFLVTYNLP 
KLNH /DIENLKKPVTT / RE FKS VI KSLP 
L KKS P LHDS FTAQFYQTFE E E * VS VL KR 


1208 


15109 


A 


1216 


350 


1 


QSSFFSIYPNFHLIiSFLFCFKDFLFFYA 
SVLATNSLPLFFPQISRYFP\PFERERE 
RERENWIACCLGWSAVMQS * LTPALNY 
W \ VRQS S LLS LQA S * S YRCG P PH P ANI F 
HFIFCR 


1209 


15110 


A 


1217 


139 


358 


KNTFFVWEAKRLFVPQAGGQGGNLSFK 
FPAPP/NLRG*SASSPSSGDYRPTPPCP 
ANFFFFKKNRISPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGSLKPQPPLRSKNPPPLASLVTGT 
Q*LG/HPVAPFPIQLIFFFFFLRRSL/D 
S I AQAGAQWCDLSSLQAPPPRFTPLSCL 
SL.KKKGNPVFVITWMNLEDIMLS 


1211 


15112 


A 


1219 


1 


361 


LKQHVSNVEKTAFYWKTQSKTFVARE*K 
GEILSSNRGECLLQSFKGQPLLLGANAA 
GDFKVK\ PMLLDHSEN/ PKALKNYTTS 


1212 


15113 


A 


1220 


292 


362 


HIPWPAIQQAEVGGSLEPRRSRP*PPR 
LK*S£HLSLLNSWDYRNVLP/RLANFCI 
FLCPVR\ FKLLGSNHQNASVSQSARITG 
VSHHAWP ILKLKKLFVCLFEMGSHYVAR 
AGLK/PPGLK*SSRLSLQNCWESRREPP 
HPDTIH 


1213 


15114 


A 


1221 


372 


3 


RLFFLLPPRPKGDFFPTL.LIW + RHGFSP 
PHVFKPPPLNLILGALKKKFFLPS/LPY 
VKFYFFKRAPLFFF/ IFFFRDRVSLCWS 
* PPGLKQYTHLGLPKHWDYRHESACLAN 
NHNNNNTNFFFETE 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLIjSLjTFVNLiSLiI 
CNL * TLS L I FVKI QKLAWHGGVRL * SQL 
/ LRRLRQENHLNQEG \ DC I PAWATE KDS 
VSKKKKKKKGGPF 


1215 


15116 


A 


1223 


379 


19 


HMQILTIMRYYTPIRMAKMKKI GYTK/ C 
W*GCAATGYI \WECKMVQSL/WQWTWAD 
S*KLNTHLSYDIAIQFL/GFYSKEIKAY 
IHTKPCT*MFTATLIIKSQI/MKQYKCL 
STDKWIKQNRGIYI 


1216 


15117 


A 


1224 


387 


40 


KKTLSTP\PEKHPTPPLFHPPPQKGGKA 
PPSFKPPPPKHPPPPKGISPPPP*IPPP 
PIYPPPPPAKPPPDFFL*SPPPPPPFSP 
PPPP*TPPPFFSPPLFFFFFFFFFLLIR 
LLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVRLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL./SSQIQQYI 
KKL I RHE * FWF / I PG I QGWFN I LKS VI V 
TYVNK\QKWKKHTI ISVDTEKAFDKIHY 
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SEQID 
NO: of 
nucleotide 


SEQ ID 
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sequence 


M 
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od 


SEQID 
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USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Prcdict- 

lo cation 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














LALIL. I KKKKKKKRALPQSF I PGDLFKK 
PNGGFPGLKMMGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ER INHTLF FLAE AQKRLPTG I SGRGR PA 
I PHNTPQRAPDHAYL PAALAAQHRAGGQ 
QAPPPG\SSPSSPYDEVKDREGDVTASH 
GLRGNGWGS P * ATSLVLNNLM YMTAKYG 
DEVPGPEMENAWNALANNEKW 


1219 


15120 


A 


1227 


3 


238 


DAWVAGHDG/ RTP* SQLLRKIiRWEHRFS 
SGDRVSDP * SCLCTLAWVAE * DS S S KKK 
KGGPFKGTKFNSRGGGRNYFFYGA 


1220. 


15121 


A 


1228 


207 


2 


NRVSPCCPVQWHDHSSLQ/PRTPGLKGP 
\ S AS AF * VAG TTGVHHHAQL. I FH F F F F Y 
*DRVI QAGVQWRNL 


1221 


15122 


A 


1229 


15 


413 


RKSVNVIIHHSEILKKKNYMIISIDTEK 
SFDKI * YPFMTKLSEN*DSLNLINNI Y/ 
AKP/VANTI LHS KRLNAFP I KQGCLQGC 
\PPFLFNIILEVLASTKGRKEIKSLQIR 
VEEI/KPLSLFADNM/IVYIENPKES 


1222 


15123 


A 


1230 


363 


1 


AADPFACRPVSPHPPLYLVTSSQSLLTS 
VSPKKQPPSAGMQLGVGDSLSGWGWGRT 
KKTRLFFFKTRSL.SAAQTRRQWCEHRSQ 
P*PPGL,IKRSSHL.SLPSS + KHRHMPPHL 
AN\l>CLiFV 


1223 


15124 


A 


1231 


3 


402 


QANSCIFSRDFGSVGQAGLEPLTPGDPP 
ASAS PRCRDYRC \GPHAQLL.IiTFC* I S I 
LILV/CIS*MKDYFITCIYFFNYS*QII 
FYRRAS DFFP FLRQGLALS P MHDHGS LQ 
PHPPRLN\HPPTSAKGVTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVITLP/QYKAYSVATVIK 
MVWHWQSDKHVDQWNREPRNRPNYYMS Q 
*FPL*YVSQRNENLPLHKNPYMNVHNGF 
ICNSRKQSRYPSTGEWVNKIj 


1225 


15126 


A 


1233 


3 


395 


LPKCWDYRR* PPHPALF/LFF*KHPKFP 
KMQVK/ WRKH * TENRLVMYFSEKFGD * L 
KDQHKLYYLDMAFQFIFTFTYVMRYS IL 
FKFNIR*LTLSSVKMAVFLVETGF\TML 
ARMVL IS* LRDL PTS ASQNAG I 


1226 


15127 


A 


1234 


416 


2 


KIDKL I L.TF I WKCKRPLLAKI VX.KKKNK 
I*RLPLPSFKTYGKGMV/TK/TKWGWPK 
NRPIKGTGF*VQKKKTPHIYGQLVFDSG 
ARTITN/WY*DN* IST*KRMKL,DLYLTI 
YKINSKWIKDLNVRAKTMTFLEENIWVI 
LD 


1227 


15128 


A 


1235 


384 


22 


FFQE I * NAIPVNTQMVRKQNSHPANIEK 
VL/WWIEGQTSPTNIPLSQSLTQSKALT 
LFNSMKA/E/RGDKAAEEKMETCRGWFM 
RFRERRHVHN I KV * GEAARS CG S LAAG S 
PLKHLLLHNTLIC 














SPYTKIHSKCLKGLNVRPLTFKPLEENP 
G/VMVPDLGPGKKFIS *AP KAWATKTK I 
IQWGYIGIiK/AFCPAKEAI PRVKTWPSE 
SEGIPAGHAFYEGFIFQIFKGIiQPFHSK 
KKKDLIIiKLDLLSADIS 


1229 


15130 


A 


1237 


1 


407 


YRVGQAGLKLLTL* SAHLGLPKCWDCGR 
EPPCLTI IAL* SVFSLVLPAVLIKLINF 
F\CRDGGQTMLPRLVSNSWPQMIC/LPW 
PPKVLGLQ- - ...... 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1230 


15131 


A 


1238 


3 


380 


ELSQRTCCMDIMSYYKAIVITMAWY * FK 
DKQRVQQNEVESPKTLLYIYRLWIYYEG 
DTTDKG*TF\NKWYWDSWI/ST/CKNM* 
FDYHFTQCIKIN/SQWM 


1231 


15132 


A 


1239 


1 


396 


FADDMIVYLENLKDSSKKLLELVNQFSK 
VSGYKSKV/NVHKSVALLYANSNQAE/Q 
I KNPTPFTIAAKKQK/ YKQKNNLG I Y * T 
KEVKDLYKENYKTSLKE I I DDANKWKYI 
PCSWMGRIDIVKMTILPKAI*RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMLARLVSNSR/PSRVLPASASQSAE 
IIGVSHYAWPSKLSF*LTIDQTHLSCNL 
F I YVFFERRS CS VTQAGGQW YGHS SLQP 
*TPGLKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


SLSLS FS*DGVLLCLGWSSTPGLKRFSC 
LS LRS S WDYRC VPSSQTNF \ VFLVEMGF 
HHVGQAGLE LLTS SDNARLGL PKCWDYR 
RVPPRPAAFFLFFKG 


1234 


15135 


A 


1242 


2 


397 


NFM INNLPKKKALCP WFTGE FYLP FKE 
ETIP*VIRISLSL/IFSIIFQKIETEGI 
LPNSFYEGCIILiISKPYKDI * \ENYRLT 
SVMHIDARFLDSILANPIQQCIKVIHCS 
HVAFV/SGTQDWFNIQKSM 


1235 


15136 


A 


1243 


12 


362 


AGFYHVGQDGLKLLTSSDPPASASQSAG 
IKA*ATVPGLSPLNFCEVRFMESWSWKR 
P*RTS\VCSALAKCWDRNASQWDKLPS 
FPT/RM*SPCLHPAVSAFCGSASVLPGI 
IVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRGGSCL* /liPRCWDYRREtiLHLiAF 
MPG*I>KK/FFCKDGGLTMWPRLVLNFWP 
PVILLPRPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVSSCWPGWS /PNS * PQSDQtALK 
CWDYRCEPLHPAFC*MNFLKVSC*HGTS 
/DS KY /CQHVSPKNKDI SLCNYNI I I I P 
KKFNI L* YI WFLDFFFFETE FCSCCW * D 
LGSLQPPPPGFK* FSCPSLPSSWVHRHV 


1238 


15139 


A 


1246 


440 


46 


KTEGAN I NKNTT YQ / NL WD ADKA VFRRK 
FRALNAYTRK* ERAQLNLSSTLKTRKKE 
QNKLKADRR /QIMQI** KLGKVENKQTK 
TIQQIH*TKSWFFEKISKIYELi*QA* *R 
KKGEKTT I TNVRNKRYSGRVG 


1239 


15140 


A 


1247 


266 


340 


RKCW P GAVAHT CNLiXTLGGQG GRIT 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVALFFFFWGFLGGGG 
LKTGFYFIPQVERRGLNFD*WNPPPPGL 
RGSSPPT/LPKQWEPGGGPPAPSNFWFF 
FE KRG FPQ VTQAGF KLWN * GD P PAG P S K 
GVGITGGTPSPHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


MHHHA* LVLVFFCGDKVSLCCPG*S *TP 
DLKRLSHHGLPKHWDYRC/ RATTPGFFS 
LFFFFFFNPPEFLLPWAPPSYSLKQNKI 
FLKRFLE PLAPPKGX.PLKRAKGRI 


1242 


15143 


A 


1250 


3 


405 


QPDSYSPQPGRLSAPPEQEGGPWLILPH 
ACAPSQI CVGPHGASILSYQERKGTKVL 
SCEGHCKL>SSPVGLVGQSFCWQQPDAVQ 
WVPFRRRTQTPGTVAHACNPSTLGG*GR 
RITS/RPGVRDQPEQH 


1243 


15144 


A . 


1251 


407 


2 


P FLKKTKKEL P * GPAI PALG VPS FFP-FF 
FLP Y KRKE I KG I RT P PMR I AAIiFTJI AK I 
LNQPKYPSVDI * LHKRHTSNTTL.LSRMK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L~Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














H* WIKKT* /HTDMMEYYSAIEKNGILSF 
AATGMS PKDIMLSE I CQAQKNRY 


1244 


15145 




1252 


1 


253 


VADVRESLDPGD*GCSELRLCHCTPAWA 
T\SRTTSQQQLKKQNENTT*KTYGIQLM 
NCQIANYCSKLLYFRKMSNLQSNLKKKK 
K 


1245 


15146 


A 


1253 


3 


372 


KKKAQKSGTIVLPCNPSYSGG*VGENA\ 
WARSQPGL IS* KQQ * QQQNQQQTTSKTD 
PYIYG/YLIHDRGGK\NTQGWDRi,LNRW 
GWG I WLMTQRKI KLELYLTPYTRTNSTW 
I KDLNVAGCGGSCL 


1246 


15147 


A 


1254 


2 


410 


KNLTP I VGLKAATN *ESIPPQT/S RKKK 
KKKKKKKKRGGALKKKKFKGGGGGKKNF 
. F KGGKKKKLGGG VKKRGEG KKPGGKKKK 
RFGKKSFFSRGGKKKKNRSSSSSSSSSS 


1247 


15148 


A 


1255 


61 


487 


GR PGP THAF W3 DS FFMGNEAR F I NHS C 
DPNCEMQKWSVNGVYRIGLYALKDMPAG 
TELTYDYNFHSFNVEKQQLCKCGFHKCR 
G I IGGKSQRVNGLT S SKNSQ PMATHKKS 
GRSY/RE * KKNKKKRGGRFKGSQFSS PG 
MQG 


1248 


15149 


A 


1256 


118 




MFIAELFI IA/RRWK* PKCPLTDEWINK 
M*YSHTM*HY 


1249 


15150 


A 


1257 


390 


1 


TQKNTPTPKPKPTSQQHHQKNKPTPPGF 
FFFSPPGK\GGFFPSPIjFWVPPGFFPPP 
VFKTRPPE F I FGAP * KKFFFS PPRSLNF- 
FFLRGPPSFFFFFFFFS*VGKEGSSPSH 
ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 


3 


GAFFFFFPPPKKGFFFTPFFFFGPGFFF 
SPPFFIPPPQIFFFGPKKKKKFLPPPPQ 
KIFFFLRPPPLFFFFFFFFFFFFFFFFF 
FFFL\DLD*NVLSGTSAPEKNQELLGMV 
AYACNPSTLiGGRGG 




15152 




1259 


462 


287 


RDG\ FHHVGQDGLHLliTS * STRLSLPEC 
WDYRCDLPCPAIPAVTLiYQIQYRPLGLiE 
SKA 


1252 


15153 


A 


1260 


446 


34 


NVRAET/ IKTLEVNTGVNLHDLS* /GKA 
FL.DARPKAQVAKEKSRQPSLHRRLR/FC 
AS KDP IKE VKRQPTE/ WEKINHVS DKGL 
EFKIHKELLEKLSNNPI *KWDKDLNGHF 
S K / ELQ I AS KHMRR CS AS LV I RKCNE I V 
DPD 


1253 


15154 


A 


1261 


2 


383 


GSQRKWFLEMESTPGDNAIDIVEVTTKG 
FNYYMNLVDKAVAGFERIDSKFERSSSV 
SKILSNSIACYR*IFCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTLISQQliYTL 
RQDSPPAKRL* FTDGL 


1254 


15155 


A 


1262 


476 


50 


FFFFFSDTGSHS W*AAVQWHDHDSLQP 
* PPGP\SDPPTKSSTREF 


1255 


15156 


A 


1263 


143 


472 


TGAVPIRPSWN/RPPAMIFF*NAQGILiF 
AEFLASQRAILWEFFEKAYHESVLRKSA 
KGLAEKCPGKLHQRVLVQYDNALAHFSH 
QTRTTKS S TRQFR 


1256 


15157 


A 


1264 


179 


489 


Y I FFFLSLFFFLWPRPEYRGAI TVHCS L 
NLPGSSDPTASASSVAGTKPH* YI FFFL 
SLF/CFSVAQA*VQGGNHSSLQPQPPGL 
K\YPTASASSVAGTKPHPLjIFFFF*KKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
. peptide 
sequence 


Prcdict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FL\ CFVPQAGGQGHDFG * L* PPPPSLKQ 
GGGLTLSRIGDYRGPP 


1257 


15158 


A 


1265 


1 


536 


FRGGWGSVRRAPGTASCYLAiiAGGPPGQ 
CPPAAISCPSPLPWGS*TE/PYVPRRPG 
DPSAAP/PSW+VPRPGVPRNVPGRGRVG 
QDGCGQAQDI QAAASLMADTAPDSPGSA 
■ GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHSFCGGQEGD 
IRAFPTLYLEVYPGPP 


1258 


15159 


A 


1266 


403 


2 


TSLHPRGYMRLLRQGFISAPCGYMRLPR 
GSS LH PRG YMRLLKTGVHRCTPWVHAPS 
QTGVHLCTPWEHTPS*RFISAPLGAASH 
VT/GSLSIQHIYIFVFCLLRQGL/NSVT 
QAGVQWCNLGSLQPKLPQAQVILTK 


1259 


15160 


A 


1267 


2 


401 


FVLNPGGRSCSEPRSLiHCTPAWATRAYL 
QLGKKKKEKKKERGEENKNQGPPRPLKR 
EVGTPGQKKPL/WGGSNGAGQKQPA/QK 
KGGKKKADHKGQRGKN*KRKREGGGSTS 
KNNSRGTGAKA*NPTIWGGGGKKI 


1260 


15161 


A 


1268 


49 


416 


LRGRALDPRLLRECGDLGAPPAPEVALR 
AGTCWTR/CTL*APPRGA/DRSPWPPRS 
PMCKAG/DECQDGI PG\MKAWSCGIjRTR 
QCP WP * PKLPCGPGHPAWRT * PLPQTA/ 
CGPGPAAPCAG 


1261 


15162 


A 


1269 


420 


47 


GPPGWASFRLNFPKARR*GEWKTPGESG 
GAFFS PPGKNF\AGQN*GRPPNPPPPGP 
GKGGI QTRGGAGLGKNPFRFWGGFPNPG 
NK*GGGTKKKEGPPPPLFFFFIjNTDFCN 
LIKRDRLGVGAHL 


1262 


15163 


A 


1270 


404 


-j 


AE ILE LKNVID I L.KNVSES LNS RI DQAE 
ERISE'IiEDRLFENTQSEESK*KRI*KNE 
ARQQDLEWS FKKANLRVIGLKEE VEREM 
GVESLFKW 1 1 1 ENS PNLEKH I HIVQEGY 
RAP \ FNSNKKTSRHS I INSHTK 


1263 


15164 


A 


1271 


387 


2 


KKEKKKEVI IIFVLL>LPLK*FQNIQVWL 
VRELEKKFTGKYW\FA*RKILPKPTQK 
RCTKNKQKRPRIHAPTAAHVAILEDSVF 
PGEIVGRRIRVKWTQQLTRVHSDKAQQN 
NVERKVQTFSGIDKKLR 


1264 


15165 


A 


1272 


1 


393 


FRMGRLPGGGAPHFSDGVAGQRRSSPHR 
RSRGRAEALLTSQTGRIiGRGAPHI SDDG 
RPGRDAP\PS*TGWQPGRGAPHFPD\GQ 
PGRGAPHI PDDGRPGRDAPHFPYGVAAG 
QRLQSRHFGRPRQAAGRWRL 


1265 


15166 


A ■ 


1273 


3 


317 


S S YETKG VMI AS FSSREADNHTAFI RI K 
TNASDSTEFI ILPVEVEVTTG+ WKTDE I 
ES CFVFS IAYSGKVKNNC*LFFFP \APG 
I YS STEMLD FGTLRTQGKK I F 


1266 


15167 


A 


1274 


92 


368 


LCPPGGRAGQGQNLN*GVPSPPGPKGFW 
GPTYGGGGGKRTTRPGGGKPEKSFD/GP 
PGPPPGGTKGNPAPKNFFAQKERGPFFQ 
NGGPGNKLF 


1267 


15168 


A 


1275 


401 


3 


DMCDWFKKEFSDTTPKA*SIKEKQLTSF 
I KIKS S CS PS \ KNT I KGLTRQEKLWGK I 
FANHMSDKGLVSRIYEELSKLN\K* KLN 
F * KWAKQ * DHLLKKTH+ * QICKERYSLG 
KGKITMRYHSTSI IMCI IKTIHS j 


1268 


15169 


A 


1276 


298 


426 


GATMLVLLF*LAFDLRQSL/NCFPQAGV | 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 














VSFKDQKL* INKENHPIKKWKKICNQPL 
PQIRYMDANKHIKIFSTSLAARETNMKI 
TT/RI L WKTVWQFFKKVS IYLPLEPAIS 
SYIiFTQEK*KHIDGSCSPSRVATAKLLA 
LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


1 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV+ SQLLRELRQKNCL\NGPRSHHCT 
PAWAT\EQNS I * KKKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFSSSFFFFGGGVPPCPPGLRPG\PN 
FASLHPPPPGFGGFPSLTSPRA*NWGPR 
PPPQATLGAFGLEGE * I FPPGRPPEP *L 
WGPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFL* TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQALPPSQEVNDAIKQMK 
KHLKLAK/DKISIVRCITKMGM*LLGYR 
S S WQ VKR I T WQL I VLNMLA YRA VFG I 


1273 


15174 


A 


1281 


236 


2 


TQS RLVFFYMTG PAVYLNHHAIiRTHQGS 
HLCFFFCFEMESHSVTQAGW\VWWRDLS 
SSQPPPPRLKRFSCLSFPSS*DY 


1274 


15175 


A 


1282 


37 


410 


KKKTLKKIKKLCPPGVKGEFNPPFYVLS 
KVFPKKERGLFFKVTFVSLTPFFKKKNF 
KIGGGSGFPPPLWFFFFFY/RDRVLLCH 
PGWNAVTRS*LTTSVDSSDPS/CLRLPS 


1275 


15176 


A 


1283 


404 


232 


LSSWDYRCLPPHPANPLYF* *RRGFTVli 
ARMV/GPRDSPASSSQSAGITGMSHCAQ 
PG 












402 


MQIKITMRNYCTPVRMP*KKK\SRTKPR 

c* ggggttki l i pcwgdykige / plw/k 
svwqflik*nnhllydpailplifytre 
mktyvht i tkycnhnek/ cmqg ftealf 
/tlpkte/sqpkhsst 


1277 


15178 


A 


1285 


1 


253 


LRGKFMTLHS /SI L.KRG*SQ INN* TATL 
IK*KKKGKIKPK/RSRRKVIIMNRNK*D 
*KQNNNKFNETRSWFFKNMKKIYKSLVR 


1278 


15179 


A 


1286 


400 


105 


FNFKKKKKKKNREVSE*QY*QAKYLKRN 
FTKNI * VAEKHMKNAQYVIKEMLI*LTM 
RYYYTPIK\MAKIKLTDNHKCQQLjKPSY 
MVGKEFGKFL I KLNM 


1279 


15180 


A 


1287 


400 


119 


TVFHHTGQAGLELLTSSDPPVSASQSAG 
ITGLSHW\PASSSFHSNHSCPVPLEPHQ 
VAQLDS FG YEKVCS FSN* VPGAQDKNDIj 
SEN 


1280 


15181 


A 


1288 


157 


386 


NELQFHPFYCLF/DLFCQSLKKAGMQWH 
DIiGSLQPLiLPRFRRLiLCLSIjPSSWDCGH 
VS/PMPG*FCIYTHIYFLVEMGF 














FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PHFSQ* IG * KEGAPMGQPI F VS FS KTG A 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNFFFLERGFFFFPPGGRAGGGF* FPPP ' 
FFFQGKKI SGPPP P KKRGPPGAPQKL /R 
EIFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPKRGGSRVGPPSGGPNLFNLGGGW 
FFRFFFFFFFFFLVFETEFRFCCPGWSA 
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NO: of 
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sequence 


SEQ ID 
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peptide 
sequence 


M 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














M 


L283 


15184 


A 


1291 


397 


39 


WGQVWAKDWIHGLGPKTQGCPPPLGVFG 
CP/SLI PPNLFFYGI PGGLKI FVHKNPP 
FPIKKEPANWIATFFPPGVFLFFL/CFF 
ETRSHSVTQAGVQ*RDLVSL*PTSGFKR 
FSCLSLPSSR 


1284 


15185 


A 


1292 


399 


1 


LNFFFLHRQKLCNI LGS DDKVPAFS I YP 
EPRICYVFSLLFEI IFS FDFNKKIR IYK 
KHVS R I YKDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIRNC*TLAIK/EMQ. 
NTTTMR*HLMPNRMAKIRKTI 


1285 


15186 


A 


1293 


412 


1 


ARMVS ISRPLGTPI WRS QKGGNKR VS P P 
TRPNPLFLKNPKKCPKSRD*NKPPLLKK 
KKKWSGPCPAQ/SCPGQNPHP/SPPKQL 
PPS PSTPPSRPS PTEAFPMAS * LAI SPA 
HTHCCWRELSGTHTLSIPCLKPLRGP 


1286 


15187 


A 


1294 


229 




FFFETESHSVTQAGVQWCNPGFKRFSCF 
GLSSSWDYRYAPPRP\ANF\ *FLtVETGF 
YYVAQAGLKLLS PGDLPALAS 


1287 


15188 


A 


1295 


401 


2 


STLPVH* KWTTKAWMTAHb/FTAWFTEY 
FKPT VQNYCSEE KIPFKI LVL I DNAPGH 
PQALMGMNKEMS WFI PV/NTTS I LQP \ 
ADQGVI FTLKSYC.IRNLFQHQCTPAWVT 
QLDSVSKK* INIFQ PGMVAHACN 


1288 


15189 


A 


1296 


1 


336 


KKTPRRKTHKEHHNGEK\LRAFPLRSGI 
KQ/GCAF*P/YILNIWA/VLAKEYGKE 
I KAS FRKEELKLCIiFIDM 1 1 FVE I PENS 
PQKL\ IN*FSKVAG* * VSTQNSVAFLFF 
FFFF 


1289 


15190 


A 


1297 


3 


384 


HTNMQKVIiV/VKTEDQPSHNIPLNQILI 
QNKS LTL FNS I KAERSE EAA * EKCEDSR 
GWFMRFKKKK/RISITKMQGEAASAWO 
AGTS Y P KDIAK /DEGS YTKQQ I DNVNET 
VLY * KM I / PTEDF WRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRS VTRAGVQWHDLGSLQS P P PGFKQFC 
LSLLRSWDYRLSPPHQQMGGIFL/VFLF 
FFFPGKGVFLGGPQAGGKGPNLG*WNPP 
PRGLGEFFGLNPPRGWGFQ/HPPPHPQL 
F/CCFFRGKGGSPRRAGRAPNP 


1291 


15192 


A 


1299 


260 


4 


GFIVSSAVFPLLKCLLDILVSSSVLLYC 
LWNFKI QKSFRFYWLS PLVSS FKNSH * N 
SNI / WPGAVAKACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRLSAWDY*HVPTHVFFVEMGFHRWQ 
ASLELL-S * GDPPAXiA 


1293 


15194 


A 


1301 


277 


441 


TVEF*GVQSMKEEI \ELCLFEDDMSVYE 
ENLKELTKN\LLK* ISNYSKVAGYKVNI 
+KAIAVLHTSNEQ*NFEIGNTILF/TIS 
TPKYLG INLTKYAQGLYEENYSNL.MNE I 
KELNTW/RDILCS 


1294 


15195 


A 


1302 


256 


482 


YNVYFKI CIGPGTVAHACNPSTL.G \G*G 
GRI/TLRSGVRDQQVQHG 


1295 


15196 


A 


1303 


182 


460 


VGREFLDMTGKEFIYKFLYIRKIN*SSS 
KLKTFVXiP KTLLRD * KAKLQTGRKYL* I 
TYPVKGL VS RLCKKLS KLNS KKTTQL.KK 
WAKDMNRHF 


1296 


15197 


A 


1304 


1 


468 1 FKQFS S XjS llss wd yrhptgn fcelap y 
1 KQNAPSCSNVFTDNVPVIST*QGLQAIiS 



609 



0164S3SA2_I_> 



WO 01/64835 



PCT/TJS01/04927 



SEQ ID 
NO: of 
nucleotide 


1 SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H— Histidine, I = Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PNVNEQSPWRETKKTAVDLCGVIWKPCQ 
VFLRKKPKTKTGQ/HEQQIFGF/CFFSE 
TESHSVTQAGVQ*HHLGSL*PLPPPFKR 
NSTTRTSDYDGP/HVPP/HPA 


1297 


15198 


A 


1305 


880 


1026 


EEHGAGLSGSQDAAGGVPAG*GGWAQLW 
VTRRAS LFLDKTHW P VDEQNLG S LYTI E 
ATAYGLMQKIELGRYNETHAIAKWLLEK 
QELGGGFRSTQ / S DGD PRETT WALEAL 
TRFRE AVP FKG I QDLiHVQ I RAP KTALNV 
NWYIDHSNAYQQRSAKFLAQDDLEIKAS 
GNGRGTI S ILTMYHKS PESREDNCNLYH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN*EATMTIMEVSliLTGFYPNQDDLKQL 
TSDVERYAFQYKTKTSTSDSTWL.YLEK 
LS HEKNTELGFR VHRMLQAE FLQAALVT 
IYDYYEPSRRCSTFYNLPTEQSSL 


1298 


15199 




1306 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVELYEGYGQSETWTYKWELSYE 
DANTYI VK* KTL* TQKEEG\ I I CANPKG 
MKIKSVSMVKESLPYYVHIVDDEGNVLP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS*QGDFYITGDRARMDKDGYFWFM 
GRNDDVINSSSYRIGPVEEESALVEUTA 
VLESTWSS PYP IMGEVGKAYI VLTRAY 
S5HDT*ALTRVIjQEHVIK 


1299 


15200 


A 


1307 


230 


486 


NAICPNGSKGSLGLGSCSVTQTGVRW*D 
HSSLQPRPPGLK\YPPTSAS * VA\GPQA 
GVQWHGLDSLQTPPPG/FKRFSCLSLPS 
SWDY 


1300 


15201 


A 


1308 


463 


440 


S AI PVHT * RRRKQT S LNADMEKV* WW I 
EDQISHNIPLNKSLIQGKALTLFNGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
NVKVQGEAAGADGEAAASYPEDLAQSTD 
EGGYTKQQNFSIDV/TTFYWKKMSSRIF 
IAEFHHTD*GDNGSVP 


1301 


15202 


A 


1309 


463 


59 


EVISTLTKMGKILKTKNAK/CWQDCGTK 
RTFIHCWWGYKLVQPLWKRV\YTFPYNIj 
AIPLIGIYP* /NMKI YSHKRTCTKMF/ I 
NSLFIISKNWKHLRYTFTREWIK/M* YS 
HTKDYYSAIKRNKLDMFSNMHESQKHYG 














PGGKGGGGPTPAPFPFPPPPPKVGVFW* 
GFPRPFLFKPPPPNKAPFPGAPHFF/SL 
PPFPRVFKKTFPTKKKFFPPGGGGKKKP 
P P PLQGG KKMGS ALKF P SGGKKP WG ISP 
QI 


1303 


15204 




1311 


2 


310 


HNQKNEFGPYVTSYVKVKEAK* IAKTRK 
LLEEN IGVN I HD I ELGSGFLAMMP \ EA* 
AIRLKIEKLDFIKIKNSCASKDTINKVK 
KTGRPGMVAHAYNPSTLGG 














RGQQDVLPG * RGQQDVljPGRGTYSEYGL 
IFSARNPSMEV*SVNHRPFHHHGVP/HQ 
WCSDSGSYFISKECGN/WVSACGIH*CC 
LVPCLPKAADLGE * WNGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


ENKLTNHGKTGNGGAQSQ/PPECEPRTH 
LQRGLEGRGGGEPWGQGQPD\ PPSNS SL 
KNPQAGVPPFSSLKGKVKRDRSVSVDSG 
EQREAGTQ* TFVNGKKKKKKKKKKAKKA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AYSPDSKA/PSSKSSPKSS 


1306 


15207 


A 


1314 


375 


3 


KKRTLLI PLMNMDAI /MNKI LAN* IQQH 
IKK I IHNDQVQFVPGMQGWFNIQKNINV 
I PY Y* LF Y* CYINRI KDQTLI 1 1 SIDTE 
KNDKIQNSFIVKTL>RK\IKENFLTLIKR 
IY+KSSNNKKNID 


1307 


15208 


A 


1315 


403 


3 


LEG VS F F F PR VAS QG S I LGS CN P PLP R F 
HHFS /CPHLLSKWGYRFPPSPPA\FFFF 
FLKTGFFFFCKNFA1.CSFP*KQASPPPP 
LTVFFFS/HHPFFFFFFPLFFFLRWS FA 
LVAQAGVQWHDLGSLQPPSLGFKRF 


1308 


15209 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 
AAAPSEK*GASPPGSHPIWEVRSVSARQ 
PPRPGGRWWGVNPPPGQPPRPGGEGRLC 
PAAPTGK*GAPLPGQPPRPGGRWGGYPP 
ARSAAS YGR*GAPLS SRP 


1309 


15210 


A 


1317 


43 


399 


LTFFFFLGEGARPPPPGWGPGAHQGITA 
PLFWGGQGNPPP*PPGEGEPS/QGPPPP 
GGNVFFWKKKGSPRAPGGPWTGGPKGLP 
RPPPPKGCE* RGNP P PPT P KF LGFTT FQ 
KKGKNSGP 


1310 


15211 


A 


1318 


437 


54 


MNEQKGERGLSSLGLPAR *D * GKGNVKE 
GR / EG VTFS PRE P KE ESLR WS TPQKE I V 
GVINHPRGQRVDRQGAGSGCEGFALRAQ 
TGG P PA * E I TRAG EQ KAGGGS KGAQTLQ 
RETRPPRGQRGGGRSASL 


1311 


15212 


A 


1319 


351 


133 


GGGWAEVPLTSQVGRSGRGTPQLPDGAA 
RQRRPPPPRRGSWAEAPTSQTGRPGRGA 
PHLPDDGRPGRDAPHLP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKLA* /DQNR 
HFIEEETQ I PNKHMKR SSIS /LAI KEMQ 
IQITKSYCIIIRLCK*LIKNSDSIKCWQ 
GCREI/GSLIHCWRKLKMNSTALQMVYK 


1313 


15214 


A 


1321 


485 


33 


CPASRVAGITGAHHHAWLIFVFLVEMGF 
HHVGQAGL.E /L P VS WGNGGHL /RLCL.L F 
GYCGQCCYKPS*RRFFCVSFYIRA*K*G 
FVCLFV*MVSCCVTRLEFWTHRCNHSK 
LQP* IPGLKCS\PASGF*VARTTGLYHG 
VWHVSNSTALTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVFKVRKLNPPWLKKENPL 
FIKKKKKKPGPGGGPFIPTPSEG* /GKK 
KGFNPEKEASNKPKFAPSNPQTGLGKKK 
KPPS LKKKKKKKKVYS LI F 


1315 


15216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\SETPFKKKKKKLCISWPGTVESCFSVR 
T* * I * LRQ YVLYLFLFQPKNVI YLFI CL 
HDKANMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


TWEGEGS WLTSQDRTTAPLH PSLDNRVK 
LRLKKKKS VGFLS PS I LLAKNQI KKPAP 
FTMA/S K/RI ECYLG INLTKDVKDLYNEN 
Y * TLMKNLKGGAMCSFTSL 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS*LTL 
PSSWDYRHPPPRPVNF\*FLGEMGFRRV 
AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP+LPK/SLRC*GKDFYNQGSGQA 
RWLPPVIPPPLEAQAGGSP*GRSLRPTW 
PAWSNPSFLKKPTPLFLKFS*GRRIALT 
PKAKVS VNWDS P PALQPGGPS KTFFP KK | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


1 Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProItne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, ' -Stop codon, /=possibtc 
nucleotide deletion, \-possible 
nucleotide insertion 














KKKLR I I G PGWAHACN PNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFG I YFCVCS LYFTCLYFP FLI FL* V 
TLTFLVFYFDFTVLFI ISL* YFNDCSRD 
CNIHM* LLSLP VL I FHCCC \ FVFRDRVS 
LCHPGWSALASSLL 


1320 


15221 


A 


1328 


2 


392 


RQASRPP*SAPPAPAGKEGSGE*PPSPK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP 


1321 


15222 


A 


1329 


378 


7 


TDDLILKFI*ERYKPNQF*KRTKLKDSL 
PDFENYYKAKWKTRW* +YKE\KHIDQW 
NRIESR*QIFNIASII/HIEKEYLCNKW 
CYRTTKMEEKNMNLKPLP*TIKFNSK* I 
TDLNIKTITIWLKL 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELLTSGDLPTLASQC 
WDYKHEPLRLGELRSLTAAWAMQQDPVS 
PNK* I KI K* I / P * SEQFSG I * S IHNWQ 
NRHFYPDPTHFHHSKVKPLTHYWSPHS 
FLSPTPGNHQCLLSVSMDFSIL 


1323 


15224 


A 


1331 


1 


389 


KMKSQAIDLEKIFTKHISDKGLGYGLYK 
ELL * FN / R/RCQTTQLKMGE R * TNS S TM 
GMSFEWMVSKHMKRCSCAFPSLVTREMQ 
VH/TTSLFVERYHYTPTRVAVIKQSDQV 
\W*ECGIRTLIHC+WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVFSPPP 
FFKPPPPFFFFGPQKKKIFSPPPPLKFF 
FFLR PPPPFF FFFFFFFFFQKNLGKGFF 
F* PRPEKKKKPPPGGVFFFSLFFLRCS V 
ALSPRLECSGAI SAHCNLC I 


1325 


15226 


A 


1333 


394 


1 


SQWGPTAIHYCAWLVFKFLGHFSKTTI 
SAPLLK+ TSTRAETS KSFS \CPRKMERA 
EGRRMFLGKSLLKQ I VLLF/L ISGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDLGS 
LQPLSSRLKRFSCFSLPSRTRG 


1326 


15227 


A 


1334 


416 


1 


LTLLPKPTPYNTRKENYRPISLINIDAK 
IL/NILAGKIPQYIKVHN\NMGLrPEMQ 
GLFNI *K/RKSVNVICHINRKEEKKILL 
INA* IVFDKNPTMI *KNSWQ\GEYKEIY 
SNILLNGKMLKACHL\KTRIN*GCQLSP 
DAW 


1327 


15228 


A 


1335 


391 


57 


WQRVCNWDKNLYQKEKRQDNVREK/WAT 
DLI INFTHEEMQSVNNLQKDTHPHSLVI 
REMQ I KTTS Y/HLLAKILNSDNSSYW * G 
CGGMGTL ♦ CCWWVS KE IHALWE I VWF I Y 
SK 


1328 


15229 


A 


1336 


285 




I FFFPWRGE 1 1 * HLS LINKGSRQNERAH 
\KDNGDFSQIISKLKLMYKYKILNQTTI 
KYMFFPKKHRTSIKIEHILGSKGSLNNC 
QRISVLQTGRV 


1329 


15230 




1337 


34 


391 


AKIASLYSSLGNRANSVSKQNKKTKNQN 
LKN/IKTAGGLGWAHSCSPSTLGGLH\ 
DPGV*GCREL*WCRCTPAWASG\ETLSQ 
K*INKGKERKKKMKNCWWMNNSLIQLTS 
FSFTKTDWSS '■ 


1330 


15231 


A 


1338 


292 


1 


GGGISKTPGGEGAINPKKPPALPPGGKK 
EAFSPPQKKKKGERA*RDISSEDI*MAN 
KHMKSCPPLLMIREMQIKSTMRYRFIPI 
RME\ILKKQKNS . - | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteiue, D=Aspartic Acid, 
E=Ghitamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Wpossible 
nucleotide insertion 


1331 


15232 


A 


1339 


369 


16 


DPWQNEQLiSRVTCPGPRRASMFRIT*YK 
VIIAAliMAYSVGPRAVSCIRAliWTTYG\ 
IMDNYKRHERRFLKPL>*WPGMVAHACNP 
STLGGHGRHERVGRVRSIPDCWRVAAG 
VKASI 


1332 


15233 


A 


1340 


394 


1 


TQSGVQRHDAGSLQPQPPRLKQSPHPSIi 
PSiBQDHRRVPPCPARPCSSHDSYVLREG 
PCGRW/HESRGRVFLVLFS* **LSLMRS 
GGFKKGEFPCTSSL/SCLPPST*DVTCS 
SLPSAMNVGPLQPHI FMHGF 


1333 


15234 


A 


1341 


400 


161 


KIGPNLPCFSTKRPWKIDKIQFWPPGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPLP 
PLPKSFFPPKNPNLPPSFGFDIPPP 


1334 


15235 


A 


1342 


377 


3 


NRHFPKAETLKAS KHLKRHLPLLVI REV 
QIKTTNI CHNI ATRLAKM* KADNTKCWE 
SCEPVLTFY\C*WEYKLVQPL*KMVWQK 
PKVCVPYYLAILLLQHIPEGTCAS/CY* 
DIHKRLLTAMLFAM 


1335 


15236 


A 


1343 


170 


369 


RCNENKI TKEVYFGEI FVRLDNKEKNTS 
FFYFYFLFY/CWR*GriAIiLPRliVLNS*A 
QWLLPWPPKML 


1336 


15237 


A 


1344 


385 


2 


SDLRRSTHLGLPKCWD YRC / RATVPGRL 
FIFLMVSLKHKSFKF* *RPI YLFFLSSL 
ML.LVS YLRNH* LI QGHKDLLPMFSS KS F 
MILVLTSRSILSLLFCFLFFVLRQGL/N 
SVTQAGVQRHNHGSLQPQP 


1337 


15238 


A 


1345 


1 


281 


HMATKHMKRC* / SEMCTLKPQLNTTTPD 
HQNG*N* K* LVNFKGHQ* FGAVWNLIHC 
W * DCKLAQP FWK I DLS IKAK I SMSYSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


SVIWNIKEP*IAKIILKKKTIVGGLTVP 
DFKT YYKAM VI /KTV * S WLKDRQRNQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
FYLGYPQGFPLPAVLNRGPG ILFWGPHK 
KNITLPARGR* IGSS*TAPPFFF/ SLFS 
FLLFSFLLFDTGSYSVPQAAVQWFNHGS 
LKPQPPGLKLSSQLLRRLGW 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PPKKK/RGPNPNGPVGEFLATGPFFWTG 
PLKKNPAPPRALFWGPPPPPPWGG*NPP 
PFLARGS PFFKKFFGGPDPFFYRTNPRG 
PNKRGPPWPLKWGNPPPK 


1341 


15242 


A 


1349 


119 


1 


ENVFRNM / W PGMMAHTCNTSTLGGQGEW 
ITRSGV*DQPGQ 


1342 


15243 


A 


1350 


58 


396 


GIRVGKVCFIIFFFGFFFFFGKGVSLiPP 
GRETGGPPlALIKPPPFGVKGIFLPPPP 
GGGG\CGPPPPPRVIFVF*GKGFPP*GP 
GGF*PPAPKGSAPPPPPLWKNWGNRIFG 
GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGPFIjDGQGFPPPTLiKQNFFFPA 
LFFPPLGKRLILGGF\ PKPGGLLQPPT* 
KPLRFKPGGESDCFPKFF*GRIVCFQQS 
LC F FP P PKKKKTAFGKRFKKKTLL FFFF 
SDGVSLCHPGWSTVAQCRL 


1344 


15245 


A 


1352 


1 


250 


RRL*SQLLGRLRRQNGVNIiGSGACSERS 
\CSERRSRHCTPAWATKRDSVSKKTKCR 
RHQHVGS LTVRLRPGY W I RHCGCHWWP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1345 


15246 


A 


1353 


249 


3 


CGATLCPKKYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTLK*KQG*RTHTFHF 
HNI LQKAI I VKTMW Y* YKHRQS PGRVAH 


1346 


15247 


A 


1354 


15 


416 


LLTLSSSLKITIHTNKGRT*SFIRENII 
IFIATTNLLGLLPH* FTPTTQLFINLAM 
AIPL* AGAVI IGFRSKIKNALAHSLPQG 
TPTPLIPILVIIETISLLIQPIALA\AR 
LTANITAGHLLMPLIGSATLTI 


1347 


15248 


A 


1355 


2 


416 


IKYLNVRPETLKLL* /ENIE*NPHNIGL 
GSDFFNLTSNSQGIKEKIDESDYFKLKS 
CCTE S DTTNR I RQLKNERKCLQ I TCDKG 
LIF*KKLK*LY/KQKTNNH*K/WSKRLK 
YFSRKDIHMAKRY/MKKCSTS I IRENKP 
K 


1348 


15249 


A 


1356 


426 


85 


HARLVL* FVF/ LFETAYLSVAQA/GGAM 
AQSAHFSLELPGSSDTATSTSQVCYHRN 
T/RLLF* I FCGYG / RLCCLG WSRTPRLK 
QSSHLSLPKCWDYRWEPLYARPYLSCFP 
ENARLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQHNKTPSL/ IKIYIL 
PIYTQKN* KNLTRHGGMCL * SQLLARLR 
*ED*SSPGI*GCREP*WRHWTPVWTIQQ 
DSKSRGEKKQVI FTHYT 


1350 


15251 


A 


1358 


2 


389 


FLHVGQSGCELPTSVDLPASASQNAGIT 
GVSHRSWLKFSLLDVPSPLTP/VSSAVP 
LIS YLATGWRQAAI AASPI FLHQLARPA 
QPAREAAADS*LPADSAFPPKQWTC* *M 
FTGALFKKKKKP CFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDH IKLKNFCTAK * TINKVQRQPME W* 
K/IFVNYPVDKGLITRIYKELKQLYRKK 
KSNNLIFKMSKS\SSLAIREMQIKTTMR 
YHLTPVR I LVYVLP KRARS 


1352 


15253 




1360 


300 


2 


KRAFH* KKMPSRTFTAREGKSMPCFK/ A 
SGWATPVI PAL * EAKVGESLEPRS SRPA 
WATRRDSCLF*K* * INK* KLTLLLGANA 
AGNLTLKLMLICHSKS 


1353 


15254 


A 


1361 


396 


60 


HKVTRKSDGMLCHSGFCS INQLAFCCLF 
FQPGGPRGTPLP/PYKPPPQKTPKKTGP 
GGGGLYSPPFGGGGRGNPFFPGAQGFF* 
PPSPPPPPPPGGRKKIFLPKKKKKSLLF 
FF 


1354 


15255 


A 


1362 


37 


385 


ALFSFSFFFGFFWGKKIFFFGQGGRAGG 
HSNLPEPPPPGGGAFPRPNLSGGGE*RG 
PPIS\GEIFGFLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 
WGKKN 


1355 


15256 


A 


1363 


12 


421 


EPWEPQTLGICTHLQTLFHDYQVLMKML 
WTVRYHLTPVKMPF I RKTFDTAGM* KK 
KKGCYQGS KKGGML I /HPPWGAL * MS I V 
KKKTWGTFFKKIQKELPWDPAI PWLGMY 
PKEGNSVFQRGMGLPSVIGTLFPIAG 


1356 


15257 


A ' 


1364 


278 


3 


CCTGEKLETFLLR*ATRQGCPHLFDNVM 
EVLANAVRYEKE I KG IQNGKKKVKT\SL 
FPGDEIV/YAENPHKS*PKNSL/KLKSD 
CSKVNIQKPIAF 


1357 


15258 


A - 


1365 


265 


330 


WPGAXAHACNPSTLGGRGERIT 


1358 


15259 


A 


1366 


379 


1 


KRKLMYC H WECK * IQPGWKTVWHFLKKL 
KLELP YDS VILLLYTFLKES / PVYARDI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
L'SSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaniic Acid, ^Phenylalanine, 

K=Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 














CTPKFIATLFI IA\ RCSLTHD * TKKMWY 
LS I ME YYSAIKNE / YLPFATTQMNLEDN 
I FS EE S Q A * KHG VF I FP 


1359 


15260 


A 


1367 


1 


376 


HLPGAESQPPPV/DNSWDRPAGRTQLLW 

VQSCS * APSEAQS LGAADS \ G PAATL PA 
RQL I TKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKDDSGKG 


1360 


15261 


A 


1368 


2 


409 


EEAMPKAKMG*RPFAPNSQILNAKKKLL 
KE I KS VTPVN I * M I RK* NSL I ADMKE V* 

SMKAERREEASEEKLEVSR/GWKSAGFM 
RFKERSNLHNIKVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYT YTLHND I LVNDGRLS PG 
GQGCSELSLiCHCTPI *MT\SETLjSQKKK 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*AFHSKGSK**H*EKKVN/WNFTKT 
*NFCTAKX>IIKRMKRQPTKWEVI FANHI 
YLTGILIS\KIYKELRTTQ*PETQSLK 














HASERTHRRGKS TE TTTPAWATERGS VS 
KQKTNKKTNK/RNTGSSIHNMVSDFEKQ 
VTQTF*SSMQMSNKPLKRYLTSVIIEMN 
I KPEYHFTPIRMAI I KKTDNTKC 


1364 


15265 


A 


1372 


3 


332 


G KE VS * E D I MMLHVY APNNRAS KHM KQN 
LIEMQGEI /DE/HPVT IGDFS/TPLSII 
DRSGQKI IKDIV/ELE/STVKQIiDLI /D 
I YRTLYLKRVEYVFN FTSLS KTFATINC 
ILiGHKV 


1365 


15266 


A 


1373 


381 




CS VAQAGVKWCNHS SLQP* TPGVK* SS C 
FSLPSHS DYRHE \ PAL FFKFFVQMGS \ T 
ML*FS*ESCIKGSRQPCSYTYLHSPPIF 
SSLFPSIiAPESIKRQEPSFFFEMESCSV 
TQAG VQ WRHLS S LQA 


1366 


15267 


A 


1374 


1 


385 


YKKGINAKIFNKI * *V\ESAISHDQVGI 
VLVXi QG F LS I / RNKS I S VTHHVNR IjKN / 
HMLI * IDAEKAFDKIQQSSHGKIGIKGN 
FFDLLKS I YRKPMANVTLNSE KMKCLP C 
KCMCTSQGCQLSPLFFLFFF 


1367 


15268 


A 


1375 




357 


CSGVI S PHCGLKFLGSNDLPAIASRVAG 
I IGMTP\HAQLIKWFCCCWWR * CLAFGG 
*RFKTMCRPIVK*NKSINK*NRTVCMSG 
CGDS / RQLLGRLRWS PGLRNVQGCSEP* 
WHHCTSACL 


1368 


15269 


A 


1376 


1 


375 


HRPKFKSISLLEENMGESFHDLGL/GSD 
LLDMAP I VQS IRDKTSDFLE I KKS CSS K 
gtv*imkkqatdwe/rtfvkhtsnkgs i 
MYEEL*NIj*K1j*NNPI*KWAREFSRHL/ 
EDTQMATKHMK/KCSTAL 


1369 


15270 


A 


1377 


260 


400 


KRGLGKTVFPWPKKKNGPFFFFPKTTSK 
GRKTIiMVIsTP E * PKY YS VKKKKKKJCG GGL 
PFPCKIFFPNLTIQTTFFWPKTNPLTLK 
KRKKPPKKTPTFWGHFFFERGTKKTQGG 
KKNPFKKGGWEKLFFLGQKKKMAPFFFF 
QKPPQR/WRKTLMVNPEP*NLLKKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALSLFFFKYFKPRMVEFLVEPSR 
*KGPSVYGLQIFFSIPKVPFPCPFWAF/ 
SLV*SSPPCLSLHLLPAFGVI.PKRSLPR 
PMIiQSFSPMFFARIG*FQILLLWFQSLL 



615 



JSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/USOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspoh 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparric Acid, 
E=Gluramic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S* FFFFFGDRVSLCRPGWSAVAP 


1371 


15272 


A 


1379 


195 


381 


AERYQTSRSTADRISCLFI *SAEI/VFP 
DSGKKTGGNNNNNNNNtWTITOKDNNNNK 
ISLSLKP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDVNRRFPKQPMTNST*SDA* 
VDEWCL*FQLIGRLK*SRRIA*AQEFK 
AS/HGQHSETLSLKNYLKKKRFSTLLFL 
REMQIKTTTRYHFTPIRMAKK/SHHTKC 
* Q GIAML PRLALNS WAQ AI LLLHFSLPI 
SWNYRHTTSSTQASLHVLFVIGCLGNRL 
LTSFAVSFTGG 


1373 


15274 


A 


1381 


400 


90 


LPSSWDYRCAPPLLANFCVF/M*SQGFT 
RL VS WS * PQD P P TSPiSQ SAG I TG VSHHA 
WPAFFIMWFSSSYSR/TMH**QHPHV*K 
LHICPN**KS\LSCPQVPTIIW 


1374 


15275 


A 


1382 


1 


431 


DNI PKKSAVYYWI TLFKKG* DNVEDEAY 
SGTLAI S I C/ EEN I HLVCS LSDE EQHS T 
AQTIANTIDI TVG LiAYT I LTAKS KLN KXi 
CT* *MPKLSYPLF*KIL*KNKTKNIV\P 
NQLQ I KAELPVE I LHNCDQDPET WLGG V 
AHTYN 


1375 


15276 


A 


1383 


2 


432 


ELSADVS FFF I TTPLSGVL * QNWG I AAF 
IPIELRSPTEVTFSFDVGNGPFEISLQS 
P THFSDHQWHHVRVERNMKEAS L /Q VDQ 
L.TPNTQPAPADGHVL1LQI1NSHLFVGGTA 
TTQRGFLGCIRAL/RMNGMTLDLEERAQ 
VTPEVQ 


1376 


15277 


A 


1384 


1 


421 


NPPALASQRAG I AD I CHCAWPLLRLS KP 
QFSNL*SKKFALGDV*SSNIL* YIYIFF 
EMESRSVTQAGVQRHDLGSLQP\LPPKIj 
SLLPPKLSLLPPK\SASCLRLAGSWNYM 
HVPPRPANFC I FSGDGVS PCWPGRTRTP 
G 


1377 


15278 


A 


1385 


2 


416 


IFSVDETALYWKMMPSRTWYL/RKEKSM 
SGFRDSEERLTLFL\GLNAPGELELEIL, 
L I * HPE I LGPLNYVKFTLS VFY * WINEA 
LITAHMFTAWLTECFKPSVETYCLGEEM 
PFNILLLIGNAPGYPRAPMEIQEINI IV 


1378 


15279 


A 


1386 


170 


1 


DGVFLIFGGQNEKLNKNKDG/DLTKLPR 
IA*KSWARQSSCLAFSKCWDYQREPPCL 
A 


1379 


15280 


A 


1387 


47 


418 


FWGFFFFFLKKKKAALGPWTPPPPGQGG 
PPPPPPRGPGKGGPNPPRQTIYGFWGQR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWPPAPGP*KGF* GAS KGAP * KGVDP 
GGERS Q VPNRG A 


1380 


15281 


A 


1388 


58 


503 


RPTRPGNYIMIKEFIFQKDRTI *NVYAP 
XNIALKYIKQKLINLKGKRDKLTLTVAN 
ISTCP*VTDITSR*KICKGTEE1>F*QPA 
\DLIDIYRTLYPTAA*YTFFSSAYNYSK 
IGNS IGHKTFSN/ CKRNDNIQWLFSDHN 
GIK*EINIII 


1381 


15282 


A 


1389 


2 


401 


LVSQWNS KNRI LKKI KNS TP VHT / RQM 
I RKQNNL I ADMET VL WW I EDQT S HN I \ 
S QSMTQS KALTL FNS VKAERGE EAA \ E A 
GRGWFTRFKERRQIj*NLQVESEPPS\AD 
VEAVASYPEDVAKI IDKDGYTKQR 


1382 


15283 


A 


1390 


294 


3 


KIVI FDFNDVKNCSS * KIVI PDFNDVKN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


I Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M-Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














CS S KT V IEGKVSDCKKIFSKHISDKYLI 
S IIYKELSNLNNNQQPN * KWAKDL/NNM 
SSKHRKICSISFGKMQIKTQIRNHYIPT 
DAW 


1383 


15284 


A 


1391 


3 


422 


PLFKKEEGTNFPFPPFWGRFFPGGKSFS 
PFFFPPPPFIKGGPKVSQKGPRFFFFFF 
F*DGVS1iCRPGWILFPPQVTEALLiRGFIj 
ALSSLCFRLDKFFFFF*R*GHLiLjSRWDY 
RHE P PHLR I S * R KT KE \ KGL I M L TRL VL 


1384 


15285 


A 


1392 


3 


400 


FLYIYKDIYR*YKTFQINNKLKIIYFK* 
EKDLSKLFTKDVQMTNMHI *MGSTSLI I 
KQKQI KTTMKHHSSLPECLHFNAANITC 
WKRLGPAGTLILRW*ECKSVKPL/WETF 
*QCLIQLSMQENYH/DPATPLIG 


1385 


15286 


A 


1393 


409 


2 


I VRHFS KED IHSTNEHMKKG FLSLVKE V 
Q I E TTMGHHH I TVRRVK I KTD I FAS A\ N 
TKC* * GYGKTRTLINS /YWECKT VQPL/ 
WKNILAVAYKIKHTFFPPRIHLSGSQKS 
HFSIYSKEMKTLRQHKDIFMATLSQN 














KSPFFFFEKGFPSPPPGGAQWGGFPPPE 
PPPPWK*FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPLPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG*RAFFCPPKI 
SKKLWSGKKKKKGRPLiAI 


1387 


15288 


A 


1395 


83 


411 


qrdsvsktkqkkinnksrtdlnrhftke 
drwm/ankhlkryst*lvirevqikttm 
ryyyisikkskinkpdhtncwqgcgrpg 

TLIYC*WE/CKMVPSLK*TFH*FLKRLN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNI IYCQKGTAFTPISDKID 
FN*KIISRQK*QYMLIKGSIKN/DITII 
NT YT S NNS S P KY I KQ TLTGLKG E I AP S T 
IWGVFNTTLSIIK 










321 




KQ*CSIEYS/FKYTR***SKL/WPGAVA 
HAYSPSTLGGRGGRIT*GQEFKTSLA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVNHACN PSTLGGQGGR I / TL.RSG 
V * DQHGQHGES PS L VGRVR 














KAGMAILIDKVDFRTKNIITE* /EKHFI 
I TKGS VREENI SVLNV/ WPHYRAS KHMK 
Q/ILIELHVELGEHTIIVG/DFNISFSV 
IjNRIDKESARRR*NT * NNTNHQIiNIjVD I 
Y * KTKNTTNGRTTFFP SAHKLFT 1 1 IH I 














YSCDHNSVQPQTPGLKQSLCQPPE*IAG 
ITDARHHI WPI FF\LKRSFVFIAQAGTQ 
WRGLL*KKKKVGGFKKGSCLTLPGRGDY 
RRFREEKILIPGRGFCNELI 














FGLGKHVLHMTSKAQS IKEKTD* LDFLK 
IKNFNFSKDTIKGMKRQARNGEKIFPRH 
QS /DDF F * WAIDIiTFPKEDKVMANKHI K 
RCLRLLLL\SMRYHYH*-NKKTDHTKC+R 
ERG \ ELELLCCW * E Y KMVQ 


1394 


15295 


A 


1402 


149 


402 


RFWIHLYRKRVFPPLLNPVITWKEKRGF 
VFVFVF*DRVLLCHPGWSAVPWS *LTAA 
STSWA\K*SSHLSLPSY*DHQHASPCLC 
N 


1395 


15296 


A 


1403 


1 


403 


KRKSVPELIC/APRC*APCSYRPNLPSS 
PGPHLDPVGPTPTPSPPRPGPARGPPAA 
RPKEPAEPGKEERRGlAPAPGG . 


1396 


15297 


A 


1404 


342 


3 


KEDNLIDKGGSPKQQI FKVW* TALY/ YT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ALYSKKIPPRTLIAGKEKLMPGFKASKN 
R/TLLLGANAAG\DFKLRPMLIDHSEL* 
/RPKSTLPVL*KKA*L.RAHLFIALFTKH 
FKPTVE 


1397 


15298 


A 


1405 


498 


66 


KNYAKSTLPML*KWNSKALMTAHLFTAW 
FTK\ FFMPTVETYCSEKKRF/S FKMLLL 
IDNAHSHPRALME I YKEINI AFMSAKTA 
STLKPMDQ \ KV I LAYKS S YLRN I \ FHKT 
TAALSSDSSDASGQSKFK* I WKGFTRTR 
GRTRGSAR 


1398 


15299 


A 


1406 


3 


411 


VSPLCLGRSQTPEIK*SSPLGLPKCVDY 
RHEPLYIjASMGVLNCPISNLGQCIPP*A 
AWWSLLLGC\PILMLNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS*VAGTLGTHHYAW 
LSMLLGSTL*DVFPFLSPSLIPSEE 


1399 


15300 


A 


1407 


361 




I RVPQKKK I KS PGRGH KIFSF* RAGP P F 
F FS F F FLRQ KE SHS VTQ ARE Q WHDHGS L> 
QSRLPGPK*FSH\PPPPHPPVTGTTPLF 
I YYY Y YFLRQS L / DS VALAG VQWHDLG S 
LHPRVRPRV 


1400 


15301 


A 


1408 


393 




PGFNISVLKKASGGLFFFSPLGKKGFFS 
QPFFFGSPRVFPPPPFFNPPPFFFFCCP 
L*KIFFSPPPGTjKFFFFKRAPPFFFFFF 
FFFPDRVSL\CCPGWSAMVQSRFSATST 
LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


LNVNPNL* SYLDKHRGMLYHI LQSKEFL 
S KTS KAQA KE VTL C K WD YNKL KM FCTAK 
KTL/IKVKR*STE*KKIFVKY/STTK/G 
*TYRI FNKIiNNKNWTQFKNRAETSRS KM 
AE 


1402 


15303 


A 


1410 


422 


2 


VNDRKMDGWMNGWKEGR+ TQKKKGGRRK 
RE KERS M FLEGRKEGREKEREERE KE SQ 
KERQKRGKARKEQRQEARKQGGR/EREK 
ERERERGREREGERGRKSSS * TGSKWK 
I ILiLiAVKMliVHLKSLHLQKVCVFSFSRS 


1403 


15304 


A 


1411 


398 


64 


PGFFFKKIFVENFFGPHFQFFFPPPPKG 
KNPL*PPQRGPG*\RFPGVFPPLKKGSK 
KKFPPPPRGTQPRWGKKPLLPKRPGFPF 
GPRVS FFCLI E IG WFFWAQVPKKKKKK 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP+ FGYPPG 
FSPPPFLKTPPGNFFLGPKKKKFFFPPP 
GEKICFL*RAPPHFFFFFFFFFWPPRQA 
ICFIIE 


1405 


15306 


A 


1413 


3 


387 


TPDLK*LTRLiSLPK*WDYRRATT?G\LF 
FFKP FAS LG S I SM * YHS P VCLAS KLR I L 
FDSS FS PSATAKHPLTSAFAAAWLS PFA 
LIQPHSSESSLYSSYCPVPILLF/IYLF 
I FLRQSL/DSVAQAE VQWRD 


1406 


15307 


A 


1414 


409 


1 


KRSGPFFYF/ SPPFKKGSFAKPFFLGSP 
VFFPPPSSPP\GFFFFSPPPKRGFFPNP 
FFWGAPGFFPPPLF*NPPPDFFFWGPKK 
KKNFPPPRGKKFFFFKGPPPLFFFFFFF 
FFFFFFFFFYKI I S I KGEIGRSSWR 


1407 


15308 


A 


1415 


3 


385 


HRPPPVMGLSPRPAAAEGVPGPPAVLAH 
QHHAQFLARP*LPPHGAGLRTCSPPCLS 
LPNPPPWAPVWPEPP* *ALPPAP/AVPG 
PIN 


1408 


15309 


A 


1416 


2 


238 


PGGRGCSEL>LPRHCTPAWVT\SKTPSQK | 
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PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histi(line, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RKKREKPGSL*EKTIiFFCQTLVFGPKKK 
NGPPLLGRVWAKRKGGFHPPLKN 


1409 


15310 


A 


1417 




394 


NKHLKRCSTSSVIREMQIKTTNRCHFSE 
* KHW/ 1 INMCW* ECGQTELSALLIRC * \ 
WDSEMVQPLWERKEYLKEITAESSWDPA 
IPLLDI \YPKRNEASC 


1410 


15311 


A 


1418 


307 


357 


LETRKTAEKIKRRAGFEK*N1jETNKNGN 
TISQNLRDVAKEVRREKFIAMNA\ *SKK 
KEKP* INNLMIiSL.KELEKDEQTKPKVRR 
KKKI LN I RAE I IRD * KNSGKD * KKS WF 


1411 


15312 


A 


1419 


341 


3 


IFVFLVGRGFPPLTRMVLIFWPHDPPPL 
PP*VIEGLFH/LPAFRPGAFLRKMVPPY 
*KNKIPPPRKNTFFFLKPPPFFFFF*DR 
VLLCHPG WRAVAQS WLATAS TS QAPAIL 
PPR 


1412 


15313 


A 


1420 


405 


1 


WGEELVPSGRVHELFSSALRSERDTDSV 
CSCGGQCHCPGAI PGAALKAACRGD PAS 
RVGVETGSTMGNNGFFSFFIFKPYFFRD 
RV1.PCHA\AMAQS*I.TAASTS* \VKSSF 
CLSLRS S WDYR CVS PHLANFKTF 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMNILiY**IN 
TILHYSYKNYFLFLFLRQGLQPLQAGVQ 
WQNHGSLQL+TPGVSFLSSWDHPNPANF 
F 1 1 FFL ERQGL TLL PRQVLNS * AQ / CNT 
P AS AFQ SAG I IGMSHCSWPYKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPG\GGGCSELRLCHCTP 
AWVTEQDSSYKKKKKTPGGGKGVF/C* K 
LGGGIKF* KRVFCFGKGPTKKKTGGEA 


1415 


15316 


A 


1423 


369 


1 


EHFCKSQDRDSVPKKKYIYIYPMSVFMP 
I NLTNW I KFLERQKLS *Q I EEAI ENLNS 
IY*/PNESEFII*SLPITKAPGPYGFTE 
FYYS FKGEAMQ I LHKLFQKRG * RHS FSN 
VIKTLKSKPEK 


1416 


15317 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCUKQVYTVFFETE 
SCSVTR* /EVQQQDHGSQQPQPPRFQ\H 
P PTSASQ I AETTAI LENFRKI C I YFLRQ 
SL/H£ATQA*VQWRNHGSPQAPPPG 


1417 


15318 


A 


1425 


401 


3 


FFFFFKKNFPFFFQVEGKGRKLG*QPFP 
PG\LKNFSAPPLPGGGN*R\RPPFARLF 
FFFL * KRG FS PLNQGGLGS PPFFS PP\R 
APKNVGFQGITPPPGNFFFFFFFSEAES 
RS VTQAGVQWRE PGS \QPLP PGF 


1418 


15319 


A 


1426 


386 


27 


CRINGVLILC*CECKMAQTL* I * FGNFL 
QKKLDLTCDSTIPLLGIYPRDMKTYVH\ 
KKTCT * MFTAALL 1 1 TKS WEKLLR PSEG 
EWINSRHPYDRIVLSNYETHNVNKIQRH 
YAEAKKTV 


1419 


15320 


A 


1427 


398 


2 


GLQVHATTKFFKFFVETGSCYIAQAGLE 

LLVSSNPPG/SS fpkcwdyrcetrrwpf 
LFKFSLPIFVF*LNHLFCLHLLPLFILK 
LNLPCFM YYLFI PPALYSLSLIALFRND 
CDLSP IHL.EVKKYFSKKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGVSLCCPGWSQTP 
GLKRSSHLNFSKCWDYKR/AAITLPSFF 
* I ALF+ ENPPNCHSLPTPLSPPKLAALG 
NSCLHPWJbPQPPVKGLSDSFVCMHPGS | 


1421 


15322 


A 


1429 


388 


131 


RGGGCLQRH\RGFKQENWFNPGGRGSS* | 



619 



*JSDOCID: <WO 0164835A2J_:> 



WO 01/64835 



PCT7US0J/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=JLeucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSMYCPPAWGAQGGSLSQKKKKKKKKS 
KNYQKILSTQKTQKSRVFIVWSKHCTRH 


1422 


15323 


A 


1430 


184 


413 


VS FLFD FY FL FF * RW / RS HS VTQAAVQ * 
CDCGSLQPFILFYFLRQSL/NSVAQAG/ 
VGSLQPLPPGFKQFSCLSLL.SIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRPPTRPQGFALVA 
HAGG*WRDLRSLQPPPPGFKRFSCISSV 
LWCTKAFNFDFQEIYSFSS* I*GFDSFE 
VKF FFLEMGVSL \ FARAGLKLLGS GKL P 
ASASE/SGGI IGVCHWAQLL. 




15325 




1432 


373 


29 


RQGFPPMGRWLDSLKKKKFQIPNSKIF 
PGGGAPLVIPPSREGEAGKFF*P\GGKG 
AIKQNYC/HCPPTWGKEGNFVSKKKKKK 
EKKMHQK *RC VKDQO * T * MLLFPSLKS I 
YYFK 








1433 


389 


73 


TDCPAIGRNHRDPVRPLLSSPHRKI/YA 
NLY Y Y 1 1 VR I LTL I Q S TDLM * I S PWLV 
LTRVCVCVCVCVCVCI ILPS /CYPVCRF 
MYPTTTVKVQNIS IGQEKQKTKT 










402 


39 


QAGLQLUASSDLPALASQSAGITGVSHC 
ASPSISL*APLGLDTFSDFPCFNDLDSF 
EEHWSGML/SECP*TGI*DLSFFS*VYW 
GYGFLiEDHRGKVPFFITSYQSYTPPTQL 
I I ADVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


EFS KVAGYKINTYKSIVI LYNSNE * FEN 
\ KK I I P F I I AS KR I KYLK I / QFAKAVQD 
FYAENYKTW * R I IKEDLNK*KNVL,C* WI 
GRL/NI/LKMVLLP 


1428 


15329 


A 


1436 


242 


2 


FGGGFFWVGTPNKMGPFPQIIKNLF/TR 
PPENF*KPP/LPFIAFFSLGFLGVFFFF 
*DIVSLCCPGWSA*AQSLLAAALTSPG 










41 1 


0 


LLFVAQECLKWFSALGGPGVPCS\LPAN 
QQPSVWVP*QS \EHVTVLQRFCLWPV1jG 
PAYGQ I WEPVPS TI.K/PPPPGFK*FS * V 
SLLSAWDCRCTP/PTDPANFCIFSRDAV 


1430 


15331 


A 


1438 


41 


412 


FFCTD * GSLCCS S WTQTSGLKRS FHLS F 
pncwnygh/ras vligl i fffflrkkt / c 
P*KKFL*NEDIiLCCQLGLEIjIiAPRDSPA 
LGPQSVGTANVTPRPW/LLFL*TQNYLN 
SSRVRFSPAPLKYKEV 


1431 


15332 


A 


1439 


1 


406 


HAYN I V\NPS PVSLTGALS S LLMTYGLT 
MGCHFLSITLLILGLLTNTLTIYH*WRD 
VTRESTYQGHHTPPGQ*GPRYGIILFIT 
SQVIFFTRFF*AFYHSSLSPTPQLRRHW 
PPTG I TPLNPLEVPLLNTCVIjLR 


1432 


15333 


A 


1440 


3 


422 


mekvswwidqtshniplsqsliqnsal 
slfssmkte*geeaaeekfeasrawlmr . 
/feerchhnvkvqg\eaasgdreatgsy 

divdetalc*kkpstyflffflkqslal 
PP 


1433 


15334 


A 


1441 


375 


2 


KVSS PRAENVI LYNCPPLFFFLCQD/CH 
SVTQAARQWRHHSSLQP*TGL\ASQVAG 
TAGAHLYPSQNLIGRGIRS LA/ SVAQTG 
VQWRNIGSWQPLPVRIKGFSSLSLKSSW 
EYRCPPPCPD\LFIFL 


1434 


15335 


A 


1442 | 397 


122 


GQASLELPTSGDPPASASQSVRITGMSH 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp.irtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CARPILGSCISLFAYPFCFLKCFLF/CC 
REVILL * CSGWS * TPGL.KQSSGL.SLPKC 
WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WE KM FAMXX.S D +ATVSRPYR * LLQLNKT 
KYKLKLSQS/MNRHFTRGDVAMANKHTK 
RKSTSL.VIRE I KTMRYHYLLKRMAEVQK 
/SVKCWQECG * GGGKCG 


1436 


15337 


A 


1444 


10 


410 


CTLFLSISPDSAGGICTMLLFQNTLNFK 
I SLALGVS DKCFS NFL I MTI VRNVF Y I H 
GKKCILYCSPG CAHTHMHTLMHTH I HL K 
QFFLRRGLTLLPRLECSGAITAHCSLDL 
LGSDSPP**EMYFIYMV\EMYFILFPRV 
CTHTYAHTDAHTHTPETI FFETGSHSVA 
QAGVQWRNYCS LQ PRP PGLRFS S YLSL P 
GS*NHRHVPP 


1437 


15338 


A 


1445 


431 


9 


GPAQSLDSSVSPLGTISSTVKWRHNAYL 
AGLSGGI/PCDCFLGPRVPWGSGICQWH 
WAVSPTLS\PTVRPPLFRET:LYFT*V+V 
PPLETCPQ*HISHCRCVINWCEDTHTPK 
KHLVCVHACVCVCVCVCVCVCLGRSRQS 
EH 


1438 


15339 


A 


1446 


85 


416 


PGCLSLPKIWDYRCEPPRPPHFFIFSSI 
QGPLTDFFLTPLEQVQISPTAKGFLKKI 
PFFFF* *RRGVPIIiARWFKS* PHDLAA 
SASQKFGITGVSPHVHPIFLY/CSSIQG 
PLTDFFLTPLEOVQISPTAKGFLKKIPF 
FFFLRQGLT\RQWSTHLGLPKCWD*GCE 
P/PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


IKSEIIETESRKPIEKIS/RAQSCFCGM 
VYE I DKLIiARLDRKKMKTQLNN I RN * RD 
E INTDFTDX KR 1 1 RENYKQD YGTKFNNI, 
DETDI FLERHQLPKX.TQEEKDNLNSP I T 
IKDIEIIV*NLSTG 


1440 


15341 


A 


1448 


2 


192 


SKWIKNLNKELKP*DS*KKPEGNLHDIR 
FGNDFLDVTPKN/MATKGKIDN*TLPKF 
KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


VFLPPLPPFCFLNFF*GKGGFFFKIFF* 
KKKGVEGPQLGPPPPPGPFKKFFFF* IF 
LKAPLFVPPTLKPLF/CPPKKKNWENNP 
PPFFFCFFVKKTQFYFFFFFFLRNR VLL 
CHPRWSVMV* S CLAVASTSWAQAI CP 


1442 


15343 


A 


1450 


413 


1 


EA*TGEWCNPGGGACHEPR*PPCPPPWA 
TERGFVSKKKRKQEK/ SRIMECFQDNLP 
GFFQI FSVMKNKQEGR* FWTK/ VKMTKY 
NA*NVNGS * YWKKRENEEECYDWKKLF 
QSlltS* YVKKAR * KPGCWLMAA I PAIiW 


1443 


15344 


A 


1451 


4 


384 


DPAI P LIiGT YPREM KT YVH I KT CTQM FA 
AVLFTVAETSKRPKCPTDE / VNKI * CTY 
I/MTYYSAMKRDE 


1444 


15345 


A 


1452 


3 


390 


LPDHPGSSVSTPRGVIITGRGFWLLFPW 
VSFFFFFFFJjKRISIjIjPPNWRGGGQNIjL 
WKSPPPRGF*NFIiA*PPQGGRKKGPPHP 
PGAPGDKNP/Q pff f fgekkki fnp ptg 
gekkkppspfkwggggpn 


1445 


15346 


A 


1453 


37 


381 


lilyinvcvcvcvyiythtqmgshivae 
agvqwcnlgmw*p*tpglk+/fllssgd 
yrltlphlanlyi fffffeqrg \ fffll 
klv*nsgpkatlprvgitglthtprpyr 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=ProJine, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibte 
nucleotide deletion, \=possible 
nucleotide insertion 














GDLFF 










3 


400 


TEMVLHTEMVL.HISVWDPLLDRPGRGVT 
WLMLTAELFLYGGTEAFSSSFDV+GPCS 
ETVECFSD/L*A*GPP*NT*SPIYRFAS 
FLLAFCIYLLETRSP\SVTQSGGK/WQD 
LTSLQPLPPRLK*SSCLSLPSSWE 


1447 


15348 


A 


1455 


77 . 


398 


RAEIVPLYSSLGNRVRPPSQKMYIYYEQ 
LHAHTFGNLAEMDKLVERYKVLSKFT*E 
E I \ LNS P VS I KE RE F \ AVIS LPQ KETLG 
PDSFTGGFYIFKEELMPILQRLFW 








1456 


397 


69 


CPPGLSGLPWVA*AVPP/RRPAPPPPPP 
AG IDAGLGDDPLQQTHHA/ P PAAAGSAP 
AGCAAGPARGAPRGRSPPRRGSAGAAPP 
PWPPAAAAASSAGGSSAPCASSPAAPPT 




15350 




1457 


400 


2 


ALFFWAPKKKKNFSPPPGKKFFFFKGPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FF*RRPIFILFYFLNQKNIPRDFFFFFF 
FLRDRVSFCQPGWTAMV*SQLTAA\ *TP 
GLKQS S HLS LPSS * DYRWVP P C 


1450 


15351 


A 


1458 


343 


11 


QG VHS RKYKE I S KLNS * KKNPTRKWAKD 
TNRH FT KDN I QMENKNMKD VQ AR CSGS * 
F * KN * WFKS YAL * PKCS EI / KKS VTGTN 
WRNSQIYDKLSTLQNN*WIFSRSLYNFE 


1451 


.15352 


A 


1459 


109 


401 


QNYRQLGQWDRIESTVIDPCKYGKX.IFD 
QS AKATE WRKDS * NHQTS / WWKES FTRV 
KALEEITKRJCVNSIaAS ISVKTDVHVIKV 
* LKKKKKKRGGRFKE 


1452 


15353 


A 


1460 


42 


424 


CPAN*NSFSRDR/SLPMLPRLILNSWPQ 
WLL/PWPPKVMGLQ 








1461 


419 


69 


IFPLKKKKRGGGVGPPLiYPPPLGAKRGG 
S P* KQNLKPPRPHKENSPFFYKKKKKGG 
RGGAPFFPPPLGGKTKKFFLPPK/IKVS 
FNPNLFPPPPPPGGKKKFFFPKKKKKKK 
SRNTVWF 




15355 




1462 


1 


277 


CHTNFLCMSFPPAISHHPLMPPHPVSRS 
VAQAGV* WCDLGRG/CSELRSCHCTPAW 
MTERDS ISQLIRKKKKLIKI KKKKNKNK 
SVIFLGGGS 


1455 


15356 


A 


1463 


382 


3 


KAGGS INQNS P P P P P PGEKKKKP PPKKK 
KKKKNPFWGGPRILPHNPPPFER*R*KI 
FLGPKFLTPIjGPKIKPLFFFFLKKKKKK 
RMTY/HSVAQGPLLNKDTLQAG\LSKA* 
RS PPKS KS MGQS FHRN 






— 




561 


86 


NDPILSLiKAEKTFGKIQDSFLIVSSSlA 
NKPGIEGNLLS * * KAS TKQNP I MNLR V\ 
LNVFPLRS * TRQVCLLSPLLFNI VLEIL 
AHIISQEKEIKKIQI*YKEEKLPIjFTSS 
SLFI*VKNLMEFAKKLLELINEYNKVER 
YKINIKNILLAKNTWTLKF 


-J457 


15358 




1465 


.. ... 




R FHRVS QDGLDLLTS * STRIX3LP KC f WD 
YRLEPPCPAKTCI.ILNGSC*VFLCYLSC 
FKAQERPSQNSWGAFVTL 


1458 


15359 


A 


1466 


2 


396 


WWPAWTHTPGLKQTSRFSLPECWDYRRE 
PP\PGLVKFLL/IQYRWTQHVGLGDWV . 
QDHHPHI E I CLHKSVI *NGI VFAYNLRT 
S SLTLFYLRRGLF/ S APQAGVQRHKLS L 
LEPLPPSFRRSSNCSSLliSSWDYR 


1459 


15360 


A 


1467 ' 


349 


413 


RLGL* PRKDITD/HVS LRKDTG / WPGAV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=JVletbionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AHACNHS TMAGHRRQ I PRSG V* GQPGQ 


1460 


15361 


A 


1468 


403 


2 


LPVEWKGGIFGSFHPPPLRLKKFFSPTP 
PKS * D/ YRGGPPPPGYFFFFFLKRGFS P 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKRVFF*IYPFFFFFEME 
PRSVPQAG VQWHDLGS LQALP PRFM 


1461 


15362 


A 


1469 


423 


3 


IEFSFFAPSKDKGAIIjGPSTLPLiPRFNN 

ffcppfpinkdirrgppprpnffffrkr 
ggfphwagmflsfnlqegpppsppq/sv 
gf*grn/hppraqffffkkfff*cypff 
fffemeprsvpqagvqwhdlgslqalpp 

RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPKIKKKKKA\GPFL 
RGNRS ILKKKATFPKNLFAPNPGPPKF I 
KKVLSDLQKNLNPPPRMGGNFTPSLLK* 
NKSSKQQITRDFRDIjTSPLDQGDL*KIY 
KTFYPKTTE YTFS SAPHGFY/S * FDH K I 


1463 


15364 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKWGGPLPPAPQ 
FFFKTPRGNPLLRG*KPPPQTPVGGAKG 
SPPW/VPPGFPKKGGGVFFFQQGL*KIS 
PPGPAPP*LVWGGESPPFKKKKSRPGW 
AHACNLAXjWEAKAG 


1464 


15365 


A 


1472 


412 




SRLSFFLSSEPHGPPSPLGPFATNKVAF 
C*/PPPPF*PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPKGGAGPP/PPFWGFV 
NPKKK\LKPFSKPSPPAAKFKNPEGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


D WNIKPVL I SNVMTGL.MSMI I TLLLQLF 
LL* LFFFK/ WELTFFGGKFHQRVDPS FQ 
GSLGRGRQFFLLTKTEYHI FFLI KVFNS 
KTEGAGLAT* IK • 


1466 


15367 


A 


1474 


42 


428 


EIIMESINRFDVITF*NFCL* CYLMSKI 
NRQASSW/ENV*NQYVGEKILIFLIYKE 
CIQINKK\KLRPKIDTQVKDKQISEEGM 
QMANQHMIQC\QPSLVI,NKMQ IEIAEGH 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


A 


1475 


87 


433 


PQS S P FS DH CS YQRL FLHL VKVS TYRJL Q 
KIRKIHKSPGNNEYFGGFFLFFSFLFFF 
ETGFNFVPQAGVQGQDLG*LQPLPLGFK 
GFS/CPQPPRTFFFFFFLRVWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLILMTSNGIHFIYFLFLLLLFFFF*DR 
VSACHPGWRAMS*SYFT*ALTSQVK\QS 
YLSLSSS *NYRHTAIMP 


1469 


15370 


A 


1477 


493 


2 


PGAAAHACNPSTLGSQGGWITRSKI PEH 
PG\TLWNPRSY*KTKMCGLEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/ CLFETESHSLTQDRVQWHD 
LGSLQPPPPRFKQFSWYHI*PHAW 


1471 


15372 


A 


1479 


2 


213 


IDQERERLMEGDRERDTETDAEKDMGRE 
/ RNRYRERERLRG \RRERRKRRDRMT * M 
PRERERENLNSLYRETYRDFETEWVMDR 
ERQRLKRRLL*AVIVPSHSSLGSRRKTP 
FQKKR*REGERKSEFLIQRDI 


1472 


15373 


A 


1480 


77 


453 


SFGDSLTLSPRLAVQWVYLGSL*PPPPE 
LK\YSPTSASQVIHYLLFFFFFFGKKVS 
FCPQGGGEG PPFGLLE I FAPGLMPFFCL 
NPPKGWVWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPEPGT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
bd 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


1 Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRPWCFFSPPPKKGFFPPLIFVGPRFF 
SPPPVFK/SPPPFLFFFPP*KKKLFSPP 
PRKI IFF*KPPPPFFFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 


15375 


A 


1482 


484 


64 


QKRQMANKYMKICSPLFVIREMQTNTTI 
RYDDIPTRVAKNKTNK*KSNSTKYCQEC 
ESARP * I YCWWKYKMVQPLWK/ S / VWQY 
LLKLNI*LPYDPVIPLLSMYPSQIKTCI 
PTKTCAQMYLVTLFI ITKNQKLAHSARV 
G 


1475 


15376 


A 


1483 




400 


YNKKGQGVAPRYDVDTAANFPKELAKI I 
DEGGYTKQQS FN I DYTALYWKKTP CKS Y 
SWR* IH/SLASKDRLT\LLVGNVVGDFK 
LKPVLIHCSKNPRVFENYTKSTLPVLCK 
WNSKAWITAHLFTAWVAEYFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFRGESPPPPKK 
KKKKKKG*DNAEDEAQS*RPPPSI*EKI 
MHLICALTEEY* *LTAEIIASNIHISTG 
SAY 1 1 LTEMLELS KLST* WVPKLFYPNQ 
LRT/RAEL*MVILNK*DQQF 


1477 


15378 


A 


1485 


2 


518 


PPPPPQRFGLRGGAPPPPKKKG*DNAED 
EAQS * RPHPS I * EKIMHLI CALTEE Y* * 
LTAE I I ANNI H I STGSAYI I LTEMLELS 
KLS T * vm> KL F YPNQLRT / RAE LSMVI L 
NK*TSDP 


1478 


15379 


A 


1486 


122 


501 


PRDPPALALQNAGITGLKQSYFLSLLSS 
WDYKHKPPCAPGYLK\VFFLEN* YLALY 
VF**SYQGTSVLTQPPSLIPRHSSPGVS 
VQLFQKK/DLQHGCLLTPS ISGYSVTWD 
GVQWRDHGSL * P * P PRF 


1479 


15380 . 


A 


1487 


491 


425 


I P KND I * AEP * R I NKR / CAKRKMVRKG F 
EAESTICKALRYEEIVCSRNTEFVHGWL 
*ENRGQ IMKG FGWS VEKLEHN* KFYEG * 
LL YRFE * LAAG F I VFQQ * VS KE I VKAW P 
GT VAHACNPSTLGGRGGW I TRS GDRDHP 
G*HA . ... 


1480 


15381 


A 


I48S 


441 


17 


KKTNIYDQLI FNKGAKSTKLRKNSLFNK 
WCQDKIS/IIQKMKVDPYLISNI\NLKW 
LKDLNVTAKTI KLLKENTGAI LHDLGFC 
NAFLDGKPKAQTTKKKQVK*TSSKFKSR 
CQWLIPIISTLWETEAGGSPELRSSKPA 
WPT 


1481 


15382 


A 


1489 


413 


1 


LFWKEMVFLFWPGRVGNPGPK\GWLRLR 
PPPF*FSFQPFWGPPVGLLFPKNFYY*P 
PGGAHGPNPPPPGGKG*GPPRVGGGKFN 
* PKFGPCPPG WATKQKPVFQKKKKPNKT 
KNRTVKVPKVTKF I YYVPYLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRM I TFFT * I VTQN FP YL * KKMDIQ I Q . 
ED*RTPN/RDQKKHTPRHI I IKLSNSQ\ 
NSKRILKTRKK*LWY*GTHIRLSTYFW 
PKS 


1483 


15384 


A 


1491 


397 


39 


RKFRGRS FAKTLFSGPGQG/ PKGNPGL I 
PEGPPPFFFWPAEDGGFFFPPGQLPPQG 
PIRGPGPFLGFPVPPPEGEGPPPTGP*K 
PGQKRPKKRGWSPQKKGPKKKTLFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FGVFWFLAPRRKGGFFPSHLIWVPPGFS 
PPPGV/SNEGPGIKFGGPIKKIFPCPPR _ 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 
AGVLQ*AEIVP\FPPT*ATEQDPVSKKK. 
PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


IKSP+PDRFIAEFYQTFKELVPILLKRF 
HKI /EEGTLPS * FYE /ACVTL I PKPG KD 
TTK 


1486 


15387 


A 


1494 




417 


ILNNARLKPFLLR*TTRQGILLLSLLFN 

SKTLFSQTT*S*GRNPFIKIPLPKMSW 
N/NF I KVTEYKVS IH 


1487 


15388 


A 


1495 


429 


4 


PFCLGSKRFPPFFNPPRGGKF*KKKKIF 
GPPRGGPPLFPPLWAPKGGGPPRAGGSG 
PPPPKGGNPLFINPKKN/PPPPGGAIFQ 
SRFLGGVNPKXFLFPGGKFLiTDPGFPPS 
LPPGG KKKKPRFQKKKKRKKRKVKEKKY 
FL 


I4S8 


15389 • 


A 


1496 


3 


433 


FS KEDTPM VNKHMKD / CCTS L VI REVQ I 
RTTTRYHLMPTRMAMCIYIFNYILFLK/ 
SKNNKC* *RYREIGTLIH/AQWKYKMLQ 
L.LW/KTIWQLLRMIiNTKLSKDQE\IPLL 
GI * KKKKKKKKKRGGRFKGINFTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL * AVAAVALLVPEATRLTMGNNI/T 
VCTPHSIAELLSSKG/DLWLTDNR/LLK 
YQAIiLiLE\DLQLRTFTCLNPATF /VPEE 
TGE PEHDCEW VWQTG KRNNKDHCL YSL 


1490 


15391 


A 


1498 


345 


22 


KSKWLIDIiHGKCKTIiKR* YRKPTK* \LG 
HGNDFLDTTSNAWFI /RKIDKLDFIINK 
ffVCSGKVTVKRMKR * TTDRDKI S S KG I S 
DKRLLCKIYIKILTTQQ 














PVVCVCVCVYVCMCVFETQS /HVARAGM 
Q*HNHSSLQIi*TPGLKQVSCLNLSSSWD 
YRHTPN* FFFF/CNFYLERGGVS 


1492 


15393 


A 


1500 


2 


417 


RD*FMRFK\EKSYSCNIKIQREAASANV 
ETMASY*EDLiAKI INEGSYTK\SQIFNV 
DEVAL\KKMPTRTF I VRERKS I PGFKAS 
KDRLILFLG/ANAPGNIKWKPVLIYHST 
NPRAFKNDAKS ILLVL YKLNS KAWRIAH 
LF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/LLSRTP 
LKDSSEEESSQ*AEIi*AVHLWHFAWKE 
KWPDM *LYTDSLAVASGLAGWSGT \ WKK 
HDWKI CDK/ DWGRGMWMNLS 


1494 


15395 


A 


1502 


3 


125 


RLGLPKC*DYR/RAATTPGIiH*F*SWKE 
QRPQ* IiALGRKPVNRNKNTQKYI H I YTY 
TH/PIPTNVYICIYTHIHTPV*YTHIIY 
I IYLFKVLVFGFIiRRSLT/S VAQAGVPW 
RDHSSMQPRPPYKQLTCH* *LASQSARI 

TSK 


1495 


15396 


A 


1503 


406 


172 


DIILDR*RQKQRLRLRQKQRETETE/RR 
DRGRERQRYRQRQVQRQGRRQRQRLRHR 
Q* QRQIDRLRQRQ/RAERDRGRGRDRDR 
GRDRGRDRERQRLRQRQR*RERQRHRQR 
/RERDRQRQRQRDRGRRRDRETDRGLDR 
G * DRG I DRGRCSDRGGDRDRGLD IGSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


ITGVSHRAWLPS*FI.KFFWVEMESHSV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=C)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknow», *=Stop codon, /=possib!e 
nucleotide deletion, \— possible 
nucleotide insertion 














AQ AGLK / P S GtiKRS S H LiDL PKCWDYRHE 
P \ PHLAFFQ FF 


1497 


15398 


A 


1505 


407 


1 


PISIYHSKDPGSLKSYTKSTLPVF*KCN 
NKTWLTAYLFTAWFPKFFKPTVETYC/S 
RKKSPFKMLLLIANAPSHPRALMEMHKE 
I P WFMSAVTS I LQPMDQG WLTFKS YY 
LRNTFCKAIAAIDNDSSGGSGQIQ 


1498 


15399 


A 


1506 


1 


408 


PPGFKPFCLSLjPSSWNYRHPPP/RPG*F 
F VFLGGTRFHHVGYTAS QHLT SRE THAY 
ALQ 


1499 


15400 




1507 


2 


416 


EPRSHHCTPAWQLSKTLSQKKKKKKRKR 
MVFTGRKKGLFFGN/LKNLGTLSREYPL 
GPGIKNRLAQKRKPLFYKKRF * n r NPGG 
GAHPGGPKSWERGGGRKI *TPVGKYASN 
PEYHICIPPGKRNQNPFLQKKKKRRAD 


1500 


15401 


A 


1508 


276 . 


14 


SPPPYFLLIR*GRKGRGQF* *LRLFFI I 
NLRQCL/DSVSQAGFQWHNHSSLQPRTP 
GPK * PSFLSJjPSSWDYSHAPQIiLAFYGA 
NCFN 


1501 


15402 


A 


1509 




391 


NIFKEIMSENFPSLMSENFPSLGKEIES 
Q I QEAQRT PNKMNP KRS T PKQVI I KF * V 
RE/MLKTAREK*IVICKGTTLRPAVDFT 
AETLQIRRE*DDTFKTLKGKKKKKKTLP 
AKLTFPSILVFKKTRGGSR 


1502 


15403 


A 


1510 


2 


419 


PRVRSRATNVISKYKQHKKKNTSK* IKD 
LNVKPEPIKLL>\EKTTGEKI>IjDIELGND 

flds/tpktrapkanltp*nyfklksfl 
taketfn\ktptkrganhisdrgliski 
ykeltiq*qknnlisk*akdlkrhfske 


1503 


15404 


A 


1511 


392 


122 


SVSLCVSLQSSGMFRC/LPTMPLRR*RQ 

SVSKK*INK*IK*/Q*LRQNVKIKPSPI 
FCEQGKRRK 


1504 


15405 


A 


1512 


2 


281 


GGCSELRSCHCTPAWTT\SETLSQKKKK 
RKKIC I ECNTLKSH/ 1 LS FVCGGKYHSP 
VS CTCL * WS LGRS LKRS LERHQ KNLNDP 
YPIFQNETNIH 


1505 


15406 


A 


1513 


242 


382 


QGPNLG*LHPPPGGLKGFSPIiTLLRSWK 
NRLPPQHPFYFCFFRKNKV 




15407 




1514 


1 


388 


RTRGTERDT/RFRERRRERDRGREGYRW 
RYRERERHRERES*V*RDRVRDGDRVID 
RERERQSERERLR/RERDWERDTYIVRE 
TETERETEGERDI ERERD I VRDRYM/RD 
RDRDILRE * ERDRETALDSEK 










4 


285 


TRXQICNGDKTALY*KEMPSRMFPARVE 
LMPGFKAS KDTLTLLLRPNAVGDVNLKP 
MMIYYSENPRALKNYDKTQLCLYSTNGN 
KAWMTDYRFT 










415 


2 


MGDRESLPPSRVFLFFLiAPCPKGILFHP 
V*FGSVRSFSIERCYR*GQRNEFWGPVI 
RVKASSSRAG*WFSSTAPPLFF/CFFE 
TESCSVAQAGGQ*CDFSSLRPLPPKFKG 
FJLCPSLPDAW 


1509 


15410 


A 


1517 




324 


PTRPEMGFRHAAPAGLELLSSSDLPTSA 
SQSGGI TGVSHRTWP I LASNNY/ SMDECL 
CAI/CERFILFFTiKNILbWILILFIFIY 
FCLIK*CITINFSQNSLLPHSIHLYI 



626 



JSDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


JV1 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/5 J 5,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, . 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRY P WL.NFVLLVEMGL \ TMLASDLERS 
G* PQ / CDLPASASQS AG I TAE SHHGWP 


1511 


15412 


A 


1519 


425 


52 


VLRGAQFLFQEARLKWDFGPGGMAPPCN 
PPPLGV+KGPLPKG\GGPNPPGPPRGNP 
VFSKKPKITPPGGGAPEVPPPWGGGAEK 
FFYPGGPRVQGTE/RKTPPPPPGRQKGT 
PLPQKQKKKKPWSL 


1512 


15413 


A 


1520 


3 


404 


THASGKSNIRGLTLPVFKTYYKAAVIR/ 
TVWP / WLRANTNRQNRL /DG PEVDPYKC 
SQMIFDKGAKAIQ*RTDNNLLNKW*/ES 
*TSTCQKKKKKNQFR/DPAFLIYTKFNS 
NGRKNLRVIGKT*KVLKKNIKQNWGDLG 


1513 


15414 


A 


1521 


332 


39 


TSRETGNFQLDLTAFLNEVGVLFLWRKG 
KGKIDIGIGTKEMP*FPI*LLMFSYIVI 
ERERKPE/LSLSPSLECSGMILAHKKNS 
LPGSSDSHDPASRVAG 


1514 


15415 


A 


1522 


482 


0 


RARTSGVLLCSPGWS *TPDLR * SLCLS F 
PKRWDYRT*ATVPGLLYSLYSRFSP/DE 
LKGCEKSRTSPA 


1515 


15416 


A 


1523 


1 


417 


NKC* *RCEEKGMPVYYWYECK*GQPFWK 
TKERFFKKNL/NIEIjPYNTAIALLNMRP 
Q*I*SQ*RKVCSCMLCAT\PTPNK*IKT 
MW * VCLVE YS S PLKKN\LLFSTTWINLE 
DISSNK/ LRHRKS S I S RFHLLL 


1516 


15417 


A 


1524 


1 


397 


RDSTYQGHHTPPVQ*GLRYGVILFITSE 
VFFFAGFL/WSAFTRSSLAPTPQLGGHW 
PRTGITRLNRLEVPLLNTFVLLASGVSI 
T * AHHSL I ESNRNR 1 1 QALL I T I LLGL Y 
FTLLQASEYFESPCTISDGIY 


1517 


15418 


A 


1525 


2 


376 


LKAKTGQKLGLLHQTVSKFVNAKEKFWK 
ELL\KS ATPVD I * M I RKRTSL I TDMETV 
* W* KEDQTSHP I PLS *SLTQS KALNLF 
KAMKTDRGKGAVE*KSEANRGWFMQFKE 
RSC/RFCNIKVQG 


1518 


15419 


A 


1526 


3 


386 


ESMLKAKTGQKLGLLHQPVSKFVNAKEK 
FWKELL\KSATPVDI*MIRKRTSLITDM 
ETV* W* KEDQTSHHI PLS * SLTQSKAL 
NLFKAMKTDRGKGAVE*KSEANRGWFMQ 
FKERSC/RFCNIKVQG 


1519 


15420 


A 


1527 


127 


388 


krksqinnll.lqfkel.ek/qeetkpkas 
rrnkkkki rvdllki kkgkplegvkkkg 

GFF*RTNKRDKPLLKLPKKKGGGRIKTF 
HKTS 


1520 


15421 


A 


1528 


3 


402 


HENHMKI CS TS YVI RE LQ I KTTMK * Y YT 
PVRTTAIQNTDHTKCWQG*REQ/GSLIY 
CW*QCKMVQ/PLWKRVWQFPTKLKHSLN 
I * SAWLLG I YP \ KSGKFNVHTKTCT * M 
FLAK* PQCSSVDEERKKM/WYMYGFE 


1521 


15422 


A 


1529 


2 


365 


IEKLYRSGAKFFCRDRVSLCCPGWSRTI 
GLKQSSCFGFPKCWDYR/R*AALSGRLL 
LTISSH/REQIEQEFTHYPEKSAKLFMQ 
DGPPRRK/ HPPTRPQASNIQNKFS FFLR 
QSL/NSLAQAG 


1522 


15423 


A 


1530 


1 


418 


GTDTE * ACDKIQKPFPDKRLNKLG/ IRK 
KHLQLDKYL/WKNPIASIILKSRRLKSV 
RLRPGDGRQGCSFPAWLFNI ILEGLARA 
IR*EKQKQKQGQ\QIGKKEVNLSLFIED 
IMLYIENLKESTKKPIIIINEFSRKEDF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

residue of 
peptide 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=LysiniTL^ 
N=Asparagine, P=Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Unknown, * = Stop codon, / = possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RP 


1523 


15424 


A 


1531 


381 


3 


ILQGFVIRKKSKIFQKACHTSDLWTLTN 
LSCLHMKGLPPHRLYSSCTVTjLFLRKPV 
MRKTTLSCCFIjT*DFFIjPI>IjPPCFFR+F 
ILLIF*LYFCKDKVSLCCPSWSKLLP\Q 
QSS CLS LP KYQDYKA 


1524 


15425 


A 


1532 


38 


479 


DEACGPQDPYLTP YVKTQW I KD * TRNKG 
IQFLEENGKN\ FD IGFSSDLLDMTPKTR 
ATKVKL\NDIRLRNFCASKDTINLSLLC 
R\KR*PVEWEKISANHISDKGLISGIYR 
QPPLNSKTSHLI*K*ARDFNRHFSEEDI 
QSALYRWVL 














LIFCRVFEYLHSLiHLPQEICLSLALFSR 
FTFCVI ICEVDVWSVIFKVPFCSKRNKV 
AVHTMLYIQIFVSIjFI*PQNWKQPKCPA 
TVERINKMWYIHIV/EYYSANKR 














VWVCLLS LEGS QS KFGNS I E FG VTjLS SG 
GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 
CFETESHSIAQAGTQWCDLGSLQPLSPE 
FKRFSCLi\SLSPPWPSG 


1527 


15428 


A 


1535 


45 


338 


SNNEPFIiDWIVM*RKVDFIRQPAMTSSV 
VGPRRRSKALPKAKLAPKKVMVTIWWSS 
ARLIHCSFLNASETIASEECTQQIDDMH 
* KLQRLQAAL INRK 


1528 


15429 


A 


1536 


425 


1 


FFNITFHSVSFSPLSQKCILTL+ILLKS 
rVKNNMRNFQRSLVRK*AKDMNRHFTDD 
HVQMAS / KHMKRCS LVIGEMQ I KN\TVS 
YHY P P I RM I KVRNS S NTKCW * GCGQTG S 
LMRCWW/NK*NQLLWKTGI PPHGLVSTR 
SRVF 


1529 


15430 


A 


1537 


4 


443 


ETFVDHYQCGG I RP FS DLQLHAGRTT Ai, 
FKAVRQGHLSLQRL,L,LSF\VCLCPAPRG 
GAYRGRQVS LS CGGLHPVRAS WLLCLPK 
*AWTMEGTSTPASLiPPCSLISDCCASNQ 
RDSVGIGPSEPGAEYNLLVPRFLSPSEK 


1530 


15431 


A 


1538 


487 


3 


TQNGG VLL S APRS VFS PTTLR / CTLQAQ 
C*AFWWGGTQQAASSTAAMAAMKPLGIW 
AGGAAG I L PKLG FQD / LPLS AEADPAGK 

GYPPSPQPLSMRRGPGANPRLARPLRGP 
VRVRIiRRASSERQKRSRGGSGPLG 


1531 


15432 


A 


1539 


394 


489 


IYLFIFETESRSVAHAGMQWRDLDSLQP 
SPTG 


1532 


15433 


A 


1540 


475 


202 


PGGGWFS PPDNFSLKENSQGGP PVSHP P 
P PPGNGEGGKT PGAGHSGI * NP P P PLK I 
N/LEKGGRGGPPGPLLNPPFPKEKPRGG 
KKKKKKKR 














LKEIKSVIPMNTLMVRERNSLIVAVEKV 
LVA* VKEQTSHNI PLCQSLVQS KALTLL 
HSVKAKRG*EAAEEKLEAGKHWFLRFKG 
KSRLHNIKVQGEATSTDGELQQVTYPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDSWDLIK\SFSTAK*AINEADGQTT 
EWEKTFANYASD KGL I S RKELKQ INKKK 
ANSPIKK*AKDTKKQPTNMKKCRTSLII 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

Jocation 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














REMQ I KTTMRYHLTTESE * LLI KSQGRA 
Zi 


1535 


15436 


A 


1543 


17 


474 


NPKKKKKKKIDLLDTYYTLYPK/AEYTF 
SSEIHKNWLYIN*LRANLNKFIN*KIKT 
I LSDHNRRQPE INKCNKTKR / RVT/TQK 
FKNL I LSN YWVKGE LQTQ I AE FLKRNVN 
KNT I YLNT*NTI KVLNSGGST/ALNTYI 
DKNFKKRVPS 














ICLKVISLRQENATSVACIVSCLSEGSR 
ASENLVLHNASPNSCGLGIAFTCLCNW 
SSGNKITDCYD*LSSS/WLGFFFFFFLK 
KSFLFVAQAGGQGGNLG*PKPLPPGLKQ 
FL.PPPRANFCFLEKTGFFLF 


1537 . 


15438 


A 


1545 


1 


300 


PPPPAPXXCRPY*XPWXXPVYHSSWRHG 
SGAAQGAVXiAGFGGVGRQGPGAASVS I P 
LCPENQGCREPGPSHAVPAPSAliPSLRS 
LTGGQ I GGTRAAQAVG 


1538 


15439 


A 


1546 


2 


436 


GAPE IKS I IKGYND/RLCTTKFYNLDEM 
DKFLVRHKLPKLI * E * IDNLNR WITSQE 
TDW* I*QQSSSSSSSSSS/PSSSSSRPN 
GFTTESYQSFEDKL.IPIICKLLKKIDKE 
\GHFPLQL * G I TQ I P KPD IYH/I ENYRP 
ISLM 


• 


15440 


A 


1547 


54 


419 


PATWEDHLSQGGRDCSEPRLHHCTPAWV 
RE * DPTS KKKKE K/NDQ * LL I P * S * RRL. 
* RS SHQTSHLMDWNS EAGR * EATFLiRSY 
SKL\KEELELLLI1.DNNDDHNNSD*YLL 
SIYSIPGTVLS 










37 


339 


KRWKCLRA* LLMRPRHAD* LNPDGVGYS 
EL*SRHCTPAWVTEQDLVSISTNRKNER 
HTLE YSHQHYSG * P * TG/ EESEYPSAL* 
QRE I IDYS FIQGMTDQL 


1541 


15442 


A 


1549 


477 


1 


PGSHDLGSYT*PQ/WSSPEVTSRDAPS 
HPS APKFCSNPCRGRWLTSS KQ P KLRLT 
SAVPGAPGARGTFHALGAGAAEAGGHSA 
SRPEAALCRPLPPLPMTLTSHPLLSGPG 
RLAWGCNGRR * IKGGG 


1542 


15443 


A 


1550 


430 


8 


CWPGSSGTPDLK* STRPGLPTCWDYRHE 
PLCPASKTFLSPQIETPYPLNNNAHSPR 
RPALVNYSLLSVSMDLPIL\AFHTNGIT 
GYVASPAPPVRWGSHSVAQAGVPWPNLG 
SL*PPPPRLK*SSRLSLLSSWDYSHMTP 
L.N 


1543 


15444 


A 


1551 


2 


419 


ETSPSLQGWLGVLFPKRGAKTSRFLI IR 
PQGGS FKDGDF FNPPGEI KTPPAKKKKK 
KDSARSPPARLQA+GAGLWDARASFRRP 
FQAPVPLSAQPRAQPIiAVGTSRDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1544 


15445 


A 


1552 


2 


387 


FRHVAQAGLELLG \ S SDPS VS AS QS TG I 
RGMTYRAQ PGL * LFNS KNS ILSGPKV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIF FFLKQG FPFFP* LEGRGKFLGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFFFLKRGFS/LFCPGGSFSLRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGVSLCHPGWSAVAASRLTA 


1546 


15447 


A 


1554 


1 


427 


LLLLILYAIVEAVT* TEGAGYPPLPGSY 
SHRGASGHRAIFALHLTGGCCI -GAMHC 



629 



MSDOC1D: <WO 0164835A2 I > 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ingto first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

acid ' 
residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glntamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T = Threonine V=Valine 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITTMIDIEPPAITQYHTPLLA*SGLIT* 
GL \LVLS LPVLDAG I T I LLTDRNLITTL 
FNPDGGGDP I LYQHLL * VFGHPEVY I RI 
LPR 


1547 


15448 


A 


1555 


3 


382 


TL YLLF AA* AGGL / GSALS LL I QAELGQ 
PGNLLGNDHI YNVIGTAHALVI ILFI VT 
P 1 1 IGGFGN*liDTL I FAAPDMALSRLNN 
ISL*LIiHTSVL,I J LLASAIVEAVA*TG*T 
G YLCLAGNY S H P * AS V 


1548 


15449 


A 


1556 


380 


3 


EVSPCCPGWS*TPGFKQFACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 
EVNSRPGLYLFIYLIKQKKGVK+*PARY 
LVLFFCFFVFVFVFVFF*HRIS\SVAQG 
GVQWCDLGSLQPPP 


1549 


15450 


A 


1557 


100 


254 


IPTVPTY*TPIKSFHRARIESSGPGYSW 
PVDSAK*VPIjAWSLiDSR\RDSGNLVHP 
LiMRVTN* MKRHL VTLTQS \C YS 


1550 


15451 


A 


1558 


2 


289 


APGVSSTMEDEMGGGLEPQRRGCSKQRS 
HHC I PAWATE +DCLKFNNNKKN YLL.L I S 
IjRTQSYIiYF/ C*VNSY*LKLSIKIjAGGT 
gekeh * sqkrks k 


1551 


15452 


A 


1559 


24 


354 


PLiPSAS PGPEGATPVPTS/ACPNK.IKLY 
HLKKKKKKKKKKKKKKKKKKKKKKPOCKK 
KTKKKNKKRKQQLNE* ETG 


1552 


15453 


A 


1560 


376 


2 


AARGSGVRDPLEEAVCLFSDLQLRAGRT 
TALLiKAIjFK\RQGHLfSIjQRIiLIjSF\ VCZj 
CPAPRGGAYRRRQASLSCGGLHPVRASR 
LLCLP KQAWAMAGAPP PASLP PCSW I SD 
CCASNQ * DS VGVG 


1553 


15454 


A 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FP FliAPGGAPGGL FS FPEAS S PGLNPFF 
WPNPPEK*KKGGPPPPPGFFFFF*KKRG 
FPGG\PGGAPFPDPKIGPPGPPKGGEFR 
GGPPPPGPNFFFFFXjKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSEPRWCHCTPIWVIMRDSVS 
KPKQSKTKKRNVFT/C*D*VF*NNNINY 
INCFS * RALYIL YEKQFM 


1555 


15456 


A 


1563 


410 


1 


TPPPPPKNFLGTPLFPKNKAGKGLFPPP 
LGFPKGGQWPIPPQRFPPLFSPKKRAD* 
KPKP PPALKIRGPNPG VFPKGFCGFS FF 
PPPPFPRGGGLIFFLPP/TK*SGG*KKP 
KKKKKEKCPKKQRGG I DQLTSNLGV 












374 


ADRNLNTTFFDPAG/GVGDPILYQHLF* 
FFGHPEGYILILPGFGIISHIVTYYSGK 
KEPFGYIGMV*AMISrGFLGFIV*AHHI 
FT VG IDVDTRAYFTSATI I I AI PTGVKV 
FS * LtATliHGSNMK 


1557 


15458 


A 


1565 


396 


0 


IIFLI FLRHGFAWAQAGMQWCGLGSLQ 
PLFPGFRLFS /CLLSS WDYR* RQGFTML 
ARLVLNS * PQ AI \S AS PSQS PG I TDVSH 
CA 


1558 


15459 


A 


1566 


186 


452 


KQKCNFKTLNTKYQPL.PFFFLERNFCFC 
PPGGGEGADFTFLEPIiPSGAKGFF\ CLT 
LQRM/ WE * RVSPPTPLNFGFLVKKGFSL 
CGSTGF 


1559 


15460 


A 


1567 


453 


37 


KTALYWKKTPSSTS I AREEKS VPG FKGQ 
AHFLFRGKHKFKLVS KLI YHFENP /R/A 
LKNYAKS IL/ PYKWNKAWMTAHLFS P \ W 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PIKYFRPTVEM* CSYNKILVL I DTGLDH 
PRPLIETYK\FINLHVCVLFNTTPIVQP 
LPR 


1560 


15461 


A 


1568 


453 


39 


TALYWKKTPSRTS I AREEKS VPGFKGQA 
HFLFRGKHKFKLVS KLIDHFENP / R/AL 
KNYAKS / TLAYKWNKAWMTAHLFS P \ WF 
IKYFRPTVEM*CSYNKILVLIDTGLDHP 
RTLiETYK\FINLHVGVLFNTTPILQPM 
PRAQ 


1561 


15462 


A 


1569 


1 


352 


FGTRRERERERERERERERERERGRGGA 
GHQIPSIEGQHKIVGAIIY\CREC\GEK 
ISGSTSYIKVWDIRDSAKCIRTL.TSSGQ 
G I SGDACAATFTRAI TSAQGEH* INQIA 
LRPSGT 


1562 


15463 


A 


1570 


394 


3 


TliDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNNAEGLLiHNQAGRTKDGS 
FAPSHDH*A/PRGTEV/DLLESTLQTSI 
KQVES KPR \EQARTGAGGQ KEKATQNPE 
KSVLTSMYTKSQGSEGRLPGNR 


1563 


15464 


A 


1571 


399 


2 


KHQLPVFWQYNKKAWTTRTLFLDWLHCC 
FVFEVRKYLASKGLPFKWLILDNAPGH 
PPRTP*VQY\KGIEVIYLPPNTMSLI*F 
LDQGVI RTYR/SHYTQ YSMQRTI SAMQE 
NSNKENI I KVWKDSTTDDAI VA 


1564 


15465 


A 


1572 


37 


400 


RGTITGEAASADQEIADKLSDAINKIME 
EKG\Y*LPEQVFNLDESTI«FWG\KKKPQ 
RTLLSKEKKRAPGFKTGKDRLTLLFC/A 
KTAVKLI IRTALTYKAENPQALKENVKHQ 
LiPVFCLTTRGL 


1565 


15466 


A 


1573 


84 


485 


AGHKDS PRPHQTQE PSWLHLWDPAPGLQ 
VELPAS P \GRALALLSPWWDGTGRPGA 
GGGTRRGGSGP TG AHGAGGRLMHGGLQ V 
PS PAPREGS *G PARYQAQRM WARTAGG P 
S TPS AGASR VPS PH CPGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGLRDPSNMRKHAYCGCCV/ CITLC 
VGAQNKNRAVCGLYSTCPRLCVYEHEHI 
CVNE*VCEHVCERESVRVCESTH/LPLC 
A*TCGPIFGCMSEKHVFSYTPCVHRVCV 
CVCVHLGCCVC/VCVCVCVCVCVCVCV 


1567 


15468 


A 


1575 


1 


383 


FbSFGFAPQAGGQGHNHG*GPP*P/PKA 
KGIFPPHPPEKREQRVHATPPGKFIjDFF 
FFWKKGGLNLGPKKNLGPGGKKNLLVSP 
PKGGGKKKETPGPGGVFFWGGIFFFFPP 
PCSPG*PL.SLLKKPKGG 


1568 


15469 


A 


1576 


35 


469 


RI PRCHQPVGPLGCREGAKPQGPGPDAA 
DSHPPASPRAH*/P*SLHRDPIPGFRGP 
CRRNAGAG PRAHTAG I CAPQ PNS * SRHH 
PWAAAPAGDDSPTTSLGSHVPVGRKSSD 
GCRQRALCTPGS PAPS EAEVGGS PELRS 
LRPAWA 


1569 


15470 


A 


1577 


473 


62 


SL.EINP Y I SGQLI FNQHAKTNH* SMRKE 
\ S FFNKWCLDN* I S TGRRMKLDPYLKPH 
I KLNS K/ LKDLN I RDTTM INLHDLGFGN 
GF*TMTTK*AIKEKIN/WDFIKI*NFCA 
SNDIIKKVKRPNVTAYTCNPSTLEGPR 


1570 


15471 


A 


1578 


59 


426 


LERRSFGAPLSPFFAPQFEMKKGFGSP* 
KFFFSPKAUSTFGGGWPFFPPPKKRFFS 
KNPQEGFI PPPLKKKKKTSQPP/ YKFGP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G = Glycine, H = Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














PKESFKRAPPFFFFKGGPFFPLVSNQWT 
GKPRLPPPPPKETQNPSSFQIGGQKKGK 
GGPQSFSFPIKKKKKKGGHSRSRTSPRV 


1571 


15472 


A 


1579 


2 


419 


KANKTKNAYFEGINKMDKPQVILKKKY/ 
REKT*IPSIGEEKGIMSP\NSEDTKRII 
KECFKQTYAHRFYSLKE/MDILIL.ESHKIJ 
PKLTQ/ EETDSLNS PV 


1572 


15473 


A 


1580 


129 


403 


YMFFIPINCQDHPK*KKKKKKKKKKRGG 
/RPFKKTLRGPKLNRAEKNKFFF*KGSI 

PSREKKKF j 


1573 


15474 


A 


1581 


317 


76 


PRFFFFFPPPKKGFFSPPFFFFSPRFFP 
PPPFLKPPPRFFFFGPFKKNFF\PPPRP 
LIFFFF*APPPFFFFFFFFFFFFV 














PREMKTYLPTKKLGYEI SHYH * WWGCKM 
VQPVWKTVWQFLKGLNIKLP*DSAVPI.V 
GM * PREMKTYLPTKKLGYE /MFTLSKLI 
IANK*KQPKCPRMNKW 


1575 


15476 


A 


1583 


1 


415 


PTRPITSSICLRQSYLKALIAYSSISHI 
ALGV\TAILNPTP*SFTGAGILIIA\HG 
LTCSLLFCLANSNYERTRRRI I ILSQGH 
QTLLPLIAF* *LIaARLANIALPPTINIiL 


1576 


15477 


A 


1584 


216 


406 


LNLLPLVLGGSSCLPPPCGGN*KPPPPP 
G\LFLEKKGFSPCGPAGF*PPALRGPPP 
PPLPRGLI 














KKFGYPFYWIG* KILK* FPG* K* SLPHR 
KSPFFFFF*GRVLLCPPGWRGTTKGHGS 
QVT\LPAAAMTFQV\K*SSPLRLPSR*G 

NYWAIAIKPSGPPKVTjG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK* KMKENRMKKNEQSLRDL*DTI 
KLTNR C I LG I P KEE R KKG AEG I FGE IM V 
GNSSNLIR\ENINLNIEVAQ*TLSRIN* 
KRATLRHVI M KM 


1579 


15480 


A 


1587 


242 


409 


GWMI FRFNFFFFLRGSFTLVTQAGG*GG 
DFG/SLRPPPPGliKRFSCLTLPRSWDYR 
H 


1580 


15481 


A 


1588 


2 


338 


EIEKKGKGKKRRG*RSNKKKRGGRLEGQ 
KY*SPPCPRJjSFFL>VDEKGPEVLQDSLG 
GWWKTP / SGCPEMTDSSQPY/ YRAFYVL 
KN/QRVGFSVDVGE IEKDQDVEKNQDPS 
CPRL 


1581 


15482 


A 


1589 


360 


0 


NNFLEQLKFPPKKKI TTDSYKAPRPFFF 
FPPSQKGVFPPTLFFGFPPGFPPPPFLN 
PPPGFF/CFWAPLKKFFFPYPGG*TWVS 
LKGPP/Ii/RFFFFFF*DGVSLCHPGWSS 
SAQS 


1582 


15483 




1390 


75 


412 


VEGQHCNFCAAQETINRVKRQHTELLET 
FANWS PDKGL I PRTYKELKHLNRKKHS Y 
* KWADDLDRHFSKETYTPPKTYSCTQYC 
* P/S ITI EN* KLKTSNIKTRLQGGHKRL 
ER 


1583 


15484 


A 


1591 


309 


1 


FSTQGAHMQVCYMDTFHDAEI WTS I EPV 
TQIVNMLPNR*FFNPCPHFFPPPFCSP/ 
LVS FLLLLRDRVS LCCPGQSRS VG FK* S 
TCLGLPKRWDYKCEPPHVA 


1584 


15485 


A 


1592 


2 


415 


LSISLSIFSFLP\FSF1jLPSLSLSI.FLS " 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, ]=Jso1eucine, 
KHLysine, I/=Leucine, M=Mefhionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFLFLSFFPPSFFFFFHRASLCHLGWSA 
MA*SWLTAVSTSQ\VKQSSHL 


1585 


15486 


A 


1593 


3 


396 


HSAFLFFL*DRVSLCHLSWKAVAQSQLT 
ATSTLL./VLKQSSHLNLP 


1586 


15487 


A 


1594 


300 


62 


KMTGVXjKSSCGKSPKQVGWLCVCVCVF/ 
S FKRQGLAS / VAQTGMQWGDHS SLOPPA 
PGLR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


FDRPAADQKAASALKASGVQAQMAKGTY 
HDWSLQDWKVLWMTH*VS\QQEQDPTNL 
YISNIiPLiCMDELQIjENMLKPFGQVISTR 
ILRDYSGTFRGDGFARMESTDKCDAVIE 
HYNVLLIMTPPGVSAPTEPLLCKFAE 


1588 


15489 


A 


1596 


428 


2 


QSESREPENFLLPTSTSSLLiLVHCLSRT 
CWNSTIjSCVPHFFHLTFNSHIjLIjTSQP 
RRVHFC*LSLSSIISWKLLNTLPGVPIR 
\ASEIFGLRTIRNFPFLSHSVLFPFSLS 
ESHSVTQQGVQWHELGSLQPLPHGCQWL. 
SCL 


1589 


15490 


A 


1597 


2 


442 


QG S L * PH P PMLK * S LTS AF \ N YRHVS PH 
LAN YFLFFF* KQGLAVLLGLALN* AQ/C 
HLPALASQSAGITGVSHGAWLFFFFLNQ 
CYLI * FLILIF\ERRNSPVAHVLVNGGD 
LGLLKPPPPGLMGFSCLNLLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NLGYPRVSPPPPFIjNPPPEFYEGPPKKK 
FY/PPPPPAQKIDPP*TPPPFFFFGTDG 
S HYVAQAG LELLAS SDLP AS ALQSTG I T 
KHEPPHLiAETI FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLCYPGWSAW \ * S*LTAASNSW \ V 
K*SACFSLLSRGDYMCMTLHIANIKKNF 
L.GR VRGSHL* S YHLKLCLKK * RTKEGFL 
S FFWKGKRHFLGQKNILNPRLKVFFLPG 
PPKGLGY 


1592 


15493 


A 


1600 


3 


397 


SRRPGRFSLMLTLS WHS * VCRAALAARE 
EQWSGCFKSHF* LEVNFWVSG IDREV/ C 
GY/LKTVLGEDLNDYVSTQI *D* L.MKPR 
CPE KQD E S LLKE FGGGA * RLNWHR PGA 
VAHACNPNALGAKGRR IPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTS FHSL, IDS CG YLLS YSN FQ I * KKM 
I FK/ YLHL.RAVPRHVIVRFTNLERQEKV 
LRAAREKG*VWHKGKPIRLTADLSA 


1594 


15495 


A 


1602 


435 


2 


PQEAIYTPTQHPTYGAICRIARIHGSRD 
Q/SVEMKHHPQLTITPSGP/LGKSLLPI 
P VALCFADLEVFI PKGGMLSPGVTI ISL 
NWKLRLPWVILQ*RITPLARVIPPDYQR 
EIRLLLNNERKKSYVWNTRDPLGHLWL 
PCPMVIK 


1595 


15496 


A 


1603 


288 


8 


EFPLIREMQIKPTMRYRLTPV*MD\R*W 
*GCGKRGTLVHCWW*CKLVQSVCERSSK 
NITIELPCDPAVLILGMHTKERKLRPCG 
WTRGVGPGYC 


1596 


15497 


A 


1604 


411 


236 


LIPSEHFFFSG*F*AFDRSSLAPTPQLR 
RHRPPTGLLPLSLLQGP\LLTPSVLLS * 
GGRFKGPNFTPPGGQGKIFFMGPPKSNP 
GPGV 


1597 


15498 


A 


1605 


2 


437 


KCLJR I S P CAGPRRP WC P SFE PRVCVWPL 
GVPQVG/PEGP*GEG\GFEGGDVRQLWQ 
GKKKKKKNTLGKRGPWFPKEKRGFSL.TP 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 
peptide 


Amino acid sequence (A— Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Giycine, H—Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 

T=Threonine,V=Vi,]7ne, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WFGPQGLWEPCLNPGGPLFLIGGFQKGP 
GKDDFPL 


1598 


15499 


A 


1606 


31 


467 


EFPGRRFRGGGCSELRLRVCTPAWTTEG 
DSISNKI *LVNEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLELLTSSDPPASAYPK*WDYKR\ 
DPRAQPNPNILKAQSNRSPPRQR*PASS 
FSNQNSCMHRCISLP*VIjFLIFFKNRP* 
NNS FGYNS SGWWSGLNI LNS C * FWFLFG 
NSDTE IR I CRS FHRGREV 


1600 


15501 


A 


1608 


379 


3 


FPILSPSYNPSSTELP***SLQSPLFEP 
YD F FS VNM FS FL FFNLKSHT VAQ AG VQ W 
RNHGSLQPRSFQLK\HPPASASAGTTGM 
CHHAW/LIYIjCVYLFLEMRSHCVSQD* V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNVIGLGLGCGL/EPTMTP 
SGLGLKG S PS * S P P AHRLSGAQLCPVLR 
APVLGPSQMPPGRKKKPTARGSPWRKGV 
FVMSGQSDPLGPSSCQELGPRQSTQGTF 
PGA 


1602 


15503 


A 


1610 


1 


420 


FRFSDGAAGQKCSS PPRPGRGRAEVLLT 
S QTGRQGRGAPHI S DNGQPGRDAP\ PS * 
M*WRPGRGAP\PS*VGWRPGGDAPHF\Q 
TGQPGRGAPHI PDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTAAAWKVKWTSLR 


1603 


15504 


A 


1611 


426 


3 


KNPFLLEAKVSFNPKWPPALPPGEQRDS 
VSQKKKKNLLIHKKAHSKH*FFICRE\C 
ESAL£iLHQNIHAGGKSYVCNK*GRGFRN 
KSHFTYQRTHSGKKAFL*KECG*DFL*K 
AILTAYQKTHSGKKSFVCKECR*DFTQK 
TK 














HHA*LIFKIF/ CVKTRVSLIjCPGWS*TP 
ELK*SSLLGCPKCWDYRREPPIRPTT 














ANKFKNLNEIKFPEAHNLPKFTQEGGLN 
NPVSLY*KN*TYSLiRIiFP\KKKKSGPDD 
FTGEFN*TSKEEVPSLHKLLQKI*ERNT 
LPNLFHKARVTQVSKSEM 


1606 


15507 


A 


1614 


338 


88 


PNPPPPSKGK\GFPPPTPGRKKKTRPPP 
GPPPPNPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A 


1615 


2 


162 


KHGGTCLYSLLLGRLRHESCLNLGGGGC 
RE/PE/SHCI PAWATE * DS VS EKKKYL 


1608 


15509 


A 


1616 


3 


399 


PEVREYLTS RGLP F KVLL I L DNAHGHS E 
PQRFNTEGINVFYIjLPNPRSIjIQTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S*KSRIMTPL/IDAIVMTEKAMEAIMPK 
TIISCWRKLCPDWHDFTRFT 


1609 


15510 


A 


1617 . 


390 


1 


KRNCFGPFLTPPPGQRLGGFKFLK\HFF 
Y YRGERGGFFFS PFNKG FPPP PPFWGFF 
LGALKF*RGVPPSKPPPPGPGKKFKL»KP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 
KKKKNFTAARDLE PNAW 


1610 


15511 


A 


1618 


468 


0 


MKXi\miWL.LLLiVALL,*GKKHLGDRLEKK 
SFEKAPCPGCSHLTLKVEFSSTVAEYEY 
I VAFNG YFT AKARNS F I S RAL KS S EVHN 
WRIIPRSNPSSDYPT*WP/VALKKKKKK 
AGV/LPLENYS I I TRV 


1611 


15512 


A 


1619 


421 


2 


SSRLSLPKCWDYRREPPRPAQPR I LKKI 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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N=Asparagine, P=Pro!ine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














S**EPFFLILKCI*LRTQGLGRGHKGKL 
NSTYVLHKFMD* *LGQLMNGLNMLDESS 
FCNVLKHTKQ\WRDYYFVCLFVCFVRQS 
L/NPVSQAGVQRRDHGSLRAPPPGFAPF 
S 


1612 


15513 


A 


1620 


406 


2 


GTKKAS SNGFING PGAAKKKS QTFAFLA 
KPGAFPSILLDEKNRPQIKPPACFP/SI 
RESQTSYPVFFFWSPSCGFFFFWGKPEP 
LLLKEAGKVPLFLFLFGFIj*DGVSIjCQP 
GWNAVLRSQLTAESNPHASAHAS 


1613 


15514 


A 


1621 


411 


0 


PPPPKKKPSGPPPP/PSSSSSPSPPRKF 
LGGPRVFFPPPFFKPPPPKNFWGPPKKK 
KFPPPPGGKKFFF*RAPPP 


1614 


15515 


A 


1622 


2 


403 


TARCGLNFPCSSSLPFIAS *VAGTTGTH 
HHAQL I LiL>I FCGDELSLM\ CPGWS / PNS 
SHLGLPKCWDYR 


1615 


15516 


A 


1623 


298 


411 


LIINVCWPGLVAHACNPSTLGG*GGRI/ 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DSVSKKKKNFKE*LIPVLLKLFQNIEEE 
GILPNSLYKAGV\LIPKPDKDT*RKEIY 
RPISLINIDAKIVSKILANKIQ*FIKKI 
TDAW 


1617 


15518 


A 


1625 


3 


281 


PFSCLSLPSCWDYRRPPPRPANFF/VYF 
YKKNTRTQCFTVKHGFTVLTRLVLIS * P 
CDPPSLASQS AG I TGVSHRTQPHTVFFL 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


NPGPRGFPPPGPPKRLDFRGGAPRPGF* 
YFIiKNFLGFFWFQKNFLVFFLGNKTPFP 
PQFFFFF/RDRMSliCHPGWSEVAQAWLK 
AALTSQTPAI 


1619 


15520 


A 


1627 


394 


40 


PQPAAASLiFS PGLFFFFPPVFSPPPFFK 
TPPR I FFFWP P * KNFFF\ P PPAFFFFFF 
LGAPPFFFFFFFFFFFFFF*DRVSLCRP 
G WS AVAQ S * LT AALNS QT 


1620 


15521 


A 


1628 


386 


3 


IFPTRCTHLHLGTNKLPTCSFFDQAKKN 
PFCSXHHSRGVGLRARLFCERLTIEGAG 
TPACPAP * FPGEPTRP/PGR VRWLTPAI 
PALWEAETGS * YVARTGLELLVS SNP PL 
SASQSARI TGVSHRTWP 


1621 


15522 


A 


1629 


401 


93 


ARGVLPLNPPPWGGRGGGSP*GKNSKPP 
RPRGENPPPFLKPQKLPPPGGGPPPPLF 
LGG*SKKTPPPPKGGAPINQKPPPGLPP 
PGKKGAPFPKKKKKKDKNIRTKKKARRS 
GSLLQSQHPGRPR/RGGPPLTKNPPLAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


PPPGTI S S PNP /QKNTjKKGPGPGGNPRN 
P P PLGGQRG/RGLWAKKSR PPG p prgnp 

plfkkkkinggggpppwppprgaragk 

SLYPGGG PPQ * PQMGPPPP PPGAXKG FF 
PECKKKKKPKKPRKT 


1623 


15524 


A 


1631 


416 


54 


E YWCKGR * I DQWNRM * FKMDQHLHSQFI 
FN* AT KAI QWGKE SLFNRLCLKWWLT I R 
DK\IYLDACIjTTY\*KINSSWHSGAFL* 
SQILERLRQEASLSPGI*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 


373 


LVFLDNMIiKIiLRHSALASACFPEDLAKI 
MDEGGYTKGQ I FKVHGTAFCWKKM PSRT 
FWREQSVPGFK/ATD*IiLLGANAAGN\ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, . . 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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KLKPMP I YHS ENPRVFKNYVKP I L PALY 
TKNS KA/QMAAPL F 


1625 


15526 


A 


1633 


3 


383 


NI WN* KAWMTVHL/ FIT* FTEYFKSTVE 
NC* EKKKILFKI /LtXIDNTFGHPKALM 
EIYKEINVIFMPANTISIL*PMDQGVIA 
TLNSYYLRNTFHKTIAFINYDSSDRCGQ 
SQLKNFWKGFS ILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQL I GRLRQENRLN / SGDRGCSEP I 
LHLCTPPWATE * DPVSKKKKKNPCPKLK 
KGPPP 


1627 


15528 


A 


1635 


331 


1 


LFPPPPVLKSGPGPN*N*PPI,KGKSFGP 
LKKKFFFFFF\FRDWVLLYCLGWSQTSG 
LKRSSCWDYRCEPPHLTPNF/SYF/CRD 
SVSL* PRLEGSGSI IALCSLKLLDSSSP 
H 


1628 


15529 


A, 


1636 


80 


381 


KL S KCHAHHS RG F YKYS L FQLG AS Q F PQ 
VLRITHPHKGSWAAPRS*G*SQC/SHFF 
S F F FET KSHF WQ AGG YGRN FT * LiQ P P P 
PGLKRFSHLSLPSSWDYS 


1629 


15530 




1637 


381 


38 


SKRQGFHHVGQAGIiELLSSSDLPTMALH 
PTCPLQKCWDYRC/DATAPNLSSTFFMQ 
EKCLAFL\PLFSSAPHSL* * PKAVIEKD 
CPGL* IW* * VARSQRMNVFNFIK 


1630 


15531 




1638 


3 


295 


PGPDGFTAEFNQTFKE*LIPILVKLF*K 
IQEVR\ FFKFS ITLTPKSNKD 


1631 


15532 


A 


1639 


48 


380 


ILGKAI S FT IE * KGLKYLGI YLTKEAKG 
LHTENYKMLLKELKKDTNIWKGIbCQ/W 
TRRLNMVKISV 


1632 


15533 




1640 


343 


23 


S WLTAT / FC FLGS S DS PAS AG VELLTSG 
DL PALAS * TAG I TG VS HRTRPALS INTS 
TLLCS S PYCPPHLQSLQGTTQPVPFIiKA 
QRTGCLLQGVI LTSQAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LE LFS S AHCC P S IjTVMQ Y YPP \ RPTSHD 
CQREKSPHRTKKKKPLEGVFLG*KIKTN 
LENPPPLPFFGGGPPPKGGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPLRAFPLQRFFRGPE 


1634 


15535 


A 


1642 




308 


NKWR*GNWISVKIDNFNFYLMPYTNLS* 
IRDDLNAKATTIKLVGENIGENLGIGKN 
F * ERTLKA\L,RGKKMDKLD F I T IGNFCF 
SKDRI KNKNKARRGGSRL 


1635 


15536 




1643 


16 


386 


EKKKLSIjFTENGIPYL*NPKESAKRIjLS 
LINDFSKV*GYKNW\DEKSVAFLNTNK 


1636 


15537 


A 


1644 


533 


3 


PLSLSLS SLFLS FFb* DRACFVAQAGVQ 
WLDLGSLQPPPPGIK*FS/CGGNVAVTP 
RLS PLTLPAMTEVRLPSSKIQTNKEKNT 
VMSE I YQSLM* MGWG /M*VRMCVKFKKP 
WVDN* ♦VSVMGCL.LPSFLPSFLSFLFLS 
PS PLS FFPYF FETES CFVAEAGAQ WLDL 
G*LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GI GGR P P * FQLLGRIiKQEK/ HLERGKGF 
NEPKSRPCISAWATKGDCFQKKKKIHMT 
LQQIQPKQSWKTDISQFRTDYKTTIIKT 
VGHGIWIDIFINGIELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNEKDNNKCWLTI \GKI *MLTGCYW 
ECKMVQbPWKTAWQFL/R/DVNI*LPYD 
LAITLLS IYTRKRKTYVYTKTCIQMFLA 
VLFTTAKRWKQP - . 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
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nucleotide deletion, \=possible 
nucleotide insertion 


1639 


15540 


A 


1647 


392 




QKLE IMLREEGMLKAKTGQKLGLLCQAV 
THLVNAKEKFLKLNVLLQGTHKW*ESET 
AKSLILFNSMKAEKGE *AAEEKFAASRG 
WFM/RFKERNCLHHLKVQGEAASAWEA 
VANY I EDLVKKIDKGGCIN 


1640 


15541 


A 


1648 


425 


148 


SQSLI * SKALTLCNSMKT/ERAEE VAGK 
KLEASNF\ LKFKKRS CV\RNVNMQGK VA 
SLDGEAAASSPEDLVNFIDEGGYTKQQI 
FN*DKTSFFF 


1641 


15542 


A 


1649 


271 


462 


RRQKKRHKRCILSPLLFDIVLEVPARTI 
*QEKKI KGI QIGKKE V/KI I SLFADD W 
LYL 


1642 


15543 


A 


1650 


70 


398 


RPEASLRHMCLNAGQLLSKRAKLGALSL 
SFFFWKKSLAFAPQPGGQGGNLG**KPP 
LPGLRG FSGLTLLRN / WE * RW P VP PPT\ 
NFGPLI KTGFPLVGQAGFDLRTLGALR 


1643 


15544 


A 


1651 


425 


3 


FEGFPKVGFPLGPRFPSRVPPFGTLPPP 
PKRGPRCFPQPGAPPPKIWTPPGALPQG 
GVGPALPG/ALQKFGPKNPGGFFSGPPQ 
MAP\GGFPGGP*RPPRGGAPFFFFLRQG 
FPVAQARVHI.PGSSDPPISAPQVAGTTD 
VC 


1644 


15545 


A 


1652 


385 


2 


KGNNPSPPEIKFFFFFF*KGUjPL.PQGG 
GQWGYFRSLQPPPSRLKLFSCPNLPSNW 
EYRGP* \RLL*LITGRGTSGSKTKVPTP 
CGPFNLNQLGLGTQARNFS/RLVFFFFE 
TESPFVAQAGIQLRDLHS 


1645 


15546 


A 


1653 


242 


3 


KNKNFGINRGFFFTFKVPGFFFFLGKVK 
LFFFFRNLKFFLKAKPP*\VFFPIGPSF 
FFFFFFLDRVLLCCPGWNVWQSL 


1646 


15547 


A 


1654 


3 


285 


HFI IYTKDLNRRFS KEDI * 1 1 1 KHV/ KK 
CS PSLAVREMQ I KTTVR PGTVAGTCNPN 
TLGGQGRRIVQDQLKQQSKTSSLQKKIL 
FRLARHGGTCP 


1647 


15548 


A 


1655 


1 


373 


KVSLFFFFEQGLLCCPGWSAWSSL.QPQ 
CPRVKQFSHVS/LPSN*EYSCTPNTFSL 
QVCVSIHKYI*YIYIYIFKFF/CR/DRT 
LARLHRLVS NAWSQAILP PWP P KVLGLQ 


164S 


15549 


A 


1656 


189 


2 


VQPGQQERNSI S KTKNKTKQKLPTKKSP 
GPD*FKEELIPILHKL/F/HKIEDKGTL 
HNSFYWTI 


1649 


15550 


A 


1657 


385 


13 


GGPPPWGARS PPKL *NPPPPRKKPPGPP 
PPPGGAPPLGGFFLFFPPPGGPPPGGGK 
/S FSPRFFFFFFFLGGGGQI FLFS PGGG 
PRGVFSPPAPPLVPKPAFFSKKKKRGSS 
GGEKPEADGYFIK 


1650 


15551 


A 


1658 


352 


2 


HLSLLSNWDS IRAPP / RPCVI FKNVPLN 
I FFL*RGVTMLPRLVDNDPPI SASQVAR 
1 1 DVSHWAKLRRS V/ CYVFETGS GS LSQ 
AGVQRYNHGSVQPQPS RVS * S SHHS LWK 
YRYPPPRR 


1651 


15552 


A 


1659 


265 


3 


HSGQRDEGRMRCGE WLESHGWRARS CM 
TLKTSFLFTMAKI*NQLKCLFMDEWIKK 
MWHIHTMEYYSAIKR/DEIPSFVATWME 
LEVIM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKT I EKI IGTKT * LFKKI KKFD 
KPIiARWT/RGKKNIQITNIRN*RRDVI 


1653 


15554 


A 


1661 


56 


320 


KFFMYSAGESTKIRCLF/SCLFLFIjR/Q 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny|alanine, 
G=Glycine, H=Histidine, I—Isoleucine, 
K=i-,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutfimine, R=Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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E S RS VS QD G G * SE P RS C YCT P AWATE * D 
SVSKKQNKNKPKKNLNKCFASSIiIjITLP 
PARHKA 


1654 


15555 


A 


1662 


3 


421 


GIITDTFPNLEKGINIQVKKV/RRPPSR 
FNPKKTTSRDLIIKLPKIKDKGS*KQKE 
KTS KS ARL PQPHGLLGLGLG KSASSPIK 
KERKQ \ Q I TYSGAP I FLVTD F SVETLQV 
RRE WHDVFKVLKEK/DFYPR I VYLVKI S 
F 














V I FADE AQ I LKKEDKLDFI KM KTS VHQ K 
TLSTEYKDNHREKI FVSQISDKEL.ISRV 
YKGLLKLWWTDKNLILFYF+RLGHSVTQ 
GRVQWCNHSS LHPQTPGLK\NP PTS AS * 
AAGNTGVHL 










79 


355 


IHLPSLIGDFNLFTGISL.LICLVLFVFI 
FETGSCSVAQ/S/GVQWHNHGLLQPRPS 
GLRQSSHLSPLSSWNHRHGPG*FIYF/C 
VEIRAHYHP 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLINFFFFPKLCKLAPFFFLPF 
FLWGGGGGKFSRNPQKHFPR* KRVFVNF 
FFFFFFLRHCLT VS LSHCHPG WS AVA * S 
QLTAGSNFW \ VKQ S S HLS /L PS S WDHRH 
APP 


1658 


15559 


A 


1666 


163 


601 


I FCKGGVLPCCPGLADLHFSTSNS I SFY 
YSSGLLRMTNKTETPMSTI PKGVGVAWR 
FGNSECI FQELPLTLHHLLSTMLAS FIH 
SHEASANALVGRSLTVGWGCRGVGVSVD 
PAAWLWRDLKGCF*DKSHSVTQSGGQWC 
NLSSLQP* APRLKRS CI LSLPRS WDHWH 
VPPFLANF* IFCKGGVLP\FAQGW 


1659 


15560 


A 


1667 


418 


3 


SVCLGLPKCWDTGRKPLCPAPSFFY/EG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
DAKI LNR I LANCISN * I * NH * \ * KVKFT 
PGKKDWFNNRKPTD I I YPH*QNREEKSL 
VS S TDMAKVFNKI Q PVLR ELT I E I KG NF 
LNL 


1660 


15561 


A 


1668 


411 


1 


LRLHVGRTTT \ LFKAVSQGHLSLQR FLL 
PSVEI CPAPRGGVYRGRQASLS CGGLHR 
VRASQLLWFLTQASAMAGPPPPVLLPPC 
SLI *DCCANNKGGFIGVGPFEPCVGYNL 
LVS HLLR P S E KPS I RVG VT * FSRC 


1661 


15562 


A 


1669 


151 


1 


PLEKEAEITGP*PHAWLIF/ CFF* TESR 
Y I AQARMKWHNLGS LQPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TFF * KLKM INLSEEGMLKAKISQKLGLC 
\TVSQWNSKEKFLKEIKSATPEST*MI 
RK*NSSVANTENV*ERSRTSHNILFS*S 
SIQNKDLTFFNSLKAWRGQEVAEEKSEA 
SS GW FMRFKERSHLHT I K 


1663 


15564 


A 


1671 


1 


363 


ECTGPKIAKIILEKKNWVGGLPI/PNFK 
I * YKAP VI * F WLKVPV IPS SSAI LMKT V 
*Y\YFKDRNQDEWYRLRVWKINSHIYGQ 
LI \FSKGTKTIQW* KSLFNK* CWNNWLF 
TCKRMKLD 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS * * IRRCNI I / KCLFS PKM 
N*VFNVIT/IQCPSGHFFTETDKSILKF 
IWKSK+PRLAKRTL 


1665 


15566 


A 


1673 


15 


378 


NYHHNQNNEHMYHSLPNFFFFFFGKGAP 
PG\PKGGGRG*REP*IPGGKGNPPL*PP 
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peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KEGGPPGGTPPPGKPGKIWDFWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKQPGPKKFF 


1666 


15567 


A 


1674 


2 


349 


LFLTVLARLVSNS*PQ/ CDPPALASQSA 
E I TG MS HRAQPRG I R * I * CGS YTTCHPI. 
I&LT \ YLFI YFS TESHS VI QAGVQWHDL 
GSLiQSQLRRIjT/ CILTLPSSWD*PHVTLi 
PG+FCLF 


1667 


15568 


A 


1675 


348 


78 


LiGPWVP P V I PAP LGG * GGRS PRPGFLTN 
PGPKGKPRVFLKIQNLPGV/RGRPPLFP 
/GSPGSGGPKSP*PLGPSFPLT*NFSPP 
FWGPPGGPK 


1668 


15569 


A 


1676 


2 


385 


TSRRDYRP\GHHNQLIF* / SFCRDGASL 
CCPCWSQTPGIKKSSCLVFPRCWDYRCE 
PGIi*I*MGKNPT\LFSNGL*CDCIPL.IH 
S IAD I RKKPHS * LQGL * LCHQQNSQTES 
CS VTQAGVQ* CDLGSLQP 


1669 


15570 


A 


1677 


386 


1 


KSTRPVLYKWNNKA*MTEHLFTA*FTEY 
FKATIETFCSEK* IPLKILIjLICNVPSH 
PRALMGMYKEINVVSMPTDITCILQPMD 
QGVISTFKSYYLRNTFDKAIAT/DSDSS 
DGSGKNLLiKTFWKGFTI 


1670 


15571 


A 


1678 


2 


193 


EGGRIFFNSFFEVILTLI/PKPKKWER 
K*SYHPISIMNGDVKILAQTLLNQIQQY 
LKRIIHYDS 


1671 


15572 


A 


1679 


561 


830 


TLLLGT \NAWDFPCIjKPMLTY \HS \ ENS 
RGL * KS WINLGLT VFYK WGTNNAWGDDR 
HHLVYRHGFTGIF*GSQLKTYCSENIPF 
KILLFIDN 


1672 


15573 


A 


1680 


415 


2 


TSCAWLSLYPVYLYRSSSLPRFIFCHFK 
HLWYKNITKKAAEINTNFCCYINRVLLC 
HLGWS / AT WS S + LiTVTS KLLGSRDPHT 
LSLPSS * EGRCI TPRLGDLLNLFSRDGG 
GIRGTSTLPRQVLNAWPQAILL.VRIT 


1673 


15574 


A 


1681 


1 


78 


RPR I RHE VGQ AGL.KLLTSGQT PAS VP * C 
WDYRPEPPCPALHISYK*NHAMCGIiKCL 
AVSA*RHVLGFIRG\WHVECCFPFCS *A 
GLKXLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPIPGA/AGLRPPPPPLGNPPPP* 
KAKICPGGGAPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPLPFGLGHKRGLFPKKKK 
KI 


1675 


15576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPFSPGGGQWGPLGNPRPPGAKGGST 
LRGPRTGGSRGGPPGPGKFFFFL>KQSLA 
L 


1676 


15577 


A 


1684 


3 


374 


GISVLPGIGAPGNKPELFEEVKLYNNAR 
ERE KYDNMAE L F / AWKTMQ AIjE KAY I K 
DCVSPSEYTAACSRLLVQYKAVFRRVQG 
SEISSIDEFCRKFRLDCPLAMERIKEDR 
PITIKDD*GNSLS 


1677 


15578 


A 


1685 


2 


373 


PFIRPETIKLLEEIPGGKLLDLGLGNDL 
LART PNAKINT WDH I KL KS FCT I KET I K 
TMKKQPTDCEKI FSRLiISDKGLiM/SQMC 
KELVQLN* KK/TDDSVKGWAEDLDRHFS 
KKDI KMANRPGKVL 


1678 


15579 


A 


1686 


378 


201 


HATCLANF \CS YG * DRVS PF CPGWS * TP 
ELKRSTRLGI.PKCWDYRCLGRSI.IiFPGA 



639 



WO 01/64835 



PCT/U SO 1/04927 



SEQ1D 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Ph'enylalnnine, 
G—Glycine, H— Histidine, l=Isoleucine, 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X^TJnknown, "=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPDH 














MCVNPGGGACS E PRS CHCTP AWVTERDS 
V* KTNKKKVKRRKK I FHANS / KLKRVGM 
AVLISDKIYYKLKKVRRDKEYI IKGSTY 
QEDISITNTYTLNT*APKYMKDTLTELK 
EEIDIYAIIVGDFNTLLLI ICRGT 


1680 


15581 


A 


1688 


307 


33 


DEGSCHDAHAGLKLLiDSSGLPASASQSA 
GILGVSHGARPLISSYNGTSHGGLGPAL 
VTS FNLSHLFKDLLS LQ I /HVTF*GMGL 
GFAGAKLiSL 


16SI 


15582 


A 


1689 


11 


394 


IFILEARTRISSRTFFKS * INIHAPYNS 
AI*LIGIFPREKK/ STCPYIYTQMFIAS 
IiFVTAQTRKQPKCPSTGE \ WS KNLWN 








1690 


20 


391 


SEGKGWYSCTKWQSMKLGGITFFFFFPP 
PKNPPPXKSGP/QKGPFF*GKGPPWPPK 
KRGHKN\RGFPKQRARPPKPVFFLIFGK 
RGFIjLGPKGG* MPGEKRNP PP WP* KGGK 
NPGNPKGGPHLTL • 


1683 


15584 


A 


1691 


72 


392 


IKMIGSLFFGFAFFFFFGKKTPFFPQPK 
R\GGKP* IT*TPPPGN*RNSGPSPPQKV 
GIKAPPPLPK\NF*FFGKNGVTPFPPGG 
FEPPTPKEPSPPVS PKGGKTNPAP 


1684 


155S5 


A 


1692. 


389 


161 


HGGACLRSQLLGRLRREDCLNQGG*GCS 
EPCTPAWVTE+D/SSQKNQKSKIKKSGL 
DNS FS I G * G I LGLS TCDRYS 


1685 


15586 


A 


1693 


286 


363 


DGISL*PRPPGLRQPSQPS\LLSNWGCR 
STPLCLAGFFVFVETGFLHVAQACLRQG 
FTMVAQS/ASQKIHIT*GAFETIQVLFY 
WGGVGFGQWRFFFFFETESHSVTQAGVR 
GCSLSSLQVPPPG 












356 


ELLEPRGRGCSERRPCHCPPVRVAEQDS 
VSKKKRERKYLYFLRFNWRSLRIFIYFC 
* /HPLQHNIQNI S FTLQNSFGFFSRQYC 
PSPLE I I FLTPLTENLLGLFMKGI IQNW 
FFGVGLF 








1695 


3 


298 


KYFE TNENKN I QYQNL \AVKL VFRENLT 
WNACVKKEERFQVNNLALYPKN* EKSM 
LNPKGKI IKVRSEKNDIE*KNDEENQ*N 
* SWYFEKI TYWQTLD 


1688 


15589 


A 


1696 


3 


405 


RLWCGWRNRHLGS *NRVENPETGLHRYA 
QLIFLTKVQKQVGEGQPFNK'* CGGTWAP 
TGKT/MEQPPKASS9S3SSSSSSSSSSC 
KM*NIVFFKMGENLWDH*AKSYEVRTKA 
* TIKGKVDKLDFI KIKHFCYGKN 


1689 


15590 


A 


1697 


6 


392 


LQRTLLVGLFNAAGNLKLKPML I CHS EN 
PRALKNYAKSTLPVFYKWIKIAWMTV* T 
VAAWLTEYFKPIFENYCS/EKKIPFKLF 
LLTNNAPGHPTGLMEMYKTTNVGSLPAN 
TTSVLQPMGQRVISTFKS 


}69Q 


"75591 





1698 


390 


"1 


AIIESDFIjTTSREVAKKIjRVHPFMVIjWV 
LEP IGKVKKVNKWGP * KLNK/NKKNHCI 
EVSSSLFLCNNEPFLDGIVTCDEKWILY 
HNW* * SAQWLNREVAPK\HFLQPNLHQK 
KVTVIVWWSAAGLIQNEH 


1691 


15592 


A 


1699 


1 


245 


GGGGEYS KI IAI KTALKNTTYLG IYLIK 
\DVPDLYTKNYGTMLREIKYLEK*RARP 
CS*TERFKIVKMSI/LPNLIYRFNTI 


1692 


15593 


A 


1700 | 2 


324 


GTSGTSGTSGTSQTCRISRVRSTSSWTS 



640 



VSDOCID: <WO 01 64835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

Inst amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H~Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRHLQCHVSKAESSWFVLSASSCSTGSP 
ISLSGS/GSSLSNSSKNLTLPCLFSSLL 
S * P*ANPVDSAFKI YPGLSSVGSCL 


1693 


15594 


A 


1701 


183 


372 


PGVLLCWPGWSAVTVHQCDHSALQPRTP 
RLK* S P \ S T WDYRY TP LCPA 


1694 


15595 


A 


1702 


29 


382 


GKRFPCSWEPKFPLTQGGFPPIjHPGQKK 
KPGS KKKKDKS KCW * G / CP. EA / GTLTH Y 
RQELEMVOSFWK/TVWQVLRKLKTVELPY 
DPAFLLGLHPTSTQKR\DTMFMA/AITL 
/ I 9 ER WKQ PR CTS 


1695 












KKVKI IGEAAVEFPDTIKKI IEEKEYLP 
L* VY\NADESGLFWKKKLQRTFI SKEEK 
\SMDRLTLI ILCKCS WVYEQDGP * AliKE 
KGEHQI>PVF*L*NKKAWTVRTLFIiD* FH 
QCFVPEVRKYVASKRii 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLNNS *VNTEI KAEVSSSLLE IN 
EYEDTTYQNLWDAAKAVLKGKHVAPRHF 
LQEVKK/RL.KRPQINNLTLYLKEL/EKE 
HIN1iKASGRK*MTKIGDLFGLYFV1jNG 


1697 


15598 


A 


1705 


100 


342 


APKWSI VCPELVGSWSH* /PSRMKPWTL 
TRQGFTMLARiVLNS * RRDLPALAS QS P 
G ITGMSHRTQPLL I NLME IFTEILS 


1698 


15599 


A 


1706 


600 


211 


SCSVARLEFSDVIKAHCHL/RTPGLKQS 
SHLSHLSS\WD*GRVPHDLANF* IFCRD 
RVLPRLL>QAGLEL\LASSDPPS*ASEKC 
WNYRHEPTVPRQNLGLLKTTYGWVFLKK 
YILTVSVFSLMCLDHCLLM 


1699 


15600 


A 


1707 


409 


1 


RGPFFFSPGKAFLKjLGSNSFFPIKKTRA 
PKKPIFSPVSP/LNFPPKTGFPPVFPPQ 
MGGFFF/CFPSFLFFFLPPPPPFFFSFP 
PPFFFFFFFFFFFPLFFF*DRVSLCHPG 
WSAVALSQLiTAALTSMDSSNSPTCV 


1700 


15601 


A 


1708 


154 


2 


IGKPLAGLTK/RKRENT* INKIRNEKGD 
MTADNTE IQSIIRDFVS* rt ahq 


1701 


15602 


A 


1709 


263 


37 


SAQLHPLN I QNHRQS I LLHDFFLKKQD/ 
G / WPGAG AHACNPS TLGGQGG W ITRSGD 
QDHPG*HISV 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAIFFFPLPREGAKNPNFPVGPP 
PFGGPFFFFFAPSQKKEPGFFGKKGVF* 
GGAKGFPNARGPFFF/G* KKKKGPKNKT 
PGFFL/MGPPNPGGPPPRGEGGKIGAKK 
KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCL.CDCMC/CVCACMYICVCV 
CTRVCM*VCVCMCVRVCVQALTVLCKSV 


1704 


15605 


A 


1712 


116 


391 


KRNFFFGPQGGGEGPKFN*RGPPPPGVK 
GIFPPSPPEG/GKKKGAPPPPGIIFWFF 
KKKGVPPCGPGGV*TPDPGGPPPPGPPK 
GGAPRQGPL 


1705 


15606 


A 


1713 


401 


47 


HHYATKPFTHAHTCTCIQDTCNTCMQHT 
QVHT/HTDTHTHTRKVSSVCVLIMAEQE 
RPCPHC * GGEAGAECGVCAWLGLSS * TW 
RNRKGPAHTVRWGQEQSAGCAHGLVFP 
RSYLWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVD I EMVS CCV/ CSG*SAVCSGT 
ASAHCSLP I PGSRDS PAS ACQVAGTTGM 
PH/LYPGVPLKPREGLQFTELPSGQLiEI 
QPTCENKR *HVPCAL * VQLTD I RPN * RY 
QFRVAAVNVHGTRRFTAPS KHFCS 
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WO 01/64835. PCT/DS01/04927 * 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
' sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 
nucie- 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G = Glycine, H— Histidine, 1— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1707 


15608 


A 


1715 


413 


3 


LNTFEPQRSFFI VNSS EM I LLVE CLFVT 

PVTQDGVQWSNHGSIj*PQTPRLK\HPTT 
SVFPVA/RQGL/NSVAQAGVQWHDLGSL 
* CRLPSLKGSSHPRLPSS WNYRYAPPPC 
I 


1708 


15609 










VAETTGLLHWKKKI VETGSHFI VQTAFK 
FIiDSSDLPAIiAC/ SWDYRSELLCPACFY 
NFCLFINIPC 


]709 


15610 


A 


1717 




384 


YSACVCVCVWVDVSVHRCMSISGH/ARl" 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR*GESLPGAYPGSHSTC 
CPGCCLDW*HSLPSTEPAACRAGAPGGG 
H * AWLGCGGRAGGR PG 


1710 


15611 


A 


1719 


3 


615 


PVGSWARSSGAGWPPGSPQSVSDGEAGH 
SIPAPRGQCSRHRAEGRTARVCLSHCSF 
SGPRPGLVPIR*SLGRPDAVQAIVPDSQ 
EGRKTGIHAEAVMFPLGPGKGVKGCAGG 
RLCPSSPPSRCLDLGRRGMPPSSGPAGP 
RPSGVGSDL.GRPGAGAATSSSSSSSSSS 
SSSSSSSSSSRDGEGP\TGSVEAPGSLL 
GPWXjPSQPS 


1 71 1 


15612 


A 


1720 


320 


3 


GLKWUJLKDNSLYL IIiAKWDDCLDEK* 
YADKMLQYTKSVWVSRSRGCQGQVGEKK 
QATEAAQEWELRK \ RL YWRKE CDALRAA 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFLTKEE IGPM * QS C P 
T / DLALVFLG I CT I DLKAY I HTE TCTQ M 
I I ITLLI I AKNRKKASCSS VGE / WNKKL 


1713 


15614 


A 


1722 


135 


396 


AQGLFCTSVKLASEQPUJILFQLDKRNK 
\FEIYGTSG*L*SIIICQNNLQSKFQMY 
HHKIMSSLGAVAHACNPSTLGGQGGQIT 
RSGD 














LiNLRAKA I KIjL E E N I G I HL P D LG IjD D L F 
LDITPKSQATK/AKIGN*GFIKL.KHFCA 
AKNI IKKMKRQYKEWKKIFGNHVSDKKL 
VFRIYK*HLPLIIKNSSS 


1715 












* KLGLLQQK\ ISKIANAKEKLLKEVKSA 
TSMNM*MMRK*NWFIPEMQKVLWW/I* 
N I /PLCQSLI QNKALTLFNS I KAERGEE 
A\KLEATKRWFMRFKESCLHNVKVQDEG 














SQLLRRIiKQNSLNPRESSSKTTTRTKVS 
HWHKNRHVNQ/YNKIENSGINLHIYG*L 
TLNKGDEASQYSS /DILFNKWCWQKKKK 
\YLDPYLTPCTKISSTWTTSGFL 


1717 


15618 


A 


1726 


390 


1 


TFFPKI \EKKIL*FIWGPRRPKIANVFP 
*QNKPKIEGIPLPGFKIYYRALVTKTAW 
F*HKNPPIGQRNKVENSETNFHPPSELN 
FFFF\YKGAKNIHWGEDSLFNKWCWENW 
I S I *RRMKLGPNLTPYTK 


1718 


15619 


A 


1727 


3 


365 


HASAKSNLRWI IKIiWLRAKT I KliLEENM 
GENL WDLELSRG FLDRT/ PKLYS I KLKL 
RK/WNFIKIKNFCFSRDT*KS*MAGKNI 
LSGKDLYSEYLKK*CNLIIRQSKTST*M 
FIAALFIIAKH 
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ISDOCID: <WO 01 64835A2_I_> 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glut.imic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
SVSKMK*NKKINK*IMFLY\EQQSETEY 
FKLPLTLA/ SRNMKFLGINLSKGVQDLE 
TENYKI LLQE I EEDLNK* RNKTY* W I KL 
NTVKMKSILSKR 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREIKPPLPPMGKPLF 
F*KTKNKWVGGAPPVIPPS/SGVLSQKK 
TFTLEGGGPNKLNSPPALP/ARGPKKNF 
FQKKKKKTKQNPSQREE 


1721 


15622 


A 


1730 


1 


374 


IFNADKIA\FWKKF*KPQGTSVGREEKQ 
APGFKAGRNRLTT I /L/GAS AVGFM I RA 
ALICKAANPQVLKGRDKHQLPVFRLLYN 
KKAWTTRTLFLDCFHQCFVPEVRKYLAS 
KGLVFKVLLILDNGPC 


1722 


15623 


A 


1731 


389 


I 


FPPKIFFFSTLFFFFPRFFPPPPFL*P/ 
SPPIYFF/CAPKKKNFFPPPPGKNFFFF 
KTPPPFFFFFFF*D* VCLCCPGWSAVAQ 
SWLTTTSVFRVPVI 


1723 


15624 


A 


1732 


118 


422 


DITTHLFKWLKFKK*EKGLNILFTKEDM 
QMEKNLKRCLT* F WKELQ I KMRYHYPP 
IQMAKI * KN/STIS I AWQG YRTIGTLF/ 
HC* *EQPFW*FLSKLNMILPYNPA\IML 
LSI YPNALKKHVHTKTCM * MF I AALF 1 1 
TKNWKEPRCPSICEW 


1724 


15625 


A 


1733 


407 


1 


NIKGPFRGPLIQWGLLIWPKDSFPILGY 
PPFPSPKISFFFLARSCSGAPNHFPLPN 
QSPCFPQPSFFLGEEKKEFLPGYSLAFP 
FN/RFGGS * RVRKGNGGPI PMGESFLFF 
F*DRVLLCCPGWSAVA*SRLTATC 


1725 


15626 


A 


1734 


322 


362 


TAIYIYI I *LFTDNI PSHPRTWIEIYKE 
INIFVPANTIPIVQHRNQGVICTFRKTI 
TVTDCD PANGSGQS KLKTWKG FT I LDAV 
KN/IRDSWEKVKIG/riN/GVRKLIPSL 
KNDFKRPKT 


1726 - 


35627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQILPPGWKGRG 
E PRVNGT P P PRG KGN P PAQ P PQ EGGKTG 
PPHKPG*FLCFLKKKGVQKGG*GGPQ/A 
PGPKGAPRPGPPKGGEKREGPPGPTRPN 
LYYAH 


1727 


15628 


A 


1736 


417 


2 


flffffflffffsfasgpeilftcl*ht 
hihflffyskst**ppvfaggmfqdpqw 
lpetkmvpnfkkkrttlt/yi p* * klce 
cdlsnffcff*drlllcrpgwsava*sr 
ltatstfqaqanrtrg 


1728 


15629 


A 


1737 


316 


338 


FFFLSFYFETESYLHHP*GFIVKLSKVK 
D I EN \ LKKTARGNYQVTYKGAS I RLAAD 
FSAE I SQAWREWDNMFKVLKEKTNWQPR 
IYKTLFVPHF 


1729 


15630 


A 


1738 


197 


379 


QKRAQ IDKAI FI CRD I AL/ P* FQ / M YYW 
ATVTKTAW* WYKNRHI EQWWRSW \PPEI 
KSQSYSHL 


1730 


15631 


A 


1739 


4 


401 


RGYRHAPPSLANFCIFSKD/MGFTWLN 
S * PQ / CNLPAS TS * SAG I TG I SH CTRPQ 
MATFL I GPHKI I P WS VLWPNL 


1731 


15632 


A 


1740 


94 


117 


KDRPM VP PVGAGEDQADEPCRGHAS LWS 
QLVSAPTTPIPLPGRDVPSRATPFPAAL 
AAQQP P+ASPYPLP PGLG AGHAS AS P VT 
VPFSP I SESTGS * ES AL/ PAPRPGGSG 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 

nucle- 

locntion 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=C»lycine, H=Histidine, 1= Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
"W = Tryptophan, Y = Tyrosine, 
X=Unknown, *=Srop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFLISHDLSK1/TEDEVHNLNSSTTIG 
\EVEFKVKKL\*KKKSSGPDGITGKFY* 
TVREEVTPHi/SYLF/HEIEKEETIjVNS 
FYEARIIL/IPKPDKNKP*TNIDAKTTS 
KVLPNRIWQYVKIIIQHN* VGFV 


1733 


15634 


A 


1742 


3 


442 


DLKSRVE PRVRPSVRKQ* WLLKVLICA 
SKDTLKRAKRQP I GWEKI FVNHMPDKDL 
IPEYKINMQKSGVFLiFTNNSYSSTTDND 
INKWAKDLSRHFSEEDIQMANKHM\KR* 
SVSLVIREIKIKTTGR*LFTPKCWQG*G 
EIITLVH 


1734 


15635 


A 


1743 


411 


2 


LPPFKHPPPE I I LGAPKKKI TLPPPRPK 
KC I SLKGPPFFFFCRYRVLLCCPGWFST 
PGLKQSSHLGLPKWWDYRHEPYCTQSSF 
SLS FFLKQTGR* WFDLSNFFFY/CYCFI 
YFSRDRGLTLLPELVLNSWPQAIIi 


1735 


15636 


A 


1744 


1 


393 


RPGGPPKGRSREQGREGERSRRRRPRAP 
WARSHMWGARVFSVPRSFDQRPQEKCVQ 
SSYKQQLPARARDGTGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSLPSR*DY 
RRTPPHPANF*MIj/CLRRSLT 


1736 


15637 


A 


1745 


395 


0 


PSAPSFFSTRL*LGEPPGFPPPPFLKPP 
PRN/SIFGAPKKKFFLPPPRGKKFVSLK 

gpplfff*dgvllccpgwsagfk:qfshl 
slpss*dyrrtpphpdnf*mf/clrrsl 

T 


1737 


15638 . 


A 


1746 


397 


1 


CGNFLKRE KNFE ARRFLQK* AAR FRN I R 
*VTPEITAFCPL*HVASFLVFLTPNFPT 
IPQLYCLEPLGEMGGSGSKLPPFPTSKT 
PNPLISVNLCFP\AIKWENFFF*DGVLL 
LHPGWSAVAQSRPTATSTS 














FTKKGGRGGNLS P / LPPQKI F* KKKTLK 
KPFFWQRVWFFSPLFFEQKGQGPFFFKK 
PFF*KRPPDTPQPPPPSFIFFFLLFFFR 
/HLVAQAGMQWRHLGSPQP PPPGLTQS S 
QLELPHTPPHPDNFCIFGRDR 














IKGFKKKRGKGGAPQKKGGTGAKPPPPP 
PQGF* K*QKRKNGS PPNVFFPNPGGPPP 
PPP FWGDKRGG P PRGGGAP P PRGKRENP 
FFPTLA/HGKKEKKK 


1740 


15641 


A 


1750 


3 


396 


KRQTTNWEKVFA*KN\ADGLISLIRRKC 
IjKVKK/W*R/DMNTQFTDKGILMTNKHM 
KRSSTSLIMEMQHKAGVIFHPSDWQKC* 
STDNTQS * QGHEEMYTIKHSW * TFDYQQ 
PF/SESNL . 


1741 


15642 


A 


1752 


30 


419 


NEG I GAGHE VS FAANKCR S MRKNVEVLA 
LNRRLDGELLSGXiTSTQALPGWAYLHLL 
SHHAVRPLFLCFKRGWVLLCHPGWSAVA 
QSQ \ FELLGQV I LRPHLPSS * DYRS I PP 


1742 


15643 


A 


1753 


16 


410 


VGPKKSLQQVWAAVQATLPLES YDLAHP 
IILKVSLADRDAI *NLWQIPIVAS*YIP 
LGF/YSKAMPSSVDIYSSFEKKKTHFFL 
TGGGGPPIjYSNYLGRLGGANHLTPGVKN 
QPGQFGKPPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


NSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLIQHQKVHTR*RMYI*SR/CGKDFT 
QKSTLI*H 



644 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lvsine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1744 


15645 


A 


1755 


138 


380 


XPXXXSPXPPPCPXPLSSLFLTKCSAY* 
TPAHRPPPPGPFVPPPKPCCPPPPPLRP 
WPPLPL.L.TNTPIPPPTLSFHPGPALS 


1745 


15646 


A 


1756 


194 


3 


WLCIPIRHTTEQQPGFPF/LIIWSQPYS 
PLFCLFVF * DTVS L.CGPGWNAGVQSGLT 
AASTSQAPSL 


1746 


15647 


A 


1757 


2 


403 


RVLFSPTLAYTYLLFYLGATILMGVNIW 
KQPNC PTK / GQQ K I KLQ Y I YRME Y YS AL. 
KKK* ILLFAIR* VNLGDIMLSEVSQ\SR 
KKNIVLSHMW 














FFFFFFFFFP*TKGLGCIHRCDHGSPQP 
RIH\GSSNAPSIiAS * VGGTTGAFHHARF 
TLIQSSSVHVSTHTLHPYPSSLP 


1748 


15649 


A 


1759 


456 


31 


FAKR I TDKKLLSL I YLAT * KGE I TKAL I 
KQ*\LKAQAKNSLKRHTIVLNHMKL/CL 
ILiLIIKEMQIK/STLRYHFFFITLAK\I 
+ KLGNTFCWQGL*GTLIHCWWECK*HNS 
YVGG I WQ / FSNKL YVQ I * YNS AI SNLDG 
RVGRPG 














QFQYFYYNGSVI KAVWYWSKNRK\ IDQ* 
NRTESPDKNLHKHMQLIFDKGTNTPQWH 
KDDLFKKWCWNN*TSTCQKKKKKKGGGP 
* KEQNLTP PGWBD 1 1 FLFGAP KNM PGAG 
VKTRWGGKNPGFPQ 














YTSASWGGARYTASAAGWKTLLIiLFljFI 
*DRVLLCHPRWSALTQPRIIAASTFQ\V 
KQS FWLMH I GD WDYRRCMP P CRANFFNF 
LCKKKKTLRRQEVKQTPALVRV 


1751 


15652 


A 


1762 


390 


1 


KFSTPGNKNL.FFLKAPPFFFFCRGRVLL 
CCPDWYSTFGLKQSP/CFSLPKCWDYRR 
ESPPQAFFVFFGLPSLPFSPPPAPSLSQ 
SSSFFF*MEFHSFAQAGVKWLN\LFGSL 
QPPPPGLK* FSCLSLPPTRP 














PRVRGFFMRKFVDS YLVPYTKSNLKWTI 
DL,HVRAKAINLLN*IJVRIHLYDRELCNG 
FLEM I KKTKAAATTTKLDFI KI KNFCAQ 
QMSS /MKVKRQSTKWKKKCSYHISDKGL 
VSRKYVKKAYNSS IRTQSH 


1753 


15654 


A 


.1764 


334 


3 


WSKRSGPPLiSKNQTKKKNATKPQT*KNW 
INEIGPI I /NTS PS KE :<TGSNGFTD * FY 
*TL.KEELLS ILLKLFQ KTEKS VI FPKS F 
YVKDHSSCLSGIHPKDANMEQHMQINQC 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKKYWKGSFLKKHFFIjETRSLS 
D/SPG/GVQWRDHSSLQPRTPGLKQSPH 
LSLLSSWDHR*APPCPANFRFRLKTRIG 
RDV 


1755 


15656 


A 


1766 


402 


386 


FKKKSRLRKI / KVQDEAASTPDLAKTI / 
DYEGHYTKQQIFHVDERAFYWKKMPP /R 
TFIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVL.TYHS /ENPRALK/HYAKS 
TLPELYKWNTKAWMKIQKFPS*FT* IFM 


1756 


15657 - 


A 


1767 


2 


406 


PRVRPRVRKLITLIiNVSQRWSSEKKKKK 
KKTKKKKKKKKRGGGRL* KKKKKKPRRG 
RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGVGSTWHGISPIKH 
SKKTKSTK 


1757 


15658 


A 


1768 


14 


409 


IASGFLFFI YFGVIGRPPKRGGVFFPGG 
GGAPHPPQGGC*KKKRGGGGGPPPFFPP 
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I SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
corrcspon 

last Amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q— Glutamine, R~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 

nucleotide insertion 














SLGGLGRPRPMSKIKTPPSAGGKPP/PF 
LRNQN* TGGGGPPQ * PPLFGGPGGKNFS 
PPRGGGGGGPKTPPPPPPGGE 


1758 


15659 


A 


1769 


305 


1 


TKTG P F PSNRAKKKKI VFQKKKKEKK I N 
CFVTAP FKG I KTEATD* EK I FAKHLS VK 

PIKK*AKDI>SRHFTR¥TD 


1759 


15660 


A 


1770 


148 


402 


FSPVSLG*GRGNIYAGMSNV*EVPPEID 
FQHEVKRALQTS FQVKLVKI I FFKST I * 
KSLAK/ WLAWAHACNPSTLGG * GGW I T 
RSGVKSEPGQHGE 


1760 


15661 


A 


1771 


250 


2 


KKKKEKKIICFVTDTFKGI KTEATD* EK 
IFAKHLSVKG/LVYLi*YI* ISKIYKELL 
KLNNEKTTI P I KK* AKDHSRHFTRYTD 


1761 


15662 


A 


1772 


407 


1 


KKIRRGGGQPPLFPLIGGGREKKFFFPG 
KRGFY*TKAGPFPSSWAKKKKFVFKKKK 
KEKKIICFVTATFKGIKTEASD*EKIFA 
KHLSVKG/L.VYL*YI*ISKIYKELLKLN 
NEKTTIP I KK* AKDLSRHFTR YTD 










1 


406 


KKKKKKPTTFPGLPFFFSLSLPPRVPGA 
RGPPSSFPSKPGLGC*IKKSGVSPAPKK 
KKPRKTNPKNFPRPNPKGGGEKFFNLPT 
HPPGGTPPFF*RRRENPPGFAPQKG/EV 
FSPGGRNSRGEPRGGPKKKKKKGG 


1763 


15664 


A 


1774 


2 


378 


AAGEWLHQGSLQSLPPGLKQSAPLGVSK 
W*NPWHDPPPPAPRFVWWGGWIjRWS 
FFLGAQAGIFFFF* IEMGSHYVAQAGLE 
/PPRLQGSS*LNIiPSSWDYRPVP 






— ^ — 








Q Q M R D KRN L F E HN K * G I R G I Y 1*N 1 1 KAR 
HEKPTVDTILSGESFSSKIKTMLISPFL 
FNT VLG VLA* ARKRKD I * VGK* E VKS YM 
FTNDMI \LDNPKDSTPKKKKTGYFMGGP 
GSKPPPQRGGAFLSLTRDPLEREFPKTA 
LFTLGQKKIKGPKFFS 


1765 


15666 


A 


1776 


334 


402 


KGGGGVGG*QGPWRLAHCTDK\KEERKR 
ERERKRQRKKERKKERKKEKKE*MKNNK 
KKNDK 


1766 


15667 


A 


1777 


406 


3 


SPSSSSSSLFSPPPFFWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV* FF 
FF\LGPPPPFFFFFFFWEAGFPFFSPG* 
GP/SGPMAGFRSLPPPGNSLSKKKKSEG 
liGEGGNSVLTRVLLISSYQIPGNPR 


1767 


15668 


A 


1778 


70 


409 


LISFLVSSLIVRLYRPLLCLFPDPSKGH 
C I PCL LPIjFWI LLH I FLFMHS F I YCL I N 
DRVLLCLPGWCAWRPRLTAAS ATQ\ I K 
RSGSHLSL.PSSK* \WDHRRCPGFFFFFF 
FF 


1768 


15669 


A 


1779 


390 


31 


SHLSLPKCWDY/RL*ATTPSQKILVFTH 
G*VLSLLSL.SFLIPPD*TF*KMSLMRPS 
LjTQKSS I IQHDLIjDKVPITI flrqsl/d 
S/VTQARMQWHDLGSLQPQPPGLKLSSQ 
PQAL\SSWDYR 


1769 


15670 


A 


1780 


357 


1 


LTLFWGAQYL.PKKGKGPFFLLSLPSPSV 
WGPLPQKKKSPPLCFFYFLNRVLLCHPG 
WIAWQSWHSSAHFSL./VLTRFK*SSCL 
S LLS S WNYRCTLLHPPNFLNFWYRQGNA 
VLPKLVL 


1770 


15671 


A 


1781 


122 


254 


RKNE \ WSGAVAHACKP S TLPG PGGQ IMR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ JD 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 














SGDQDEPG*HEPPRPTDSFFLYPSH 


1771 


15672 


A 


1782 


45 


387 


TQTP KLRRF FHFS LLKS WH YRCS PL \ PQ 
HNGYFLTNLD I ATSMFFFKDRVSL, \ LAQ 
AGEQWCDPS SLHPQTPGLRR\ PPASASQ 
EGETTGAHHHTWRNL I YF FYYTYKF * GT 
YPMA 


1772 


15673 


A 


1783 


1 


401 


FATLARLjVSNS*PE/ cdlpasasqsagi 
TVSPAHLAISPSFPAMPSSGLGLSYPAY 
HPGLGLRCHLPVLTSPWTSDTGPSSVLP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 


15674 


A 


1784 


432 


1 


FLFLFFFIKKKTPFFGAKPKNRGF*KPP 
EFFQIF/ S/CPVFLiGRFPKPQKEFFFPE 
REMGFFFLFSPQGGGFARGFPPPPLNQG 
EAPRAGHKKKGPNLGGRTFFFFFFEMES 
\FSVTQAGVQWHDLGSLQPLPPPFKRFS 
CLTHAS 


1774 


15675 


A 


1785 


15 


434 


RLSLSCCGREEHSTLiPGAPWRCTEIAWA 
DSPDPAPSPPSALPSLLPFHVYRDVCPV 
LCLRGWP * + MVERGRLG I SPTWLLGW P F 
PGGA/PHIKPE*YFLFAYTILRSVPNKL 
GGVLALLL.S IL I LAI I P I LH I S KQQS 1 1 
F 


1775 


15676 


A 


1786 


1 


258 


CWPETPVLK* SHLDLTKRGEDRREPVWA 
ASTTIFFFETGL/NSGAQAGVRWVHLGS 
LEPLPPIPSLFMTP*GPCPGLGSSWPLR 
EF 


1776 


15677 


A 


1787 


399 


63 


SLHNQWKSTPTI/KTSKKISHI *STWPG 
WAHACNSNTLGHHGGRTA* /RSGVQDQ 
PGHHSETSSQRLRNPYIKRCIKYLAHSK 
CCINDS FTVSVTSRKLIGKREVS PNNI T 
FR 


1777 


15678 


A 


1788 


3 


474 


MSISPVRWNSKEAGRAANRQFPSFSPWK 
DDSRDAS PPEPAS PTIG\ P IRRLAES S W 
TWGS PCAEHPRARAGRRKAATDCPWAAG 
SQ WRGPAGQGAPRS CLF PGS RTAARAQH 
PRVAPPPPPAPLNTRASALRSQLPRNPL 
*VMTPRPPAAAPRSPVGP 


1778 


15679 


A 


1789 


66 


395 


LVQPLFDFI WHSRLS LWSGRDLLWSG KE 
TMNPCLNHSIGVLQEWQGSDVKKRRRLM 
ESLTG PAADVI R I LKSNNPA ITTAECLK 
ALEQVFGSVDSSRDAQIKFLNTYQNP 


1779 


15680 


A 


1790 


413 


1 


PS PRA1. 1 TOYS * E EG PRF W QV \ E KKGQ P 
LKPHPGLGSPHQESPRVGPPKRGGYNPF 
*KKRAKFFAPGENKGPPFWTGRDPTF*G 
KTKKKTNPPKKGGKKGPPPKPGQFFFFF 
L.RRSLAWSPRLECSAAISAHCKLRL 


1780 


15681 


A 


1791 


314 




KTKP FFLKKT PPKKKNKKKG S C * DKARY 
QTRKG/IVNLGH/HPSFLFYFILFFETE 
SHS VGKAGVRWLRANS LQTPS PG FQQ FS 
RHS KPSRKDYRHP P KRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


EKQS FAFDWHFDL.YFDNVK * KEGSESKA 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
PAS VTQEAADE FPDNFKK I TE KKGYLHG 
KFLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL*REREREREREREGERERAR 
ERERARQGTSTVES RF/HS YCRRDQD VA 
RPKAKGEVAAGRGS PDGLQVGRGQ * PGP 
SLRPGPWREWGFATYLACGGP I PTGEVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, 1^ Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SGEEWAQQGEWGQALGPP 


17S3 


15684 


A 


1794 


405 


119 


IIKMAIVPKAIYRINTISIKVPMAFFTK 
\LKT I I KFI *N * KRARIAKAI LS KKNKA 
GSIPLPDFKLYYKAIVSKPTWCWYKNRP 
IVLVHSRTARKK 


1784 


15685 


A 


1795 


455 


3 


CSVTQAGEQWCNHSSLQPQPPGLKWS/C 
PSQLPK\WNYRHVPPRPALTAHPALTAD 
F* RRKEYICLMRHRGKKWHDFTLRS KKMK 
AMRHEYHSFLTYPNGHLYLHWQFFFFDT 
ESHSVSQAGVQWHDLGSLQPPPPGFKRF 
ERVGPGGNSGAD 


17S5 


15686 


A 


1796 


2 


134 


PRLQHCTPAWATPQDSVSKRKRDMTTDP 
AI*KG* * ATTTN I Y I HKF YNLDE I DQ FL 
KKHKL PQLTWYE I / DNLNS PITR\lEFV 
I LNSKKKYPGSDGFTGEF* D* FCLETKK 
R YDYRP CNLKRI ISDYNKHLHT 


1786 


15687 


A 


1797 


1 


404 


PTRPLTGSSASGMMVEIFPKTYLISSAC 
W VK I S K I DLKLFS P F FFFG F LEGG FY F C 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/PP/AVGGGGKPPI*GNPPPWPPQG 


1787 


15688 


A 


1798 


2 


383 


SG WLWACRS PDS E PLACPAG P RQ * GAYH 
PGQQLWMVR* VSQQCWPTPPALHS IS \P 
RP*LPPRRAGLGTCSPP/VPESPLLPPW 
APAWPKPPRRALPPA/PPVPGPIDRPRA 
EECRRMVRDWQAAPPAAP 


1788 


15689 


A 


1799 


35 


410 


ATGPSLGKVCS * AFS FL* PKLDFWHL YF 
QASGFLL/CHNSPSQTNSCSSFGKHYGM 
VXiRVNS L * PDHPWNE F YFLNFF I FYRDS 
/SLTILPRLVSNS *AQTILLPLP/PKVL 
GLiE 


1789 


15690 


A 


1800 


65 


415 


KKGVLKGGPPPQPLGV*GPPPEKGGPGP 
F\GAPKEKPPLALGEPKGKPF*RGPKGF 
F FKKGLEPGAP P P P KPRGE KP P V FKGPH 
PFCKKGGGDKREKNGGL*RKKPTELGNG 
PPPPG 


1790 


15691 


A 


1801 


417 


3 


NLGPNI FPPRAPQNWGPAPPLFFFLKKN 
FTWGGGSTPLFP/LNLGGLGGPFPGLEV 
YAPPS PHG * PRFFFKNQKLP P PVWGALY 
SPFFGGWEN\RKAPPGQTLFFFFFFLK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 


15692 


A 


1802 


1 


431 


QPCTPGLK*SSCLSLPSIGDYRC/RTTV 
PS*FFFFFIjEKGVGFIPRGGIKG1jDNC* 
LGPHPPELK\NPPPQNSQEVGTTGPPPR 
PG*LFFFFETFFFF*KGGFFILAQP*MK 
WGALKKTPPAFSKRGGYRGGTPMEPTHC 
FLKEN 


1792 


15693 


A 


1803 


256 


399 


AI KN I HDS * EEVKI S TLAG I WKKL I TTL 
I DDFDG FKTSVE EVTVDVME 


1793 


15694 


A 


1804 


407 


2 


FEKANLFPFLFKNSGPPNVPGRGMAGVP 
KFCCPS KRKVPGP I FFLRRLFLFYPKCR 
RPPLFWPAFGP* KFFLPQVFAPAF* KPS 
PKKKRAPVFFFFFFF/RDKVWLCHPPWS 
AMA* SQLTVTSVS WAQAILLPQR 


1794 


15695 


A 


1805 


429 


116 


LLTKKKRKKKPLACGESGLGGP*GTVTG 
V * QAEDTHVI W \VLSSAPSLSSE EMTDS 
MPGHLPS KDS RYGMEMLTDKKWT WDGGA 
WDSSPQGANGKRGARQASGFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, lM=JVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1795 


15696 


A 


1806 


2 


408 


FVXFVFLVETGF\Sir J ARALACAISNS* 
PRDLPTIASQGAGITGVSHRTWPRRSCV 
F * E AFLTMLMVLHI P PSLL WHS VFTHLL 
ALSS * CFLFFLRGPWAVTQAEGQGHDLG 
SIjQPP1jPGLKWFSCLSLPSS*NYR 


1796 


15697 


A 


1S07 


1 


196 


FRLGASLDLSGGCSGVSWTPAPPLPPPP 
PPSAASGIS/SGSTSAAAGL*SCRTAFF 
SFLSSFFFFFLKKINPFPFLGGI*PFKG 
GPRLV* GNI * PPPQG \ DFLGGTRGGKKP 
WGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\ PP PP PPSAASG ISMAAHLQLPACDRAA 
QhS FLS SHLFFFFF 


1797 


15698 


A 


1808 


395 


3 


LGKKMNNPKFWQGCEATGTL/M/HCWWV 
CKFVQS FWNTDS I Y * G* AAATHDS AMLL 
LGMHS MPACTFVHQKT* TKMF I AALF I L 
PLNWKQV/RCPSVI \DGYIPTMDQSTAM 
KMNKLHAKTWMNLRNLMLNEKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISIiQPPPPGFG*FFCLSLLSSWD 
RQPAPPR/LANLRR*T*LQSA*L,WR\RG 
PILDEMKSFMCNCNSL.TEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK* SARL 
GLLKCWDYSCEPPCLTQMPFSFFLFF*D 
RVS/PLSPGWS 


1799 


15700 


A 


1810 


20 


355 


PQCAHGCRAVAPVCVCFCF CVC VCSH CV 
SMCMCGEVSAGWTLLCLCLST\CSGAYA 
CDWGCG/ CYSACVCARVCAC VCAQLLTC 
I GMWE * GQAGQGEVLLDLPLCLCSCWAP 
Q 


1800 


15701 


A 


1811 


3 


414 


SSKNDNWSLQEFMDKIAGMKKNLGNLTE 
LNNTV* EFHRAITS INNRINHAEERISD 
LENWLSEI \NRQT* KIVTRNEQKLREVW 
DYVKRLiN1j*IIG\VFEREGGKAYYLiQNI 
FED I VHENFPRFARDANSQI QEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKS PS PKKFKKGPGGGRWS * FLGGAGG 
G VP FRPGI QGSNYRFF / S PP P PPLGEKK 
KPPFKKKKKTPQKLQMGNPHPYFNPRVH 
GCREQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPL.FTRSST 
/SGASPP 


1802 


15703 


A 


1813 


411 


66 


WKNNVFNKECLER * I F/ 1 1 QKKKLDP \ F 
LTRY I KI KS K* I KDLNIRLE I KKTPGKE 
SVTLAKWLIRSPYLSFP*QIQSKQ*INS 
YVLIKITKEERWSTSKEYQKPWRERKAM 
YKIK 


1803 


15704 


A 


1814 


369 


20 


QEVRPS I YLSSSNRKYVKDSNARFTKEA 
IQIANIHMKNCPTSL.IVGETHIKTSKGY 
HYVP IRMAKILKD \ CltfRRCGETGTLIHF 
* WEW KMVQPFQKP F * WFP KKRKI H * QFD 
LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAI LGRRLRGEACTRSP FS S I L I FMVS 
MGEWPGVP/ GIKGTRTGAEAVPTRRRKS 
S I W PQTGGAGES SG / PLRGLPRQGKPGS 
PRGPSGPPQNCARWWHPQAAPLGACCFS 
GPEPSRLPPWRQGNWCIiPSTPSSSA*EG 
WR/PVLQPGFL*SPSSIASICPGAERGP 
PGSSRP/ GLRGAPGG 


1805 


15706 


A 


1816 


273 


2 


MESHIMWPFTIGFFHLVQCLHGPSMLQC 
I AVLHS S YLFI YLFI Y * DRVLL CHPG WS 
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SEQ ID 
NO: of 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


1 Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparfic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AWQS*L.TGASGSW\LRRSFHLSLP*C* 
DFRHEPR 


1806 


15707 


A 


1817 


40 


404 


GLHE\PGVQGCSEL*SHDCTPAWVTSET 
LTLKKLKIKKSKKKXRANPRGFERPTFG 
EAGPGGLLKAIISKPRVIQGTQKSGALL 
GFNSNGGENHPPQKQPYLGGLYAKAHSP 
RGGGPPSPCG 


1807 


15708 


A 


1818 


392 


3 


EKYNMSYDIKSTNHRKNCKLDFI KI / RN 
CCSLKDT INKMKMQASNLE KI FAI HMPD 
RGLIFKRKNSCNLVRR*QPPFFKEAKDL 
NTQKS Q * TNG / HGS KETS L 1 1 REMQ I KT 
T/MNYTTSIPT*MLKIKKMNI 


1808 


15709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
* fdiCTTVEK * KTI SN/HDVP I RSSWTGM 
VAHACNPFTLGGRGG*ITRSGVGDQPD* 
HGEGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFGFGGIKP P PKQKKVRG 
GFFAICPKEQVFFFFFFWRAVVQSLNHC 
SLQPQPPGLKQFKQSSHLSLLSS*GYKH 
VLPCPANFILFSbVETGS\SIYFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


TPFFFLRVLRLTPPLLGNFFGPGFPPWG 
GFSPGPLLKGPRP/CFPIFKPIFQPGKW 
GLVFFFFPPFFFYPRGSR*NLKKIFPNF 
PFFFPVFF*ILNPPFFFFFFFFFFFL*D 
RVSIiCRPGWSAVARYRLTASSTSQ 


1811 


15712 


A 


1822 


362 


76 


SEIAPLHFNVGDRVSLHLETNKQSNKQT 
KKTL I FRDRVLLCR PGKNA/ VEVQ * *LP 
AASN/FLRLKQSSCISLSSNWIYRHAPP 
HLATVFNFLI I FE 


1812 


15713 


A 


1823 


314 


2 


VISKPCPRELTCITYGVSLTQCSMFGRM 
KGLLLIWPPVCEVRRASGRPPLMGSEEP 
LCPAAT PSGRCTQQ / LH* ERAMMTMAVL. 
SNRKGGNVGKR* RNQI VAVS 


1813 


15714 


A 


1824 


57 


389 


NLHLLQLPTYTDADS TGPTLS GMNVKNL 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
FKCSCLS FLRR WDYRCAPPRRATF\ * FL 
VETAFLERLTSCDLPTSASQSADITGV 


1814 


15715 


A 


1825 


410 


70 


VPIMSATQDYRHEPPRPAGRFL.KKLKME 
PPHNPALLiLVG I * PKNMKSLHKDVCTPM 
FSGTLFAIAKIQKKPNCPSSMDEWINCR 
NY\MHIYDGICYSALKKNEILARRSGTR 
L 


1815 


15716 


A 


1826 


2 


411 


FLVEMG F \ SMLV I AGLKLPTSGG APAS A 
SESAGITDVSHRAWPVFFFLKRCLVLVG 
RS * *A\ WPHTNL I PPLPSG IKGDLCPNS 
AGGWEKGAPPPSPGKF/CEF*GRTGTTN 
FARG/WTKTPD 


1816 


15717 


A 


1827 


276 


3 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLQ*FS 
CV\K*FSCLGLLSSWDYRHMPPHLDNKS 
IFSRNGVS 


1817 


15718 


A 


1828 


1 


391 


IjEPRRRFLQCVQDCATALQPGQQSKTIjS 
Q KKKKKKGG P P / S * YQKGQG CP SG KKGR 
GVAGKGAFGPGFGGENKTPPGGGPTGEG 
PFPQKGWGPSQGPTKGPNLWGPGGPKL 
GGKGGPPGPTKGGGGPSSF 


1818 


15719 


A " 


1829 


2 


134 


DHLSPVVWNQPGQHSEAPSL/LINIWKL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence . 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nncle- 

location 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gltitamic Acid, F=Phenylalanine, 
G=Glycine, H = Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGCGGAHLSSHL/LREDHLNLEVQGCSE " 
P*LHACTPAWATE*DPVSQTKLN*NIWK 
IiAG CGGAHLSSHLLGR I T 


1819 


15720 


A 


1830 


427 


107 


VQHQPGQQRESPS I I I I QKLARGGARCL 
*SQLPGKLRLENRFN/SGSERSHQCTPA 
WVTDRD CLKRTG I TRAS SQRLSVG I KQS 
CLN P RTAQLQ VS AQ SPSTVSTNL 


1820 


15721 


A 


1831 


3 


540 


VQFPNFKIYYKATAIKTVIAQHKQR* ID 
E/MNKI * TPE INS YVYGYLNFNKDAKAI 
QWGNDSF\FKKWC*DNWISACKYSQTSV 
SASSASSSSS 


1821 


15722 


A 


1832 


385 


2 


AGRQSETPYHNSTIMKVSLQVCVDLSCL 
LGPGS LVQDS PNI PAQRKRFLRLiGS FWL 
PLSRFSRVGWPLPHWGQRSSGFSLPR\P 
P*SQIPAPRSPPPAGPVPARSWVCGPRP 
QTRPLPAERPSPRPRRL 


1822 


1 5723 


A 


1833 






RISRSYLSEYGGSGKEHPTLGASYARIM 
VFG /VTI I YF * RQCLA*AQW YSHSSLLP 
QTPGL,KHPP\AQAS*GAGTIGTHHHT*L 
TFAF/ 1 FVLGCFFL * NKI SVTQAGGQGC 
NFGSLQPPPPGLKRVSCIiTLPR 


1823 


15724 


A 


1834 


2 


306 


kARL.VSDS*PQ/CDPPASASQSAGI ICV 
SHRAQPAEEILKVFGTLCLEPQRPTPDI 
FIIPYLFC\LFEMESCSVAQAGVQWRIL 
GSLQPLAPGVKRVSCLSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFSCWGPNMKKRD 
SPVFHHQDP IFPIP FFFFGNGFLFFPP / 
LAGGQGGNLN*PNPLPWGLKEFPPPTPR 
GRGEKGGAPPPPINFVFLKKGGFPLGGR 
GGLEPPPLGDPPPLPPKRG 


1825 


15726 


A 


1836 


220 


401 


KGSFVFIPQPEGEGPFLG*LKPRFPGLK 
QFSCLTLIiRSGNYGPLPPPPVIF/CGFL 
R 


1826 


15727 


A 


1837 


12 


357 


GLLGGQMNGSLGTQTSYEDLMSSS\FKP 
NSPPPTPS/VRTGHLjPK*P1>ESSNGPPP 
PQVSHSFQGQWARGHPSPPPQWNTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEG£NA 
PSLA 


1827 


15728 


A 


1838 


8 


380 


LMKTHAKLiIjNKRVAN* IQQYKK/HH/NQ 
MGFILGVQIYFNF*KINLIQLINSVKKK 
KN/HSSSSSSSSSSSSSSSPPVLIKSLC 
NLGKNKRNFLCLTKGIYKNKTE\NSMKI 
I LNGEQLNAFPLRLGTK 


1828 


15729 


A 


1839 


2 


444 


VPGDAKWFSVLHIiKDAFFFIPIiVPESQY 
PFAFEWENPNTREK\TWAVLP*GFWDSP 
HFFAQPL.ERDLRGLQLEDGS I LQYVDHL 
LVYS PTQEASDQNT I KTLHFPADRS YKV 
S KKKAQ I TLQQVHCLGYILTPGTCK/LS 
PERVQAI 


1829 


15730 


A 


1840 


1 


642 


EIKGIQIGKEEVT/SLiFADDIDYLRIDS 
TKKLLEVT CELNKVG * KINM * T * I VFLY 
IGNEHLELEI/R/ELMPCIKTSSTMKYL 
EINLKKDV*DIjYTENYKVFPREIKIT*A 
NQ*EILCLiCSRRIiNTKM/STFPQVFCVY 
YAI P VK/ 1 PSRNFLVLVDKLI LKFI * KY 
RGPRTAKTTLKKKKKWRLTLLI FKSYY 
KTIVITIGWYGFQDRQVD*WNRIE 


1830 


15731 


A . | 1841 | 3 


423 


HRITSE* DbHIARELI W\SMYGSLDHKN 
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SEQ ID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 
SeqUe " Ce 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalflnine, 
G— Glycine, H = Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W~Tryptophan, Y—Xyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














YDAELDRMAMPCLCAIAGALPPGYVDAS 
SSSKAEKKATVDAEGNFDPRPVETLSVI 
IPEKLDSFINKFAEYTHEKWAFVKIQNN 
WSYGENIDEEIjKTRPMKR\PY \ KTYSEK 
D 


1831 


15732 


A 


1842 


33 


179 


YMLGGRG CSDLRSHHCT PAW VT/ TA* LS 
KKQYQRQQNRLQYHLQEYERKK 


1832 


15733 


A 


1843 


349 


10 


LAGGLNSMEGER\ LR* EREECTQQQM VH 
DKYCKDLMGFGTKPRHI TPFSS FQAVQP 
QQSNALVGLLG YSSHQGLMG FGAS PS PA 
KSTLVESRCCRDLMEEKFDQRK\QWVLK 
CR 


1833 


15734 


A 


1844 


15 


856 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMLGSTHLPNLTDMLGPELGVQGIP 
S PGCACQRGRG \GGGRE CCS PPGVS PQG 
\SAVGRGAEGPGGLTRSGSGAASALVRP 
GEKGCWCRTASGAGPQRRQRTRGPGSWG 
LSFSQTSEEKCPSPAGSAGAGPVCQRRQ 
SSFAGGCGTGAGAPGST\GDAHPAQGGS 
GGPLRSIiPAVGGPRPGPSFLKTSGSGSV 
PQG VP ILWSLiT* RALAAPGSQGPAGLAV 
SCTGGRGYRDPQAPGTGAG * HGNSTRLR 
GP 


1834 


15735 


A 


1845 


402 


2 


S KARRQGRPLRQG * APG / AAR I PEQKR I 

GQTAMAGGGHDPLPLPPARSRSQESIGA 
RSRGSGHSQEQPAPQPSGGDPSPPQERJJ 
LPEGTERPPKLCSTLPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


TSRKIS * KTGNQFLMKECSCNSHHKAAF 
TKKDVLNI \LAWKHVNTKASETFHFFQ 
SGQAKVQQGFVKEGCELINEAJjNLFNWV 
YGAMHVKTCTCMRLLDRLQYIMGDYAEA 
LSNQQKAVLM\TERVMGTEHPCIRPL 


1836 


15737 


A 


1847 


440 


4 


VDGRHVE VS KKGGQVNYQAG \ KTVE I WA 
DKLGCNMLGTADMVECLKSTRYKELIQQ 
AITAGGAPIAFGPVIDGNVI pgdrqilm 

EQGEFLNYDIMLGVNQGEGLKFVDGIVH 
NEDGVTPND*KFSVSNFVDCMRPRRGPN 
YSRFQ 


1837 


15738 


A 


1848 


526 


0 


PRRDPPPKRQTPIPTHVSPVWEKGPWGP 
APLRPDHPSLSPCPAMG*K\PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSSIPSLARFTRPGESSPL 
PPPQTPSGPP 














RLPGGTGIPWGAGILCPS*LPGPGSLSP 
AAGRG/ SGPSAGPGAAWFSSP* /PACPS 
S S RS AVPGGAG S FRRAGPGL F YTL PAP P 
WCGRGASNKI IQMPGLVC 










3 




YTVCECVCVCIjCVCLPVSL\SIjCIjSVSV 
CFFPSLCGFVCVCPCACVSIiAECALCAT 
KRFV/CMAACLW * AS FCVSTWVMRPAVN 
RFRRGGSALGA 


1840 


15741 


A 


1852 


128 


524 


KIPGLGQRSEGVGQGKDDLHVTAPVPTH 
GWGEGAASKPTVLPPPPP\PDAPTVFFF 
FFFFWEKKYFFGPPPKGAGPQIYLLGPK 
PPGFKPFFPPHPQAAGI*YIKPPPWYKC 
PLLKKRGVSTLAP\GFPKPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Fredict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysleine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1841 


15742 


A 


1853 


1 


1648 


MTVPLHSGLGKGVKPERKTTVRRPFVSA 
GKKYACPFKKAETPQWRRLRMRPKAPAA 
SLLATFPISQRLIPI/rPARKHCPSESET 
STWWSKAPATPRTRGAGPTSPPRPTRR 
R\*ACSLQKLFAVEEEFEDEDFLSAVED 
AENR FTG S LPVN AG RLR P VS S R PQET VQ 
AQSSR / PAAVTPHCS LRG FG PARLG P/P 
ASLPPARPVLTA/GPSCIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDEFDKVL 
ASMELEEPGMELECGVSSEAIPILPAQQ 
REGSVIAKKARVVDLSGSCQKGPVPAIH 
KAGIMS AQDESLDPVI QCRTP \DPP * DL 
VLWVTFLFQ/ PALTVPTQQLHWEVCPQR 
SPVQALQPLQAARGTIQSSPQNRFPCQP 
FQSPSSWLSGKAHLPRPRTPNSSCSTPS 
RTSSGLFPRIPLQPQAPVSSIGSPVGTP 
KGPQGALQTPIVTNHLVQLVTAASRTPQ 
QPTHPSTRAKTRRFPGPAGILPHQQSGR 
SLEDIMVSAPQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


IHKFIHQIWLAKITCQRTKV*KERSVLL 
PTSFSPVPSQGHYTCKQ\LCSLASDLSQ 
PDLOTKFMNLAVLHAMWNSRKVS CYPWT 
MIYFLHANRT 


1843 


15744 


A 


1855 


373 


3 


IKDGI YRYFYKAIiDSFCIiCCEFMNQFFS 
CMDRELSQRCFLNS AY * FPSNL+ CYLCY 
LiFFFF*LKKYFLTFFLiFL.RDRVLLCCPE 
*SAMVHS *LTVPLDFW\VKGSSCHSLLS 
SWDYRHALPHLY 


1844 


15745 


A 


1856 


378 


I 


RQRHSPAGNTGRPQVrPCG* ISWPSITK 
DRTSMSSVTSGALGHTAASPHARLLPLA 
LPSVRTQHGSPPPGQEQPTIICPSNLPT 
HP SLPLGMH PSVRAS PPLCK/P/SPPSI 
PASVHASKHPSPPVY 


1845 


15746 


A 


1857 


3 


379 


YMVRKVI EVWFLIjLLLLFFFFRGGFLGQ 
GWGPPAPGFSFGKKNPQGLGSSNliRGPW 
* TNP * PVPGGALFLVGPPTPAEFPFKNF 
SRGFLLV/ALADF* TRPRL VS PHGG/RG 
AKGTPAFLESMPPWMP 


1846 


15747 


A 


1858 


452 


1 


GTHGLLLGSGPF\RQVFKPDNFVFGQSG 
AGNNWAKGHYIEGAKLiVDSWDWQEE* 
ESCDCLQGLiQLTHSMGSGMGTliFISKIR 
EECPDCIMNTLSWPPPKVSDTWEPYN 
ITLSI HQS VENTDETYC I DNE ALYDI CS 
RTLKLTTRCI 


1847 


15748 


A 


1859 


1 


385 


NTSSDYIFPFFFLFRNSIHSVTQAGGQW 
HNQGSLQPWPSRLK\ CPTASAS I CLHMP 
PFLANFLIFFVEIGSPYVAQAGSRDPPA 
LASQ/SAGITGMGHCTQP*VFLFFFFFF 
S FRKKCS P WP PGGGP P I F 


1848 


15749 


A 


1860 


470 


17 


I EMDSRRVPRDKLACI TKCSKHI FDAIK 
IT*NEIASAD/DFFPTLIYIVLKG/NPP 
CLQYNIQYITRFCWPSRLMTGEDGYYFT 
NLRLGTHCSWLMMTMWTCFRAFR I FSSQ 
WMSYVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CSRPEFPGRRFVEAVRSKPYLSIiPF*SR 
* S FFNVPAE + TS AKD I LAS SE F I KQNND 
VSS\LQKFMPQEVK*LDHIHTA*ADGSW 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phehylalnnine, 
G=Glycine, H— Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KAELERGPDGSAPSPFPQKPAASRQPEL 
GELATFLGRVDPWYQSNVNTLCPAIHKIj 
AEMPPSLDTSRTVDPFILDVITYYIRMG 
TQPIYFQIYTVKIFFSDLSQDPTEDIFL 
IELKVKIQDSKFPKDGFSPRRRGVAEGP 
GAELSLCYQKALLSHRPREVTVSLRATG 
LILKAIPASMY 


1S50 


15751 


A 


1862 


192 


3 


SSGSHSVTQA\GVQWHEHSSLLP/LLT* 
PPGRK * ASHLTLAS S * DYRRAP PHPANF 
* IFCREGV 


1851 


15752 


A 


1863 


82 


370 


SLCQKRAFVGEKLVHGLLVSPSGGRVPS 
CPDPWGCRPRFHAIAVYSFLKLRWIPE 
VSILPEDLEELYDLFKVRSSGKMRGSRP 
*AGLSRG/DPACP 




15753 




1864 


2 


325 


IQVYSISHLSIYLF\IYHLSRGSMHVSM 
SLS I * S I YP CMDV* MYLSNLCMHV Y I YL 
FYGS I YRFYLS I CLSVYLS I YLS IYLSI 
YL I CHP S I FK.T V I D EHA I FAT WRH 


1S53 


15754 


A 


1865 


3 


377 


YSPWCKLFRELCKINVFD\LDSPLLSGK 
EFNDTTHNTFDHMWRTKEHNEAGWLLIjS 
SVDKVMKENDELRDSNSWLQKQV\ * PLK 
SAKTALiSGSLNSCREKAEIVEKQTQSIiT 
M*VADIjQRKMHVQP 


1854 


15755 


A 


1866 


10 


378 


GWKNGEFIDAL* KVYGHKAPNKSAVYKW 
IT/ * FKKGQDDIEDEDHSGRASTLRKKI 
HLVYALIDKD/ *RLTAEAIANTIDIS IS 
LAYRI L.TEKLKLS KLSTQWVP KQLCPDQ 
LQRRAELPMEILK 


1855 


15756 


A 


1867 


346 


1 


DILWKLQKPQPNGKMLKAAKERKKFAF 
KGVPVRMNADFS I AAMKA/RRRWNS I FS 
F*KENNCHLRLLYSAKNIFP/EIKTFSD 
RE FVT I RSAVKE I LKDVLWAEERLSHVK 
SRNV 




15757 




1868 


1 


377 


GTFFQRTQCKGIKQ\YWGL.IIKASSDP 
TCVEKEKVYIGKLNMILAQMLKQEWTEH 
WPAFISDIVGASRTSKSLCQNNMVILKL 
VSEE VFDFSSGQ I TQVKSKHVKDSMCNE 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSLYHPVLFFFFFFPGGRTG 
PNP P AGG EGNE TG / PNG P S T PGGGGNP P 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TNKCCSPGYSPP 


1858 


15759 


A 


1870 


2 


578 


FWKHALLLGLDFLPGKMAPWSGQGCSL 
GHTEGGTSWDFAVGGASWRLKWCVKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS *KRG/VAPGSP*IiALGMGGG\DRIiT 
LIS Q VHGNQVTQ I IPFSTEGETKAQRSP 
S LP PRDL I RGRHS WNLDSTQLLG YCPLL 
PPPIiHPAGPLPVPFTNGEIQKENSRE 




15760 






1 


382 


SGQDAGS CLLYGAGSGAMVSGAYNPY I E 
I IEQ PRQRGMRF I YKCEGRS AGS I PWEH 
STDNNRT YP S IQ I MNYYGKGKV \ RI TLV 
TKNDPYKPYPHDLVGKDCRDGYYEA* FG 
QERRPliFFPNLGIRCA 


1860 


15761 


A 


1872 


490 


1 


ADSLSKDPGRPLHPPNIWGDRGRQ*PGE 
TPQ*HRRSCASQDPGRSQ*PGKILPSPK 
PGRPPPMTGEDAGPPK\HGNPSPNDQKK 
NPPPK* /PQGNRIHRDPGRFPPQ*PRKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
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peptide 
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C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylafanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginej P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \ = possible 
nucleotide insertion 














SFPNNLGDSNDQRSRFPVA*KNLPQSSS 
GTRAIGRELYLPPQPGSSAFPNSTT 


1861 


15762 


A 


1873 


373 


1 


GGGVPGALFSHKKKSLFFFPPPPFFFRK 
GL*NF\KIKGCGSPFLPLLFFLEKESCF 
VP\RVECGGVILGPCKVCLPGSPPFSAS 
AS * VSGATGAC/RPRPGKFFS PFFFFYF 
LVEMRFHRVSQDV 


1862 


15763 


A 


1874 


3 


374 


YMLG KE I VS KT KIGQE LGL LNQ / TS Q W 

TETEKAL.LVW I EDQTSHN I LLTQNV I * N 
MALTLFNS I KAERGEEDTEEKLEGSRY W 
FMRFKKKKAI SIT 


1863 


15764 


A 


1875 


2 


364 


IHSGKGESLWDLGIiDTEFLDLPSKA*HI 
KAKNDKtiDL I KM * NFCSAK/ ET * / 1 RMK 

N\KKNTIRR*AIDMDRHFTK/EKMPMTM 
♦YTKGCSISLVIL 














GFLYLCSSEIDPYPSPSKKI KSEWIRTY 
WMKLLSENATIMLKDSDLSQVXFCVNIP 
FVQAAKAKIEE+DYIKLKWFCSAKGAI * 
KAKRQPTE/ W/ DR I FANY P CVYGL I TTT 
CMEFTQLTSLITTPTITLPWNPRQSYHH 
NPT 


1865 












VRPTKIjDPLERTQYTLPLL Y KWNNKAWM 

tahl/fta* fteyfks \s vkpde*vrei 

DYRML.YL 


1866 


15767 


A 


1878 


h 


491 


IHRLLWPPLSAAR PPSRESGLRCRAPRR 
PAS AAAATAAS P S \ PTAPQG P P R /RRRL 
LIQPPLYPRGLFTPGVPPL/APGGSREP 

AVQAEEGVDPVGHLSEVLAAPCKQP* TP 
PPACHCLDGEGRPSGVQAPLHKAKLYP 














LQPPI/TWALiIjIjQPPQPKSRAAFFFFFFF 
FGKKSQFLFGPPGGGEGEKLG * REPPPP 
GTKGMPPPPPRERGKKRGGPTGRENLGI 
*RKGGVPPGGRGGGQTPNPGGGGAK\PP 
KGGK*GGGRLPPPQIKGPKRGRPKKRKG 


1868 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLELLTSGDPPPLSLPKC 
WDYRY \ DHHAQ PS F* LFLS VQ I SG I KFN 
HSWQPCI 


1869 


15770 


A 


1881 


1 


458 


FAIRAGRNLPDKEFRYLRTVIVTAAVYW 
GLNSKLiRCLTS \LLTFQHRAGVSPYTSP 
FGFA*TCVFAKQLLEPILC/RPCFHRAP 
LLPKIjRGHFAE/FP*QCFFR/QALGFSP 
/RST CVGLRHG 


1870 


15771 


A 


1882 


3 


392 


YMMRYHYILIRMAKVKI \ S S TNDDTQ * L 
KXIHC*QECRMVQPLWKMV*QFLIKLNI 
*LP*NLiAILLWGIYLIEMSTYEFR/RKI 
CIQMFMTDL I VI AK YWTQ/ PQ CPS VAGW 
IKQ/IRSIHTVEYYSAVKRNQL 


1871 


15772 


A 


1883 


2 


473 


IQGGIAAYRVDRQERSNGRQGNRRNLSG 
TNMRRNKQKGLRRCKPRW/RSRGMAWKV 
DKPRDAR * KSDRDRQ/RGETDG * RVSGG 
LTA* EAYRH\HRGQ\EAASQSGRSRQAG 
RQTP*QEQAGNEAEIRREEEKSRGQERE 
RE \ KDRDADRHKG 


1872 


15773 1 


A 


1884 


138 


444 


CYLT1.IKCRIIYINDKVIVLTIVWY*HK 



655 



_0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 
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Predict- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q— Glutaininc, R— Arginine, S = Serine, 
T=Tlireonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /^possible 
nucleotide deletion, Wpossible 
nucleotide insertion 














DRH 1 D * WN RLiQ S P * LN P S 1 WV Q L I XjLNN 
AKNTQWVKDTLLDKWYW/ENW I YQCMRM 
PL.GSYTIPMNQFKMNYGLS 


1873 


15774 


A 


1885 


27 


470 


TICCYIFLFFSFFSL,FFFFFGKGVSF*S 
PGGENTG/ANLG* WNPPPPGKGISPA*P 
PKEPGMEGAGHPPGTKS*PGKFGPGPGG 
G PS FPYRG PNKT WGKPGPKWF PGG IG YP 
ATIPQKKQPEGKRGR\PGGPMSPYNGPP 
KTPTHGKGPG 


1874 


15775 


A 


1886 


478 


1 


KNVQENSALQAAMCRKMLIVCQTQCVI I 
SGESGAGKSVAAKYIMGYISKVSGGGEK 
VQVRRGKQKDKLGRPHLHGS AAP PI CTP 
ASV*LLPTPPASAP\HVKDI ILQSNPLL 
EAFGNAKTVRKNNS SRFVSLCRPAWSS C 
SLRADLSTHTPTHTHTRV 


1875 


15776 


A 


1887 


1 


402 


HSLERPHYIG*LFKNIFSRD/RVFAMLA 
HWSRTPGLKQSTPLSLPKCWDYRCEPQH 
PAGS FFFFFFLKNGFWGCSLGGRAGGQQ 
*LKS*WRPNPLG*GNPPC*PSKEVGTTG 
AHKKIANREIRTRACGGTNFSL 


1876 


15777 


A 


1888 


511 


124 


GTRRQHFAGAHVPPEGP*S\MLDPKLK\ 
DDRPARDMW I RE PGLLLPRAPAQDAG KY 
YCHRGNLTMSFHLKITARPVLWHCLLRT 
GGWKVS AVTLAYL I FCLCSLVG I LHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QbKP * ATKSV* KDTAFG IDVGNDFLAMT 
PKAQAMKGKIDKWGFIYR*SICTAKETI 
NRVKR * PRK* EKI FAKPTWQKGQ I S /RI 
HKEFQQLN/KQKSNNLIERQTKDLNRFL 
SKECSKDli 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK 
KLTRGGCARYFPPLKSPRPRIPFPFFEK 
GEGSPNSKHSPAPPFWGPKETFFFRRTP 
PPPPPPPLCSFLKNNLFSYIPKGFLGGK 
GQNFHS FLPSFFFS I KKKLLGLGVHVRF 
YCRVNSCTGFFVQMY 


1879 


15780 


A 


1891 


1 


455 


NTCLGFGNGFLDATPKA*SMKKI INKLD 
FNETENFCSVKDTVKGMKRQATHWEK/V 
RKTHILYKDLILKIYNQLLKHHNKKTST 
I KQ * AKDLNRED I QMTNKHMKRCSGWGR 
WFTPII RALWACKVEG SLELESS LGNIV 
RPHRKREREIHCY 


1S80 


15781 


A 


1892 


1 


537 


RGGIQAPKEVSPEGRQEPARKSLI *TA* 
ETPP*SQ*/PIPEEP/TGVFMKKPVSVS 
LETGKHAVWPKVNGKELPDNPTIKWFK 
GKWLE LGS KSGAR FS FKESHNS ASNVYP 
VELHIGKWLGDRGYYRLEVKAKDTCDS 
CGFNIDVEAPRQDAYGQSLESFQRTSKR 
SLYALALEDPGM 


1881 


15782 




1893 


2 


514 


VRCQRR CH E * RACGS S LVNAKKL YEDAL 
MARKVKQSLFSLDVETDEDKFQMMSLQ\ 
CSLAYGTLTKILSEKRSAKSYGMSSVRM 
R S AGQTS KAHLHQ P RR VS QVLQV P AVNL 
LPFRKKGQTKDPALNTSLPQKVLGTTEE 
ISGKKHTEDTISVASSLHYSPPASPQGS 


1882 


15783 


A 


1894 


473 


2 


VMGESRGFSPPPTFTGGNFVFFWKQGVS 
LFCPEGFKTLCSSSPPPPPPQKAGVLGG 
SFHARPPPFFFFQI *IPFFFGIKKP\LC 
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C=Cysteine, D=Asparric Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














S PPP * PL + KKRG I FP VPKGI FCEMESCS 
VAQAGVQ* YDHGSLQPWPPGF* *SSYLN 
LV*LSS*DYKCVPPHPMY 


1883 


15784 


A 


1895 


613 


11 


FRPRSPACGLHAVSSPKLPGAQALSSPG 
LNLVFTAGSWDAG/LLRLSPRANNPRVA 
LPRVHTGPSST/DLSPSCPLGLACGFGT 
S QPACL * S P FLLLAPARPSGCAWPGLP P 
VCSLHLRGGWGGADPTGCLDAWAMALAS 
LRPCLCPALPLSREPPSPLLNDLVLPRM 
PRRAPVLPLGRTPHSLLLCHVQSPSEQL 
PSKGPERL 


1884 


15785 


A 


1896 


20 


449 


KFGYSSAAARRQQLGWEAWL*YSFPLQL 
E PS AQTWG PGTLRLPNRALLVNVKFEGS 
EVSPCVACGIQAALSMGSTSSVKLLSHP 
QAPLPQWHQMVFARCLCMCGAQLNVPP\ 
ESFTFQVS TKDVPLALMACALRK/ KAT V 
FRQPL 


1885 


15786 


A 


1897 


393 


3 


RPTAQSKGNI*VRVAS\EALSPKLLDFL 
PG KVLNGE K VDVR PATRQNLS QFEAQAR 
KRECVRVPRGG I PPRAHSRDSSDSADGR 
ATPSENLVPSSARVDKPPSVLPYFNRPP 
SALPVMGLPPPPIPPPPCI 


1886 


15787 


A 


1898 


395 


217 


RER/CKSFR/PPA/HLiQAKIKGAQ*QVN 
QAAAAQAAAPAAAMVSRDISSLLVSSQK 
SKVSNYM 


1887 


15788 


A 


1899 


1 


375 


NTVLVQ * NNKAWMTVHPF/TAWFS EYFK 
ATVEIYCS/EKIPFKILLVFARVHSHPR 
TLI E I * KE I YAVF I PANTPS I LQPMDHG 
IILSSKPYYLRKASRAQRLTPVIPALWE 
AEAAGS PEVGSSGLA 


1888 


15789 


A 


1900 


47 


326 


VKSIIQFNSKNEPGKHDKTPSPEKTQIV 
WWLGSELRS PS YSGAAGGAEMGRS LESW 
RSRPQRAETAPLHSSPGSGSEMLSL*RQ 
HLTPTAWAGVQWRGLCSLRPRPPGFK*/ 
FCPSQHHPQPPSSWDYATRCQATKQFVF 
FLEMGF\VMFARLILTVELNN 


1889 


15790 


A 


1901 


181 


837 


AGRVDRRE PGMGTG I CKE LETGSRE TRS 
ASRWGRGRWRLGQACRVPQGLPLSTFHL 
GAQAKARGGTPLACS SHLPNSHVGSLKA 
QRDEAMVQ S GLAP AVS S TCTRWT * GS E W 
* *GLHIVAARRQQGREEEPPRTTAAPET 
LCFQQTSWASSCSLEHSAQPSEVQVRAL 
SVPSHSPMWV\PQLSLPRDHRKPPGE 


1890 


15791 


A 


1902 


1 


385 


YTWGFRGKKPLIHCL* EYKXVQPLWRAA 
WRFSK*LRVEL*FNAAISPLGVYPEENK 
LFYQNSTCTCWFITALFITGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGLELSTSGDPPTLASQ 
WVG/HYRRE PLRPANTLQS * PLGLKQPS 
CLS LPS S WEYRHM Y 


1892 


15793 


A 


1904 


2 


39! 


IQPLISQRKYKTLGQNSLS/CCAHP/PP 
HFFFVLDSYFHSLLSQLKNAPFT*G* CP 
YWYSKSYSLIHSWGSSS ILPCPLTSSGF 
PSLPPSYQPLPCPSLFLL*NSLSTLCLL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 


A 


1905 


3 


424 


YRAGCLQSLLPPPLLLFLLPCDVPFPPS 
S \ LP * VKAS CGLI RS QQNVGT I P CLQNR 
KSNKPLHKLPSLRHSLRAMQNRILPILR . 
IGKFF / 1 FFS * DG VS LCHPGWS AG VQLQ 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
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Q=Glutamine, R=Arginine, S=Serine, 
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W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














FTALHLL/ VLKRFS CLSL 


1894 


15795 


A 


1906 


2 


389 


IQRGLDKSCLITDNIPSSQSLIQKKGVN 
PSFKS MKADRG KE AAEEKS E ASRSWFMR 

IDEGGYTKQ*IFNEDY/MWKKM*FRSFL 
TREKLTPGFKASKDRLTPLV 


1895 


15796 


A 


1907 


458 


13 


AKEETQSVWDFPM/VPQGVYVNFPVSR 
NANLSTIKQLLWHRAQYEPLFHMLSGPE 
AYVFT CINQTAEQQEIjEDE QRRLCDVQP 
FbP WRM VDCEGARVNKLLS SQ I SLF I G 
KGVRELDSLSDPEVSDFSTKMCQFCEKS 
AAL*DQATS 














EKTDGLYRASQRGKDLRRLCAQSFHASW 
KDGMALCALIHRRRQ\DLIGYAKLRKDD 
P IGNLNT AFE VAE KY LD I P KMLDAED I V 
TTPKRDEKAIMTYVSWL* I VIAGAEQAE 
TAASRI CKVIiAVNREKKKLMEERV 














HSGPRREGALLLPKCLPHAKRCLiLiIjFKM 
CSDGATALCCPGWSSAAPS * LTQS P /AS 
TSQAK * P S HLGL PS C WD YRC I P PHPANC 
LDYYY 


1898 


15799 


A 


1910 


418 


3 


QD*YATANRWFICMLSQACFLPSL*PAH 
LL*L*QLMLFSFPGTPVFSYGDEIGLDA 
SALPGQHMEAPVMLWDESSFPDIPGAVS 
ALMIVKGQSEAPGSLLSLFRRXLSVQRS 
KERSLLHGDFLAFSAGPKLFSYIRPMY 














ILEAYPEVKDPAVKGASSKKEMYGH* / A 
AEQALPVASEQEQQRHERSEKKQPQVKE 
GNNTNKSEKIQLSENICDSTSSAAAGRL, 
TCX2RKIGKTYPQQFPKKLKEEHDRCTI.K 
QENEEKTNVNTMSKKNREDV 


1900 


15801 


A 


1912 


499 


141 


PGLGERDWTSKYGQGAGEGSTREWASRC 
G/IRPGGDAGQQQPRPE+SVCPRGAHSP 
GPGSWKASPAWHSAEPGGRCSGLGVQKE 
GFPGHLLQPGCRTPGPGIRKERFSGYLQ 














QHLCNLARLATS AYDGCSNS E V/ CDLDL 
LY*AALFLKLDYQIRFPNYFSTKYRIYY 
LCLY 


1902 


15803 


A 


1914 


504 


0 


PGPGQRKHSTAPMCLLNIYAQTLRFVLA 
NQ I YKC I KRI I HHDQVGFI P I MQS WFN I 
QNQ*\INIiIHHINRLN*KNHMIISFDKT 
HHLFI IKTFIKLGIEGNLLNIiIKNTF/ S 
KNPAANI ILNSEKV 


1903 


15804 


A 


1915 


46 


415 


YTSNKQLQIQILKITYNSTKKYKILINW 

TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NLSTL* I LTTS F * FAW KYTGTG I YKVTM 
KNRVGRLS LPNFQNY 














ENADCVERARKSPDSIP* \RGGQISVTM 
VSPNEQEKAGQLAIGVRAVRYNGV\LLA 
KMWRKKLHLTLSANLEKI IAIGLFFSNF 
E RKPP ENT F LKLTAMATHS E S NLS CFAQ 
EDIAICRPHPAIKMPEKAEQYKPIiTASV 


1905 


15806 


•A 


1917 


384 


3 


TRTITSGQYSTHVIRASRVPTRS* VPVF 
RSCTSNRRFSQAIEPRVMH*KVHIRAST 
VRYDSGGHVAVYPANDSALVYQLGKILG 
ANLYWMSLNNLDEESNKKHPFPCHTS/ | 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Linknovvn, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














RRTALT* YLD ITNPPMY 


1906 


15807 


A 


1918 


405 


105 


KAEAQRRREICSGPHSQADSAWHPLVTW 
WH*PPPVARSSQEPGVHSPVP/HVCRKI 
LLIRPKMALANEGNYRELRWFTPWSRSR 
*VGCLTTPGMCVKHLRCV 


1907 


15808 


A 


1919 


409 


3 


GGQEEGVFRVliNSKEFETERGKP* IPPV 
KDQE I I H \ PTKFNHVAHMG PGDGKQ VLM 
DXiPliSAVPPSQEESPGPAPTNLARQPPC 
RNKPYISWPSSGGSEPSVTVPLRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


1908 


15809 


A 


1920 


9 


470 


APARNPLPRPCTWPTGP*CL,RPARPPVA 
SACLCGGTWNliALWLCS PGT / PVPFLTP 
PCS / S CEVQQPASHS VASNQSKEPAKSA 
AVAHECPPGGTGSADPGWPPGATCPESP 
GPAT PHTLG WE PGKSSPPTMEEE PWAP 
QGSPCWTVRQRMTMMM 


1909 


15810 


A 


1921 


556 


1 


IQLWAALGGILRRVGRDPPSHRIGKEPS 
AMAGQAGGNGDGEEGSS \GGLAAVPHI,P 
* SQHAAHPLL IGP PGQQNLGDSKV* G FP 
SPRIjEENTLENGGWGSKQIjHGSPGSQHA 
GGSWKNGETSLKG/PH*ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS/PPPHGSPA 
PSGTSPLPMSAVSLLPPGSL 


1910 


15811 


A 


1922 


567 


41 


GGWGET FSRLGNDLQAHR * SRFNAQAQE 
ETS R \ VLAVS L INE ALDKG S L EKT LS AL 
LLPAAGLDDVSIiPVAPRYHLLLVAAKRQ 
KAQVTGDPGAVLWLEEIRQGWRATQDA 
NTAQRMALGVAAINQAI KEGKAAQTERV 
LRNAAVALRGWPDCAKGYQRDLESAMA 
KIQRPAAV 


1911 


15812 


A 


1923 


2 


405 


IQGCGI TSS S VLHGMVFKKETEGD / VTS 
VKDAKIAEYSCPFDGMITETKGTVLIKT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
WVTGGKVADMALHYANKYNMMLVKLNS 
QWDVRRLCKTVGATALPKLTPPCL 


1912 


15813 


A 


1924 


510 


37 


LLGHAFHVQSSSGRPQLAEASGHSHLKK 
GECVQQRTGNVGLSPNTARWGTPLGPSI 
SSSAPPWSFSAFPGPPGSGK*AAKDGPG 
CSLKSGLKR\RSQGGLR*TRGSGPSPMP 
PSPSPS/ERPPPGDEGLLPCTPRGGLPG 
PKINTACVCAADI S PGLEPV 


1913 


15814 


A 


1925 


74 


429 


ATIPGHELLLLFFFFFFFFFLGKGAWPL 
G*GKG\GGPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPIFGFLKKKGAPPG 
GPGVPKTRGQIEPPPWPSKRAGVTGWTL 
GPGPKV 


1914 


15815 


A 


1926 


515 


304 


ALAASIALALNGVFTNTI K\ * I VGRPRP 
DFFYRCFP/GMG* PHSDLMCTGDKD\W 
NEGPKETSPSGHSSPV 


1915 


15816 


A 


1927 


I 


433 


NTVGSNKKSKKL YFAE I EKS YLNF I WNL 
KGPQ*VKIILKKKSSVTKCTLIHSKA*Y 
KV/LKI VWYWQRDTHM/DHWNS I KQHMV 
KQFFDGDTKTVQWAKDCLYNNWCWEK\W 
ISTSNRMKFN I YPTP *ANINCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


356 


GGT*PPQAATPIS*LYLPITSSL/TLPP 
APSLPLQIAPIPSDSPSPRYS/TPTPPI 
SLGLAPPTTLLIPVPSLPVSPRPLNSTA 
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nucleotide insertion 














PP/TFLTGSGAAVTWSLAVSLSPAPAS 
AMDKAQNM 


1917 


15818 


A 


1929 


490 


11 


PTRLTCPGSHVPGPVTFPPYETHSMP \ P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSS QLPNTWVH PS QRMR 
TWLQQHPPL.PAPPQP/ PSPGS I CSSMTH 
LTEYLASFPAPQ/PREQAPQGGVPTSLP 
QW*APQPTAPCRATQSQQLLDG 


1918 


15819 


A 


1930 


1 


552 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLL\PFVKSRKRAFPSP/LSPCSSFV 
KSRKRAFPSPPVPAPPS*NQGNERSRHP 
QSLLLLREIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRDHTFLPLPEPRQPS 
T VPVG CFGFS R I PRR WNHTG CAFL L I CH 
S ARLLCDPRVS S CWRHS 


1919 


15820 


A 


1931 


414 


3 


RVPGESRK* ERVLDS RHKD * EGRRRGN I 
\MGKRVDYTT*TVITDVTNLS IDQVGVH 
RS I AANMTFAE IVTP FNI DRLQELVRRG 
NSQYPGAKYIIRDNGDRIDLRFHPKHSD 
LHLQTGYKVKRHMCDGDIVI FNRQCI 


1920 


15821 


A 


1932 


521 


103 


ATE PAG VRLKEGGN I TES F VTVGNV ISA 
LiADFSQDA/SRYS*KKKQVLVPYRDSVM 
TW1XKDS LGGNS KA I MMAT I S PADVN YG 
ETLSTLRYANRAKNI INKPTINEDANVK 
LIRELRAEI ARLKTLLAQGNQ I ALLDS P 


1921 


15822 


A 


1933 


1 


490 


NTGWWRKGEGNGTLMVRRLPCVPLCSS 
AGGLTEEDEGELCAAGFPLLAEDFGQAL 
QQLQTAHSQAVGAP/KGGDQVLGGGTGA 
PPTAYPNHALSIS* IPSVSWHDEGALPA 
VKIEHjGIIHLLPERHALLSLVQARSGL 
LLHGPPATGKILLNKAGTTECCLT 


1922 


15823 


A 


1934 


383 


151 


EVAFLiENL I KDD I ERGRLP LLL VANAGT 
AAVGHS \S * LGRMKEL.CVQCVNRPKMEG 
YMHVSQHPVPESHKMRAKAI F 


1923 


15824 


A 


1935 


379 


1 


YVPWSKDKEYFFNSQ*GLTTHQILPYI 
DGFRHVQKISAEADVKLNLVRIAIQILL 
* VGLQSYLGQGHQPGKS/CRGPGCEGWE 
GMVLRS*AQLSLSGTTAL*HWCPSSRKL 
IQFGIIKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 


GCLFI YMKPTAMS SS QVARSGE VS P FTA 
VPA*S*K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA*SGC*DVERSSSHSH 
GTAHSP\HGTAHTPMEQHTHSHEQHTHS 
HEQHTHSHGTAHMY 


1925 


15826 


A 


1937 


426 


1 


KHGEI IDDLLKVYRDNASKKSAI / YSKW 
ITHWKKRRDDVGEEVHSSRPATSVCEET 
IHLVCALI *ED*LIAAETIANAVGIPRC 
SAYTILT/E/KLKLSKLSTRWVPK/P/L 
LPACLQ IREKRSMAILNK WNQDHEAFLH 
IIIAGLY 


1926 


15827 


A 


1938 


2 


469 


KRRKLVSS IAAAIHPAIjSS *TRQRFIJL.L 
LFLiFHIVREDLVQLRIKKKHTIQIGNVE 
VKIjIj * PTDN 1 1 LCG KLYR FHRSTQKVLi Q 
L I /N FS I VAA YK I NMH I S L VFL YS NDE * 
LENKTKQTS S FT IALKR I KYVGSNKKS K 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 


IPLSTHSPGMGTSTHLSTRHHLSFWTYH 
ERIiLNPGRETDWA/ IMLFRWGLLMLPRL 
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nucleotide 
sequence 


SEQ ID 
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NO: in 
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Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VSNS * AQVI PPTSAS QSAG I TGMS \ HAS 
GHLIFNY\CSM*WL>HljRNSQ* krctgqe 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSLPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


GIPGG*GSTTRNALYHVMNGEDWILTT 
CKHGKDWQKHKDSRCDRDNTEYEK\YDF 
GEMLHNATFCLVHRGRRHGSFRFLEAMQ 
AACVPVMLSNGWELPFSEVINWSQAAVI 
GDERLLLQI PSTIRS IHRDKI IALRQRS 
QFLWESCIATAIEDPNLIGWR 


1929 


15830 


A 


1941 


1 


423 


NT TL I FAG GM * NVC PG PLC ARLLHRS LT 
HATPDPPLLTLP* IPTPTPSSRCAPKAP 
PHLICPTSCPLCSKPHPWPP/CSSCQ 


1930 


15831 


A 


1942 


48 


417 


RLTMYQVLYRLHCTAliQPLP PRVK* FS C 
LSLPSTWNYMHMPQCPTNF\CIFTRDEV 
PPMLP\SWWRTP 


193) 


15832 


A 


2943 


450 


2 


VHRRSNRQNMDTSR * RGAMQS VE S V \ G V 
PY\EQ*TIVDGI\NSGVWEGIAYA*IEE 
RYPEEFALRDQEKYLYRYPGGE/AYKVE 
TIKLNVEAVNTRRDKPTNIFAKNQAPVR 
MRRNSFTPLSSSNTIRRPRNYSVGSRPI 
KPLSPLRAQECI 


1932 


15833 


A 


1944 


451 


3 


GRITRHLPRRAEDDREREREPSPL.PSRH 
PMFPPSVTPKASSDWP/PASSIPCQACH 
G / PPPVSLPRKPAHRS C\ P VFPP \ ASGG 
DTS IHSGKTVYVKRKSQPAWPVLPPGGL 
* APSHGAPS PSPPDQHRHCPE I \ R * DLIi 
PAPAPSPFSIPPLY 


1933 


15834 


A 


1945 


402 


3 


VRLLVS WKVMYP * SRS KASL/HPQITAS 
LTGSCVNCIVILILNFFYEKISAWIAKM 
EIPRTYQEYESSLTLKMFLFQFVNLYSS 
WFYVAFFKGKFVGYPGKYTYLFNEWRSE 
E CDPGGCL I E LTNQMT I I MAGDV 


1934 


15835 


A 


1947 


■ 


405 


NTG WRVFA I LC S LKGR P RG I E R V / GG KK 
KKKKKPGGPLGPAQGKTPKTQKGGGAQG 
NRGKPPLFSGGNFGNGRIFGNPLPAPGP 
GGGGG PR * KTR \ KRKNGNWDLLKGGGKL 
VFCPMVGKKLSGVPGV+QKNFKGGW 


1935 


15836 


A 


1948 


443 


1 


LTPGAANASIiLG\CCMEDL.SVNG*RQGL 
WEALLTHNMVAGCRLEEVDNAYGHYEAF 
S TLAPKAWLS VELAEPCVPEPGLP PVFA 
NF I QLiLS A/ PWVTEGGTAWLE WWHVQP 
MLALMEAELRKSQV:LNRVT*GAHYSDCI 
AAALRIKIT 


1936 


15837 


A 


1949 


396 


2 


GNGRGGSVPPNSLNEDGISCAI * / RHIN 
WLNG*TPTIYCL*ETHLICK/DH/HRXR 
VKGKKKI LHPNGNQKPAGVALLT * GQTD 
I K* KAI KSNKEGHYE I KGS VKQEN I TVG 
NIYAPSTRAPRYKKQLLDLKGVN 


1937 


15838 


A 


1950 


419 


1 


RWQ/PSARPPTPSGK*GASLPARPSSGT 
+GALLPGCPVWKVRSASSWPPSRLGSEE 
PLCPA\PSHLGSEER\PSRPPSHIGSKE 
R \ PARPP I A *D VGS ASAPP/L PSGM * ER 
PAAPPSGR*GASLPGRPSPEMWGVPLPR 
RPVWDVRAPSRAP VWEVRS VS AWP PHLR 
RE KTLRLATAP V 


1938 


15839 


A 


1951 


422 


3 


QNHVIMSE /DAKIAFGKIQYPFKMESLN 
TL * MKVNFLNL, I KI TQKNP IANTMFTGE 
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beginning 
nucleotide 
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amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MLE/AFLIjRQGKRSC/CSLSPIIjFIILIj 
EVLPS * VWQEK* / IQG*EREE*NCDFVD 
DMI VYVEN* KKPLYNLL* I I S * VARFKV 
ILDTCI 


1939 


15840 


A 


1952 


2 


409 


CAHYSKNPPQRPLRTDARLPAV/DFWSL 
GAI LFMVGCGQPPFQEANDSETL/TM I MD 
CKYTE PSHVS KEG KDL I TRTLQIDPRRR 
ASLQ* IENHPWLKGVDPSPATKNNIPLV 
*YK\NLSEEEHNSIIHRMALGDITDR 


1940 


15841 


A 


1953 


' 2 


367 


IQ CVRL * VHVCL CASS CVC I CL. YVHATL 
CVSTCL*CVAVCM/CLLCACATVSACLC 
/V/CGCVSTCVCVCPPVCTCDCVCMSAC 
VCT/CLCVCVTVSTCVCERLCVPTCVCR 
CPCICSSLNGNEWMG 


1941 


15842 


A 


1954 


374 


2 


EAPWLLRAGRGALPPCVJSKTPPSPLLFX 
PPLLPGTL.VYQPWVPLPPWNRLACAVTL 
ST*ARAGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLSPWTSHLLPGCCHNP 
CHCPFPHHKPTRV 


1942 


15843 


A 


1955 


1 


411 


NTPSPELHPC*PGL\PPLPSPPQQPPTW 
APPRTSTQQKLPILCLLKPSAHTDAPCT 
QPGS TL PLHT PHTQQAQGT A YQ I HTT * A 
APPPGAKPG*RCPPPPPSRQPQRMQTPA 
PGNPQPCPRLTTLTRVLVPAVPPLPI 


1943 


15844 


A 


1956 


33 


451 


RGRNTFGPLQSPPPRFK*FSCIjSLLRSW 
EE \ RDYRCMP VHPANTMIAS LVLNS \ CD 
UP ALAS QS PG I TGVS HPTRPHLS * FDMP 
LE*GHHHIjI j LVI 1 LSQPPNMPP * FQRFST 
GQLECSSFRTNLIVPSPSLNFHSDFPLT 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHSISKINTDNTKSCK 
G*GSTESLFHDRWEYKFVQLLWETVWHY 
VR*TFTILYNPEI ILKR/ IFRHTYKNVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/PPPPGEPE 
KWPD\QGGGGGGDPKPPGGPPP*ATGQN 
S AGKKKKKKI PRPGAHGADPLLPGKAGG 
EEWLDPA 


1946 


15847 


A 


1959 


407 


238 


TQAFALI/KDGGVIGGICFRMFRTQGIR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA*GLQGNPSPQ 
AAAPPARPGLVSGNCLT*EMAQAGA\GT 
GGSLSASLGKRCPHIPVPCE\PVL*GLC 
VPGRSSLGELGGNPVTVQSFPGPGAESD 
PW*GDAECCSLLQAS FGDRAGWSIR/RG 
SVGRPAGVPRGKGRKPTLSGA/ SGPGS V 
LGGFCCPEPLSREAESGMVQE* *GRFWT 
GQERTPTGR*GCKVAGYLKSSATVGHSG 
AGAGGSPGKTSATLDVGQGLGGT/PSGP 
AWDRTNGYMEKAALPLiCNGKVTGNTQCG 
AQP PASG PS PWVWQ PLLLRQRLRQSTGL 
♦LLFPGVGGGLQPGEGXGQPFPLSPWGS 
LTGRKNSNKARPGGNEGTEG * GARNG AS 
LTWVWTVPNGG YC P QAGRD F WLVDS R FK 
PSLGLRAMV*NSRKAPLSFEDGRMGTV* | 
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NO: of 
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SEQ JD 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=ALinine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histirtine, I=Isolei<cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 
G/PGGKERSISPPLPRGL*GP/VGERGR 
NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPPGPGGRGPIQT 
HSHSHLFPWLETSAGSSPPNFLCSLPPY 
TPLLRLLPSPFLCPSQTTQRMPIPAPHP 
SPASPCKVP*DARGAPAGGAPSTL/RGH 
HPPHP 


1949 


15850 


A 


1962 


375 


131 


FIiVGR I RLLYCLRELFVYLLREQKAKRN 
SQWVHTLPISSRHVDAVPCSPS XNRNRM 
GRDKKRPFPL\CVDA*LHSPSWIQP 


1950 


15851 


A 


1963 


89 


538 


GLILLQKWHPGAVSSMVERG* LLHSGLF 
FSFPDKVSSWQAGVQWHDL.SSL.QAPAS 
GSKRFSSLGDHRHAPPHLAHFCI F/M* T 
QDFAMLARLGTS F I YLFGCCC\ FETGSH 
SVAQAEVQWHNHSSLQPHSPRLRRSFHL 
SIPGSWDHRNAP 


1951 


15852 


A 


1964 


402 


42 


CCHYPGTPGLQGRLSASHQAS/SGPQLP 
APLPLQVLPFLALG I GVDDVFLLAHAFT 
EALPGTPLQVGPCP\QGSSEAAQLTG*E 
PLGSSDLGLLMNLGASCPHL*TGEIIVIi 
CPKGYCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPALSLPRPPVFTRRMGPQ 
GSGLPKLPVSAPSSRMGSFPAPSPLLLA 
SGDRHLCPDAAALPLLAIETGCDS\PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 


15854 


A 


1966 


521 


2 


YELYG I I KRHRRE VSNNQQ VR YKEHS I P 
PDYVSSVPTDPTWGPERREEESSGHFMV 
DHTGTAAGGGGGMIIASPKLGATPLPPE 
♦APA/PPPPPPPPPPPGVGSGHLNIPLI 
LEELRV1.QQRQIHQMQMTEQICRQVLLL 
GSLGQT VGAPAS PSE I HGTGTAS STKPL 
LPLTV 


1954 


15855 


A 


1968 


111 


395 


YliMRGFLLHHNMVKGKEDERKTKGARLI 
LFFFFFFKKESPSPPRVEGRDP\NLGT* 
NPLPPKVKKLSRPTPPEKWEPGAVPHCP 
NNFLFFR KNGGS 


1955 


15856 


A 


1969 


413 


101 


D VNRHFS KKNKHCQ * L VI KEMQ I KTKRR 
YHFS PTRMGKIKN\ KKENKFWQGYGETG 
I FTHC WWECKMVQ P I KKTVWQ F FL.KVK * 
LIiVFGPACKELGSYYSLVTKS 


1956 


15857 


A 


1970 


411 


1 


NLTPALCTKVHFKWVMDIKMKGKTIRLL 
ENNTRKYliHNIKIWKGLNRTE\KP*TIK 
KKTDKFDYDTIKNLSSSKNKTRQTIDWE 
NIL\ YLQYITDK* I ISGIHKAFLKV/ YK 
/ KKTNNP VGKWARDLNGHLI KND I HMAC 
I 


1957 


15858 


A 


1971 


3 


828 


GQACHFI FRSAQAGGSRLRI WRAKQLGL 
WRVCALIMPALEHSFPTRLLQGNSVPGP 
S I SLDTS CL/ CNRCVWEGGRG * TGPGL P 
SLGKVLVEGIPSESPGPTASHPCCSPRP 
DPDQ/LS C I S APS ATPTVH* SHliTLiPMG 
WPGPQ*GLHLSDQPGPRKP*PGCSGLGG 
GDAAPRGMEKP\PPPQLP 


1958 


15859 


A 


1972 


398 


3 


GPGCYFSVRLQCP/RKI PAWKRAVCSAT 
LI \SLQGPSLSAPHVLGLAALAVHLGES 
RSALPEVDVGPPAPGAGLPVPALFDSLL 
TCRTRDSLFFCLK*ALLPQSRLLSSLSK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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od 


SEQ ID 
NO: in 
USSN : 
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Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylhlanine, 
G=Glycine, U=Htstidine, J=Isoleucine, 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \ = possible 
nucleotide insertion 














SDTCSFDKIRRDRHADIPPIjY 


1959 


15860 


A 


1973 


2 


433 


QDPTKMTKEELNALKSTGDGTLG/RASE 
VEVKNE I VANVGKRE I IjHNTE KEQHTE D 
TVKDCVDI EVFTAGENTEDQKSSEDTAP 
FLGTIAGATYEEQVQSQILESASLPENT 
aqvesnevmgapddrtrtplepsncwsd 
LiNGGS 


1960 


15861 


A 


1974 


373 


580 


TIFSRQVLRIQNALSDKPNVSTVYSNNG 
SELHGTSEASV/YHWKILISTEYHKTL 


1961 


15862 


A 


1975 


47 


413 


KWKHLRGS EHWRPQENQVHQR I AELRKA 
GLWSQRR/PAEAAGGPRPKSHWDYLLEE 
MQWMATDFAQERWKVASVKKMVRAVARQ 
LQDRTRREAGARREEPSRLRQTS PVLPE 
KSSVPGLVLRR 


1962 


15863 


A 


1976 


444 


3 


GYERS RGTTSGTHS S RDYDG I FGQATVK 
WAMLDQFRMLSPCSKEVMHQPFYLKRVE 
IMAQCEEWIADIQQYSSDKRVGRIMSRH 
SAAI KRRTGQLREELLKLPCPEGLDPDS 
GDAPEVCRAATGAEETIMHDQV\QPSSS 
KVLPSDFQ 


1963 


15864 




1977 


281 


567 


PRSSLQGQELSRPWGRKGNFPGPPVPPP 
QRMFYDSEL.FSGISDPSCCF/RSAPTAD 
QUYGDQDMHEVVRKHCMDYLVRKWRPLG 
KGQGGKISSPHRP 


1964 


15865 


A 


1978 


357 


2 


KQILGPPHPQAQPGRWPPPHGPKDAPL 
WSSRAAPPGRGLGRAGPAAGVEAGATLR 
DSSPSTWTREGLHVQAQRKRPSHVHKG/ 
SGPGCLEDGEFPTS L.RLQAQLAE I GRGN 
GLSVRRQ 


1965 


1 5866 . 


. A 


1979. 


29 


434 


VQAEAEGLPGDTEHPQPQLMSRSLEGQS 
DVTIKHVACGDFFTACLTDRGI IMTFGS 
GSNGCLGHGSLTDISQPTIVEALLGYEK 
AQVACGASNVIAXiATERELFAIiGRGDSG 
RTGARTKESH/YLPQQVPMP 


1966 . 


15867 


A 


1980 


3 


2082 


SS EG YLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQS PRYEPQSPGYE PRS PG 
YEPRSPGYESESSRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLONPLGLTGALRGPG 
RRGGRARGGQGPRPNI CGICGKSFGRGS 
TL I QHQR IHTGE KP YK CE VCS KAFS QS S 
DLI KHQRTHTGERP YKC P RCG KAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
S LRSHQRVHTGQRPFS CG I CGKS FSQRS 
ALI PHARS HARE KP FKCPECGKR FGQS S 
VIAIHARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLPYISQHNLIRKYKK/WP 
GAVAYACNPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 V 


3 


424 


EGQAIVERMNL.CLKQQLQKQKGENRYYR 
TPHKQLN\ALLTLNFLSLPKGRILSAAE 
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SEQ ID 
NO: of 
nucleotide 

sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QHLQKPAAKTEAEQLVWWRDIilTESWEI 
GKI ITWGRGYAYVS PGL/NP S RHLKPYH 
ERMLRKRFREDPEDPPSCSHVKTDAEED 
PN 


1969 


15870 


A 


1983 


3 


399 


YSKLS FKGTLTKFRR I YSSS FYKEFQGC 
ISDLCHP /R/TLTNCLRGELL.KENLVWG 
AFGCHPHFPCYYNKSQERNLLQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHKVFKR 
QLELAASLEK/ PLVI LCQQADE 


1970 


15871 


A 


1984 


1 


405 


RRHIGGGVRLYYIGGEVFAKSLSDSAIF 
AQTPNCNQRYGWHPGTVCKIPPGCNLKI 
FNNQEYADLLDQSVNQGLEADY /QLTRM 
CTILMSLLKGWGAEYRRQTATRTPCWIE 
LHLNGPLQRVDKVLTQMGYPS I LM 


1971 


15872 


A 


1985 


27 




QGREHAQGGQSPGAGHLGPTPEPQPEPQ 
PRPS S QAVPAGRWE PAQE / PTRHPHPRL 
SPASRPLGSPPAPLVRSSPGRCRLHEHT 
rWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSSVRLQQPHMHTPGKIMPDPSKRNGKF 
TFT 


1972 


15873 


A 


1986 


414 


220 


GAEQEELiLSP/GSGGCSELRSCHCTPAW 
ATRAKLRLTKQNKTKRESYRQGNQNWGR 
ILGELLGRL 


1973 


15874 


A 


1987 


52 


412 


TRERKLFFACDNVWKHLKRYLRKNSFGE 
NLHSSRNIICKKKKKKKKKKKKKTKNFST 
PKNFFFFKACQCMGKKNTKHDFHEQNFI 
EHALLQQLNINFSTNYIEGRVF\HPGAP 
IESLLLMCH 


1974 


1S875 


A 


1988 




143 


EKRRGLDKRTP/AQAAFEKMQEKRQMER 
ILKKASKIPPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


TEFQPSEK/WGEDLGDNTWEYIFAIDLL 
CCHQKWICHPLFLVGWRAGAEVSGVF 


1976 


15877 


A 


1990 


1 


439 


DKTAAEDAIRNLHHYKLHGVNINEEAST 
NKSKTSTKLHVGNISPTCTNKELRAKFE 
EYGPG\IECDILKDYAFVHMERAEDAEE 
A I KGLDNTE FQGKRMHEQLSTSRLRTAP 
GMGDQNGCYTGGKDGHWSKECSIDRSGR 
VADLTEQ 


1977 


15878 


A 


1991 


1 


145 


WAASKAMK/MGDWKTCHSFI INEKMNG 
KVWDLFPEADPVLLKRLRESR 


1978 


15879 


A 


1992 


2 


425 


N I STLKKTLiESD CT \ KLFSQG I GGE QAQ 
AKVDRCLS DLAAETNKFRDLLQEGLTEL 
NSTAIKPQVQPWINSFFSVSHNIVEEEF 
NDYEANDPWVQQLILNLEQQMAEFKASL 
SPVIYDSLTGLMTSLDAVELEKWLKST 
FN 


1979 


15880 


A 


1993 


3 


449 


VAGPAPGAGARPGLDLQFLQRFLQILKV 
LFPSWS S QNALMFLTLLCLTLL \ LKSFD 
QFTCNLLYVSWRKDLTEHLHRLYFRGRA 
YYTLNVLRDDIDNPDQRISQDLERFCRQ 
LSSMASKIiI ISPFTLVYYTYQCFLSTGL 
LGPVSIFGY 


1980 


15881 


A 


1995 


1 


410 


SSRRPFIALKTKSMRDLNPEDIDQLITI " 
SGMVI RTYQL I PEMQE AFFQCQVCAHTT 
RVEMDRGRIAEPRVCGRCHTTHSMVLIH 
NRYLFSDKQMIQLPESPEDMPAGQTPHT 
VI LVAHNDLDDR / VQPVDRVNDSGFF 


1981 


15882 


A . 


1996 | 1 


154 


LFFFRLLVRYTKKVAQVSTPTLGKVSRN 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
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locfl tion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGKVGKKSWKHPEAK\KRPCGKN 


1982 


15883 


A 


1997 


2 


475 


KKLRRQTRREAQKE\LPDKV\RWGLVTP 
SETPVGISNLSPFLGPRCL/QNPAELKP 
PWRFPWAQRPKAH\ ERANAGPKTPAEQR 
KVKKI KKLKED I S QGVH I S VYR VRNLSN 
PAKKFKIEANAGQLYLTGVVV.LHKDVNV 
VWEGGPNAPKTITRLMPLRI 


1983 


15884 


A 


1998 


3 


431 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTES R WKS EEEVES DDE YLALPARLTQ V 
SSLVSYLGSISTLVTLPTGDIKGQ/SPL 
E VSDSDGPAS FPS S SSQSQLPPGAALQG 
SGDPEGQNPCFLRSFVRAHDSAGESSLG 
SSQA 










1 


400 


ALDLRGLQILVGFPKRRVTTCSYPTALQ 
SPIEYQRKERSTAVMRTEPDSAYQASPR 
PYSAGPADSKKPTKGYCYNPTI.P\RLEI 
MTLEGTTG 


1985 


15886 


A 


2000 




372 


QNIDLVISFFSS RL.LQAG AELS VERVLE 
1 1 KQGWAL PKDRL/RGSCA P T LS AAGR 
SSGGQSPCMPGLCVCSFKVLTVSWLVCQ 
KFPELKFKYVEEEQPEEFFIPYVWSLVY 
NSAVGLY WNPQDI 


1986 


15887 


A 


2001 


393 


1 


GG I GRGGGAGGG VGAAGSASGGVGRRGA 
GGVI ADSGAPGGG VEGGVGASGG WRE /G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRS RGT IGGLGGS GSAGG 
GVGACGGASGYVGIRGAGGG 


1987 


15888 


A 


2002 


2 


362 


WVTFISL.LFLFSSAYSPGVFRRDAHNSE 
VAHRLKDtiGEENDKALLL I AFAQYLQQC 
P FEDHVKLLNEVTE FAKTCVADESAED C 
DK\SL\HTLFGDKLCTVATLRETYGEMA 
DCCAQQEPE 


1988 


15889 


A 


2003 


2 


358 


E ANRG W F I R L KEGS Q LYN I KVQGE AAS A 
D VE AAAS YP E DLAK I TDEGG CAKQQ I FN 
VDK\QTAFSWMKRPCRTLIAREEKSVPG 

CHSENSRTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFLSFTVKAVTFNGWVWWLTPVIPA 
LWDYRH\RPANFFVFLVETGFHHVAQAG 
LiKDLGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLIFVF/ CSRDRVL>HGCSQTPC 
L.KQ FS CLGLPKCWD YR 












134 


PMTFFTELEKTTLKFIWNQRRIQIAKAI 
LTNQK\NKARGI TNI C 


1992 


15893 


A 


2007 


315 


127 


SEIAFFFCSFLKIILDTR/FSFFARAGL 
KLLASNDLPSSSSQGAGITGVSYGTQPV 
CFEYNVG 










3 


325 


RCSMLAVREM/QKATKRCHFIiPTRIALI 
K/ND/GSNKCWK\HCHWECKMVQLLWKI 
VWQFLKKIjNI EXjP FD PE I PLRD I YPKQL 
KTYVHTKTCGQMFI AVLF I LNVPHLMNV 


1994 


15895 


A 


2009 


369 


3 


VGOAGLEFLTSGDPPASASQSAVITGMS 
HHTQPIFCIFGFA\GCPDWSQTPELKQS 
AHLSLPSSWDYRCMPPHLANFYFCRH/R 
VCCPGWSQTPGLK\YPPALVSQSVATTG 
TSPRAWQDTILPV 


1995 


15896 


A 


2010 


114 


287 


APLCLCLRHLRLL I RRLTVLGTEAHTL \ 
NPITTRGRGGQITWGREFETSLANMVKP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnian ine, 
G=Glycine, H-Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 














CF 


1996 


15897 


A 


2011 


218 


346 


IMKFNLIiKFYFIYLFSRQSL/SSVAQAG 
VHWRNLGSLQPLPPKF 


1997 


15898 


A 


2012 


310 


1 


SSSSSNTHFGIPKYLINPDTCFLA/KVN 
NSSLTGL.EYTETLKPG I K 


1998 


15899 


A 


2013 


3 


301 


SDCCASNQRDSGGVGPSEPARKHTLCV / 
CNSLDLIE ' 


1999 


15900 


A 


2014 


332 


3 


LjRDPLiEEAVCPFSDLQLHAGRTTAIjFKA 
VRQGHLS LQRLLLS F\VCLCPAPRGGAY 
RGRQAG\SLSCGGLHPVRASRLLCLPKQ 
AWAMAGAPPPAWPR PCSL I SD CCASNQR 


2000 


15901 


A 


2015 


2 


325 


DLLHHAPPVNLFLRDRVSLCGPLCGPGC 
SQTPGLKQSSCLSLPKC/WDYR\RATAP 
Gli 


2001 


15902 


A 


2016 


373 


3 


WAHGliQPGKPKVDSLSKKRKKKKGKKLN 
TFPLRSRTRQACLLSPLLSNIVLEVLPN 
E IRQ / QKKEI KGTHTGREELTLS LFTDN 
MI I YVDI PKQSTKKNQGSYS VARPGAVA 
HACNP STLGGLGG 


2002 


15903 


A 


2017 


343 


1 


EFPFVSGSRATGKSSDIRATKYI WRVXiE 
YLRAWPRGQRRLKSS\HTSLLGSYHPGA 
FRGDKWSCCHQKDETGGRGEDEVXiLCCP 
GVLGCSALCRSGVRTKFGIRGRPWKERE 
R 


2003 


15904 


A 


2018 


139 


2 


NSISTKNTKNWSGMVAHAFNPSTLRGRG 
GQI \RGQDFKTSIiANMVKP 


2004 


15905 


A 


2019 


3 


322 


ARELVFFFGAYRKGFFHRDMKPKNLLCM 
GPKLGKIADFGLARELRSHPPYTDYGST 
KWR/YRGNPALLRPTPMAKFPYEGWLHI 
SGSSIWNIKTSDSNFIFFKESKG 


2005 


15906 


A 


2020 


205 


377 


NIVENIVFCWPGVCFLQTCTVCINPETS 
DE/ WPGAVAHACNPSTLGGQDGQ I TRS G 
DRE 


2006 


15907 


A 


2021 


3 


324 


KKWGKR\LNRAPNGRRYPETRWALEEDQ 
CHICKELHIKTVRFHCTPIRMAKIHTTD 
NPQSWPRCGTGTLIHCRRGCKTV/R/PL 
WKTVRQ/FL/RKLNIPLPCDPAVLSLCI 
Y 


2007 


15908 


A 


2022 


2 


382 


RVSQDGLNLLTSRS TRLGLPK/ CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWNPGGRGCSELRLHYCTPTWVTERDS I 
SKIAKNK/NNKRPRNNCR 


2009 


15910 


A 


2024 


339 


3 


SWDHRRVLIFVFLVQTGFCHVGHAGLEL 
LTSGSQSAGIAGVSHRA\GQKHQFRPEH 
RFLKFGFVFRDRVSLCCPGWPQTPRLKQ 
S S RLS LP KS WG PATALDPDS S Y LE E MVL 
S 


2010 


15911 


A 


2025 


2 


146 


NTFGRSRQEDHLGPGGQACSELRSHHCT 
PAW\VIEQDPVSKKKKPPKP 


2011 


15912 


A 


2026 


3 


380 


RliECSGGISIHCNLH/LPGFKRFSCLSL 
PSRWDYTRLP/PFVFLVETGFHHLGQAG 
LELLTSGDPKCWDYGC\DHCTWP 


2012 


15913 


A 


2027 


32 


296 


DYMNSLMYFHSLVLASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/ CFFFFKNRK 
TRF VAQAEGQGGNFGS LNPL P PG FRGFP 
CLSLT 


2013 


15914 


A 


2028 . 


2 


187 


FTLLPRLECSG\MILVHCSLNLPGLRWS 
S CLSLLSS WNYTCVAPFS IFYYFI FLTW 
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E=Glutamic Acid, F=Phenylalanine, 
G~Glycine, H^Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=IVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGTLNC 














AAGIiSQAGMQWGDFGSREHLPSRFQRFF 
CLKVPNNWDYRHGPPRPVGIFFFSF\LV 
ETGFHHPGQ 


2015 


15916 


A 


2030 


88 


362 


KFGINLKKKEKKGPQKNQNFFPTFLFFF 
FFETESHS ITQAGVQWRDLS PLQPLPPG 
FKSIiPSSWDYRCLPPCPANFCIF\IPDS 
VSLCWQGW 


2016 


15917 


A 


2031 


7 


131 


GTIMAWAPGEGGCSEPRSCHCTPAWVT\ 


2017 


15918 


A 


2032 


1 


343 


LECSEPRLCHCTPAWV/TGDSVSKKKKK 
KKKK 


2018 


15919 


A 


2033 


243 


14 


QKLAPWPPQSAGNNRRVPPHPASMAFLT 
KIEKTALKFIQNHKRPQIAK\TILSKNR 
VGSITLFNSKIHYMTTVTKTI 


2019 


15920 


A 


2034 


238 


3 


FLALPKVLGIIGCVPPHPASMAFLTTIE 
DTALKFIQNHKRPQIA\KPII#S KNRAES 
I TLFNSKIH YMPTVI KPLRVTST 














IjVKIQKEMNVFMTANP ISIPQFMDQGVI 
SSCKSYYFRNKFLYV\ IAAMDTD/SSNG 
SWQSEWKTFWKGFIILDGIKNTDDSWEE 
VKISTLTGIKKKLIPTSTDDFKEFKTLV 
EDVT 














GRCC \HELRSRHCTPAWATKAKLKKKKK 
KKREKKTQKGKNGLGFWAF 














RSSNEGGRDCVFCFGDRVSLiCLPGWSAV 
AQLWLTATS TSQ \ VKS LS HLGL 


2023 


15924 


A 


2038 


3 


193 


NGLNAP I KRLRLANW I KS QDP S VCCIQE 
AHLTCRDTH / RC YLKG / W YKAF 


2024 


15925 


A 


2039 


116 


337 


SKLiKILPRLCWGWWQAPVI PATQEVEA 
EDHLNPG / RSRPAWATQRTP VS I KKKI I 
LRPANGK/CHGPSW 














VFFFFALFYLFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGLTVCSTSPFFLSLLPPSEES 
ACFPFAFCHDCKFPEAS/SVMLPVKPVE 
L 














WHFSPPQPliPPP / PLPNPPPPPTPPPP\ 
PPPPPPPPPPSPPPPPPPPSPPPPPPPL 
PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


2028 


15929 


A 


2043 


3 


344 


LYKWNNKASLAAHLFAAWFTVYFKPTVE 
TYC /SG KKKI PFKI LLL I DNTPCYPRAL 
LEMCEEIS I VFAPATTTSS /LKPMDQGV 
I VTFKSNYLRNT FQAGGG E KKE KH ERKK 


2029 


15930 


A 


2044 


2 


349 


PR VRKS PGPNGFTANF YQ TFKELISILL 
KLFQKKKK I KKGENPPNS FYGAI IPRIP 
NPNMDLSKK/ETYGPVSGRNMEAKI FTK 
FLiAGHFKQ S FGRE I HHDQRE F I PG I QGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXPFLWVLVWCWLLFXVWLWLCFFG 
F WCCWFCX CGVW F FVFF WCVLCGVL 
LGCWC 


2031 


15932 


A 


2046 


3 


284 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSIjLlLPPSASIjLLPPCTSLiIjIHPPTSLQ 
LPPLPSFYLLLPTSISHHLPTHNLPTTS 
IQDPSTPCSIK 
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NO: of 
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beginning 

nucleotide 
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Predict- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2032 


.15933 


A 


2047 


95 


1 


GRVDAILAHCKLR/LPGFTLFSCLSLPS 
TWDC 


2033 


15934 


A 


2048 


211 


1 


EPTTMTGAKWGSSQTTTNYHMLQSLRIN 
WVDFFFFFCTDR/ SLSMLPRLVLNS W I 
PAI ILPHPPQVLGLQ 


2034 


15935 


A 


2049 


35 


266 


EVRRPSTWQPPRLRSEEPLRIAQPPRLG 
NIGFCHVGHAGLKLLTSSDLPTLASQSA 
GIT\GHSYRRASEEDKKESSMSS 


2035 


15936 


A 


2050 


281 


3 


QSNFKRALLS I LIjKAI YRSNI I PI K/ VP 
MTFFTE I/EKVI PKFIRNLKRLRIAKVI 
LS KKNKTRRI TLSD FKL Y YY \ IAVFVAA 
WYWKKNRLSDQWN 






A 


2051 


251 


12 


ILRFFL.PKFLGFQKPPSLEKTHFFLPAF 
GNSLFWPPEKLGQKKVFFFFFFFFLRQG 
L / NS VS QAG VQWHNLGIiLQS PLiPRLR 




15938 


A 


2052 


2 


325 


ADHLRPGVQDQPQPGQNGKTP SPLKIEK 
\LAGCGGGHPRLREENCLNPGGRGCSEP 
RSRRCTPAW/VNDSKTLARKKKKKKGVE 
KNECGRKVMRV/LQGPKAKVKPWGENL 


2038 


15939 


A 


2053 


3 


166 


SLLLPRVECNGAXSAHCNIjHL\HNLHLP 
GSSDSPASASQVAEVRGSLEPRSSSIA 




15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLILliFCE/YRVS 
ITQAGV/QHDPGSLQPLPRVFKQFSHFS 
LQS SWDHRCAP IRtiAI FWVFC 


2040 


15941 


A 


2055 


301 


0 


QRKSHMFLTINQKLEMIKLSEEGMSRVE 
TGQ KLD LMC / Q VS QAVNAKE KF LEK I KG 
DTSVHTQMIRKQSSINVDMEKVGIVWIE 
DQAP/HNHIPLSHLUMRAR 


2041 


15942 


A 


2056 


1 


109 


RPLRRLRQENRLNRGSRGYSEPKLC/HL 
CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 
PPP 


2043 


15944 


A 


2058 


134 


2 


EKESRSVA\RLKGSGAISA\HCNL\CLP 
GSSNSPPSASRVAGSSGA 




15945 


A 


2059 


120 


326 


NSLVADME/KVLWWIEDQTSHNIPLSQ 
S/LI QS KALTLFNS IKAKRS EEAAEKI F 
EASP.DWFMRFKIKQK 


2045 


15946 


A 


2060 


102 


413 


ERTGFRHVGKSGLEFMTSGDPPTLASQS 
VGIT/VHEPRTRPG 


2046 


15947 


A ' 


2061 


311 


1 


FFKKQRFCKSGCKTPGKPRGPKKLGANQ 
FSGPTPIjKNGVGFSPGPKGGFFPPPPGG 
FPAGRPKWLTRILGKGGSLLRG/RYPKK 
GFLKP I LGDNS PQRAPKRGP 


2047 


15948 


A 


2062 


229 


380 


WYDLGSLQPMPLRFKQASRVNIjPRSWDY 
RHPPLSRLN\IVFIVDTGILHVG 


2048 


15949 


A 


2063 


142 


383 


PQSCFSTHWQLLQKQEETAGAVSVCVCT 
S\VCVCVCVCVCVCAGAMCVCAGA/CFC 
VCVCAGA/ CLC VCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


NLIQI KALTHFS S I KAERGDKSTE /EKF 
EGSRGWFVRFKERGHLCN I KVKHEAANA 
YAEAACSQLSRRSS 


2050 


15951 


A 


2065 


348 


3 


WVSPYSPCVCVCVCVCVCVCVCRDWVSP 
CCPGVCMCVCVCVCRDW/CFTM/CAQVC 
VCVCRDWVSPCCPGVCVCKDWVS PCCTS 
WS ILKLLS SGNP PTTVSQSAW I TGRSHS 
AW PARA 


2051 


15952 


A 


2066 


73 


286 


NLIRGLLESHILISMRYGYCKSYTLSMR 
D I PEALNKWKS / I PMFCSWS RRLIVSMA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucinc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VLSKL INRFNAKDVQH 


2052 


15953 


A 


2067 


329 


0 


PSSTFLLPFPRPPPSPPPPPPPPPPSPP 
/PSSSSPAPSSPHP 


2053 


15954 


A 


2068 


221 


3 


ELKVILHCLRDFSLQSSIMKVLILKILI 
LSVCCVC/VCVCVCAVCSCVC/VCSCVC 
/VCSCVCLCVLCVCVCVLSC 




15955 




2069 


I 


167 


GTRENPLNPGGGGCSEPR\SCHCTPAWA 
TKSETLSQKKKKKKKISNYKTPFKSYRI 


2055 


15956 


A 


2070 


2 


354 


ARACLGLPSSWDF/ SVENRFHRVGQAGL 
ELATSGDPPTSASPECWDCRH\DHHTWP 
LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNTRGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRF/LKQNSSSS/SEVN 
EISWNNDNNMLYPDTGNGCINILSYPIT 
ESRAIYQRPI FHVHVNQDLPHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSD P PAS AS RGAG I ADGVW FTQ / 
SLNGAQAGVQWRDLGSLQPHPPSRL/LL 
ASQSAE I AASARPPPRLGS EERLCLAAH 
RLGCEEPLCLAAQSGK 


2058 


15959 


A 


2073 


1 


338 


GSRLQRVCINYRLSFFPFLSQGWINFTW 
LFCLCVCFLRDRVSLCCPGWPSTSGFKR 
SSCPSLLSRWDYRHMPQHLASHTLFKKL 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGQGSGRRRRGRGG 
PRTGGAEGGRGAGETPGGGARPEREQGR 
GRHSERQG PT/RQTKRP KTKTKQLSQNK 
NT 


2060 . 


15961 


A 


2075 


450 


225 


TPVGRGCSE LRS CNCTPAW VTD \ E TLSQ 
KEKRGGVKIGWKKRRTRI I SLHLPGSHE 
KFNNLETVKNCVNHFCRLNT 


2061 


15962 


A 


2076 


2 


470 


TPQNKPHPTTKNTQPQPTQKK/PTQTQP 
/TTPKPKTKKNTTPNPAPPNTQKNHTTK 
PTTQNPHKTKQKNKTKPNTTQQQNKPTN 
QNPKTQTQQKTTTQTKPPKKKKKKTADT 
TS PNP I S TKK I KKIAE P 


2062 


15963 


A 


2077 


304 


1 


NSATPPCSPTA/KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 
RKI I PKGRRHPLiAPPQAARRDLNYYHPI 
IWRGRVCVCVCVCVCARA 


2063 


15964 


A 


2078 


167 


1 


T I LQTNS TWSNVLLWQGAVAQAC \NSST 
LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 


15965 


A 


2079 


3 


364 


HETRSRHSCAWCSAALWRAAVASRCPSS 
/IPVTPPQCLYWP/WKVPLQCPPDL 


2065 


15966 


A 


2080 


303 


2 


SKRGRPSGHECPFLGTSSSCRHVASCTI 
RTPRRLCKSQLDRCSPLKERHKFGLSRS 
VLSAMTQSGIY/WQPPPPEFKRCFSCLN 
LLS S WD YRHAP PRRARA 


2066 


15967 




2081 


4 


326 


AGITGMSHRAWLFLYFLNKFAFT\YGLV 
LNFFLHKIQEPSLG\SGSGP/LSCNS 




15968 


A 






343 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 


15969 


A 


2083 


330 


88 


ACDRSIISRIHKEIAEQLCKQK\SNNPI 
KK W PKDM I R YFS KQDVQT VKTMNKCS TS 
LIIIEMQIRSTMRYYLTTVRTPHPS 


2069 


15970 A 


2084 


1 


340 


RVRSHGTTHLAQLI FVLLVQTGFHHDGQ 
DGPDLL/NLVIRPPQPPKVL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


2070 


15971 


A 


2085 


2 


348 


ARALSLGANAAGNFKLRPLLRYHSQNPR 
ALKNYAKYTLPVLHRWHSESLETAFTEC 
FTEYFKPTVETYCSEKD/I/SYKALIiLI 
EKALGHPRALMEMYKDI S I VFMPVNTTS 
ILQPMD 


207] 


15972 


A 


2086 


369 


2 


KGLPPS PANLFFLSVY/MGFPMFTRL I L 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
S FFYQKFVRFFVLARNRTFGFVDYFS 1 1> 
YLIYGCSNLYRFLLSALYFFVSVFVFVL 
TQGIEAMVPRA 


2072 


15973 


A 


2087 


314 


3 


LLESVGPTRNSRPFKGLIGRILLDPEFY 
PSLVSDDLPASASPNAGRILPFFFFETG 
SHS\TQAGVQWRHHGSLQ\LKRSSYLSL 
TSSWVYRHTPPHPANICIFSRA 


2073 


15974 


A 


2088 


330 


16 


CPCFFLSAL.SVLVGWCFAFWCGCV/WW 
CVCFWVFVCVCFLGCWLCLCFV/VCL 
VGVCCWFFGGLCCVCC 














GTRTFLPPSYKDPCEY/IWAHPDNPGSS 
SNCNMLNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPHLPSPNPSSP 
SPSYPLLSDSTICQTTPTTTPITSSHTL 
LTS 






A 


2090 


350 


84 


QRKENKRTRKRATERRDESREEKAGRKE 
GENQEKRNKEQQGRQRRSRDRT/EEEKE 
EAKRREHKNPKKKKTKPPPQKKKKTEKK 
KTDNLSN 


2076 


15977 


A 


2091 


1 


355 


SDPPTSASQSAGITYVSHRTWPLLEFSG 
TS IRLAGKPAGVLVEVTGK/SVCGGGVT 
KTHWNE CHTGYP KCC WSSQAGES S LQPP 
PPGFKRFSCLSLPSSWDYRLLPP\QNFC 
IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


AS I PCLKKRKKKKJKKTGGKPPP \GGPKP 
KRGGGGGPPPTKRTLFGPPQGNQTPGGV 
S/GPPKGDPPFSPIPPCQRKSPPFEKGG 
KKGPPPWRGVKREKGHFLKNFQQK 


2078 


15979 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTLAS PKCWDC 
KR\DYCAWPHI FP I SGPL.YVFFPLPRMS 
PHTHTH KYTRTF FH YAHKH CACT LNLLLi 
RAQFQCHFFKGWPDP 


2079 


15980 


A 


2094 


76 


342 


WFXXFFFLLFVFFVFWFVCFCVFCGFFF 
FCFFCCVWFVFGFCLFLCFLFWFCFFVF 
FGFFGFWFFVFFLVCFLVFVCFFGCFFF 
LFCFF 


2080 


15981 


A 


2095 


115 


325 


MDERKKI RGGGRQG KE CKIH CKKKLS PG 
IRSYPVEN/F/VDTMYDYLQPAYYKLND 
LTNADPCAVRYLLFDQN 


2081 


15982 


A 


2096 


80 


227 


SCLGN/CIHLYSHSPTLSFTHTHTHTHT 
HRERERERERERE I C I CMS VYA 


2082 


15983 


A 


2097 


308 


1 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 
SWNQPKCPSILEWIKKlAWIYIYVCVYI 
C VCVC I CVCVCVCVC VCVC I Y I YMMEY Y 
SAIKRNELAAFAVTWTRA 


2083 


15984 


A 


2098 


2 


361 


ARACLGLPSSWDF/ S VETRFHR VGQAGL 
E LATSRDPPTSAS PECRDCRH \ DHHTW P 
LL 


2084 


15985 


A 


2099 


1 


221 


LLWRIjRHENHIjNLGGRGCSEPRLHHCTP 
SWMTR/ GKTPSQKKKTKQPMEWQKXiCIjK 
KVFGNTGVRE I FNG I KLiS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 
peptide 

sequence' 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, ti—Histidine I = lsoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=TIireonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 


2085 


15986 


A 


2100 


186 




ERVYVCVCVCVCVCVYQCYLPQIRSLi/V 
ASSLKANARVCVCVCVCVCVCVCVCVCV 
SLWHPLTA 


2086 


15987 


A 


2101 


305 


2 


HNILLSQSLIQSKALTLFNSMKA/E/RG 
EEAAEEKLEASRGWLMRFKE\RSHLCN\ 
TKVQGEVASADTEAAPSYTEDHSKITDE 
GGYTKQQ I FNVDKKAS YWNMS 


2087 


15988 


A 


2102 


6 


166 


EQTALE I LARAI RQE KEIKGIQI RKEEV 
KL CFFVND \ MVL YLENS KDFLKVDAA 


2088 


15989 


A 


2103 


53 


176 


E I KNNRPGM VAHAY/ NPSTLGGRGRQ I S 
WGQEFETS I/VNMVK 


2089 


15990 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT 
gvshcaqpivgdfntplsil/d /rstrq 
KINKDIQDIjNSALDEADLLDIYRTLHPK 
/STEYTFF 


2090 


15991 


A 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
ECZTRITNTEKCLKELMELKIKARELRP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


G FHHVDQAGLELLT PQ VI H P \ LGL PKC W 
DYRRE PP CLAS PHFHQ I AIS QKRHREAK 


2092 


15993 


A 


2107 


1 


398 


SARGPDG FTAE FYQT FKEELVQ ILLKQF 
QRIKGEILL/KNHYVKPSITLI PK\ PG\ 
RD I T KKLL TRS F CVS LCPVLSPLQSLQS 
RPSSLSMISLHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 


HENWNNKGWGDSTI YSMKYLY FKPMLRP 
Y/C/SQKKIPFKILLLFDNAPGHPRVLM 
EIMYKDMEVF\MPVWTTF/ILQPMDQRV 
ILTFKSYYLRNTFHKTIAAINSDYSDGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


1 


213 


HFPVENESAPG/FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGS FKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


KKKKKNPHRKKI KKRWKKI FNAERNKKR 
AGVI I H I LDRLD FKQQTI RRD KR\G Y YT 
MINGTIQQEVITILNIHAANTKAVRYIK 
QVLLKLKTELiGPNTI I TGDTNTLLLSTL 
NRSSRQKNQQTLDLICNI 


2096 


15997 


A 


2111 


188 


365 


FQNTIHICVCVCVCVCVCVCL/'CVCFCV 
FFFCLLCGGFMCGCWCDFCILFCFYGVG 
FFFL 


2097 


15998 


A 








KPHDATTNPSLILAVAQMPAYQELEEEA 
IAYGRKLGGSQEDQI INAIDKLSVLLGA 
E I LKMITGR VS TE VDARLSFD/ SDAMVA 
TA 


2098 




A 








FLF'l'Ui'CJjFMTHILGHKINYITN/ CKRN 
VIIT/SYFSPHNRIKLKISIRKISRKSS 
NTWKLNNRLLHYPQ I KDEVSRE I RKYLE 
LNINENTNF/QNLWDIHK 


2099 


16000 


A 


2114 


3 


387 


QTNH/NI PLSQNL IQSKAI TAy NCMKAE 
RS EEAAGKFEASRGWFMRFKKSS Y\ I KV 
QGEAASAGVEAVAIYPDLAELIDE/GCY 
TT/QIFSVFQTAFFWKKKPSRTFM/REE 
KLIPGLKASKDSSSLLLRVHAAGD . 


2100 


16001 


A 


2115 


152 


393 


VYCPICWLVFFFCFCSVLILFVMFV/CL 
CFSFFCFLGFVWFSSFYLFICVFFFIV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl.ilanine, 
G=Glycine, H=Histiriine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGCLFLCWFDLFLFVLFVLCLFFVS 


2101 


16002 


A 


2116 


2 


373 


ARACLGLPSS WDF / S VENRFHRVGQAGG 
ELATSGDPPTSASPECWDCRH\DHHTWP 
LL 


2102 


16003 


A 


2117 


3 


449 


HEFDHAMLQAHRAHHLAIDAYHEFEETY 
IPKDQKYSFLHDSQTSFCFSDSIPTPFN 
MEETQQKSNLELLRrSLLLIESWLEPVR 
VTjRSMFANNLVYDTSDSDDYHFLjKDLEE 
G I QTLMGRAEKRHCRTVQNL KQTYRR/F 
DTNS/HNHDALL 


2103 


16004 


A 


2118 


146 


15 


FFFF YFFFLFFFFF I PFLFFFFFLFFFF 
FFYFIFILFIYSLLVF 


2104 


16005 


A 


2119 


406 


3 


LFSWETGFYSKMLSRTFTATEETSIPG 
FKVS KDKLTlA LGANAAGDFRLKPMLI Y 
HS /ENPRALKNYAKFTPPVLYKWKNKAL 
MTTHLFTARYTEYFKPTVETY/ILLLID 
NAPTHQRALMEM YKE I NVFM PANTTS I 


2105 


16006 


A 


2120 


109 


396 


YYFYFYLVL.FYFFIFLFFYFFFFFILLL 
FYFFFSFIFFFFLFVYFIFLCYFFLFFF 
FFFFYLFFYSFFFFFIFLFFFFFFFFFF 
SYYLIFFLSYII 


2106 


16007 


A 


2121 


15 


413 


I VLARNTN FWLSFliF P VALG I L I VL.KGV 
KY I FWP LE YCQRLKM FVSYS FH Y \ FFLG 
SLLFLKYGFHMYLILLL/CIFI IIMCFF 
IKYSFFFCCLYHFFFSFYLFFIjYFLIF\ 
CYLVILFFSFLLFLFLSSYCFFF 


2107 


16008 


A 


2122 


1306 


429 


SSSSSSSSHVLRIIKDEDFKILEQRQW 
LSEKEAQAL.CKEYENEDYFNKLIENMTS 
GPSLALVLLRDNGLQYWKQLLGPRTVEE 
AIEYFPESLCAQFAMDSLPVNQLYGSDS 
LETAERE I QHFFPLQSTLGL I KPHATSE 
/HKRGPSMVMILTKWNAVAEWRRLMGPT 
DPEEAKLLSPDSIRAQFGISKLKNIVHG 
ASNAYEAKEWNRLFEDPEEN 


2108 


16009 


A 


2123 


3 


206 


LRRLRQENHLNWRGGGCSE PRSQHCAAA 
W/VSNSETPQKKKKKXEKKKKKKNLPNS 
ALKKTYSQRGKLF 


2109 


16010 


A 


2124 


23 


401 


IASGRPFFFFFFFFFFWPPPPGGYPHFS 
FLKKKKKRGGGGGKSLLPPGKGPNPPK\ 
WGFPI.FPPPLFPQKTPPPPFFFYKTPPI 
PPQGPRGGPLKFTPPGGGGGNPPTIL.LD 
KKRGVLGPPPFFWTW 


2110 


16011 


A 


2125 


3 


439 


MFDVSLLTFTHSLFHFSPQFHRKCELST 
LCDGGELRDHI LLPTS I CP I TR / DKCS C 
PGEGC 


2111 


16012 


A . 


2126 


1 


213 


HFPVENESAPG/ FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2112 


16013 


A 


2127 


104 


419 


NSFFFEEELYNPFGGLGKKTFFGGGEEF 
GHTPPENEALGGKNKFFTGEGGQTFSNN 
GEEKS VSFWISIEKI LHRALL /AHALCK 
NCWELNFGQKEEPFFPPPEEF 


2113 


16014 


A 


2128 


399 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPL.KKKNNPFPP 
PGKTFGPPRGFFKRPPPFFFFFFFFFFFF 
FFFFF 


2114 


16015 


A 


2129 


10 


457 


KTSWTWCPVPWPATQKAE/AGGSPEPG 
RSRLL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in. 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutan»ine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2115 


16016 


A 


2130 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL . . 


2116 


16017 


A 


2131 


317 


177 


FFFFFFFFFFFFFFFFFFFFLRLQIFFF 
FLPILIFLFLFFLFFFLFF 


21 17 


160 IS 




2132 


3 


542 


epwsvaqagvqwrdIjSPlqplpprferf 
srlslps s wdyrrlpahpanf \ qflvet 
vfrhvgqaglelltsghlpaltsqsagi 
tgmsrts\rpgflfkv 




16019 




2133 


146 


439 


LKSVTS IAKTWIQPK/ CPTNSEVDKCPS 
TVKW I KKMW Y I / Y FTME Y YAAI K 


2119 


16020 


A 


2134 


95 


1 


GRVDAILAHCKLR/LPGFTLFSCLSLPS 
SWDC 


2120 


16021 


A 


2135 


2 


380 


THTHHTH/ THTHTHTHTHTSLS 


2121 


16022 


A 


2136 


4 


337 


KRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNWFRPANTAS ILQPMNQGG I STF 
NSYYLRNTFHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVSS SQVRAS VYLKKKKKKGP P PEGITN 
TAGEKPPQSFRGKG\PPFPLISPKKEPV 
I\SFLKNFGPCTIGKKKPPHPPAKNGGP 
LQ/RP PTQGGGRGKKKGKNNQRPLS GLG 
GNRSPKPPF 


2123 


16024 


A 


2138 


2 


357 


FLGSS DPPPS AS PVGRATG / R VFFFFFL 
VEVVSHYVAQNGLELLDTSNPPAVASQS 
VRITCVSHRTWLLSPLYKIIQVCVSK/S 
PELEQSEDKSLKK 


2124 


16025 . 


A 


2139 


379 


16 


LLQVRCFVSTVNRGSSCQKT I QVYYVQE 
AIPPSFLLSPFLM/ PYTKINSRWIKDSN 
VKPKTIK\TLEENLGMPLNIFFTYQLLW 
LYLHPES QLE I CNS FRALQEGNLI I F F I 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NYLLNNLFFFFFLERGLTFAPRAGGWGG 
NLTSWNLGPPGPNKPPPFPPKRP\GTPK 
PTLKEGIjFGFFKTTGFPPGAQKGPELPG 
LRGPPGLAPPRGGNKGGNPPPGPLKTFW 
G 


2126 


16027 


A 


2141 


46 


421 


AG VS WRDHS SLQPCLTS RARA V \ SHLS F . 
LSS RDYGPMP P P P KKKKKKKKKBOCKKKK 
KKKKKKKKKKKKKKKKKKQ 


2127 


16028 


A 


2142 


26 


479 


LAYLLPSPVMYKPQNWRTILQSISNKGL 
I CR I YKETLQLNHKKTNNP I FFKWVKKS 
KHFTRGAIQVANK\HIKMKLNIISVYGN 
AQKNTMRYHHTP I KMKKMGCALWLTPVM 
CQEAGYILKMYPGWTPYPGRWGRPTAP 
ALPPRGTFPRD 


2128 


16029 


A 


2143 


9 


166 


QNRLLARLTKKKRGKNQAHP/ IKNDKGD 
ITADPTEIQASIREYHKH . ' 


2129 


16030 




2144 




328 


lekesrpppppppppppppspp/phpgs 
LPKPLGLQPLPRPEVSFLTEYLPKIjKAC 
EGGGVE IAAAS FPR I YLMGMCSRQDR I Q 
KDIDWIQKSRAEDCLFAGVKAILKA 


2130 


16031 


A 


2145 


1 


440 


KTFLRSLWQLVE/CYISSGLIDDHRRPM 
ALTPRHPRSGITAPRPRPQPPGRVGIPE 
PTALSPSPGPPPPPCSTPGRCQVPSLER 
RRKEGREPPSVGRGCGGHGISPSSDIFF 
HLNFCLAHPPDLFVFVCS I SNQLYIFHE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G— Glycine, H = Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 


A 


2146 


23 


422 


IASGRPFFFFWGAPGGEEGKKNFFWGPR 
GGKKKIRGAPTPRGKGNPPPKPWGKKGG 
APPKNLGLFGKKGAQRGPPGWFKTPGKK 
KSPPPGPKKGKKNRGRP\GPPPQIFFKP 
FFSTGEKNPLGKKTPNWRLLPW 


2132 


16033 


A 


2147 


3 


372 


KKNTPFFLTPRGPPPPPRGPLSPPPKRA 
TPP P P P FKKKKPG FP PKKKNFFP PPGG G 
PPPP/LPGEKKGGYF/WW 


2133 


16034 


A 


2148 


363 


1 


FNCRWFETKSHSVPQAGEQWHSFGPLQP 
LPPGFK\EFSCLSL 


2134 


16035 


A 


2149 


2 


361 


FFFLVETGFHQVGQGGLEHLTSDDPPTF 
GL>P \ GGWDYR 


2135 


16036 


A 


2150 


387 


2 


KKSFFLVSPARVQWGDPFNPPIPGSNNF 
P FST P PKTGV I RG P P PARK I FVF FI KTG 
\ FPQLGRGVLKS LPQ / CDS P P PAP PKGG 
VSGGNPPAPPFFFFFFEMESHSVSQVGV 
QWRNL.GSLQPPSPGFKQF 


2136 


16037 


A 


2151 


23 


421 


IASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKFFPPPSP 
KGGFLTPPPPPPFFFFFLKKKGVINGGG 
GGVKISPPGGTPPFLPQKGG/NKKGGPP 
PPRKKKIFLFFFPGAIKQRPPP 


2137 


16038 


A 


2152 


3 


366 


VIWAHCIFH1.LGYTDRPCLKKKKKGFPG 
GEKMGLKNFFPPLGKKPPPPQIPNFWSV 
IEKPPRGFYbGGAAPPKKFFIjLKP\GPP 
FFKEP P PPKKKGGAS PPLKKKFFKGKKS 
PPFFFSDPTS 


2138 


16039 


A 


2153 


423 


253 


FFWGGGAPIFPPPKKGFFPKNPPGVFFS 
PPKKKKIFFFP/HPVNFGPPKDFFKRPP 
P 


2139 


1604O 


A 


2154 


2S7 


1 


KEMNKKKNPKKKKVSKKISTAFYNKTLN 
NINFF\FAFFFFFFFFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFRFLFLFNYKQTA 
QLMQHSAPTRP 


2140 


16041 


A 


2155 


433 


2 


RRVLFIFPPL/HPFSPPPLPFSPFPPPP 
PMKFFSPPPTFFYFYKSPPHPP/HPPQV 
VFSPPTPPMPFITPPPPPHTSPSFPSFF 
FFFPFFFFFFFPSFFFFFFFFFFFFFFF 
FFFFFFFFFFFFHI ISIKGEIGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 


606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
KGPGAS PA \ PCGAAAGSTAQG \ GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVLWQHGIGRP 
GAHRPGPCAP/DSAPRAPGGRCG ~ 


2142 


16043 


A 


2157 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 
KKPPPPPVGGPGFFPQKKLKSFFPLPPP 
FFLGGGGPPPPPPPKIKSFYPPPKKVSF 
PPPPKKAFLPPPPPPLPPPPPSPPQPPP 
LF 


2143 


16044 


A 


2158 


241 


5 


KKFSFFPPGGSQGGFFGSLQNPPPGFTP 
FFCLNLGKKWGQRGPPPRPNNF / S FFFF 
FFFFLVE TG FHH I SQDGLDLLTS 


2144 


16045 


A 


2159 


434 


2 


LGSASQLGCSGVRDPLEEAVCPFSDLQL 
CARRTTALFKAVRQGHLSLQRLLLSF\V 
CLCP VPRSGAYRGRQASLS CGGIiH PVQA 
SRLLCLPKQAWAMAGAP/PPASLQPCSL 
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I S DC CASNQRDS VG VG PS E PRAG YNLL V 
RCFLSP 


2145 


16046 


A 


2160 


448 


84 


FLGGFFFFFFWFPPPSPPQKEAL\QKKI 
FFVFFSPESNQKNFFFSFSERGGPPNFI, 
FKGGGR\FSPWFLFFFFFFWFFFLWFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLGKCPLKYT 














RGCSEPRLRHCTPAWVT\KETVSQKKKK 
KFLTLGGKTFKNFFFFNRAPGTFP 














FFHSPPPPPPPAGGAVFPPKKKKKSPPP 
PTPLLL/RGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 
PPPPPPPFFFFFFFFFFFFFFFFSL.FFF 
FLRTLWRKGRGRGI 


2148 


16049 


A 


2163 


5 


370 


QSSCVQWLVPWPTIQEAEAGGFLEPRS 
ERLiQ/CSHCTPAWA 


2149 


16050 


A 


2164 


437 


36 


FFFFFSEAESCSVAQAGVQKHARSLPQP 
LPPG\SSDSCLSLLSSYMCLSPRTRGS 








2165 


3 


623 


RQGFTLVAWAGVQWYDLGSLQPPPTGFK 

rfsclslpsswnyrhappcpanfvfl.ee 
tgflhvgqaglelptsgdlpasasqsag 
itgmshca/rpkvcsyhlff 


2151 


16052 


A 


2166 


337 


0 


PPPPPPSPPPPS/PSPPPPPPSPPS\PS 


2152 


16053 


A 


2167 


350 


156 


AEATS KIRCQKCY I M I AGHS GTRLQFQL 
LQRLRQENHLWPRA/RGCNEPRSHHRTW 
PTWYISKSFLA 


2153 


16054 


A 


2168 




337 


DERSLSQRSRSWSYNGYYSDLSTARHSG 
HHKKRTKKKK/IKKKKKKKKRGPFKKKG 
PLKTRKKPRGGLLKAHPFWGGPPPGFFL 
TGEGAPPPVFFKKKKKKPPLGGKGGFLW 
G 


2154 


16055 


A 


2169 


2 


606 


RVXiRAVAAHEEPDKEGKEKPHAGVSPRG 
VKRQRRSSSGGSQEKRGRPSQEPPLAPP 
HRRRSQPP\HPGPLPPTNAAPTVPGPVE 
PLLLPPPPPPSLAPAGPAVAAPLPAPST 
SALiFTFSPLTVSAAGPKHKGHKERHKHH 
KH\PAPMVIPAPAEPI 


2155 


16056 


A 


2170 


311 


2 


GLQPLGLGSVKQCMDLACVPETVCVCVC 
VCVCARAHTPVCTQGCVPESTQCVCMHV 
C/VCVWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAI RQQRC 














NTPNSIYKAMSLKGPITGTFLPNYPGHK 
VC VC DT YL CVY QHTHTHTHTHTHT QAFP 
HI/HIYTHT 








2172 


2 


278 


KNRLNPGGGGCSKLRSCHCTLiAWAT\SE 
TPSQKKKE KRKRKGRKRKRKKKI VYTKI 
EKGNAKMTGESFIHFCKKSSSVPQVALS 
AEYRCSST 














fyhvgqaglklltssdppalglpkcwdY 
rre \d caq pnvks 


2159 


16060 


A 


2174 


195 


389 


FRVFS KLKY YYDFFRGRI SLS CPGWSTT 
PGFKRLSCLSLPSSWDYRRPPPCPAN\F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 


IASGRPLIFFFFFFFFFFLFCRGGGPPF 
NPRGKERGGFLIKGPPPPGGKKNFWAPP 
PGGGEIKRTPPPPGVFFFFLKKKGFSFG 
GGGGKK/PPPPGEPPPPPPQKGGKKKKK | 
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nucleotide insertion 














TRGPLF 


2161 


16062 


A 


2176 


2 


158 


FFFLKQSLSVTQAGVQW\PVDSLQPLPP 
RFKRFSCLSLPSSWDYRCVPQCPAN 


2162 


16063 


A 


2177 


340 


120 


PRFHFLASPSEMSQMTVKAKTTVPASEC 
AYPKIEPFFPF\DPRGSES\FDKLGVHH 
HPL FPLSGEPLMI LiHKNTH 


2163 


16064 


A 


2178 


3 


256 


YLKKDLNVNQKTLTLLE/EKQGK/HLHD 
MRLCKEFLNKTSKAQYILKKISQY/LIK 
VQNFNAVKDPVKRMKRQASD/ WENIFT 


2164 


16065 


A 


2179 


95 


1 


GR VDA I LAH CKLR/LPGFTLFS CLS LPS 
SWDC 


2165 


16066 


A 


2180 


1 


143 


ARGERERERERERERERERERERERERE 
RERVFV/RERAALSRERGDKCERSGLF 


2166 


16067 


A 


2181 


1 


304 


ARGQGHLSLQRLLLSF WCLCPGPTGGA 
YRGRQASLSCGGLHPVRASWLLCLPNQA 
WAIAGAPPAALLPPCSLISDCCASNQRD 
S VGVGP S KPCVGYNLLVW 


2167 


16068 


A 


2182 


3 


163 


FQRRSTKSCGWDKDARSQSRSPPRQQAH 
GHHS / HTHTHTHAHTHTHAHTHAWTRP 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKGFFFLRQSL/DSVSQAGVQRH 
NLGSLQPLPAGFQLFLQPPPPGSKDTHP 
GVQRHDHNSLQP 


2169 


16070 


A 


2184 


228 


88 


WFFRIYDSFFLVFLFVFFFFFFFFFFFF 
FFCVFTP FPVYF I QYWY I 


2170 


16071 


A 


2185 


287 


3 


SLFLAIPQWEFQRKIYLRKQKIVLLACQ 
PSKGLiSIVQWRGQLNSPVLKSQDPTP\ 
PRFKRFLCINFLSRWSYRHVPPRPDSFV 
FLVETGFLHLV 


2171 


16072 


A 


2186 


252 


105 


VSISCHPQ\PCLIYTCY/KIDGYSNIHL 
CTHTHTHTHTHTHTRVELQGPA 


2172 


16073 


A 


2187 


267 


459 


KHVPGSS YTWL/ I FFFRGNFLKKGANFV 
PQGVIiKGGNliSSL.YPPPPRIiKQYSCIiTL. 
LRIGNYRHA 


2173 


16074 


A 


2188 


170 


15 


RSVMKDLNSHFSKENIQMANKRTKECSA 
LWFRE IK/ TTMRCHLTP IRMAT I KK 


2174 


16075 


A 


2189 


2 


504 


DVT I S TCHASAKVGTRLVFDHYGKI IQK 
TPYPHPRGMTVSVKQLFSTLPVRHKEFQ 
RNIKKGLG\RRSCFCFDF 


2175 


16076 


A 


2190 


1 


565 


FFETESL.SPRLECSGAISAHCNLHLPGS 
SDSPASASRVAGTTGACHHTWV/IFFVF 
PVETRFHHVSQDGLDFL/NLVIRPPRPP 
KVLG 


2176 


16077 


A ' 


2191 


142 


382 


NTPPLLFFFVIRDRVSLCCPRWSGVAQF 
WLS AT YAS R V \ KR FS CLS LP / SN WD YRC 
VP 


2177 


16078 


A 


2192 


138 


365 


KHQYHH CC / LLKKKKKKKKKKKKKKKKK 
KKKKIKNKKHGGPFKKNFW/EGQPRNWA 
GGV 


2178 


16079 


A 


2193 


385 


21 


RGEIFFFKTRRKKFSPQGGRGGVFPPSP 
PKNFFFPQGGKFFGG/EGGPKVPPPKKG 
GFPKKPQGGFKVPPKKKKKIISPPGVIG 
GPPGNFLKGAPPFFFFFFFFFFFFFFFF 
FFFLVRAVKLS 


2179 


16080 


A 


2194 


415 


56 


PPPPTTAPVFSPPPPPRPFFSPPPPVFF 
FRSFPPAPPPPPLFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTI PPPPPPFSPPPPPF 
FFFFFFFFFSLLPFFFFFFFFFFFFSFL 
FLFFPSRLW 
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MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTGPP/RGFFKGGERGPPPPKKKKDPPP 
QKQKKKKKWGGGGGGGGQRKKKHPQKNP 
PPPPGRGGGKNFLWGRPRGPPPQGGGGK 
K 


2181 


16082 


A 


2196 


2 


266 


SKPRSCHCTPAWATQ\ETPSYKSCKKKKK 
KGLPLGGPGEKTIFFSLKPGQKAQPKNK 
IPFKKKIPFFSTPKGKKKKNFQGGPKKK 
KKEI 


2182 


16083 


A 


2197 


1 


277 


ARGERERERERERERERERERERERERE 
RERERRHP/RPRERE 








2198 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGGAPP 
PRLYRERVLKKQRGRFLPPVRERKK/RS 
PVQTLPGY 


2184 


16085 


A 


2199 


346 


1 


VFRHGLPCFCFPHQLQPPKRKKFNNRKR 
ALTSHTQFFVFETESCSVA/RECSGAIS 
AHCHL\CIMRSSNSPASGGAGSSELRFR 
HCTPVGQQTEIWSPKNKPIKKEEGCPYC 
QFLSC 


2185 


16086 


A 


2200 


3 


389 


HEGMILAHCSLNLPGSGD/ASHVARTTC 
VHHH VQLILFI FVETK/ FSHYAVQAG / L 
ELLSSSDP/PTSQSSRITGMSHH 


2186 


16087 


A 


2201 


3 


204 


HECHCTPAWAT \ SETLSQKKKKKKKKKK 
IFPGEKTWGGWNKKKFLFLNRKFFLGKG 
LFHLKRGPLiKLF 


2187 


16088 


A 


2202 


69 


379 


KKKRVFFWGPGGSKPPT/SGNPPPWPPK 
GGGLRGIjPPPPGPRGVIFKNFGKRGPPP 
PPGLKS WGPRDFFGLALQRGGI SGLNNG 
APP FF FGVLKKiP P LFFVS KG 












379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 
TFSGGGGGKKTCPQKLKAAGG/SKKAPG 
P P PG AE KKN P ALGG E KKTLGPNGRGP KR 
LRGLAKKGNPPSSFGGK 


2189 


16090 


A 


2204 


' 


341 


GQSLPVS PRLQLSNGNRTLTLLS VTRND 
VGPYECEIQNPASANFSDPVTLNVLCEY 
LLF\PVAQATSPNPRSQRPGLSVPLSSK 
NADS PPTPRNPARHDFXiPQANMGRPSLD 
QE 


2190 


16091 


A 


2205 


2 


370 


DAPPRPANFVFLVKTGFPPVGQAGFICLP 
PPGDPPPLASQITGKG/HCAQPPF 


2191 


16092 


A 


2206 


362 


24 


PRGSSSASSSSSSSSSPPSSSSSSSSSS 
S2SPF\PPPPVIiKPPPPPPKKSPPPKKK 
NPPPPXKKTFFFFFFFFFLRRGLCPFGR 

M 


2192 


16093 


A 


2207 


357 


3 


AGQALWIiARVIPVLWEARSYYVRISWYR 
NIVAMDSDSLDGSGHSKliKSFRKGFPVL 
YAIKKTHDSWEGVS \ MSAL I VWKSLI PS 
CVDAFE E LTS SAEE VAAQWG I ATDLE L 
LVGCARA 


2193 


16094 


A . 


2208 


280 


411 


GNDVYFLVFLF/ CLFEKESCSVAQAGVQ 
WHDLS SLQALP PRRMA 


2194 


16095 


A 


2209 


3 


392 


P I I YE KYTQQ I NEM PR KLQHQQ LALVS R 
NGPIIiLHDNAQLHITQC/LQKLNELGYR 
VLPHLPYSLDLS PTDYYFFKHLDNF\LQ 
GKHFHNQEDAENVSQKFVKSQSMDFYAT 
/GNKLIS HWPNC VD CN 
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last amino 
acid 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PJienylaJanine ? 

K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GILANCASIKDLISRIYKKLKQIN/KQK 
TND P L KKWAKDMNR YFSQE E I QGVNKHM 
K.K 


2196 


16097 


A 


2211 


213 


493 


GKRFYPACPGKIGALQGFFKGGGPFFFF 
FFFLTPPPNTNYLTNKLKI KS PLCTFYF 
SNILGFYPSITHRTIPDAWVTA/SQSGP 
VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


LAPRVIFGPPKKPPERPPLFFFFFETES 
SSVTQAGVQWHNLSS\PPRFKRFY/CLS 
LPSNRDYKRPPPHPANFCTFSRDGVS 


2198 


16099 


A 


2213 


455 


60 


NPRREVGPICPPPKTRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNLGPPRDPFKRPPP 
FFFFFFLDFFFFYFTIYKTSAACPRSPP 
AQPRVARRPLVPSS \PPLPSLCLAPAPR 
GPGSLCPRGSLEGDNGSSPG 


2199 


16100 


A 


2214 


2 


243 


LTLS PRLKCNGMVSAHCNFCLLSSSDS / 
RLR/QENCLNLGGGGCSELRLRHCTPAW 
TTERDSVSKTNKKNSLKIFTCFFVDAA 


2200 


16101 


A 


2215 


1 


286 


FSQLLRLFIC/SQGGRLLCCESCPASFH 
PECLS I EMPEGCWNCNDCKAGKKLiHYKQ 
I VW VKLGNYRQ VF PRTKRKYS I IVQTSF 
ILWIQSDLEIDR 


2201 












ITI FF FVRQGLTQAGVQ WR \ DLSGETLV 
IL.PPLEAPCSLQSSWDYRRVLPHLPNFC 
IV 


2202 


16103 


A 


2217 


216 


4 


VYPPSFMVFSQVFLSSTHISL.SFSFFWN 
YLFIYLFIYIjFIYIjRWSL/DSVTQAGVK 
WHNLGSLQPLLPGFK 


2203 


16104 


A 


2218 


14 


228 


KRSSHLFTDDI ILYMENTKHSTK/NLLE 
LIKEISKVTG /YQKSVAFLYVNNKQAIK 
KTIPLTIASKRIKNSGQA 


2204 


16105 


A 


2219 


244 


3 


EVLNQNSGL.PRWPNILRAKASLRVPRQC 
S RG WFS ANG AGTTR YLYAKEWGGLGGG 
GCS KLRS CHCTP VWAT\SETLSQKK 


2205 


16106 


A 


2220 


162 


2 


INM VGNL FFFG DR VS LiCHAG WS A VAP S W 
PTVASTSL\VKQSSFLSLPSSWNHRH 


2206 


16107 


A 


2221 


146 


3 


GRVDGVPWRNPGSL.QPPSP\GSSDPPTS 
ASQESGTTGAHHHTRLIFVF 


2207 


16108 


A 


2222 


239 


2 


SYISKPDKFPHDSSPEIKPVTVNWRNSF 
SFIFPFFFLFPEKESHSVTQAGVQWRNI, 
GVSSYWP\AGLKLL,TSGDPPALAS 


2208 


16109 


A 


2223 


2 


159 


LNRDLGGGGCSELRSCHCTPAWAT\SES 
PSQKKKRKKKKKKKKGGRGRNSKI 


2209 


16110 


A 


2224 


3 


345 


RFKLFSCLSLPSSWDYRRVPPRPA/NFF 
VFLiIET\GFAILTSSKTERQSRliECIFG 
FYGLPCREKRASERRSVEG/HERKILFS 
FDFFFLGGTESCFVTQAGVQGCYLGSPQ 
PPPPG 


2210 


16111 


A 


2225 


103 


319 


FSEE YRNVTNFLMLTMSCS CLTLVE / C / 
WSEGYMATPCTILLLLFFQRHCLTLSPG 
GVQWCSHSSLQPQTPGIK 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGIiELLTQWS IHLS LPKCWDF \ 
RHEPPHPA 


2212 


16113 


A 


2227 


2 


178 


IFLIFT FLEMS SH YVAQAGLEF PG / FKL 
TSRLSLLSSWDYRRPPPRLANFFAFLAK • 
GDAA 


2213 


16114 


A 


2228 


173 


3 


FIFIFISLFFIFFJLRQSIVLSPSAVQSR 
LQPPPPRFKQFS\CLSLLSSWDYKRVPP | 
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C 


2214 


16115 


A 


2229 


333 


1 


HTPPPPPTFHSLCTEGAPPPPQNT/PNP 
PRNPFIIPLSRSHTRANEPSLSTPRTHP 
HS PRP PLLTHPHPNPRAS AP PGSRYPPR 
ARHRERPRERPTQRERERERESVCVCV 


2215 


16116 


A 


2230 


350 


3 


KKKD I P PVNI YAPNTG AHKY I KQILLDL 
NRDGP\HTITAGDFTTPLSVLNSSAESR 
CSRPFI I S PSLWGLRE 


2216 


16117 


A 


2231 


277 


1 


QMHPARGHLPQALI PVQKPALISQGMSA 
SGSTQVS P FLS PCFWVEVNCSNTKVLT \ 
PFCGAGTGSHSFAQAGAHWCNRGSLQPH 
PPGFKRL 


2217 


16118 


A 


2232 


3 


335 


ETRFHHVGQAGLEPLTSDDPPASASQSV 
/GITGVSLRARPGES/GSWKTVCCNNMS 
EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAIDEVRTGTYRQLFHPE 
QLITGKEDAANNYARGHYTIGKEI IDLV 
L.DRIRKLADQCTGLQ\GFLVFHSFGGGT 
GSGFTSLLMERLSVDYGKKSKLEFSIYP 
A PQ VFTAWEP YNF ILTTHTTLE HSD C A 


2219 


16120 


A 


2234 


1 


365 


GARLILVFLEETGFHYVGQAGLELLTSS 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRLIDHISKLGTRGLQGFLVFHSCGNLG 
\TGFGFTSLiLMERLiSLDYGKKAKLEFSI 
YPAPQVS TAGVE P YNS ILTTHTTLEHS D 
CAFMVDNEAI YDI CRKNLD I ERPTYTNL 
NRVI SQI VSS ITASLiRFDGALNVDLTE F 
QTNLV 


2221 


16122 


A _ 


2236 


614 


84 


LAASLAACAQLSALAASHRMWALQRLRK 
LLTTEFGQS ININRIXGENDGETRALS F 
TGSALAAIjVKGLPEALQRQFEYEDP\ IV 
RGGKQ LLH / S PH F FKVLVAS RL.VT LE AG 
HFCPCCAETHKW\AWFRRYCMASRVAVA 
LDKRTPLPRVFLDEVAAVRVCGHIIjQLG 
DTELQQH I CHL 


2222 


16123 


A 


2237 


1 


393 


GPMLAQLSVFRCGSTSAPNDLWYHFIEL 
PYHGES ITMLIALPTESSTPLSAI IPHM 
STKTIDRWMSIMVPKKVQVILPKFTAVA 
QTDLKEPLKDLG ITD I VDS SQGHFCQ I T 
KAENLLV/ SHILQKQK 


2223 


16124 


A 


2238 


3 


402 


HVGQAGVQI GKACWEL YWLEHG I QPEGQ 
MPSDKTIGGGNDS FNTFFSETGAGKHVP 
RAVFVDLEPTVIDEGRTGTYRQLFHPEQ 
LI TGKEDAANNYARGHYTIGKE I I DLVL 
\DRIRKLA\DQCTGSQGFLGFP 


2224 


16125 


A 


2239 


2 


478 


GRGGLHRI PWTPLTPSFARGLVPSLAR 
GVEARNGAGPIKSYPRPGSRLKMQNGS K 
GSGLQNKTFHWEICDAHVNSKIQLKQ\H 
ISSRRHKDRVAGKPLKPKYRPYNKLQRS 
PSIIAAKLiAFQKDMMNPLAPAFLSSPLA 
AAEAVSSALTLP PRP FCFV 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQISCPTKKGIKSFLVLCPLNF 
FLFWRQDLAHPGWSAVTQSWLTAASN\Y 
GLKQSSYFSIiLSSWDYRCIPPHLiGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


EPPPPGGEKKKRGKPPPKKTPKKDGPQK 
KS AAF FGG \ GK I KKK KGAFR KKKKKGGG 
PPPKKKPPPRKKKKKKNFSPLPPPKHTP 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G-Glycinc, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph»n, Y=Tyrosine, 
X=Unknown, *=Stopcodon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 














SLFQKPPTKIFLFSPP1>I»F1iNILFFFPIj 
SPFIF 


2227 


16128 


A 


2242 


3 


143 


PNFLSPRLDCRGTIRGHCGLYiiPGSCDP 
S/CLSPPSSWNYRSTPP 


2228 


16129 


A 


2243 


132 


2 


MSAHCNLHLPGSSDSPASVSQVAGIMGA 
CH I F\ VFLVQTKPHHI 


2229 


16130 


A 


2244 


334 


43 


TS KHMKRCS I S IiATREMQNKTT I GGKFV 
PTRLiAN I Q KPENAKYRQGCLMC / WCWVC 
NIASPLWKTIWHYL.VKCLPYSSAIFTLG 
\IYPEEVLASSVPGT 


2230 


16131 


A 


2245 


312 


2 


PPSKLPEKNFFFFFFFLRQSL/SSVAHS 
WQWRDIjGPIjKPPPPESKQFSCPTLIjS I 
WETQTALSQDCATALHPGGQRETLSHKI 
CVCVCVCVCVCVCAVCIYIN 


2231 


16132 


A 


2246 


3 


230 


RAQAMVETSRERCLLRPPQIETR\HVAQ 
AGLKLLASSDPPTS ASQS AG I TGVSHHT 
WPQPLTFCPHAKSLPFINQI 


2232 


16133 


A 


2247 


2 


214 


GRVDLVIQAGVQWHDLGSLHPPLP\GSS 
DLP/ASDSQVAGTAGRFHYAWLIFFWF 
FVETGS /HTQVSNSYDVL 


2233 


16134 


A 


2248 


3 


139 


EG VQGCHHNS LQ P P T PG \ S S D P P TSAS I 
EAGTAGSHYHVWLI FLLF 


2234 


16135 


A 


2249 


3 


160 


EGVQGCHHSSLQPPTPG\SSDPPTSASI 
VAGTAGSHYHVW1, 1 FLLFCFEGDAA 














GQKRYKIFIjCFFIjFGWLVFLRWSLTLSP 
RliERESVSKKPKSQKPPIKKPKFTPREF 
KEVLAKYGTKFFFVFFGLFVWFFLRWSIj 
/NSVAQAGVQWRDPGSLQAS PRP 


2236 


16137 


A 


2251 


15 


394 


FVSFSFFPSFPSHLFFSSSSFPPFLPSF 
HFSFLPSDRPSWPS /FLPS FL.P 


2237 


16138 


A 


2252 


60 


306 


GRER ILEE IMAEDFLNLMKOLNI S IQVA 
QQIPSKMNSKRPHRYQHFRSQSQRILKA 
TREKQLAT YKGS S VT/ VS P S PGPQTVNS 


2238 


16139 


A 


2253 


3 


351 


G FHHVDQANLKLL.TS S DL PAST S QSTG I 
TGI\DHCTQPKFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDQPGQHSKTPS\HQK 
KTKTSQAWRHAPAIAGTRQAEA 


2240 


16141 


A 


2255 


357 


1 


LNLNLSLTLYAKINLKWITGTNVKHKTT 
KFLGGKNGANIiMDTRLDNAFLDLTSKAQ 
LTKEKIDKL.NFI KIQTF/CS I KDLLGNIi 
KRQAPEEKKILRNHISNKELVSRIHKEV 
PKLNNKK 


2241 












FSLFPPVGGQGGFFSSCKSPPPRFRAFF 
CPNPFRKGGNRGPPPHPGKGTLGFFFFF 
FRQSFALSPRVEYNLCLPGSSDPPASAS 
QVGGAPGLPPPAWVNFGIFFF/CFFLRQ " 
SRSVAQARMQWRHLGS LQAMP 


2242 


16143 


A 


2257 


2 


132 


TLTLTPRLECSGT I S / AAHCKLHLPGSR 
HSPASAPRVAGCGGGHL 


2243 


16144 


A 


2258 


214 


347 


KI SLI LGVHKI CCEF/ CFFEMKSRSVTQ 
AGVQGHDLS SLQP P PPGL 


2244 


16145 


A 


2259 


347 


2 


FFSFFFFFSEAESRSVA\R1jECSDTVSA 
HCTLHLPGS 


2245 


16146 


A 


2260 


333 


1 


S DQR W TENAFVE LRDE G FAC PSFSEL/R 
STPSTSGEEVENFEKKIjDECITRITNTE 
KCLKELMELKAKAREPREECRSLRSRHN 
QliEERVSVMEDQMNEMKREGKFREKRT 


2246 


16147 


A 


2261 


2 


357 


SPRSCSVYGIAILLFLYFLYKLAFALLY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R~ Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GLALNS PLHE I QEP / SIASGSGPLSRNS 


2247 


16148 




2262 


355 


0 


PPLPSPLSPPLSLSFFSLPPFPFFFPFF 
PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFLKKGFSFFPRGEGRGKDSRSLKQLTF 
GFKKP/SCPSFLRKW\ETRLCPPAQKIF 
FF F F FL LEMG F C Y I AQGDLKG S S QS S G I 
TGVSYHIWPTFIGHLTMCIAKC 


2249 


16150 


A 


2264 


270 


92 


DRPARRKMFFTYSSRLFFFFFFFFFFFF 
FFFFR\FFFFFFFFFKFFFFI,FSRWSLiM 
IFIS 


2250 


16151 




2265 


1 


417 


FR P PAGVQWRNL I S LQSLPPG F KRFLYL 
SLPSS/WDYRCMPPQLA/NFFVFWFDV 
WWWVLVEMRLRHVIiARLD I WL I C I 
SLMANGMEYLFLCSFAIHIiPSLLKCRFT 
YFAHCLFLYYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


LIGLCEDTNLGAIHAKRVTLIAQDLQLA 


2252 


16153 


A 


2267 


310 


417 


RVPYGTLGSGPLATMAGFDDKLKPHMEG 
AAAPIIiVR 


2253 


16154 


A 


2268 


186 


464 


NSCLSHNQRQLLFRIiEKGMGPISAQEKL 
VLHTLSGFSGLV\VGWLVFEMESCSWQ 
AGVQWRVHPPPRFKQFSWLSFPSSWDYG 
HVPPCPANF 


2254 


16155 


A 


2269 


. 


427 


PEPPSLiP PDGAKKQKTQKS KKWR I CFLR 
KKKKKKKR KKQKKKKKKRGGGSQ KKSRR 
GPKLTTETN 1 1 1 la I KGGLKKMNYREI E K 
KLLFGGGGVIGPTPTQDIKGREEINYLE 
AVGREKQRFISLVK/TNVA/HEATRDTI 
FRGYL ■ 


2255 


16156 


A 


2270 


399 


44 


ISFQLLLPITVLPGHVRVLMEMYKEVKV 
VFVP\STTTFMLQSMGQGVTLTFKSY\Y 
LRNTFNKAVAALTSDSSEESGQSQLKPV 
WAAFSSLNASKNI /RDSWEAVKIPALKG 
VWKKWSLKM 


2256 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERERE/QRERERERE 
RERERERARRNIYHTYRPRPPRVFFFFF 
FFFFKKKRRGFLVPTPPGGGGAQKKKTS 
LEGGKGVFFKGGG KKTPLKKPGGWGE PP 
QKKNGGGGPPPRDPPPPPRPLF 


2257 


16158 


A 


2272 


463 


20 


SYNIPLSQSLIQ/SRALTLFDSTKAERN 
RRGK\RSGKLQWEGSRGWLMRFKERSHL 
HNIKVQDEAVSYPEDLDKMDALNTKQQI 
FSVHKIALYSKKMPSRIFIAVERESMPG 
FQASKDRL/liLLG/ATAAGDLKLKPMLI 
YNSKNPRVPRAEF 


2258 


16159 


A 


2273 


474 


82 


VGVWADFLKNTSQAQATKAKMDKWDP I K 
L.KSFCTAKETISKAKRQPTEGEKIFANY 
TSDKGLI / SRIDKELKQPYRKNPNNPVL 
KWAKGWAQWLTPVI LALWEAKAGRTQGQ 
E I EATLALiFSGIjFCQVFLC 


2259 


16160 


A 


2274 


152 


3 


AEGRNGDGI IQKSS I \RTLLSNDKNPQN 
IHRRPTRFLrMLYQQNLCHLGL 


2260 


16161 


A 


2275 


32 


361 


LGAS AR YE K PT VNL I LNGERLN I F P I R / 
SKTRLGYLLSLLLFNIGLAILASAINQK 
KEIQI IQI VKKKI KTQRKKKKNKTKKKK 
ALiFKFKGGPEKEKGPPKNPFKTPPWF 


2261 


16162 


A 


2276 


329 


487 


EFVNITIKIATSLHYKAIVIKA/AMVIW 
YWHKNRHIDQWNRIESLEINPHICNK 
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correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=Isofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnminc, R=Arginine, S=Serine, 
T=Thi eonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


AHHS CSHLL.TTAAHYS CSPQLLTTTAAH 
TYSP PQLLTPAHHHS CS HIiLTTTAAHHS 
CSPPQVLTAAHH\GAHTCSPQLLTPAHH 
SSPQLLTPAHHSCPHLIjTTTAAHHSCSP 
PQLANSYRG\SPYCSSWSQTPGFRKSSH 
VGLPEPWIFHGLQW 


2263 


16164 


A 


2278 


4 


467 


KNVTQQGKIHIRDKLDEMWGNTSVFCTN 
HMKHQTNFNAKKCNVFKECGK\TACNFQ 
LTQYQISHANQKPYECQICGKPFRKRAH 
LTQHNRIHTGGKPYECKECGKVFICCST 
LIQHKRTHTSEKPYECLECRKTFRRSAH 
LIRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FLCVCACVCVYVCMCMCACAC\CMCMCG 
CVCVCVCVCVGLRGLGWGAWCRSWGPP 
LCFLLLG I LPLKSRLLWLPRRT I S I CTL 
PSAQGPliPAPGFGKYASNTTGVKGSSSV 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKSCHCTPAWAT\SVTLSQKQNKT 
KQRRTLGS I FFQHTFMHLKKEKSLILQK 
W 


2266 


16167 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLFLF 


2267 


16168 


A 


2282 


68 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPRLP 
SPSPSPSLSPISRLPSP/SRSPSPSLS 


2268 


16169 


A 


2283 


3 


454 


CQSAPLGGASQLGYSGVRDTLEEAVCPF 
S DIWL CAGRTTTLFKAVRQGHLS LQRFR 
LPFVWLCPAPRGGVYRGRQASLSCGGLH 
QVRASRPLCLPTQASAMAGAPPPASLPP 
CSIiISDCCASNERGSTGMGP/SEPGTG 


2269 


16170 


A 


2285 


1 


452 


LKDSGRDYVSQFEGCALGKQLNLKLLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMSKDLEEEKAKVQPYLDDFQ 
KKWQEEMEbYRQKEEPLRAELQEGARQK 
LHEL/QEELSP/LGQEMLDRARAHVDAL 
RTHMAPYSDELRQ 


2270 


16171 


A 


2286 


3 


266 


WSSPPSSGHSTPRLAPPSPAREGTDKAV 
SALKS PQPNRGMGRGQR/ PGLPS / DTAT 
ITPHTSGFPKQPQLSLKVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


IYTFVKSSSKTLRPRHIDQWNRIENPEI 
KP / EYSQL I FDKANKN I KWEKDTLLNKW 
CWDN 


2272 


16173 


A 


2288 


81 


487 


TVYFKPTVETY\CWDKKKI PFKGLLRRD 
NTPCYPKSLLEMCEKINIVFAPATTTSS 
/LKPMDQGVIVTFKSNYLKNTF\RLGEG 
RKKKKKKGRKKSFHKALAAINS DSSDGA 
GQSKQKT/FWKGFSIPNAIKNIQDPWE 


2273 


16174 


A 


2289 


75 


469 


SRGVAGAPPKPSPSTPSPGPLDVTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\PPATLGPPSAGPE/PSPGS 
CTSTGWGYSFCCPRCRRNMRWQPPQQGP 
AWHWWPREFP/ PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 


PTQPLRVELTTVTLRCDINKWDYIKLKS 
FCTA\ KKRQPTEREKI FPNHVSNKRL IS 
KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


SLTHRVAGGAAVTPLAHAGARQIFFLGD 
PHPTSSLLGWGPAWDPCAFQVSDHPASS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RVGKTLLSCPGGGRL.PWVTCPGGGQLSC 
LTQMHLTPAGMCSPCVCVCVCVCVCVPN 
GMCPRCMCVCVCPCRDVY/ ICVCVCVCV 
CVCMCLCPCWDVFTEL 


2276 


16177 


A 


2292 


,141 


392 


SCSTEVKQPKIGVREVDFVAAPGIAPSS 
P QKKQTTS TS S L KS LLR LS KVR FLFN I V 
LDVliAREIRQEKEIKGIQ/LGKEE 


2277 


16178 


A . 


2293 


14 


291 


YVGTEG X S FIS FMRVTNYMTRHLATIjRE 
S\CYSR\VYPRFIEFLRFDIQSTGQ/RI 
TSRQH P PR/ DLRDALLYLNRR I TLVRTR 
CKSVAKRPPGSP 


2278 


16179 




2294 


1 10 


293 


LLSNRS LAI ASLCGGCNELR SCH WTPA \ 
WRQRET VKKKKKKKKKPGWG FLNPPPQR 
GKLCF 


2279 


16180 


A 


2295 


96 


313 


WNGCYLLSNRSIAIASLCGGCNELRSCH 
WTPA\ WRQRETVKKKKKKRKKPG WG FLN 
PTPQREKLGFLKRGPGF ■ 


2280 


16181 


A 


2296 


234 


2 


CCLETTRSLFDKGTKENTQWGKDSPSNK 
RCWKNWISTCKRMKLDPYLI P / YTNINV 
KY/ I KDLNLRPE I TKLLEENLREK 


2281 


16182 


A 


2297 


334 


6 


KLFSPPGGGGGPPPPPPKKGWVPPETP\ 
KRGGGAPNPPPPKGGGAPKPPPQKKNSP 
PKKKKKIFCPPPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPLAMYA 


2282 


16183 


A 


2298 


467 


8 


LiPGFKASTDRDN\VAGDFKLKQMIjIYHS 
ESSSALKNDTKSTLPVLYRNKEAW/VTA 
HLLIPWCTEYFKLIAETCCSERKISFKI 
IjLL I \ DNAPS YPRALMKM FK INVFMS DN 
TTSIVYSTDQGVILTCNSYYLRNTFYKA 
ITAIDSDSCRMPQEGN j 


2283 


16184 




2299 


1 


449 


SIYLSIP/FNLSINLSIYLSIPIYLSIY 
/HI S VY I YL/ S I YLS S IHLS AI I CLS I F 
QSISLSLSINLSTYLSTIHLSLSSIIYS 
S 1 1 YHLS IH/LS YYLiS 


2284 


16185 


A 


2300 


1 


445 


QAGLQLLTSGDPRTSGLPQCWDYRC\DH 
RSWQT 


2285 


16186 




2301 


241 


22 


KWVLGTCACVHVFI CVYMCLYVCAWV/ C 
VSMCLYVCACVCMNQC/VTVCMCVCVCV 
CRFVCVGIPPHSKWISIG 




16187 




2302 


3 


478 


GGQTETLIjTSQRKGGWPEALLTSQMGRP 
GRGAPHI PDDEQPGRDAPHLPDGAAGCA 
S APH L PDG E \ PGRGAPHI P DG AAG QRR S 
PLPRWGG / ELGRGAPH IPDGAAGQRRS S 
HPR/PGRPGRGAPHFPDGATG/Q/DGAP 
HFPGIPPDGT 


2287 


16188 


A 


2303 


440 


41 


KSHHLSFLSFLLFFFP/TKSHSVAQ/CW 
SAISAHCNLCLPGSSHSSALASRVAVTT 
GVYRI 


2288 


16189 


A 


2304 


2 


395 


FFLVKTRFLHVGQAGLKLPTSGGPPALA 
SQ/SL/RITGMSHRTQPE 


2289 


16190 


A 


2305 


184 


2 


sikkpkplvnlrnh/nt/wqgavtpacn 
sstlggrggritrsgvqdqsgqrresef 
lrlgygld 


2290 


16191 


A 


2306 


86 


472 


I I KLCKWQNK/RFEAGSLVPEMGFYYVT 
QAGLELL FSRDPHTS ASQSAG I TGAAFH 
QRWSVGTVLLQVDRGTPPVGDCGSRTPQ 
WPGQAFLRTALKS EAHP PHSSTDVTPVL 
WSEGSPCLLSPPSLSFTG • 
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nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N = Asparagine 5 P—Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y = Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 


2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAIP\TTREAEAGESPEP 
GGGGCNEPRSCHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAFPDFPGSTHPLTSASQVAWATGAH 
HHS WX,I FVFFVETGFHL/SELLSSSS PL 
ASASQSAGITG/REPPCLTSPFFF 


2293 




A 




253 


365 


TDYFYIjFIYLRQSIj/SSVAQAGVQWQDLi 
GSLQPPPSGFK 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKLKDANTIjNIKVWRNI 
CHS S PNQ KKYGLAI LNLDKS G FRSRKDT 
GDEE /HFIKI KKS VIQEAI I I INI YA 


2295 


16196 


A 


2311 


167 


2 


LiCYCVIFVFFI F wrqs IiTLS QGGMQWCD 
HGSLQPLPP\GIiKRSSHLSLPNSWDYRR 














DHLIPGGGGCSELiQSHPCTPAWVTA/ET 
LSQKKKKKREK 


2297 


16198 


A 


2313 


188 


437 


AGFFPPENQLTNMKVRKANESDPWGVKP 
ESDESETHGSLSIjSLjSLFFIjFIFIjRQSLi 
/NSVIQAGVQWRNLSSIiQRISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 

LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


SP PRNWDYRCVP PRPANF VFIiVEMGF/ Y 
HVARPEFGLELPTSGEPPTLA/FSKCWD 


2300 . 


16201 


A 


2316 


98 


288 


LMA WPATWEAE AQES LE PGG WGGG E EG 
CSELRSCHCTPAWVT\SETLSHTHTKKK 
KKNGAAL 


2301 


16202 


A 


2317 


1 


410 


LNHIPNLSLTKRKPSPHSLNLKKKKKKK 
KKKKKKKKKKKKRGGGVKKKPRGGQKKK 
GGEKKNFPSKKGGKKKKRGEFGKKNFFG 
GGKKREKTPQKKKSPKGKKKNLR/EERG 
EKNPKRGGEKKRSSSPRNNNL.RGEKK 


2302 


16203 


A 


2318 


249 


3 


PLKASSPPKAFNFCREVGPICPPPKKKV 
P\PKIPKLVFIPPPIRKKLLPCPPPLTL 
APPRVPLKRPP 


2303 


16204 


A 


2319 


2 


393 


AHLGLPKCWDYRHEPFRPAPLFLLNSYF 
GLDLLT / S /GDPP AIiASQSAE I TGVSHC 
AQ/PEYVY 


2304 


16205 


A 


2320 


389 


2 


RGNSNIGGPGPLRGKKFSPPPPLKNWGT 
KLGPPPPPPFFFFFF\GKPGFSIFFKPI 
LKFKGPKKPFPRAPKNGGSPGPGWFFFF 
FT1SNFTQLPGIKL1SKGGKKNFASPPPPF 
FFFVHYKTGYPSATPDAW 


2305 


16206 


A 


2321 


389 


206 


FQWRWGF/NKVGQANLELLTSNDPPASA 
SRSAGTTGMSHHAQLKNYFLMVRMWRNW 
IAVGM 


2306 


16207 


A 


2322 


371 


42 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQIPPPPKIFSSPFPPGGFFSPPPE 
R/VDFFFSPPPFFFPPPSFFLiSPPPPFF 
FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFFTPPEEGFLKKPNRREGPPS 
P ITDPTLWPNMM KG I VPKAPP I I FMGGG 
INMTFSGFVTTKAPFPRPLRFNPMLRQG 
FDLLTLKAS \WGS SASWY 


2308 


16209 


A 


2324 


1 


413 


RGSGDNRHGPPCRVNFVFLVETGFLHVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


16210 


A 


2325 


58 


400 | 


SETLVSKKKKKKKKTPPPPKNPKKKKNP 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine T 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPPPGAKKGPGGKF/PPPPIFQKKRGG 
FKK/SPPKKPPKKKKTPPPLGKKGPPPP 
RGGPLGG FPPPPPQBCKNGPPPPKKKPPF 
FFPPK 


2310 


16211 


A 


2326 


462 


306 


AHHNLRLPGSSDSPASASGVAGTT/GMC 
HHARPILYLSGDASALLHCFSSAQLF 


2311 


16212 


A 


2327 


1 


393 


SRPPSRTEKIRNFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ITGMSHRAQP 
TS 


2312 


16213 


A 


2328 


391 


2 


AWFKETKAGWI IPRSWDRGSWVSQPYSA 
LTSSPESGFHSVTNGDSMTCLEDGKHSL 
VAPHGIPQRGSSLQDGVSFCPAPTMKPY 
LASTKM\RLHEARELGFFSFLSLRQGLA 
LSPQLECSGAILAHCSLNL 














GCSELRSRFCTPAWAT\SKTASQKKKRG 

kkkkggvfyf 


2314 


16215 


A 


2330 


2 


406 


AAAPSALALRDGWAVRPELDLLPPCGEE 
VAPGAHCLGCGPSPCLFLSPSHTRQSPP 
APTSSPGLSTSPPLVPTHVSAPHSSKGP 
PS I PGAQALRG CGLGGWDR \ PS S PSLP/ 
PDVSPKPLNFAP 


2315 


16216 


A 


2332 


226 


377 


KRKSKHITFLFKTL.SWPDTVAHACNPST 
LGGQGGR I/TPRS GVRDQPDQHG 


2316 


16217 


A 


2333 


3 


191 


CLSPGGGGCGGLRLCYCMPAWVT\NETV 
SQERKKERRKEREKEIKKERKKRKKERK 
KEKKKKG 


2317 


16218 


A 


2334 


295 


81 


FFKPFFPPKKTCPEFS\FFWEKRGFPPP 
PLRSFFQNPPPKGGPPILLKGPPPPPSL 
GWPPPPPLFFFFFFLR 












8 


TQIVPLPSNLGNKTRLRLKKKKRNEQGN 
IPTDTIDNRIKQIIQ/TYYEQLCANK\N 
IjDKMDKALESHNFPKIjKORESL.NI/HSA 
KEIHFI/ILN I ST KKTPDPTG FTGKFLQ 
I FKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFFKQNLA/SVTQVGVQGQYFRSL 
QPLPPRVKPFSFPNPLSNRGYRGPPLGR 
VRQENCPKLKS KRFQLNKI PRLGKKKKL 
RFPKKKKKEKEKKI VKT 


2320 


16221 


A 


2337 


411 


57 


KKPRSFSSCSSSPPFFFFFSPPPKKKIF 
PPPQIFWGPPFFPPPPFFKPPPPPFFFS 
PQKKKKI \SPPPPKKKFFFKTPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFF 














akapslslvlfsfstfflgiqglallpm 
LECRGAITAYCSLNLiPDSSGPPTSAPSP 
t p y / r i ag atgthhnalll f kf fs rdg l 
PL 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPSIHWSIYLSIHP 
SIHPSLYHSSIHLSIHSTIYPSTCLSIH 
/CISIHPSIYPSICPSVHLLAHSFIHST 
I CP SMHLP IHVS I QHFLS AQ I LP VS VFG 
EVSDI 


2323 


16224 


A 


2340 


506 


0 


R'bHEQLGIVRADKKKKKKKKKKIKKKKK 
KKKS SS SSSS PGTFFRG/ VPLKDPVG 


2324 


16225 


A 


2341 


203 


1 


VELRVRATEPG FNFLKGI YCTSMVDWI K 
KMWYI YTMEYHAAI KG/DRIMS FVATWM 
ELEAIFLSKLMQ 


2325 


16226 


A 


2342 


402 


40 ! 


PYPPKKKASPTDA\FSSSSSSPPPFFFF 
SPPPKKGFFSKPFFFFSPRFFSFPPFLK J 
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PPPQFFFFGPLKKNFFFPPPAIiKFFFFK 
S PPPLFFFFFFFFFVEAGVX/LCYSG 


2326 


16227 


A 


2343 


181 


1 


QLKDQASDLLGKNGDEVKET I PS FLPSS 
LPPF\L.PLFLHQFLPSFLPSFFPPSLLP 
FLS 


2327 


16228 


A 


2344 


2 


407 


FVASQIiGCSGV\RVRDPLEEAVCPFSDL 
QliRAGRTTALFKAVRQGHLSLQRLjLiLS F 
\VFLCPAPRGGAYRGRQAFLSCGGLHPV 
GASRLLiCLPKQAWAMAGAPPPASbPPCS 
L I S DC CAS NQRDS VGARPS E PG AGH 


2328 


16229 


A 


2345 


405 


178 


I IKFI YEKATA/ LLNG/EKLKAFPLKLG 
TRQGCLHSLLVCNTVLEVLTGTLRQEKE 
IKGIIIEKKEVELSLFADRIL 


2329 


16230 


A 


2346 


2 


412 


FKASKASLSPLLGANTAGDFKLKPGLIY 
HSLH\IiKNYADS ILLVLCQWWNKAWMIA 
HLFTAWFTEYFSPPLRPAQKKISFKMLL 
FIDSAPSQPGVLMEMYKEINWFMPANT 
TSILQPMDQGVI LTLKSYWLRDTFH 


2330 


16231 


A 


2347 


2 


397 


ESLEPGRRRLQGARIMPLHSSLDNRVRL 
CLKKERXKEIGVLIRCWQECKIVQPLWE 
IVWYFLKKLN\ESPYEPAVPLLNIYPRE 
MKIHVYTNTCTQI FI VALFTIAKSGKWW 
GTVACACKPSSSE / WLRWVDYL 


2331 


16232 


A 


2348 




423 


EGCSELGSHHCTPAWAT/AESVSQKKKK 
PKKK 


2332 


16233 


A 


2349 


49 


262 


QMCKG SNRRRGKRVGS RQIS KKKTNAP I 
KKWAKDMNREF/D I QMANKHMEKCSTS L 
I IREMQIKSTMRYHLY 


2333 


16234 


A 


2350 


356 


2 


FVTAPLHSSLGNRARSYCKE KKKVQVAA 
KAVLRGKFI I AYTVFKKRKISN INLS I S 
LKTLEKEEHTETKADGAQYVTKI \RAKI 
NKIETANETKSRSliEKTSKTVEGKCLRD 
ILLSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGliGELL.FRSTVPTbQPPGCG 
ASFPVLNTLPF\SLSPSQSSSSPASLVR 
PWVAPPFLCPHGEPDGGPDSTTSL.PFPG 
PAATGPAARLIQHPASRQPRPASHTHCG 
V 


2335 


16236 


A 


2352 


360 


0 


NTFLAAFRIiVFCQMTSYSLALLSHKLAL 
MPLNLSDLLTRWTHCMGELFFLDILAIQ 
NPFHTVFFLGHPEWGMESRFWQAWQW 
PDLRQLQPS\PPGSMRFSCLSLPNSWNS 
PSYGRL 


2336 


16237 


A 


2353 


34 


474 


EGWRPCKEIAARQVGCPHSCFSPHWQLL 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VCAQAMCVCAGA/ CFCVCVCAGA/ CLCV 
CVGA/CLCVCAEAVSVCVQ/VAVSMCVC 
RS/VSLCVCMQGQSLCVCAGA/CLCVCV 
CGIPPPVLCLN 


2337 


16238 


A 


2354 


297 


16 


KFFFL.KSFFFFSFFFLTPPRFFFFFPKK 
KKIFFFPPRKIFFFFLIPPPPXFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXRHGG 
NFEEITILTV 


2338 


16239 


A 


2355 


3 


315 


P VTPATRETEAGE TL \ HDLGE PGGRGCG 
E LRS CHCTPAWVTEQDS VSKKKKKKRGA 
RFKESNFTTPGLQRNI FFLGALKLISGA 
GVLKRRDGKTLGFPQFNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRGE I PSLPKI Q/ELAGCG/GHLNPG 
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RSGLWCYKSSSVYLSTRGVWVRGIASV 


2340 


J 624] 


A 


2357 


416 


2 


FFFFFSETESRSVAQAGLRT\QWRNLGS 
LiQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVPWPRGGYFKSLQPPPPGV/SCPN 
PPKKLEYRVLFPQPSNPFFFFCIFSRDG 
VSPCCPGWSRTPD 


2342 


16243 




2359 


279 


380 


RGYNP / WPGAVAHTCNPSTLGGRGRQIT 
RSGDQDH 








2360 


415 


224 


FFFVFFFFL.LI TF I LMLNLLPCHKFLFL 
Q FLLG YL FL.L YL \ CAFFLLVALN I F I TS 
FQQLDYYTF 


2344 


16245 


A 


2361 


278 


2 


ISDPFRFWNIYRIHTGLISLIQISENCV 
SCQKFQILEHFRFQIRDAQSVLRK\RKA 
WTGAVAHACNPS PLGGRGGWI SRSGDLV 
HPGQHSETP 










139 


282 


KKKKGGRGGGGGRGGGGXXWGGTKKKKK 
GGEKKNLWGGGKGGGKGGGS 


2346 


16247 


A 


2363 


19 


409 


PKPPSVLGGGGPARYPSPLGGPNRPVPI, 
GPGVGAPPGPPGKTPFFLKIKKTYPARG 
GPPVIPASPGGEGKKSPLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 


383 


HSHFKNLSS I IKKLHRNNTFTEHFSLSS 
SLiNQCFLNLTVFYSHLGNFKNSN/ SWPG 
AVAHTCNPNTLGGRGGQ ITR 


234S 


16249 


A 


2365 


400 


221 


GRL.RQENCLNSGGRECSEPRS /HUHCAP 
AWAT\EQDS VSKNKKQNKQKQTH I YTVL 
LCARH 


2349 


16250 




2366 


383 


14 


GGRGCNAPRSCHCTPAWVTERASPQKTK 
KQKNTHTKKRISSCCYKMEDPLRQA\TF 
LQCPRAEG P SQKAARMELME KQEKNQGP 
ARHRRQEQPLTSRPCPDHLCWLSQVSS 
T PAQGLS LL I CK 


2350 


16251 


A 


2367 


274 


1 


PRKI LKARGKEHIiAS RGTM I RMTS DFYL 
QTMQARREWSKI L/NVLEEKI HQHR I L/ 
PVKSSFKSEEEIKTFSDKQKLRGLVTSR 
SDLGKDVK 


2351 


16252 


A 


2368 


161 


2 


PFFFFSETESCC\VTQARVQWRHLiGSLP 
GS SDS PAS ASQVAW I TGTRH YAWL I F 


2352 


16253 


A 


2369 


361 


198 


NGRLI FVFLVEMGF \TMLARIAS CDPPA 
SASQSTGIRGMSHNSQLKCFTEFDSFC 


2353 


16254 


A 


2370 


116 


300 


HLNGDAVEERDFMKCTVSGI IVAHCNLE 
FLG/SSDPSASAPRVAGTTGMCHHIWLI 
FVILVEM 


2354 


16255 


A 


2371 


2 


192 


MKLDPHLS PYTKWSRW I KDLNLRPKT I 
KILEPNIR/ITLLGIGLGKDSMTRNPKA 
IAIKTKliAR 


2355 


16256 


A 


2372 
... 


1 


133 


AGELLEPG\GRGCSKPRSCHCTTAWATE 
QDSS PE KKKKKKKGGF 


~2_ 




-ft 




43 


403 


LHDS PAIiAS QGAGTTGVSHHAR PAAGIN 
SRIGQAEDRISELEDWLSEIR\RQSGRN 
MDKRMKMNKQMLQEIRDYVETKSM/NTR 
L.IGVPERVGENGSNLENIFQDLIHENYK 
GKPIRLMVDL 


2357 


16258 


A 


2374 


404 


215 


GQGGRITRAQKFETSLGNWRPGSEAQE " 

LLE\PDDRGCSEPRSCHSTPAWTTEQDS 

VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


SKQLEFTQLYTKLNQLNKTKISDLKKEK 
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K=Lysine, L=Leucine, M=Methionine, 
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YPIRLFSKENTQMASRYRTKYSTWLI IR 
EMQ I KTTFRHKliAALASS RCLLGLG \ AT 
SAHCCTVGAPLW 


2359 


16260 


A 


2376 


413 


1 


PKKGLFPKPIIWVTPGFFPPPRFKKPPP 
KKIFGAPKKKKKSPPPPAKKFFFFKGAP 
PPFFFFFFFFFFFFFFSGDSQERVREAM 
PVAGGP\PRPHSLSAPHAPGGTAWTPMH 
PVQTHKAQSPKLPASECPPPTTPLS 


2360 


16261 


A 


2377 


398 


247 


RRFHHAGQAGLELLTSSDLPALASQ \SA 
GITGMSHHAQPSATHFQKHLVS 


2361 


16262 


A 


2378 


2 


143 


QENRLNPGGGG CS ELRSCHCTS \AWVTT 
AKLCLKKKKKKKFFGKGGG 


2362 


16263 


A 


2379 


417 


90 


FFFFFFFPGGGRLQVCPPPGILLFCFLY 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE/QGGVSPDGPGFFVLPTPK 
KPPPPAPQKGGEPKFKPQVWGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VHPLNHHDQKGQASSTQKKKKKKKKKKK 
KKKKKKKKG/WKGGGAF 


2364 


16265 


A 


2381 


402 


2 


NFFLKGRGWGPLPPFLPRGPPPRGGPQK 
/RGGLGGPPPFFFGSKTPPAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 
RGGVPSPKKKKG/APPPFFPPPFFPPPP 
PPFFFFFFLRYNLALLPRLGCSGT 


2365 


16266 


A 


2382 


166 


5 


THGH V I Y VDQEMQM M I ENM / W PGTVAHA 
CNRSTLGSRGGWITRSGDIiMVKPRLY 


2366 


16267 


A 


2383 


2 


457 


TSQPSLLSSWDYRSTSPRIiANF\ILFYF 
IYFFFAFSVETGFHRVSQDGL.NLLTS/S 
/ IPS I PRI PKHWDYRHDP \RTWP 


2367 


16268 


A 


2384 


2 


417 


GRVGFSQSNGNPSSLSFTLLKVDFEVTI 
PGEGKDRI FKVS I KWLAIVS WRMLHE AL 
VSGQI PVPLES V/ QALDVAMRHLASMR Y 


2368 


16269 


A 


2385 


318 


32 


TME I MLDKKQ IQ VI F/FE FKMGREAVET 
THNINYTSGPETVQWWFKKCCKGDESIiE 
DEECSGRPEVGNDQLRAI IAHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


3 


ISQAPSTPPRKRGFR/PPQKTSFLGPPF 
YAAFYQEKKFLFFSSDPPRETGDKGKQK 
GFPPPKVAPKKKGFFKKGPPPGKKKDPP 
SFFKHKSQPTRPPRAPALEGPRSR/SAA 
LQPGDRRR/PPSQKKPTRPPTRPPTRP 














KKGFFPFGPFIFLFPPPGFLPPPPPPIF 
WFSGFCPPPPPLFYFFFRGGPX/HILVF 
PLFFLPPPFFFFFPIFAKESPPKKGPGP 
LNFFFFGPKKN/S PPPFFFFFFCEMEFH 
SCRPGWSAKWHDLGSLQPPPPVFK 


2371 


16272 


A 


2388 


410 


95 


KATMDKSDHIKLKS FYPAKETPTKVKTQ 
PPEWEKVFANYPSDKGLIPI IYKELKQP 
YG \ KKSNNS I KKWAKDLNR YFSKDIQMA 
NRCMKRCSRPGAVAHTYNPS T 


2372 


1 6273 


A 


2389 


362 


92 


RFLFFFSPPPKKGFFPKPFFFFSPRVFP 
PPFFLNPPPKLIF\GPPKKKIFFPAPGG 
KKIFFLKGPPPFFFFFFFFFFFFFFFFF 
PVENTFY 


2373 


16274 


A 


2390 


131 


487 


ATEHEKTEKSSLSFFSISKRKKKMEKLH 
DIGFSSNFLG\RPKAQATKGKTDWTSAK 
LKICSSRDT I SRMKRQPKEWAKTFANKS 
CNNKKPEKIDKNKKKKKKKFLGGALLKK 
TNIiKPRG 
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2374 


16275 


A 


2391 


408 


145 


RWSLALSPRLECSGVSSAPCKVPPPGVT 
PFSCLSLPSNWEDRCESPPPAQ/IVFIG 
EGFYILHGFF/RRGPKIRCFISGCPPPV 
LSFPT 












203 


LFFFAFSVETGFLHVGQAGLEPPTSGDP 
PVSA/FPMCWDYRHD 












16 


TPSAGGRHIELSLiSTCPSCAQHOGKEHL 
EGGEGGGAQSLTTAPSSATSSQDPISAH 
AVEDKLSIRTjETDILKTKS/WPGAVGHT 
CNPSTLGRRRQANHLRPGVRDQPGLVI C 
R P R P P KVMG XjQ A 


2377 


16278 


A 


2394 


3 


396 


QLLERLKQEDRLSTGSQGCSELRSHHCT 
PAWATEP\DSVSK 




16279 




2395 


273 


416 


FFYSFLI KIRWKKQPGMVAQACNPSTLG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


F I P I E S ACSQEC L KPNLRQ E W / Y I FGT L 
KLI FFETESHPVTQ/DWSAGELL\DPRG 
RGCSELRSCHCTPAWATRAKLCLKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAVWS PKGLFS / 
RIWKEAPILS PQKKKKKTNNPIKKWAND 
ANRHFSKEGTQQ 


238) 


16282 


A 


2398 


48 


393 


SILKTRKPPLKKGRGRRKKEKESVRTHV 
FFSYQSNAFPSKPLNNSMTLGRLLSFSF 
FLVHFFFYTDGILLCCPGWSRTPGLKGS 
SHLSFPKCWDYIHEPPHPAYHS/L.FCR 


2382 


16283 


A 


2399 


128 


383 


EEAPKHFPKPNLH/ QKKVLVTAWWAAAG 
LIHCSFLNPWETITPEKYAQQTNEKHQK 
LQCLQLALVNRKGPILLHHNVRLHITTH 
AS 




16284 




2400 


54 


384 


LFTFILNSVFHTYMCLYFWTLFFSVNPF 
VSMPIPQCLDDSSFIISLDSEINPNIYS 
QL I FDKKTQRGKNS LPDRWCWENCI FTY 
KRMKWDPYL/SPYTKIISNWMKDLNIK 














HRGENTHQQGGGLSRWRRHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRKRS 
RAGDI E VEKTQQSEGALEVR IRSRQRGT 
SRWRSFSRQRVSFRWRICSRQRCSS/KV 
RRRSRQRGTSRWRRHSRQR 


2385 


16286 




2402 


1 


330 


RPPPPPHCWDYR/HEPPR\QPTLWVIFK 
LS VETRLCYVAQ IGLELLGSS KESSRLD 
LPKCWDYRHELLCWMVIFQEKLV\SGFL 
FKIPRFFKAGMEIFKQIEGFWSSHPLAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGLRGTHGPWEQAGISGISPSNSFL 
FVCFRQS VALVS QAGEQWRRLGS LQPPP 
PGCGRESC\PSFQGGITPSCFFVFLEKM 
GLHRVGQAGLQLLMSGEPPALCSQRRGI 
TGVSHHARPPSKGFIWHTGAPAM 


2387 


16288 


A 


2404 


239 


573 


VCFGVFSLSHFFESEFSSITQAGVQWLN 
TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 


1 


122 


PTRPGRGCGELRSCHSTLAW/VNSETTS 
QKKQRKE KKLPLL 


2389 


16290 


A 


2406 


417 


130 


QALRVKHVQLVPSSDFLAKTRAQSVNNL 
LRHSSSSSSSSSSS/SSSSSSSSSSSSS 

sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LRGAFLGRFCPIRR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


1 Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQSLRYMAHQPVCCFTCSILIRAELN 
EI/ETKNKKIEKINET/RKSWFFEKINK 
MGAITPDATERQNI IQGYYEQLYMQNLE 
NLEYMDKFLGRFKP P S LNQE ELDTLNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKRHSQPRKQSKLR 
EGNAQRNNIRRGPGMVAHACNPSTLGGQ 
GGQV/TLRSGVRDQPGQHG 














TSFALVALGGRSCSEPRSCHCFPAWVS\ 
SETLSQWKRKTPTLNNAKYWGGRFLRFF 
FLTRGGGGRFFFFGGTKTKSLGAGFKKG 
GGGKPGGPPNKGLG 


2393 


16294 


A 


2410 


1 


384 


PTRQVRIKLFPRIYVHTRKRLKNIYGNT 
ETF I TQ P FKNHS VWQL I RNNRNAKKQEN 
VAHNEEKNKKQS I ETN\QKYTHNKNLDT 
TKKKKKKKKKKGGG P LKKT I FKARGGEN 
NFFFLGPQKLNSGAGF 


2394 


16295 


A 


2411 


423 


2 


FFFFPPPQKKSLFPPFFFCGPKIFSSPP 
VFLTPPQKIFFCPPKKKKYFPPPRGKIF 
FFLK\PPPPFFFFFFFFFFFFFFFFFFF 
FIFF FFFFLGFLLS S LLIGVP FLLLNAN 
IMILLQKKILiKDEVQI ISSFPSTKNRKV 
L 


2395 


16296 


A 


2412 


218 


112 


LKKENNTKCYYGWGTTGTLIHC\WW\KL 
IQVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPGGSKPRGLG 
EPPPPPPQRGGNTGGGPGARPKKNGGGF 
FFFWRGEPP/PPPPAKGGGGDLGPGPFG 
REETNNFPAPAPPGPKNKNLGPRAQKIL, 
CF 


2397 


16298 


A 


2414 


217 


409 


VKYLSYSVLLTIICTVQACYQELRPGAM 
AHTCNPSTLGGHGGRI /TLRSGVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRFWGTKPFPPEGGGPPPPPNS 
PPGGGEKSPLFFFGGKGNPTWSPGPKFL 
KKRVIKGPLHPGGKGYKKTREFGLKPGP 
PPPKFFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVLLSSSSSSSSSSSSSSSSSSSSSCS 
LLVRYLCKVKHKSSLCAHPGTVPFLELI 
HDTDSFAH/VFPADLL 


2400 


16301 


A 




205 


486 


RHGYGFGFWKNTCMQHRHSYRRVYIHRH 
IHLTFLAYVHRES PEAMS ALLTQILLFF 
KIjLSFIiRRCIA/EVTQAWQWCSI.GSrjR 
PRPPGFKRFWC 


2401 


16302 


A 


2418 


364 


1 


KESLGDPVKDTV I LFATRNPMQMGSNYQ 
FFIYLCLTHLLrSRSYCWFGCKMLQ/PLW 
KTVWQFLKKLNAEFPYDPAIPL\DICIP 
VFTASLFI IAQRWEQPKCASTDEWINQM 
WHMHTI EHYPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLLQPRPPRLKPSSRLSLW 
SSWDYRPAPPRPANSC 


2403 


16304 


A 


2420 


63 


465 


RLQRFLLS F \ VCLC PAP I GGAYRGRQAS 
LSCGGLRPVRASIiLLCLPKQAWAMAGAP 
PPASLPPCSLISDCCASNQRDSVGVGPS 
NPGAGYYLVARRFLSPLEKRSIQVGVTL 
FSRCRLSPLSLTRKGNSLTPCAS . 


2404 


16305 


A 


2421 


61 


482 | QRARITGVSHHAQ/LRFCLFDMRSHSVA 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: Of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=G!utamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKML/STSAF 
QSL/WDYRCEPQHPIHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECSEKCRCPGHDLQLSPGLCHHG 
VSGLAKPLLCFWPWIWKPQPLCSPTTSL 
PFLPLPPFSQCPQGPAWLELEGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVIPAL 
WEAEAGG 


2406 


16307 


A 


2423 


1 


444 


PGPGPGCGPVATLGSPSRPARTDSPSL.P 
PHSQLREAEARNRDI EAHVRQLQERMEL 
LQAEGATGESLMCPLPRRT/WEEVGRLL 


2407 


16308 


A 


2424 


189 


418 


SRRAEPGS FRGCLRVGVPS CTCV\S LWV 
CVCVCWG / VCVCVCVCVCARA/ CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 


16309 


A 


2425 


30 


895 


LDEQCTSEIHRRGEATARPRAPEHPAPP 
ATAVRGRDAAS QNLKRR PGSGTDGLRLQ 
GAEPSRLLRTYAGGAVIPRGTPERAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 
PQLIiRN D WGS CG FM VPEAARG KVFQDS Q 
EGAHIRRETVS KS VCAE PWRHQRARD PA 
PTNFPL1KCQKQRGASTSSGQHGDRVNL.V 
FFIDDDYSPPSKR/PKTNEPPQPPVPEP 
ANAGERKMREFNSGPHNPVEETKLICLC 
PSGHASCQVHLWTGAMLLGFQSWRKT.PG 
SGLKARILQ 








2426 


494 


154 


SSRVRCQCA\LLGGASQLGCSGVRDPLE 
EAVCPFSDLQPRAGRTTALFKAQMEMQK 
SPVFCVAHAGSCRLELFLFGHLGSSLHG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLIiALEPGRQGAAPVEDLQPQGPD 
KPPPPPLPQPFRARTWTAAVPRHPPPV 
ACHPPQPLAASKPWP/SVAGGDLLPLPG 
PERPVHAFFIGFIFVHLGLGGVSGRGAV 
APARSGPVPPRPPSSSTSRFSLFFLHE 


2411 


16312 


A 


2428 


84 


409 


DYKHAPTMPRIQNFVYHSPQQPRCENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KI F 


24J2 


16313 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSLY/EKDK 
RNKAYFTK/RPSPVNDIIST 


2413 


16314 


A 


2430 


456 


0 


PGWPGRGAP \ PS QTG WPGRGAPH I PDDG 
QPGRGAP \ PSQTGWPGRGAP \ PSQTG WP 
GRGAPHLPDNGWPGRGAPYI PDDGQPGR 
GTP \ PSQTGRPGRGAPHI PDDGRPGRD / 

P 














CRERRSCHCTPAWAT\SETPSQKKKKKI 
FFGNGPPGGPQAGLKIjRAWGFFQKRGTM 
GPGTKNHPGQRGEP PLLQKKQKITRPGG 
GGPGAPPPQGGGAGKSFNPGGGMFQGGE 
IP 














RPLRRLRQENRLNRGSRGCSEPKL>C/HIj 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVXPRLVSNSWAXVICL/PSVSQKA 
EIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLEMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWL.VCRGTIHKFRCVPH 
LTGRRFEHGVTDCYTLFRDAYHLAGIEM 
PDFHRENDWWRNGQNLYLDILQAPGLYP 
VPLSAAQPGDVIMCCFGSSVPNHAAI / Y 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

btide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CGDGELLPHIPK 


2418 


16319 


A 


2435 


3 


427 


EGARTSSERHPCNKYLLLHSLAHLFINA 
LNLVTiKGLS PS PFPALP IS FP I FFFS PH 
FLGTPTLEGGRADLPFLQPPGAPG/QPA 
PLNYGPGPYRNPCP/RLPQLKPAGPGHG 
LLKSPPPNPGRNWPLLGSLFDFFKKRTS 
IPLP 


2419 


16320 


A 


2436 


238 


400 


QFRWKRGKAIFFFFFLRQSPSVTHAGMQ 
WSNLSSVQPPPPGFKQFLC\ PSVPSS 


2420 


16321 


A 


2437 


11 


442 


LGTR P RATD WGVRL ELS RAC P VLGK P AK 
HPRPQRWCKVNFSYSPEQADELKLQAGE 
IVEMIKEA/CGNPDMPSVSPGPQRPPKT 
TEDKGWWEGE CQGRRGVFPDNFVLP PPP 
IKDAQPLLLFGIDQEAGPTESGISGIRF 
RRLSC 


2421 


16322 


A 


2438 


411 


80 


PQAEEGAPTPGSENFNPPPPRGGGAPPP 
PQKNFFPPRGVNPGGGGGQKRPPPKKGG 
/SPKKNPGGDKNPPPKKKKNIGEGGGFI 
GAPRGTPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 


A 


2439 


290 


1 


QLNKIKKTPLLFLPWANEKVPKIPPNYP 
PGPKKKGYPPSIFFFFFLRQGL/NSVTR 
AGVQWRDLSSLQPPPPGFKRFSCQKPNS 
AFPHASADAWVDP 


2423 


16324 


A 


2440 


440 


213 


PFSRPLFFFSPPPKKRASPPPFFFCFPR 
VFFPPPFF/SKPPPKFFFFPPPKKKKIS 
PPPPKKIFFFFSPPP 


2424 


16325 


A 


2441 


131 


408 


GCVP PEPAFL CFVLEI VS S VAQAGVQ WR 
NLSSLQPSPPGLKRFYRPRLPSS/QDYR 
RAPPLA 


2425 


16326 


A 


2442 


102 


351 


QAS S S VLKLC VCVRARLC V/ CACVCAC V 
CVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPIGRRLKL/DPPFIiSP 
HVKINPRW I KDLNVKPTT I KTLEGNLGN 
TLLDTCPG ' 


2427 


16328 


A 


2444 


410 


30 


VCVAPPLCVCIGTLCVRTHTQFCVCVHT 
HSLCVCR / CQ FLC VCVQ/ VAVS VCVCRA 
SLCVCR\ALCVCVCR\TVCVCVHGTVCV 
CVCRDS FCMCVCAGAVCVCVCVCVCV\ C 
LCVCVASAQLLGRGFCSSVNI KGAGP 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSNLFFFGPTINFFTPQFK 
QGGGQNPNPFIiFFFFFMRQGIi/DSVTQA 
GMQ WCHAGSLQHLP PGLKQ FS CLLP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTLGGQGGRI 
/ TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


LGDVCAFFFFFYLKNQLNPGGKL.ILPVG 
PAGENQRWEQFDKLQKGGIKMNPLRGGI 
SVPLTKKKNQ/WARGE 


2431 


16332 


A 


2448 


239 


3 


SPLCGNNVYKPSTVEKTNQVEKMPPSKQ 
IWELCVELLWQ/SNRGIAGSGAHACNPG 
T LGGQG WR I MRS GDRDHPGQHGE T P 


2432 


16333 


A 


2449 


406 


144 


GCSEPRSRPCTPAWVTSETLS/RKKRKR 
RKKKSCLLRAILSTSPELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATCSGSHS 
WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAI FLTPRPKVPLTPITPIiAFNPGHQE 
EIPLPKKKKKIVRAIYDKPTANI ILHGQ 
KVEVFPLKTSTRQGCPLSSLIiYQHKTRM 
PSL\ PLLFNIVLDVLARAI KQEKEIKG I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
locution 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, ., . 
E=Glutamic Acidj F=Phehylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


19 


DLGRMTAGSGDQRCAVGVKLliS FSLGAA 
GKGPKHSEAEASLPRNPGSYNRQAQWPQ 
SSLHLKGTIPDTTSLNTPWKMLSSLKVP 
SWQGAVAHDCNHSTLGG\RGGWITRSRV 
RDQPGQHG 












396 


FKLKLML I YHSENPRAIiKNYTKSTLSML 
YKWNNKAWMMAHVFTTWFTEYFKPIAET 
YFWGKKKI TLKI LPLVDNAPGHPRALME 
MYKE I NV/VQI C I LQPMDQGVI STLRSY 
R\KNTFCKDIAAIDGDSSDRT 


2436 


16337 


A 


2453 


115 


411 


KGGNFPPPGGGGEKKIW/PNGGPPPRGK 
RKPPPPPPGGGGKGGHHPPPGPIFFFEK 
KKKGLFGG/PGGAPNSHPKRNPPPWFPE 
GGELTNPPTFFFEGGAF 


2437 


16338 


A 


2454 


3 


114 


HKV/GQAGLELLTSDDPPASVFQSAGIT 
GVSHRIRSVS 


2438 


16339 


A 


2455 


397 


1 


WSWQK\NRRIDQRSNIESPEMNPRMYGQ 
VIFDKIAKNTEWEKDSLFNKWCWKNWI / 
LKRMKLDHPHLTP\KQKSKWMKDLIRPE 
TEKLPEETGGNPHDIGLSNDF\LDLTPK 
AQGAKVNTDKWDNX KLKNFPTRP 


2439 


16340 


A 


2456 


424 


3 


PKKKK I FFPP PGFKI FFFLRAP P FFFFL 
CLSHFLLNRSRS LS CTTSCCVSTI PTSL 
CNKSSGV\CGLHCSLiLiAI / CSLIHLTLC 
PFCVLLVCMCDTVCVCVCVCVCDTVCVC 
PCPYGTIjDIAFKHFFSRWSLTLVAQAGV 


2440 


16341 


A 


2457 


266 


379 


HWPGAMAHTCNPSTLRGQSGRI /TLRSG 
VRDQ PDQHG 


244] 


16342 


A 


2458 


10 


409 


SRTGPNPRAQTDRPWCVAFACFELPLW 
LCHCTPAW/TSETL 


2442 


16343 


A 


2459 


184 


387 


IVHFQMHKMINVAYI IPQFYSFIbEIQS 
HSVTQAAVQWHSHSSLLPLPPGLKQSSH 
/LSLPSSWDY 












1 


KNRVSFFF/ CSFETESRSVTQAGVQWCN 
LGSLQPPTP 


2444 


16345 . 


A 


2461 


380 




FFFFFSETESRS VAQAGLRT \ Q WCNLS S 
LQAPLPG 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVIPGTWGGQGGGFPRF\GS ' 
KPGFTWGNPPFLKNHKNYPGGGPPPVI P 
NFLGGKPGNFFYPGGGGFQ/SGPGAVFP 
PPPGQRRSIFLPKKKKERKLILFFSLEQ 
RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA ' 

VTQATVQ/WMPSRLTANPTFRTQGILLP 

PGLKGSSCLSLPSSWDYRHV 


2447 


16348 


A 


2464 


395 


1 


RLRRENHLNPGGGGCSEPRS/HPAWQQS 
ETTS QKTJCTKTKQKAIiASDTVLiS PRQSS 
EERFHLSLFHSSFVTPFG I FS FLTTSPH 
PILPVWLAPQLPAIYCYGDSVIKSNFQL 
WLSKELQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEASPKTSWDYRHVLPCIiANFFIFF 
VKTGL/HRAGLELLTSSDLLCFPKCWDY 
RH\DRSTWPLSS VFGS I FLVYYWPLYL I 
ETNMLTTL 


2449 .. 


16350 . 


A .. 


2466 


1266 


1473 


YFVPQSTQNHAAVFRVGSLLQEGCGKIS. " 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leuctne, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














KLYGDLKHLKT\ FDRGMVWNTDLVETLE 
LQNLM LCALQTVNG 


2450 


16351 


A 


2467 


49 


356 


VQVI LLPQDLRRLRQENCLN PGGGG CSE 
PRWSHCCP / PAWVTEQDS I SKTKK 


2451 


16352 


A 


2468 


115 


3 


LSHTKWSAWPGAVA/ STLGGRGRQITRS 
GVQDQPDQHGE 


2452 


16353 


A 


2469 


3 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLIFGFFGGKGVSPVGPGGFLIiPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


2453 


16354 


A 


2470 


404 


218 


FAQWIKWGALSSPKPPFPGFKLFSRPS 
PPSYWDYR/RRPRPPRLFFFFFLRQGLA 
LLPRLE 


2454 


. 16355 


A 


2471 


386 


258 


AGGIiLSPGVQGYNKXiLSCPCKPAWTT\S 
ETLS QKKKKKKLP FY 


2455 


16356 


A 


2472 


2 


364 


HHTQI I F VFLVETG FHHVGQ AGVELLTS 
GDPPASWDYGR\GHRTWPYSHI FNNL 


2456 


16357 


A 


2473 


1 


292 


DRGCSEPTSCHCTPVWATNSETVSQKKK 
KKKKKT PLLRG P E KKNGNP P PRGFFRKN 
PGFKRKKGGAPPPGLFKIGRKKSPVGGH 
RHTFILGGPLPRF 


2457 


1635S 


A 


2474 


1 


307 


SliSCSSIVRRACFPFTFYHDCKFPEA/S 
PVMLPVKPVEL 


2458 


16359 


A 


2475 


2 


596 


MKNAEDILTMEVLKSTMKQELEAA\QKK 
H3LWEL.LRIPNICKRICFLSFVSSSSSS 
SSSSSSSSSSSSSSSSSSSTKITAW/LP 
PJjEAS FYRSTCLMP ARALLFAST I PFWG 
LTLHLQHLGNNVFLLQTLFGAVTLLiANC 
VAPWALNHMSRRLSQMLLMFLLATCLLA 
IIFVPQGEKSSQVEERKCLSLFSQGLPW 
SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYLANI/DL 
NREIESNI I IVE YFNTSLSKMDRYRSSR 
QNIDKETVDLKYS I /HINQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 


16361 


A 


2477 


12 


362 


HHEPG\GGGCSELRIiCHCTPAWVTELDS 
VSKKKKKKKSQKKK 


2461 


16362 


A 


2478 


401 


1 


FFFFFSETRSHPVA\RLECSGAISAHCN 
QCI 


2462 


16363 


A 


2479 


3 


353 


YMCVCVCACVC I CVLP WMCLCVC / CYCV 
CVCWYLCVCLCVSLCGHQHLAVSGKRSQ 
PPSHESFKTSLLLWVFKNLPPLLCAWVP 
QVSQNMPLDGTRTLNNVSLPDEKVHDLL 
LPLTAA 


2463 


16364 


A 


2480 


57 


361 


MNESLLSCKRHSLLAFSHI/CVFCLICS 
TDISALCVTVCACMHVCGCRCVCMCLCV 
CLCLCVCRCMYGRVCAPMCLCVCVHCVL 
ARMCACGCGCVYKAECPI 


2464 


16365 


A 


2481 


3 


381 


YMCACGY I CMHA/ C VPVCMCVCTCLCAH 
VCL/CCMCVCTFVPVCMCVCICVCLCVC 
TP I ITLP FLSQERI S FCTDLRS FRATAK 
RSHEEVKNCVYLQTI WNQYL I I FYLQNP 
NFGWAQWIiMLVI PAVWE 


2465 


16366 


A 


2482 


2 


380 


IHVLGNCLYFSEPQFPHWQQRITRSFLE 
GYGENLMRL.HMSYIHMCLHI CAYICI /C 
/LCVYI CVLLCAYICVCVYLCVCICMCI 
FVYLCTHICVFVCVLVCIYLCV/C/VCV 
CICVCI 


2466 


16367 


A | 2483 


296 


1 


NGTTIRRMIiPIFSRYRPGMPGLVQCTIQ 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lviethioiiine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconinc, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SCTANSVPYSHGFIYFFIYLLRDRVWLC 
YSGWSAVAFSWLTITSN\YRLKPSSCLiD 
LLGSWNYRQMPPCI 


2467 


16368 


A 


2484 




319 


NTVGLCVCV / CLFVCVLSLCV/ CLSLCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWLCVSVCVSLCVSLCVCVYECVIS 
STKRSLGAPSRAEAAKLPRNWAPAKQSH 


2468 


16369 


A 


2485 






NHFIISIDI/DKHFRKIHQPFVPKAITK 
U3I KGNFLiNLLRGTSVKSTGNINM\NGE 
KPNNL P LRL V I QGD VLFHL LLE VLAS A V 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KK 


2469 


16370 


A 


2486 


17 


409 


CLDNKKPRKEYCQKTHLI KKNPLLSR I C 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKKMKRCSIS/L/SIRKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


LECNVTIMAHCNLRNLGSSDPPTSAS \Q 
TAGIHHSLQHFFFLFFVETGFCYTQAGL 
ELGSSDLPTLASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKSLYLiRSFFLFVCMFLRW 
SL/DSVSPGLTNCHHLGSLYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 


A 


2489 


1 


404 


KAGAQMGIKKPQHSSRLRLRRPSPRLPS 
SQKPLNTHYFPS FS FCAHRHTHTHTHTH 
THTHTHTGV / HLH PHT 


2473 


16374 


A 


2490 


373 


146 


YGLERKISFKIL/LFVGKSPSYPRAIiME 
MCKINWFMHANTTPTLQPIDQEVILGP 
GTVAHACNPGNLRGQSGRNA 


2474 


16375 


A . 


2491 


427 


85 


KTPPPP\RGPFFFYFFFFFKKAKNFFPP 
PPTPPLGKKKNP\PPPEKKISPPPGFPP 
PPFKKGPPKTLFKTPLKKKKGPGAPQKK 
PFFFKTPPPFFFFFF 


2475 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 
FFFFXXXXXXXXXXXXXXXXFFFFFFFF 
FFXGCLFIYLFI 


2476 


16377 


A 


2493 


3 


135 


I S AH CNR \ LLL PGS SNSTAS AS PVAE I T 
GACNNNRQDFS FI I LQI 


2477 


16378 


A 


2494 


37 


420 


AHVCVCTCVCVCICLSRRWVCAHSHVC 
AG I FLHI*I NQTFTTHL/ CVCKVMALR VT 
PSPCliQGA\QTQIAGQCMCAKVCMNFMY 
IFIEGHICPQTCSCLGEPIRGGLSPLVC 
IC/MCAGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQ I TGGGGRGRVS YFLRG 
VSQENPLNPGGGGCSEP\RPPGWGTKNP 
LiFQKKKKSSKQQEGRGED 


2479 


16380 


A 


2496 


138 


439 


RTFILGDHHHKPSAVHFHLPRRKLCTHE 
TGAAPPPPALGATVPPSVSVCLTGGGPQ 
ITPPPPALGARDLPSVSVRLTMGGPQIT 
PPHPDQQPL/SPPSVSVC 


2480 


1638 1 


A 


2497 


118 


382 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSACLICVCLCQ/CDLINKCV 


2481 


16382 


A 


2498 


164 


439 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSAClilCVCLCQ/CDLINKCV 


2482 


16383 


A 


2499 


1 


411 


ISTS FYTDTINRTALHFAVGRNHLSAVD 
FL.LKHKARVDVADKDGMNALHFATQSNH 
VRIVEYLI QDLHLKDLNQPDE PKESPLH 
LWINNHITWNSIjIjSAQHDIDIIj\IRS 
S KP PLHVAADRGNVKL VE/LLLKAGCD 
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1 SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2483 


16384 


A 


2500 


80 


404 


RTAAAVSTVSFPQDFEGQSPKCTQGVRE 
ALRQIK/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAFTALAMMDY 
PYPTDFLGPLPANPVKGRRRIiPRRRERP 


2484 


16385 


A 


2501 


215 


1 


AAIQQGSLACSHSVPPATTPRAYTPVPP 
QLLVRNF\YPKTLELRSQLRCARRFPRE 
TGADCRHAGAGRQTK 


2485 


16386 


A 


2502 


423 


214 


WQAQVGGLLEP \GGRGCCELRSCHCTPA 
W\TC \ GE I LSQNNP KRQQNKTKQKMG PG V 
PGSWGS PRAGGLTV 


24S6 


16387 


A 


2503 


294 


1 


LHWNQRQRCPPPPPPPLPPPPPLPPPPL 
PLR \ HR I AGAGQTGAKIRWGRWRVGA 
GLGVAAA/DSRES CPDGAAAGGGEGGGD 
SAATAGPGGGGGGGTK 


2487 


163S8 


A 


2504 


331 


411 


PFLKTGNKGAPPPTKIFFLFFKKKGG/W 
PGAVAHACNPSTLGGRGGRITRSGDRER 


2488 


16389 


A 


2505 


407 


195 


GGRGCSELiRSCPCTPAWVT\GEVLiSQKK 
RKSSNKKSRCLEDQAEVGGERQLVQVSV 
PSTCQRRVGAWFIYL 


2489 


16390 


A 


2506 


1 


468 


RQGSMNKLETERQIKKAPARNPERERER 
ERESKKGEDRHTD I CRP/RERERQNYRD 
RQ 


2490 


16391 


A 


2507 


27 


380 


FVCLCPAPRGGAYRGRQASLSCGGLHPV 
RASWLLCLPNQAWAMAGAPPPASLLPCS 
LISDCCASNQRDSVGVGPSEPGVGYNLM 
VRRFLSRSEKRNIRVGVTRFSRCVCHPF 
L*LGKG 


2491 


16392 


A 


2508 


396 


234 


RQENCLNPGGGAS S E PRSRLCT PAWATE 
RDS I SN /QNKQTNKS HILRAAFS RPS CY 


2492 


16393 


A 


2509 


I 


421 


AKKIKPPFPLKKKKKPGRGGPPPPPPP/ 
AKFGRETPLNPGGKGS IKPKLGPPPPPP 
GGPPNLPPKKKKKKAIPK/PLVPSHVKS 
TGFPPSCS/RSALLRAFTQ 


2493 


16394 


A 


2510 


197 


3 


TGP P D FNFFF FLEMKS HS VAQAG VQWRD 
LGSliQP P S PQ FKQ F Y YWGLQ GA / LPP CL 
NKICIFSRGGV 


2494 


16395 


A 


2511 


114 


454 


QGPLEKKGAPFKNKFFKTGVLFFGSQNI 
P PWWFKKGFSRTPFE IKKKNP PG I LGGS 
PKKFFFFFFFEMVLL.HHPGWSALAQSLL 
TTASDF/LRLKQSSHLSLSS 


2495 


16396 


A 


2512 


3 


495 


FFLSRGLFIHLESAPAIFQCLLFWFLV 
FVATGSHYVPQAGLKLIASNEPPASNSP 
KCWVYRHE YPALKCLCFGLCLF/ AFLP 


2496 


.16397 


A 


2513 


25 


437 


PLLFSPPAGHAMEEDILPPAPSFLFYFF 
FYWLVWVKI I YLYVLAQAATTNYHRLSG 
LNNKHLFIiTILEPRMp/SIKVPA 


2497 


16398 


A 


2514 


2 


256 


QAPSAEGEMTSYVLLAYLTAQPAPTSED 
L.TSATNI AKWITKHS \NAQGGFYYNQDN 
LPYSLCSGRNESTAFGTNGQDI IHPVHS 
S 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCSYYQSCCTDYTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNNAIRHEQ 
A\GFTSVISDLQAQI IWISEQKHAMSFV 
EEMTVLEVIL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAEYTASLKASCWMGDLAE 
SPSLDPQVAMGTPPAGPCGWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRWSPQVPP 



697 



\ISDOCID: -cWO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2500 


16401 


A 


2517 


401 


2 


GGEAAIRGQTKPQLEGKAPECSECEKKI 
VRS \ PRL I RRQRTP TGEKPYE CE E CGKS 
FSRRYRLAQHQR\TRAGEKTYECNECGR 
GFSERSDLINHYRVHTGERPYKCDECGK 
NFSQNSDLVRHHRAHTGEKPC 


2501 


16402 


A 


2518 


398 


1 


ET FG K \ S GGRS I VPG Q FLAVG P KGRAVM 
I S A I E KQ KL> VY I LNRD AAARL T I S S P VE 
AHKASALVYHWGVDVGFENPMFACLEM 
DYEEADNGSTGEAAGNTQQTLTLYEIiDL 
GLNHWRKYKEPLEEHGNCI 


2502 


16403 


A 


2519 


384 


3 


GGGID/SDASLVIAGVRLEDEGRYRCEL 
INGIEDESVALTLSLEGEALPLPPHSCV 
AAGPPRLGLPGLLPSISSAPLGTPAPSP 
RPRRPSSPSAPIRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 


1 


426 


GDRQMITALLRKLKQQSRESVEENRPRL 
LKALKELGDFYLELHWDFQSWVPLLSRI 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQRGDLS FI FNGDAAPS ES VWLDNEPK 
VYQRLRHEES/QEKNRTQVDCLTP 


2504 


16405 


A 


2521 




425 


ALiPIGRMPIMVRSSRKCVLTGKTPAEFA 
KliNECPLDPGGYFIVKGVEKVILIQEQL 
SKNRIIVEADRKGAVGASVTSSTHEKKS 
RTNMAVKQGRFYLRHNTLSED I P I VI I S 
KPMGVESAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


I QS FS WYG YHLCS QG I LSERVSAS KFP 
IiLFFFFFKKGGPPFMP/QGGKTKGPLKP 
GDPQSPGIKGFPCLAPPNKGKKGGPPPP 
RGNFLGFKKKKRFPQVGGGGSKPPREGK 
PPPI.APQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


GAPRSLS EKERQLMGM INQLSS FREQLL 
HAHYEQKKIlAASQ I EKQRQHMKLGKQ\Q 
QEQIARQQQRLIQQQHKINLLQQQIQVQ 
GHLPPLM I PVFPRDQRQLDALAQQGFLL 
PPGFSYKAGCSDPYPVQVIPLY 










2 


369 


N AR CL I L RAAE YMAA C VYVC I S VFACMC 
FCVRVCVPVCVSGCLCVCVSMNVHLSLC 
/VEQCACLWMCVSLHVCLYICACLCVSV 
CEAVCLHLESHCRGGSRPFPWGPAFSL 
SSCLILAPPSVT 


2508 


16409 


A 


2525 


59 


380 


RNRNFNKNSSFFFFFKKGPWGGGPGGTK 
GPKQRQGEP\NHWGQKNFPGQTRKPRKP 
TPGPPPPKKIFFFKPLEKKGPGQGPQGG 
QKPGGQKKPPGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLHVARSEMD 
KERVFQATRKSSP/CFVPLLPWPT 


2510 


16411 


A 


2527 


445 


3 


rwlglal ialkreiikkgdl v pemrwwds 
siipngNfdlteetpkredyfgianlve 
hpaqlnppvdndtpvtlgvyltkkeqkk 

LRRQTRREAQKELQEKVRLGLMPPPEPK 
VR ISNLMR VLGTEAVQD P TKVEAHVRAQ 
MAKRQKAHV 


2511 


16412 


A 


2528 




478 


RPTRPKRKLKSHRLQSRQESKKVRVI, 7T~ 
NAEDNEME E E TDDG P LLV P RVKVAEDG S 
IILDEESLTVEVLRTKGPCWEENDPIF 
ERGSTTTYSS FRKNY YS KPWSNKETDMF 
FLAI SMVGTDFSMIGQLFPHRAR I EI KN 
KFKREVYASAI EDQATS VHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2512 


16413 


A 


2529 


385 


1 


QKKCEDLK\DQDNPIVRPPPTPGSCGHG 
GWNISKPyWQLRWRGFYYLYGAWGRD 
YS PQHPNKGL YWVAPLNTDGRLLEYYRL 
CNTLDDFLLiYINAREIRITYVQGSGIAV 
NYNNKYVSMYNTVNV 


2513 


16414 


A 


2530 


] 


1228 


FRATLRPETMFGQTNCWVRPDMKYIGFE 
TVNGD I F I CTQKAARNM S YQG FTKDNGV 
VPWKELMGEEILGASLSAPLTSYKVIY 
VLPMLTI KEDKGTGWTS VP SDSPDDI A 
ALRDLKKKQALRAKYGIRDDMVLPFEPV 
PVIEIPGFGNLSAVTICDEL.KIQSQNDR 
EKLAEAKEKIYLKGFYEGIMLVDGFKGQ 
KVQDVKKTIQKKMIDAGDALIYM\EPEK 
QVMSRSSDECVVALCDQWYL.DYGEENWK 
KQTSQCL.KNLETFCEETRRWFEATLGWL 
QEHACSRTYGLGTHLPWDEQWLIESLSD 
ST I YMAFYTVAHLLQGGNLHGQAES PLG 
IRPQQMTKEVWG \ YVFFKEAPFPKTQIA 
KEKLDQLKQEFE FWYPVDLRVSGKDLVP 
NHLSYYLiYNHVAMWPEQR 


2514 


16415 


A 


2531 


335 


2 


KKKALFFIHPPPFVNGRPHKNPKIKSLG 
SPTPFSLIFLKKKFCFVNQDGVRWLYFG 
SLQSLPSRFPPFFCLNLLSSWEYRGLPP 
RPGKYFFPFIFFYFLVEM\GFTVIiARMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASRSSK\W 
SPLPRAliHLTDAK 


2516 


16417 


A 


2533 


3 


2083 


S S EG YLRGNMS EWE EEE I SQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQS PR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRSPGYESESSRYESQNTEIjKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
G VHPLDS FTQGFGEQPTGDLP IGPPFEM 
PTGALLSTPQFEMLQNPLGLTGAljRGPG 
RRGGRARGGQG PRPNI CG I CG KSFGRGS 
TL IQHQRIHTGEKP YKCE VCS KAFSQS S 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGICGKSFSQRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VLAIHARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


2517 


16418 


A 


2534 


434 


3 


APLHSGKRSPTKCN \ ECG \ GAWNRSSLlt 
DRHKI IHSEENPNKCEECGKAFKQASRIi 
TIHKI IHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSN1/ 
TNHKRIHAGEKPYKCKECGRAFNISSNL 
NKQECI 


2518 


16419 


A 


2535 


46 


454 


PSTFSSKVMDKQTLCSSQATSNTSRYAA 
ALYRQGS I YPKEMKTCVHTKPYTWHSS 
IFMIAKKWKQPRCPSA\DSWRNRMWSIH 
AMGCYLTMEKNEAVMLPRR/WSIiENIVL 
SERSH 


2519 


16420 


A 


2536 


13 


476 


ALKTYKYSHKKAFNPKKVYGKCSQKDPF 
PINHCLPTEKLHLCDKIGEGVFGKVFQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 

Ctll 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

locution 

correspond 

ing to first 

amino acid 

peptide 


Predict- 
ed end 

location 
correspon 
ding to 

acid 

peptide 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glufcimic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine,' I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 














I ADHTP VAI K I I A I EGPDL VNGSHQKTF 
EE I LPE 1 1 1 S KELS LLSGE VCNRTEG P I 
GLNSVHCVQGSYPPLLIiKAWDHYNSTKG 
SANDRP/DFFKDDQLY 


2520 


16421 


A 


2537 


162 


296 


YVCQRYKIANPFL.PSYTKINCKWVTDIN 
VKPTSVKLLQEKRRKSIj 


2521 


16422 


A 


2538 


495 


348 


FGW\HAFIVKEPRVEKLCKASARAKPQP 
PAI I AKTFKAGGITGQYAQAAL 


2522 


16423 


A 


2539 


480 


63 


ARSE AW I S I ADAYMASPGDR I AQLKLLP 
YIKVGNSEIKRTGGFGSTDPTGKAAYWA 
SRVSENRPACKAI IQGKQFEGL.VDTGAD 
VSIIAIiNQW/PKNWPKLKAVTGLVSVGT 

R 


2523 


16424 


A 


2540 


2 


509 


NVDADDVRLAI QCRADQS FTS PP PRDFL 
LDIARQKNQTPLPLIKPYAGPRLPPDRY 
CLTAPNYRLKSLIKKGPNQGRLVPRIjSV 
GAVSSKPTTPTIATPQTVSVPNKVATPM 

QNVLINPSMIGHKNILI/TTNMVSSQNT 


2524 


16425 


A 


2541 


501 


1 


QWQWEDPGPGPRPREEASSYKCELREQ 
\LPQYVRDFFRKKAE\SGMDSSRNLEKL 
AERFIiAKTCSTKDQQFKKDQNVLiSPVNC 
CHLLL.TQVKRESRAHTTLSDIYLNNIIP 
RFVQVSEDSGSLFKKVQRYFFTEVRGWS 
NDTIFKILLDIMLITWVTQLSVHQTPV 


2525 


16426 


A 


2542 


473 


287 


EKDFNLPPKDLRL/KTSDV/TSTKENEF 
EDYCLKRELLMGICEMGWEKPSPIQVC 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKAELPTHG I QI 
NSCSVRL/VDIKKEKPFS ILKVEGQAQA 
RTHLNRAFDI I VLTRGG 














CKKICYLIHYWWEYKMIQPLWKTVWQFLi 
\KKLNILYDPAVILYDIYPEELY 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFLSL/ SVMPK 
IGSVAGINYGLVAP PATTGETLDVQM / K 
GEADTENH 














FWNVDEVGGEALGRIjLWYPWTQRFYE 
S FGDL S T PDAVMGNHKVKAHG KKVLGAF 

SDGLALLDNLKGS fatlselhcdklhvd 
PENFRLLGNVLVCVLAHH/FDKE FAP P 


2530 


16431 


A 


2547 


375 


1 


GFACRRMQKRVRE VSHAEAE s aas kkvr 

ATYTQMQPHSLIQQQQQIHLQQKQWIQ 
QQIAIHHQQQFQHRQSQLLHTATHL.QGA 
\QKQKQQQHEWR 


2531 


16432 


A 


2548 


3 


376 


ELGSDVA\GAEALVDRRQERKGE IDAHE 
DS FKS ADESGQALLAAGQ YASDEVRE KL 
AVLSEERAALLELWEL.RRQQYEQCMDLQ 
LFFRDTEQVDNWMSKQEAFLLNEDLGDA 
VDS/IKEXLKKHE 


2532 


16433 


A 


2549 


360 


1 


RGEMLWTV\NNRFLKNFVPGKIEPFKSH 
SLYPPCYVHDVSFWIDQKKGFDELEFHT 
VGRAVSQDTIISIQFLSRFQHPKTQQVS 
IiCYRLTCQTCDKALTQQQVASMRSQIRK 
EIQQHQY 


2533 


16434 


A 


2550 


2 


403 


WAEEDTELRDLLVQTLENSGVLNRIKA 
ELRAGVFLALEEQEKVENKTPLVNESLK 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginitie, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KFLNTKD \ LQGLEGRENLARDLG I I EAE 
GTVGGPLLLEVIRRCQQKEKGPTTGEGA 
LDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 


1 


409 


VPRNPTPLGGPGGPILRSREGGPPGLPR 
EPPSVLKKREPPTRGGGRPPGFPLPRRV 
RPEKWVNPGGQTFPEPKLRPRG/HHPGG 
QTKNPFPKKK 


2535 


J 6436 


A 


2552 


256 


1 


HEMDGTLLGPFPPPGSSKGPSLTGPPIL 
FQSPGP/APHTPSSSPANIiKT/CTPVCP 
SHLPW / CCPLCL PMRLPWS P VP PVS KSS 
PPALY 


2536 


16437 


A 


2553 


398 


3 


ARQQQQL.L.QQQHKI / NWLQQQ I QVQGQL 
PPLMIPVFPPDQRPIiAAAAQQGFLLPPG 
FSYKAGCSDPYPVQMIPTTMAGAAAATP 
GLGPLQLQQL YAAQLAAMQ VS PGGKLPG 
IPQGMLGAAVSPTS IHTDKRCI 


2537 


16438 


A 


2554 


20 


517 


DRPPSTKRRDTPQLRGAPDLSPRGPAPV 
PECPEH/SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCFJjPEALiSPPAP 
FRS PQSYLRAPSWP WP PEEHSS FAPDS 
SAFPPATPRTEFPEAWGTDTPPATHRSS 
WPMPRPSPD 


2538 


16439 


A 


2555 


361 


3 


KSSQEALEEALRQRLEELKKLCLREGEL 
PGKLPVEYRLDPGEDPPIVRRRIGP\AF 
KLDEQKILPKGEEAELERLEREFAIQSQ 
ITEADRRLASDPNVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


16440 


A 


2556 


470 


2 


LKAAVTAGLEVPSDVSDRAFE\WLSAFP 
L \ DS P YS I HHPR R I Q VS SE KE AAPDAG A 
ERI TADSDLAYSS KVLLLS S PGLEELYR 
CCMLFVDDTAEPRETPEHPVKQIKFLL.G 
RKEEEPVLVGGEWSPSLDGLGPQADPQV 
KVSNAIRCAQAQTGTV 


2540 


16441 


A 


2557 


2 


315 


WAEEDTELRDLL VQTLENS GVLNR I KA 
ELRAAVFIiALEEQEKVENKTPLVNESLK 
KFLNTKDGKMFSLFLFIYLNEFLNLDTI 
HEKSGISGLFFVFNCYRSSF 


2541 


16442 


A 


2558 


372 


3 


CNISS HFANKKCQDVI VAARNVMTSQ I H 
HAV\KIIPGFNINVPGLPPPDEDTELEV 
QKVSNPQYHEVMNIjELENTLDQHSYSLP 
TCRISEYVKKLMEIxAYHSLLEAASSSDQ 
CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLGI AHKSVLPTLTDKFN 
TI PAKTPMS FFKE PDKLTPKFT / WKNRT 
PRTVK 


2543 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKELMS IPGIP 
GPVQVPGGGLST\GMRGFPGSSLGFXiTJ 
PGGAL I P FS PAFFS RVGGDLS PRNTGPG 
QKPG 


2544 


16445 


A 


2561 


431 


51 


RKIYRVYERENFRVEIMFRFSHTSKKVC 
KGNV/DVAKFIKLHRDGWHVLNVQCACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK 
I GDKVRVKAS VTT P KYKWGS VTHQS VG V 
VKGNI IWVIKFLMLTFH 


2545 


16446 


A 


2562 




455 


PEGIALEWTESTGKEREHTFQPGDNVE 
VCEGELINL.QGKILSVDGNKITIMHKHE 
DLKDMLEFPAQELRKYFKMGDHVKVIAG 
RFEGDTGLIVRVEENFVILFSGLAMHGL 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 
nucle- 

locition 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I=Isoleucine, 
K = L.ysine 5 L^Leucine, iV]=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














KVLPPPLQLCSETAS/SIN/VGGQHEW 


2546 


16447 


A 


2563 


407 


1 


KWDHPLSKCEVPCGGSIISSNGPVSYPG 
FPS P Y\ TS SQDCVWM I TVP SGHGVRLNL 
SLLQTEPSGDFIAIWDGPQQTAPRLGVF 
IiRSMVKKSVQSSSIQVLLKFHRDAATGG 
IFAIALSAYPNTKCPPPTIHPCI 


2547 


16448 


A 


2564 


382 


1 


TQEVEVAVSLDRAPAFQSGCQSETLSQN 
SNSSNKTWILDHFGliSDFIiDQRFSACTV 
FTPREQVS SHTRMF I AALFT I AKTWSQP 
/KYPPVIGWIKK\MWHIYTMEYYAAIKK 
DEFMSFAGTWMKLEAMY 


2548 


16449 


A 


2565 


429 


0 


PLQSSCQTSCHQNCSRKTSLHFPGDVIA 
TPEQVGGS PAQVPI PYLDDDI PLLEVEQ 
EPVSLELGDVSLVSVSREGLQPASITGS 
RGHL I VQLQELLHHWVLWS AVKSRWV IV 
GL.FVS IlilLSIiVFAIRLRNSASRAPVLL 
RP 














VIHGVI \NPFVHGDQYKKKFPLK\FYQE 
IYESPFVTETGEYYKQEASNLLQESNCS 
QYMEKVLGRLKDEEIRCQKYLHPSSYTK 
VTHECHQRMVADHLiQFLHAECHNI IRQE 
KKNDMANMPVLLRAVSKGLLHMIQEHVL 
RALGRIPTSYVRMQS 


2550 


16451 


A 


2567 


3 " 


178 


YNPGGGVCS DLIS S HCTPVWVT VSETLS 
QKKKKKKKKKKNFLPETRERALPGKKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
QIRKRNS / WLYWAHTCNPSTLGGGGGQ 
ITRSGVQDQPG 


2552 


16453 


A 


2569 


170 


430 


TSSQLAPQCLAV/VSGPLAACPELTSAT 
SPWLQVRTNAMASPLLKFSAEDLLFKAA 
LSQFCVIMLNAKLSVQKYEK\LISAFSD 
SR 








2570 


3 


463 


CRFFGYSTAAAPMTPSSGGSTLSGITAP 
AVPNI PS P IGVNG FTGL PPQANGQPAAE 
AVFANGMHPYPAQS PTVADPLQQAYAGV 
QQYAGPAAYPAAYGQI SQAFPQPPAMI P 
QQQREWPEGCNLSIYHLPQEFGDS\ELM 
QMVXjPFGNVISSKV 


2554 


16455 


A 


2571 


3 


424 


LKTMKGGTGNGLEIMLDIQQDEYIjPVWG 
ETGTSPTSGAPLHGSRPQP/PARGFLGF 
TVRPG 


2555 


16456 


A 


2572 


1667 


2046 


YIFFFTAFLWAALTFQVTTTLAPLALLV 
RSAKMMRASHDKPTANITLiN\GKTGRAS 
KQRREERNRQEVKLSLFTDDMFLYLESP 
IVLAPKLLLLINFSKVIAYKINVQKLLA 
VLYIKSSRESNEEHN 


2556 


16457 


A 


2573 


1 


399 


FAISQDHPAIjPSRPPSLHHPKPGTLTFH 
PDLPHQATCSRP I RHQRTWPEDAPLAKA 
DTVS P AE HP P AAAT KAP / TK / PAP D K PG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


16458 


A 


2574 


2 


452 


AKVNEMKS PMRKGHTLLKNKEEKLNQLiE 
SSLWEEAS DEGSLGGS PTKKAVTFDLSD 
MDSLSSESSESFSPPHLDSTPSLTSRKI 
HGLSHSLRQI SSQMSSVLS ILDSLNPQS 
PTPLLASMPAQLPP\RDPKSTPTPySYC 
GRCRGFSLTS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nmino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparttc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleticine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2558 


16459 


A 


2575 


2 


369 


TLVYPAITFILLSICICYWIVTAVFLTT 
SGVPVYKVIAPGGHCIHENQTCDPEIFN 
TPEIAKACPGALCNFAFYGEKSLYHQYT 
PTFHVYNLFVFLWLINFVIALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


2576 


406 


1 


RQEGTGPSYLLLVGITPRSFWGFSPPLG 
PSGKNPIKTFGGPLQGFFFRVQMGVLLP 
LPGPSGN/CSIKVSALNSSFSPSGVNPG 
EASLPWF/CFFEMESRSVAQAGVQWQPP 
PPGFKRFSCLSLLGSWGYRRPPLV 


2560 


16461 


A 


2577 


3 


410 


YISPFYITHMRAHTNLPGPF/KLNQRAD 
ALVSAAFADAQTFHSLTHLNAAGLRKRY 
GLSWKQVKEIVKHFSAGEVLHLPHQGAG 
VNPRGLS PNS I WQMD ITHI PTFGKLS F V 
HVSVDI YSHFIWAT YQTGEATAHVK 


2561 


16462 


A 


2578 


386 


1 


TERIRQRQYQRETEKDKRYTERQRKTER 
I RES DRDRE RQNQ I E RDRRAT \ RER DR E 
KQRESDRETYRERENQIETERDRKRQSD 
RDRETQRETGRIRYRERERQHQTESDRD 
RERESDRETQRHRECI 


2562 


16463 


A 


2579 


2 


432 


LLYFAKRYGAAFGECCQAADKAACLLPK 
LDELRDEGKASSAKQRLKCASLQKFGER 
AF KAW AAARLS QR FPKA* FAEVS KLVTD 
LT* VHTECCHGDLLECADDRSDLAMY I C 
ENQDSN FSKLNECCE KPLLE KFHCI AE V 
END 


2563 


16464 


A 


2580 


3 


426 


NLLNDALAIREKTLGKDHPAGAATIjMNL 
AVLYGKRGKYKEAKPMSKRALE I RKKVTj 
GKDHPDVDKQLNNLALLCQNQGKTEQGE 
YYYPRALHIYHTKLGPDDPNAANTKNDL 
VACYLKHGTVKQA*SLNKENLTSAHEWD 
V 


2564 


16465 


A 


2581 


10 


389 


KLAG *GLWGHTLITNPLTEPLTYPFLGL 
YL+S I 1 ITSSICLLQTDL*ALI AYSS 1 1 
HIALVITAILIQTP*SFTGAVILIIAHG 
LTSSLLFCLTNSNYERTHSRI I ILFQGL 
QTLRPLIAL**LLA 


2565 


16466 


A 


2582 


1 


192 


LIPTLAI ITR*GGQPERLNAGTYFLFYT 
LVGSLPLLIALIYTHNTLGSLNILLLTL 
TAQELSNS * ANNLI * LAYTIAFI VKIPL 
YGLHL * LP KAHVEAP I AGS I VLAAVLLK 
LGGYGIIRLTLILNPLTKHIAYPFLGLS 
L*GGQPERLNAGTYFLFYTLVGSLPLLI 
Alii YTHNTLGSLNI LLLTLTAQELSNS 


2566 


16467 


A 


2583 


2 


438 


QAHGPLAGCRLRPRSPSPVLWRRRRRWW 
RQRRKWKTKTATAAAGMYASWTKACRGL 
EELINLTRLNVS YNH I DDLSGL I PLHG I 
KHKLR YIDLHSNR I DS IHHLL * CMVGLH 
FLTNL I LEKDGDDNP VCRLPG YRAS I FQ 
TFAQL 


2567 


16468 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDLIDTDRGE 
WDS LT PQARR * KEAGLQMFVQLARFHNI 
VCKEAFGTLAFLTSE I KSLFGHPFLAER 
I ISMLNYFLQHLIGPKMGALEVKDFSEF 
DF * PQQHVSDSGAI YLDLGDEENFC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLS YKYATVIRNLREGTCPEAPSD 
ECKPEKRCALSHHQRLKCDEWSDISVGK 
IKCVSAETTEYCIA* IMNG*ADAMSLDG. | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

lasAmino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp;irtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H = Histidine, l=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFVYISGKCGLMPVLAENYNKSDNCEDT 
PE AG Y FAV AWKKS AS D LT WDNXi KGKKS 
CHTAVGRNAC 


2569 


16470 


A 


2586 


3 


413 


MAVESRVTQEEIGKEP*KPIDR*KTCP1, 
MLRVFTTNNGRHHRMDEYSR *NVPSSEL 
QIYTWMDATLKEL.TSLVREVYP*ARKKG 
THFNFAIDFTDVKRPGYRVKE IGSTMSG 
RKGTDDSMTLQSQKFQIGDYLDIAI 


2570 


16471 


A 


2587 


1 


798 


LEWlLMliVKAGADQRAKNQDGMNALHFA 
TQS NHVR I V E YLi I QDLH LKD L>NQ P DE KG 
RKPFLLAAERGHVEMIEKLTFLNLHTSE 
KDKGGNTALHLAAKHGHS PAVQVLLAQW 
QD I NEMNELN I S SLQI ATRNGHAS LVNF 
TiTiSENVDIiHQKAEPKESPLHLVDINNHI 
TWNS LiLSAQHD I D ILNQKQQTPLHVAA 
DRGNVELVETLLKAGCDLKAVDKQG*TA 
LAVASRSNHSLWDMLIKAERYYAWREE 
HHESIRDPSTGFTL 


2571 


16472 


A 


2588 


2 


285 


AWSIAPSHPHSKVPPGPRRGKAEGRPGA 
AAQAAEQAE VHP P S SG PS PLPARQ P P VW 
QIPPTPSLKTTTRRGAQPQHSRKRLA* S 
RSVSVLFRKM 


2572 


16473 


A 


2589 


11 


438 


AYYGLNWHLGATbSQKKKKKKKKXKNFP 
RGGGPPRNPHFWGIGGRPGTGPPRGEKN 
GLKNQKKEKFLAPR*KKSTGKPLKPPQG 
KGWGGGPKFPKKKRPPRAGGGSLKPKGK 
NRE P FLAI.KS LPG VFLGG FGGNPGMGLK 
NQ 


2573 


16474 


A 


2590 


313 


391 


VHLVRVKLGL* SRI YKELLQLNNTT* DN 
PI * KWATDLDRHFSTKYTQMVDRLMEKC 
STS 


2574 


16475 




2591 


62 


616 


EVHQGTEVRDSEVRRRPQARGPLiMPAER 
AGRQRWLVPALQPRRGGLRR* RGAVRQH 
GAHPHGLLLQDQKI PALPGRKQAGSLHA 
PGTEGEPDHGGDPVLDAGIQHHRQCRHP 
TADHLNPGEHRRGEAHVRAAV* PAAGAE 
GAAKERRAHQANTA1.QVHRR * LGSFAEL 
RLLRKPGRTSVWPSPM 


2575 


16476 




2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYHjIL 
PGF 


2576 


16477 


A 


2593 


479 


313 


QDGLDLLTS*STRFSLPKCWDYRREPPL 
KLI NLQS SGVGLHVQLFP PS FC FDQLL 


2577 


16478 


A 


2594 


3 


419 


LTLHSNTLPPPEMSGLT* PPATNAYWTF 
LPS AI RLFPE IFF I AVLLS VS LFDETET 
LSDAHSWRLTFKYERDANYHLLMSAQER 
LiERLFGLHGGTI PIVPTADFHDS ISGAS 
DTDIAHSGIjAYTMERSARQIMRTAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGRYRGKTYGGYWSLCATVNKALDE 
RIPITSASYYATVTLDHVRNILGSDTDV 
SMPLL * ERHR I LNETGKFLVDKFGGS F I 
NCVR*RDNIAHKLMHLWESFPSYRDVT 
LFEGN * VAFYTRAQ I LVTDTGTVI»E GK 


2579 


16480 


A 


2596 


199 


397 


S PTALNTDDVNP FMG S S S RGHHRQ FQTG 
I VSDHPAEAGP I S *DEQELP YAVLHFHK 
VQPLEPKVTD 


2580 


16481 


A 


2597 


154 


3 


MLSLRGFFCLFVCFETGS * YAAQAGLKL 
LGSGS P P I SSFQS AR I TGVSHL 


2581 


16482 


A 


2598 


425 - 


223 


QITGHGGKGL * SQLLGKFGQENFLiNLGS 
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SEQ ID 
NO: of 
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sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSFN* PKLLPGTSGWVTMLNSLGKKKKV 
TAYLNKWMPTP 


2582 


16483 


A 


2599 


3 


441 


QGPSRDLVSELYQINAFDTPESLLMIGK 
DHSDP I HHTFDHMWRTKEHNEAGWLLLR 
SVDKVMKENDELGDSISQLQKQILSLKY 
AKIALTERLISCPV*TEIVLNYT*SLIM 
LLTDLL* KVHAQSHHAHSGQASNCNVMT 
LIGIiQH 


2583 


16484 


A 


2600 


131 


423 


GPAPMVILFHLTGFLLAFLPLSHLLTSY 
LVPW I LSGTDGHTFRSACLPRWLEAE W I 
FGGVKYQYGGNQEGK*CFFTG*SYVYNG 
SSGKVPWETFSRT 


2584 


16485 


A 


2601 


222 


447 


SGI PS FGL* WEQNRPGRLNAQ I LKDLG 
VSPGSDYGKLKNGISSGIiDNGVTTSDHD 
VLKMAI VGRKI W ILGDWSG 


2585 


16486 


A 


2602 


2 


457 


FICDPLIKAIGTEGDTDVLSEIMNSFAK 
S IEVMGDGCLNDEHLEEL.GG I LKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQLLPLIVNLI CSSRPWPY* QWGLCI 
FDDIVKHCSPSL 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKALjLWKS VI CVHQKI 
LEEEKPCEGTKYDNI FSNRGCPPVPRWV 
HAVE I PCK*TECEKATGVHGPRGASEFL 
PERPTGMNMAKCESR 


2587 


16488 


A 


2604 


1 


405 


RFDVSDGLELRPKYNGIAHRLTTIWYL.D 
GLRGLYKGVTP I IWS AGLSWGLYFVFYN 
AIKSYITEGSSERLKASEYLASTAEPGP 
MTLCITNPLWVTKTGLMLQYDAWNSPH 
*QYKGMSDTLVKIYKHGGVQGLY 


2588 


16489 


A 


2605 


165 


3 


PEELLLVFFPVLKNTGIRPGAVAHTCNP 
STLGGRI TRTGD * DHPGQHGETLSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRMNKRSATNVFFCARKGEVL 
GLLGHNGAGKS SS I KVITGDT * PTAGQV 
LLKGSGGWDALEFLGYCPHENALWPNLT 
VRQHI.* VYAAVKGLRKGDAE VAI TRLVD 
ALKLHDQLNSPVKTLSQGLKIKLGFCLS 
ILGKPSCGILA 


2590 


16491 


A 


2607 


1 i 


429 


VDYTVRKFCIQQEGDMTNRKPQRLITQF 
HFTSWPDLGVPITPIGMLKILQKEKACN 
PQYAGAIVDHCKA*VGRTGTFGDIEAML 
DMMHT * RKVDVNG FE S RNRAQS CQKVQT 
DMQYVFIYQALLEHYLYGDTELEVTCLE 
TPW 


2591 


16492 


A 


2608 


166 


435 


KFLSNNYVHFKQNFKKVLKFIKHLVLNY 
FKNIVLGQV*RATPGIPALWEAETGESL 
EPRS SR PTWAMWRNS I STKNAQ I KNI VL 
RILDPG 


2592 


16493 


A 


2609 


228 


431 


IFSKICIFNGFNYFPIVGHIGRIYLFIE 
TGSHCVTQAGVQWCDYGSLQP*TPGLK* 
SSHLRLPSSWDH 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLIiRRLRWDYHFSPGSQGC 
SKS * S YHCTPAWVTQGDSSLKKQKRKI S 
TYTS ILSVLNETVYI I VYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 


1 


439 


LLGS I LSSMQKPRGLVDQETLRKAR* QA 
ARLNKLQEHEKQQKVE FRIRMEKEVSHV 
V* DSGHITT* VQSMNS I ERS I LHEWE V | 
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SEQ ID 
NO: of 
nucleotide' ' 


SEQ ID 
ISO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidinej l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 














DGLTSFYFGEDDDCRYGMIFTKDAAPSD 
E ELDS YRRAEE WD PHMADDKRKLTELAQ 
RPDDDA 


2595 


16496 


A 


2612 


1 


439 


VIRKVTGTEGSSSTVLDYTIPSSTGGMP 
VRKSEDQTDTKRTVI KTMEDYNNDNTAP 
AEDVI IMIQVPQS I WDQDDFESEDQDDK 
STQPISSVGKPATVIKNVRTKPSAIVKY 
PEKESEPFAKILKFTMDVSHEVIPHEVK 
SS*YSA 


2596 


16497 


A 


2613 


2 


453 


E KPEKEECNLWTEM WQENVPGS FGGIRL 
YLQELMTITQKALHSQPWKMKAQGAIAM 
ASMALQTNSLVPPYLGMILTALVEGLAG 
RTWALKEELLKAI ACVETARS AEL.* KSV 
PNQPSTHEIIQADLKECSKENGPYKI* P 
VICAADIIiHA 


2597 


16498 


A 


2614 


2 


441 


LEPALPGRWGGRSAES PPSGS VRKTRQN 
KQKTPGNGDGGSTS * APQPPRKKRARAD 
PTVESEEAFKN I MEVKVKI PEELKPWLD 
EDWDLVTRQKQLFQLPAKKNVDAIL*EY 
ANCKKSHGNVDNTE YAVNED VAG I KEYF 
NEMLGT 


2598 


16499 


A 


2615 


1 


430 


RGDRDLHCTESQSEAS TEEGHDSLS VG I 
FEEDSQLEFILDPPKS KPPAWLNGIMTC 
* DFELLNPRRVRFLLE VKDLALTRRQ I L 
LHKGLSDYEKSTTLQELVLKKSSRSGPP 
LS IEDLGLNFQL* PSSRVYGFTAEELKP 
SGE 


2599 


16500 


A 


2616 


373 


552 


ICKIKYKLFNSALLFFRQGLALLPRMEC 
SGAIKVHCSLDPLGSSNPL*RKIKESTL 
NLEKSLCTRGIFLCKYEEVPKQFLKICF 
QI FLETGS RSVS QAE VQWHDQNS LQPRT 
PGLKSFS CLNLP KCWDYRCE P PRRALCS 


2600 


16501 


A 


2617 


498 


157 


QLIGSSTHQAALRSRSLPPPAGPGTFHF 
HYQGKAWWKGGTSPDSLPSLLGRGVSV 
QLHPRGKEQRGASDT*HKCPVKLWTIGG 
KYRVSETSRIFSLPPTTLQRAGLDSGSG 
SL 


2601 


16502 


A 


2618 


255 


389 


LSEFYTYEGPSIRPPIGS*GTNLPLPLS 
Y I PRS P S AVDDENLLDE 


2602 


16503 


A 


2619 


3 


207 


QHSSL I I IRAI A IKLG I AP FHF * VP * VA 
QGTPLTSGLLLLT * HKLAP ISIIYQISP 
SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


QYSSLIIIMGVGIKLGIAPFHF*VPEVA 
QGTPLTSGLLLYT*QTLTPISMIYQISA 
SLNVSLLLSGT 


2604 


16505 


A 


2621 


45 


447 


WRRIDCRLNLCVTTASLKHFFS I AHKFH 
LHNGSHLQSQHS CEAQLRRTAR F 1 1 CCL 
PYGESGQSWPTLTLERANLNWLTVYFNN 
WKDWEYDSDQYMSNFDRSSEMMNI I VW* 
RS IDNLPH IICS TL INTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GNQATPKTAPATMSTPTILVATAVHAYR 
* * VAEKEHPLKFGGRACS 


2606 


16507 


A 


2623 


2 


453 


HGPGGLLDYIDKERIRDFLNGECMCEVP 
DGGLVPKSLYRTA* DLENEDLKL WTDTI 
YQSASVFKGAPHEILIQIVDASTVITWD 
YHVCKGDIVFNIYHSKRSPLPPNKDYLI 
AHS ITFPDGNNVHLNNKVWMLGPDY IMW 
ITTLITNNEN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


I Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKLTPLFCAHAVRFI I 
STSLIYKGAYP*ALKGKEKY*IiAVFWVY 
NKKGLNNNKTLFLDGVHLCFVPEVRKYLA 
GKGLPFSIGLYHLILLLFFFF 


2608 


16509 


A 


2625 


232 


478 


LDCGN YFTV *MYIKTSHCIP*IYIILNF 
HLYNFRYNLGDYNGE I VS EVMAQRQPMK 
PTYAIPIIPITNSSQFKHQEAMDVKE 


2609 


16510 


A 


2626 


3 


430 


TSTPNVHMGSTSLPGDSTTIEDAIQSHS 
ES AS PSALSSHPNNLS PTG WS QPKTP VP 
AQRERAPVSGTPDRDKLRPCGQRD*GYY 
WEIEASEVM1ATRIVSGSFGTVYQGKWH 
GDVALAI LQWDS T PEQFQGFRNDVAGL. 
RI 


2610 


16511 


A 


2627 


2 


341 


AIjQKHEDTDCPCVWSCPHKCSVQTLLR 
SE*SAHLSERVIAPSTCSFKRYGCVFKG 
TYQQIKAHEASFVVQHVNLLKEWSNSLK 
KKGSSFDKECVDKNKSIHCWHIQICSSV 
I 


2611 


16512 


A 


2628 


2 


179 


RHTGLWVTSLPAVFPGQVRRTLFITGLP 
RDARKETVESHFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQRCQEGDCGEPLPVS 
RWVGSFIiIHYIQEHCL 


2612 


16513 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPGLK*S 
CLGLPKCWDYRHEPLRPANISY 


2613 


16514 


A 


2630 


3 


442 


FTCGTI 1 1 AI PNG VKE FT * LATLHGSNM 
K*SAAVL*ALGYIFLFTVGGL.TGIVLSN 
SSLDIVLHDTYYMVAHFHYALSIRAVFA 
I IEGFIH* FPLFSGYTLHOAYA* IHFTI 
I F I G VNI/TL FPQHFVGIiFGMPRQ YSD YP 
DAYTTR 


2614 


16515 


A 


2631 


2 


454 


AAPMELICWSGGWGLPSVDLDSLAVLTY 
ARFTGAPLKVHKIINPW*SPSGTLPALR 
TSHGEVISVPHKI ITHLLKEKYNADYDL 
SARQGADTLAFMSliLEEKLLPGLVHTFW 
I DTKNYLEVTRKW YAEAMP FS LNFFLPG 
RMQRQYM*RIjP 


2615 


16516 


A 


2632 


133 


410 


WMWSSKAPHCFRLPS IGDADTVHQCAMS 
FQKGHSALEGVLHLVFKPDLVYQTLLQM 
PPRKCCLWPGAVTHACNSMTLGGRSR* F 
TRSGVQDQP 


2616 


16517 


A 


2633 


2 


230 


F FSETSS LLE I QG I ARHGGTWRLRQENN 
LNPGGGGCSELRS CSCAPAWVTVRLDLR 
KKRNRNP * KILKNYLXI FVI 


2617 


16518 


A 


2634 


2 


370 


GTSSSDPAQPGDDKEFIDASRLVYDGIR 
DIRKAVLMIRTPEELDDSDFETEDFDVR 
SRTSVQTEDDQIilAGQSARAIMAQLPQE 
QKAKIAEQVASFQEE* SKLDAEVSKWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


376 


MTDTDALY * RELFD PADKDKMDHS RRGI 
ALVFNHERFFWHLSLPERRCTCSDRDNL 
TRRFSDLGFEGKCFNDLKAEELLLKIHE 
VSTVIHSDADCF VCVFLSHGERNH I YAY 
DAKIEIQTLTALF 


2619 


16520 


A 


2636 


1 


461 


DMAFL I LTERKI LGYGQGR I G PNVACPY 
GLLQPFGDAI+LFTKEPLKPVTSTITLY 
I TAPTLALT I ALLL *T PLP I PNPLGNLN 
LGLLF I LATSSLAVYS I L* S R * ASNSNY 
ALIGALRAVAQTIS YEVTLTI ILLSTLL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IKGSFNL.RSLITT 


2620 


16521 


A 


2637 


2 


384 


ENFKALAMIAFGQYRQKRPFEDHVKIiAN 
D * LN FAKTCVADE S AENCD KS IiHTL FGD 
KLFP I ATLRETYLE LADCCAKQ+ PD I HE 
CFLQL K YDNP YLPRL VKPNVDVMCTAFH 
DTEETFLK*YLYEIA 


2621 


16522 


A 


2638 


1 


373 


TFIYLLFLFSSAYSRGVFGRDAHKSEVA 
HRFKDLGDENFKAIjVL I AFAQYLQQCPF 
EDHVKLANEVTEFAKTCVADESPEN*DK 
SLYTLFGDKLCTVATLRETYGEMADCCA 
KQEPDRNECFLQ 


2622 


16523 




2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKA*GPHCEKCTLPCTFNEEPLKAIj 
TFFREHGP*VSDPEVATAPTEKEIPSLD 
QETTKLEPGQPQLSPGISTIHLHPQFPV 
VIEKTSPPVPVEI 


2623 


16524 


A 


2640 


215 


478 


KYFLAS HTSLFII CYTAHLTCT I AE P ICQ 
IESHFGKRLDADLVF*KSDDSTWDVIP 
SKPV*TSGLFSGKCL*HIVEGIIRAVDP 
RKLY 


2624 


16525 


A 


2641 


208 


376 


ILRNLIKNHFWPGWAHACDPNTL.EAM* 
GQITRSRDWDHPDRHGEAPSVLRIHRLA 


2625 


16526 


A 


2642 


424 


2 


KGEPLPPLGGWKT*GPRAFPQAQIPHAG 
FKTRGCPFPLPQGRNKAQVPNPFCPRER 
FLPGKGSGRVAPLKNGAQLGGPPFYPHP 
FGGPSPRVPLGFGVQTHLGNKPKPPFFP 
KKKKGGTGCSVGNWGPKSSQTLPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELSTRGFQGPHRRIAGLKQCLVMFPELE 
RSGGIiSEH*HLHLPGSSDSRASAS* IAG 
ITSVCH 


2627 


16528 


A 


2644 


2 


339 


CCEKPLLEKSHCIAEVENDEMPADLPSL 
AADFVES KDVRRNYAEAKDVFLGMFLYE 
YARRHPDYSVVLLLiRIAKTY*TTLEKCC 
AAADPHECYAKEFHEFKPLVEEPQNLIK 


2628 


16529 




2645 


81 


369 


VEVTGQPQNASFVKRNRWWLL1PL1IAAL1A 
SGSFWVFRTPDG * TSWR * I HKHMTQL I 
NNHLRA* YS VYRD I YFLW IA I * * SS * PA 
SVLL FTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQNYRFG*ELKADIDYHFKE 
DNDEDDHQLSLRTVSLWAGAKDD *H I VE 
SE AMN YEGCP I KVTIATLKMS VQATVTL 
GGFEITPPGDLRMKCGSGPVHIMGLHLV 
AGEE 


2630 


16531 


A 


2647 


1 


178 


GYTDT I LDVRSQRVRSLLGLSNSEPNGS 
VETKNL * Q ILNGGES PKQKGQE I KRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TIKRYKNVMIFY YFNFF*R*GLiNSLiAQA 
AAQWCNHGSLQNQPPGLKGSSCLSLPCS 
*GYTIFYLSIHQLIDIWIVSTFSKLFLH 
VS AYS S I KMS 


2632 


16533 


A 


2649 


2 


369 


KWITFISLLFLFRSVYGKFEFLLDAPKN 
DVAHRFIDFGEQHFKAIjAEIAFDHYFLH 
CPFEDLVTLVNE*TEFA*TCIiADESA*N 
CDKSLHTL FGDKLCTVATLR *TYGEMAD 
CCAKLEPERI 


2633 


16534 


A 


2650 


1 


349 


VTFISIiLFL.FSSGYSRGVFRIjDAHKSEV 
AHRFKDLGEENFKALVLIAFAHYLQQCP 
FEDHVKLVNEVTEFA* TCVADES AENCD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine f 
G=Gfycine, H— Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














RSLHTLFGDTLCTVATLRETYGETADCC 
AKQD 


2634 


16535 


A 


2651 


21 


403 


CVCCVCVRESRTESIiPGDNEDFNVKNAS 
VKDVRCVHFDCDSQNDPPMEATGFTAQV 
TI YSLINLLHRLKYFETLDLH* IAI FFS 
KVS I LNHPGQI I ADY APALDCHTCH I AC 
KYAEliKEMIDRRSVK 


2635 


16536 


A 


2652 


1 


355 


ARMS ITDTYGQHL I AGGLMTQEDVS E I K 
S S YYA K S NDH LNNMAH YR P TAL NLQ AH W 
HGLAQAEAQ I TTWS SG VPFDLLLMVGMM 
TVQVP* ELQMHSHLLKTHDQSRMENMMY 
GIKLDW 














LFEHLG * YKFQNALLLRYTMKVPQVLTP 
TLVKVSRNL* KVGIKCCKHPEAKRMPCA 
QDYLSWLNQLCVXjHEKTPVSDRVAKCC 
TESLVNRRPCFSALEVDETYVPKEFNAE 
TFTFHAD 














YTCAIVPCKLFWWCCRDRVSLCCLYWSQ 
TPELK+SSCLSFPKC*NYGHEHTWPARI 
IFLNLYNKSG*TATRKCPPTKVTPSSHQ 
Q*HCTWPYFYFK 














KI FAKHTSDKGVES KYI * RTLINQ * ENW 
S IJCNWS KNLNRRFTKD IQKANKHMLS AT 
S L VH * E S KL K * Q * YHY T S I R VAKM KKTD 
YVTSW 


2639 


16540 


A 


2656 


340 


10 


GREDKSEKCYLKPGRSQPDRGAPKSSPG 
VP * P PAP PAI1PGPGRSSPFPQGSL1AGKD 
LRPRQPSQPGG PGELIFPVKTKKKKIKE 
RKTLIjAWTRSNERPEIAQREGGCIjRQ 


2640 


16541 


A 


2657 


509 


261 


I P Y FKI HCLHS ALGVAETE KETAEHLDL 
AGASSRPKDSQRNSPFQIPPPSPDSKKK 
SRG I MKLiFGK* VKQ * TSGMGP VLHVA 


2641 


16542 


A 


2658 


345 


3 


SAHLSHPKCWDYRREPLHLAR I SFLFKA 
SSPLYGRATFYLSTPPLMDGRWVAS IFW 
LL*IVLP*AWVYKYLSETPLSIF*KTGS 
LSVAQAAVQWHNHGSLQPQTPGLKRSSH 
It 


2642 


16543 


A 


2659 


271 


56 


VILYIVMPIPRPCPHPHPYIHTHTHTHT 
HTHTHTRLGKREF* LYSW * LPGYSTNGN 
KTFLEIYLLPILLKM 


2643 


16544 


A 


2660 


366 


68 


PGQQS KI MPQTERKKGRKEEREEREEKE 
EREGRKEGRKNEGTETVIEGGSSKTQEQ 
TRIKAS *LNSLYTTVKPQRRLKGGKKHP 
KDSNFKD * RNI S PHR 


2644 


16545 


A 


2661 


351 


199 


LLRRLRQENLLNPEGRGFS E PRSHYYSP 
DWATE * DFWSNKI K* QKVQYS KY 


2645 


16546 


A 


2662 


I 


160 


ERAWLHLS PGDGVCS EPRSHHCTPTWVT 
E*DSVANQPGWDRVRLCLKPTWVTE 














GRDNWIiKTVAKPESLSGKRGPSLLRKVF 
NPFPVYILFLPQAPKGKWLSPFPPFFFF 
FFFLRGSLAM* PGCSAVAQS 


2647 


16548 


A 


2664 


96 


349 


QMPSSVLTRVANAYYSGFFFWFLKQKFC 
FVTQAE VQGRS IS * PGPLVPGFKRFLGL 
TLLS T WNCRRAP PGLVT ACG FKMS LLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTIiPRIiNQVESLjNRPITGSEIEAIIN 
RPPTKKSPGPDRFKAKFYQRYKEELVSF 
LLKLFQ * I EKGG IG WARWLMP VI P VL 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va»ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2649 


16550 


A 


2666 


2 


228 


NLALSPRLECKSQGFTMLIRLVSVS*PC 
DLPAS AS Q SAG I MG VS HHAQ P FLHS YLR 
PLIL I KGDQFKYAFNLFSC 


2650 


16551 


A 


2667 


129 


301 


VRPLAGLGSPLI FFFFFEKSLAFVAQAG 
GQRHNLG*LLPLPPGLKGFSCLNLPSTW 


2651 


16552 


A 


2668 


318 


21 


FSLLVKDFQNDLAVNSSSIFPSTFLFLV 
RS*LDFVNLNFFNFYLFRDEFSLCCPGW 
TRIPRLKQSSRLRLLSSWGYKCVPLHPA 
INRFL*CDFSISGY 


2652 


16553 


A 


2669 


2 


322 


QDGLDLLTS*STRLGLPKCWDYRREPPR 
PAPINKCLLjSIYHGPGPSLGTEGEKGQD 
Y I P AI EQLTDQ WRRLT WTG C Y CAAVAE D 
SMGVWI PLLPGYEAE KEAGRFH 










29 


261 


EFNTLSKCVWIY"*LFSLSLSLSLCVCVC 
VCVCVCVCLCVSVCLFYGS P I CLHFMHP 
DTLSEESFHFLQIISKELLKC 


2654 


16555 


A 


2671 


5 


224 


GSAWWLTPVIPALWECKAGRSPEVGSLR 
PA*PRDPPAIiASESAGITGVKKNXiIAEL 
WEVKSCILLIFISLDKV 










126 


321 


WGRRGARARRDSHRALRVSPELALLELR 
PSTMAHI CNPSALGGRGGW IT* DQE FET 
NLANLEKPC 


2656 


16557 


A 


2673 


360 


74 


SRLLRKLRQEVGLNLGGRGCSGPRLHHC 
TIAWATERDPDSKNKTNQKNKQRR* * TL 
PETNSLLAWEPDCVCKTNKIiATRbEIMA 
QKSCSQRPQYH 


2657 


16558 


A 


2674 


192 


357 


RIRKWLYYYYYYLLRWSLALSPRLECRG 
T I LAQCNFHL * S S S RS PASAS PVAGI T 


2658 


16559 


A 


2675 


236 


375 


EE VLYSR KDRHVAS YS RK* W * RSGAEAH 
ACNPSTLGGRGGWITRSG 












2 


ATKVS INS IGTLGVKVGGIjLEGRRLiRPG 
PKQSSCLGLQSSWDHR*VQPHPGNNFTL 
FVETRSMLARLVSNS WPQAI FQ 


2660 


1656] 


A 


2677 


164 


389 


LTVQHGLRGLRKLTVMVESEGQASHLLiH 
KAAGRRSE CQQGNCQMLI KS S YQPCAE A 
HACNPSTVGG*GERITRSG 


2661 ■ 


16562 


A 


2678 


208 


1 


CSVYEGSCFCILRVRRVCVCVCVCVCVC 
VCARSCI YVRSENH* KQLPSSS ILT*RF 
* IQSTFRSNKRLR 










159 


382 


RWLIKNHPTQARIQVRGLLDRDCQTQTW 
LWESGVQPIiATTPRRHSEMGGWPGAVAH 
ACNPSTLGGRGG* ITPPRPESKLEDCLT 
GTARHRLGCGSQGCSLNLPPPGGIQKWG 
VGRAQWLTPVI PALW VAEAGRSLEVR 


2663 


16564 


A 


2680 


29 


367 


DCQS EQLRKLRWEHHWS QGV * GCSQP *S 
HHWTPAWVTEQDPVQNKQTKNYNTHQRA 
GEVWRE INCLSTEQLTEL YSE I TQ I LAL 
SVRN* KLL* YVKSS VI S AETQ I SGGKI F 












1 


ENI VP VR FLC I HRLFA I L I MVQ * LE R KR 
DHIVIHIiNLTLETVYLKKWQTRPNAVAH 
ACNPSTLGGQGRRITRSRARDYPGQHGE 
T 


2665 


16566 


A 


2682 


1 


395 


LLIEH I L IAMALL I LTERKILGCIQLRK 
GPNWGPYGLLRPFTDAI KLFARE PLQP 
VTS TITL YMTAPTLAVTMAVLL * TPLRI 
PNPLVNHDLGIiLCILATSSLADY* IL+S 
R*ASHSNYALIGALRACPH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first . 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2666 


16567 


A 


2683 


3 


349 


VRIDNAAVCLLY*AWAEFIWA1iANIjI IR 
PDLDHPGNLLGNDHI YNVIVTAHAFVI I 
FFIGIPI IYGGFGN*LiVPLI IGAPDMAF 
ARITNIRF*LLPPSLLLLLASAIVEAGA 
AGT 


2667 


16568 


A 


2684 


99 


2 


LTVAHACNPSTLGGQSG* ITRSGVREQP 
GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSFIINSFI I IFLYTTI FLIFYDV 
VYSPEFKP*VCFCLTECWDYRREFQCSF 
LPCKRV* VLAPFIKNFFFFFFFLRWNFI 
FVAQAGVQ WRHLGS LQP PPPNFK 


2669 


16570 


A 


2686 


382 


269 


MGFHHVAQADLELLGTNDPPALASQSAG 
ITGVSHHAQPKLTF* TFS Y*ACVGGSS P 
QF I L WOHGLAGNS FCCF 


2670 


16571 


A 


2687 


3 


381 


GHLYALTFFFNS Q I CLF PR I Y KTCKKS F 
WLPFLNI.VSLCAI EGYKTKKVPSNV* KG 
IFIHHQTG FILRMQGWLN I*TSINGIYH 
ISRIEDKNHKTMFIDAEKWFHNI*HPFI 
IENIRPGWAKICN 


2671 


16572 


A 


2688 


289 


91 


IKYKELGVACSKTSRNVSYYCYHHHHHH 
RCHHYHHHS * RLAALCEE S GWRKALGS T 
SMGRVAFHRH 


2672 


16573 


A 


2689 


3 


211 


LHHVGQDGLDLLTL* SAPLGLPKCWDYR 
TEPPFPAPNEFLLLHSLLSKHYALGRPH 
TPRERKKASGILR 


2673 


16574 


A 


2690 


2 


197 


DLLTS*SAHFGLPECWDYRREPPRPASY 
AFIKYFCPALSCFWdETGPYLrFAYLR 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVP VKMAI VKKTRNNRCW * GCGE IGT FL 
HCWWEC 


2675 


16576 


A 


2692 


211 


339 


PGHPLS I ILQWIWGEDTRRGPMHEALH* 
PCCSELRSCHCTPAWPTE *NS I F* KKKK 
KWLGELKEKQKNMACANDP * PGHPLS I I 
LQW I WGEDTRRGPMHEALHGKSGDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


RHEGLNLGGRGCGEPSSCHCTPAWVTE * 
DSVSKKKRKGKKKSAIilLI FSNGVLFCH 
LG 


2677 


16578 


A 


2694 


234 


13 


KTSREPWSEFLSIKGEQQCSFGLLiYVFC 
FQMESHYAAQAGV*WCNLG*LQPPPTGF 
KQFS CLS PLS P SS PAS AS 


2678 


16579 


A 


2695 


149 


2 


SQGEDFTKTRALQWRSRRSVVAHACNPS 
TLGGRGWWIT*GQEFKTSLAN 


2679 


16580 


A 


2696 


2 


194 


CIGLGWAGACS SRLRQENDMNLGGGAC 
S ETRSHHCTPAWVTE * DS IS KKKRKRNR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


YLIRRKKISNSKS*FSWPGWAYACNPS 
TLGGRGRRTTRSAVRNQPGQHGDPKFFL 
VSS 


2681 


16582 


A 


2698 


53 


293 


TISIKSKKWKNKQGEYNKWCMEWWIiAIC 
RRLKLCP YVLVHS C I AI KNYLRLGNL * R 
KEV*LAHS STGCTGNMAEKPHGRR 


2682 


16583 


A 


2699 


2 


335 


LYAAKVPLQKACIRTFSYISFFIFFFFF 
FFGKQSSFGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLI F* PGGVPNSDPKG IGPPNPP 


2683 


16584 


A 


2700 


166 


3 


D YRRVP PR LVKR * GFTW*AGV* WRNHG 
SLQTQTIGLKQSSHLSLATCWDYRYE " 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

Jocation 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Unknown, * = Stop codon, / = possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YRMTVFPARHGGSHTCNPSTLGGRGGWM 
T* CHE I ETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


EVACNCLLPAIGCFASVTYV*SSVLL*F 
EL*VLNENSSF*NFIH*FF*NRVLICCP 
DWSVMA*S*FTEALTSQTIRSSHVSLPS 
SWDY 


2686 


16587 


A 


2703 


258 


185 


TM I APVHSSLGDRARPYLFKQKR* GLAL 
SPRLEWTGAI IVHRNLEFLCSSNPPTSA 
SGVTGITEMBVERHGWIQETFWK 


2687 


16588 


A 


2704 


361 


41 


NFGPVYKTNPGPSRPGWGPLGKPPGPQK 
FFPFGVPPSPPFPKKGSVFSPPGGQKGP 
FKGVFWGFP* PRGKKTPTQQPGPSPKVP 
NWDFPKGGFPKKKKKKRKIL.RN 












329 


SCTLQLTTPRLK* SSRPSLPNN * DYRHV 
PPHPASFP*FSEMF*DSTQIFLNSLKFS 
S * YYLVI FFFKME PCSITQDGVQWCDLS 
SLQPLPPRFKRFSCLSLPSTRP 


2689 


16590 


A 


2706 


232 


1 


KRKRKS FKTYYKKF FLNSS FKNQCP YA I 
GHIFRSIIFFFFLRQSFGLVA*AGVQWQ 
DUG S PQPP P P G FK P FSHLRLR 


2690 


16591 


A 


2707 


135 


1 


CFFLIFFFFSSDTQAGV*WPQPPGIKRS 
SQLSSPRSWDYRRISPR 


2691 


16592 


A 


2708 


1 


157 


LDLLTS*SARLHHPKCWDYRREPPRPAD 
YKYFLKEVASLIVKLYLFCKLNFE 














MQEHYYFIKRGNKMI IRKYVQLHANKL.D 
SLDKI DKFQVKNHQN * L * KK * KS WPGAV 
AHACN P S TLGG RGG W I T KS * DRDH P WL T 


2693 


16594 


A 


2710 


1 


383 


LHKVTTVLRFLSKFHILVLYFYYTCVHV 
SVSDMWCWFLY* IFTDMYNAVFQ*LDFF 
KLSIKFLRCIC*CVY*IFIPVFIYCFFL 
YLICLFLSYFLFNIYVMLLFYILCIIYF 
YVIFSSSSIF 


2694 


16595 


A 


2711 


227 


1 


IFFFSNSFLiKISLGFFLNFAFF* KPKFF 
PFLSFKINQNFFFGFPFNFLKIFFKFFP 
NKFFKF* IYLTFFKKTKPK 


2695 


16596 


A 


2712 


1 


136 


KNTKISQGWWPTPVIPATQEPEAGGSI.E 
RGRQRLQ * RNXiGS LiQ PL P P TFKR S S CLR 
LLSSWDYRRRPPPLANFCIF 


2696 


16597 


A 


2713 


3 


349 


L.I PRGQGSTWLPYNPATS I FGNDLNE I 
KM YGHAKTC I WMFMTSS F 1 1 DRTRKRQ * 
CE SVREWINKQ * CIQTMELVFGRNFFFF 
FFFFFFFFFFFFFFGGPPPPKIQIFKEI 
NPQ 


2697 


16598 


A 


2714 


1S1 


342 


TS QLLNLPLP 1 1 KA* AIRQE KE I KG I QL 
GKEEVKLFLFAHDTI VYP 


2698 


16599 


A 


2715 


3 


205 


FRHVAQAGLELLS AS SLPTLAS RSAG I T 
GVSHCVQ p * TVS SLRLGFYDTS FKI ADI 
QSCFSWVLYYCLQSAEL 


2699 


16600 


A 


2716 


"Tsl 


— 


ASNQSKNSFEKGEKNQQSVMVKTSQQAL 
NKRAFFDMIRSVYQKTYK*RNA* *ERQS 
FSLKVENKKEYPSSLLLFNIVLKVLVNT 
GHKEK* KAHRFDRNI IQFANDMIVYVEN 
PKDSTKRLS 


2700 


16601 


A 


2717 


4 


194 


FSCLSLLSSWDYRSMPPHPANFIjYF**R 
RGFTMLARLVLNSWPHSAGITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


1 


212 


LCCPGWWETARLKRSSRLSLPKCWDYRH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Asparfic Acid, 
E=Glulaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, l=possible 
nucleotide insertion 














ESLHLAFGFFLL* YFGPYLMVYVFSFNL 
YDFSLVYSL.YGTSI 


2702 


16603 


A 


2719 


53 


299 


KAVKVWIFFITKVHLTMETQIVSLSLFP 
FPSFLHPSLSLSSFL*RDRVLLCYPG*S 
A WQS * FTWPQI PGLR*AS CI SLVSS 


2703 


16604 


A 


2720 


139 


320 


QPTMRHSFSF*KQEISIFLKVSSSLHHT 
LRFLFFFFLLDRVSLCHPGWSTVALS+L 
TVAW 


2704 


16605 


A 


2721 


1 


99 


I ILAYSS ITHIG* I IAVLPYNPNITILN 
LTIYM 


2705 


16606 


A 


2722 


215 


361 


LTFFFFFFERESNFVTQAEMQGGNLV*L 
KPRPPGLKHFYYLSLQSTGNY 


2706 


16607 


A 


2723 


335 


24 


AIPLRPEF* THPGQNGKTRFPLKPQNTLT 
GVGGNARYSQFPKRLRLKIPLNPEVGPS 
I NPNFHPG I PF WAQKENFFP KKKKKKE I 
I MLNFS LI CYNLFFLYI CY 


2707 


16608 


A 


2724 


27 


314 


WKQLKYPPSDEWINKMWCIHAMEYLAIK 
RNKVQLCVT*RWRWLMDTKKK 


2708 


16609 


A 


2726 


1 


112 


GFHHVSQDGLiNLLTL*STHLSLPKCWDY 
RHEPLCQAR 


2709 


16610 


A 


2727 


271 


3 


RSPMLKNQAIKSVIMVYIGTIWRSAGPF 
AIFFIAEYTNI I I I I TLTTT I FLG TT YD 
ALSHELYTTYFVTKAVLLTSLFL* IRTA 
YPRFR 


2710 


16611 


A 


2728 


121 




ENKYRPGAVAQAYIPNTLGG*GGWir*G 
REFETSLTNVEK 


2711 


16612 


A 


2729 


2 


290 


NRHFTKENIQMPTRHKKRDPQSSLVIRE 
MQTKTKVR YYFI LTRMAKVKKI DNTTC * 
* G YEKLNS PKLQTTKCP I FEWVNKLWHS 
QIVEYFSGIQHA 


2712 


16613 


A 


2730 


98 


264 


LVSSE * * * I YQ ILFFFPEMECHSVSQ W 
VRGSNLC* IiHPLPPDFKRFSCFSYWEG 


2713 


16614 


A 


2731 


2 


286 


PFYKATVIRTVWYW * KN'QIDQWNRKES 
PEIDLHK*SLLIFAKGAKAIQWRKDSLF 
NKWC * NRW I S TCKNE S MLG WAHTCNPS 
TSGGQGGRIA 


2714 


16615 


A 


2732 


323 


181 


RDFVLtiVE ITMLARLVSNS * PCDLPASA 
AQSAGI TGVSHRARP FLDL 


2715 


16616 


A 


2733 


272 


131 


GRVDRLNPGGGGCS * LGSCQCTPAWVKE 
TPSQKKKNKIICFYAFLFI 


2716 


16617 


A 


2734 


227 


87 


AASTLTLTSLiIPP I LTTLVNPNKKNS YP 
HYVKSIVASTFIISLFPTTIFMCLDQEV 
IISN*H*ATMDFT 


2717 


16618 


A 


2736 


25 


415 


FSKYLLDHQLSAGLl.VEQHRGSLCPLSIi 
LHSDSSLSFAVSGKVSLAASIRNKLEL.P 
ET * RALMM * NHGHPLE FGGM * * PWT * E P 
G S * RC * ELGQPGQARRIiACNP S TLGGRG 
GWIMRSGVRDQPGQDVET 


2718 


16619 


A 


2737 


390 


1 


KGFFLETKRKFFFTNGFFPLLGGKGVPF 
PGKRLGPPILGFYPPQRVFPFPKPPFFP 
PGPFKRGQKPPGGGKNF*RGPPGPPKKG 
GAPTGKKKKKEEERKEKEKKTPRNISKP 
LSHPFPNFPFCATVRGFS 


2719 


16620 


A 


2738 


96 


331 


Q FTYTHGL FPFNFNRL.CVFFFS * KLVGG 
V*LCCPGWL*TPGLKQSSCLSLLSTWDY 
KGVPPCLAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LRE VFPFFS P PKI KCFS KNSPQKYFP PP 
WFKTPPTPPFFFPPLFGKNFFFPSLFN 
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SEQID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

last amino 


Amino acid sequence (A-^ Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, 1— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=VaHne, 
W— Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














FCPPRGFF*RPPLFFFFFFFFFFFFFFF 
FFFFFFFKVFYSLLIjLYFLFILK*MLTF 
KTSISTQCY 


2721 


16622 


A 


2740 


198 


38 


ANPLFFFFFFLRDGVLVYRPGWSAWQS 
*LTADSTPWAQVILQPQRNPIEHDF 


2722 


16623 


A 


2741 


290 


39 


MSKVGPLLGVLGTSFRVLLKGPEIKGDP 
LLANPLFFLFFFLRDGVLVYRPGWSAW 
QS * LTADS TP WAQV I LQ PQRNP IEHDF 


2723 


16624 


A 


2742 


1 


399 


ILIRKHSDIGTLYLLFDA*TGTIGTDLS 

ALG I I F F I V I P I I I GGFGN * LVPLI IG A 
PDMAFAR I NN I S F * LLPTS LLIiLLAS AI 
EEAVPGTG*TVYPSLA*NYCR 


2724 


16625 


A 


2743 


398 


2 


SPPPLI FFLWGFSYIFPPPKKFFFLINP 
PPKFSPPPFFFKPPPPFFFFFPP*EEKK 
NFCSPPPFSPPPPFFLLPPPLFFFFFFS 

KKKKKKKRARSRSRTSPTRP 


2725 


16626 


A 


2744 


295 


185 


FFFFFFFFFFFFFFFFFFFFFFFFFFLF 
FFLFXFFXF 


2726 


16627 


A 


2745 


374 


4 


LKKHKSTRVP*NVKSGK*NFSPPFKIRP 
WAQKRAKKKGAREKKAD * EKGEFGKIjKK 
KKSFRPGKKV+G+PKGFQRNFEKKPVKS 
QGPVNMGVIPAFEKKPRIPPVAKIPI*G 
AQKKKGRPLAI 


2727 


16628 


A 


2746 


2 


382 


QDATA PIIDELIS FHDHAL IIICLICFL. 
ILDALFLTLTAKLTNTNISDAQEIETV* 
TILPAIILDLIALPSLRILYITDEVNDA 
SLT I KSIGHQWY * TYEYTDYGGLVFNS Y 
I LP P L FLE PRDLG LL 


2728 


16629 


A 


2747 


2 


374 


DWP PTVKRKTNPRAQSTAADR F I LL FTV 
RGLTG IGLANS SLD I VTiHDTY Y WAKFH 
YVLSLGAVFAI IGGFIH* FPLFSGYTLD 
QTYAKIHFTI I FIGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


QIPAK* INKYLQN* KKKKKKKKKKKKKK 
GG 


2730 


16631 


A 


2749 


161 


3 


NGNYTLVKKCFNTKEKIKWVKN* IFFFF 
MRWSFTliVAQAGVQWCKLGSLQPL 






A ■ 








PPKKKNPTGTPKRVLKK*PPREEKKNSP 
QRKKKNTAKNKEKEAPQEEI * KG P PRY I 
FFFISFFFFFFFYFFFFFFSIVI"NfIjFGT 
FIATTL.EA 


2732 


16633 


A 


2751 


311 


110 


I TLPRHGG PHVQS QL, FKRLRW KDHMS LG 
DRGCSELSSCHCTPVWRQSKTFSKIKYG 
RNGTDKAATCNPGRFRSWGRKVA*GQES 
ETSLSNSRTRL* KNKIWSKRDRQGGYL 


2 


66 


A 


2 


223 


16 


KKTPQKKKNTEGKGPVKKSPPQKFLiKST 
PP*IFFYVFCFFFFFLYFIYLLFRIFSF 
FFILFVYYLFILP 


2734 


16635 


A 


2753 


367 


10 


PAP RGG VYRGRQ AS LS CSGLH P L RAS WP 
' LCSPTQA+AMAGAPPPASLPPCGLISDC 
CASNERGS VGVALS * SGAGDNLL VCRLL 
SGKCRNHPSSASLTLAAVDWSCSYSAIL 
APPUCNS 


2735 


16636 


A 


2754 


14 | 341 


PGPNF*FGGQNSPGKNQNLSIiPT*KGNG [ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGNPKNGGPTVPGTPPETRGSLLRTLQG 
PLRPSGTPPTRRAPGTPVGNPGF*PRDF 
FAQKGPNVREGLKQGPNPKGLNPPP 


2736 


16637 


A 


2755 


15 


372 


HSWWECKLVQPL*KTVRRFLKKLKIDPP 
YYPAIPLLGIYSEERKSVKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPIiPPRNDPVFPKGPERTPRPTWGFIP 
GGVPLGKFFTLLVSPFFFFFSPSPNLFF 
F*NRVSLCRPGWSAW*SWLTAASTSQA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFLRQRLALLPLECSCVILAHCNLRL 
LCSSDSPG*QSETVSKKKKKKRKKEKET 
GNL * R S I.DMKGR 


2739 


16640 


A 


2758 


23 


328 


NITYYT*LFVLYFYVPGHTWFFFXFXF 
FFFFFFFFFFFFFFFFFXFFFFFFXFFF 
FFFXFFFFFFFFFFFFFFXXFFFFFFFF 
FFFFXFFFFFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


DR * I.FS TNHGYIGTL YLLFGA * AGALGT 
ALS LLMRAELGQ PGNLLGLDHTYNG I DT 
AHAFAI IL.FMSLAI 1 1 RS FGN * L.VTL 1 1 
GAP VLAAPR I TD I S W * L * PHS VLLLLA 


2741 


16642 


A 


2760 


120 


2 


INKYAWLiGTVAHACNPSTIjGGRWTA*GQ 
EFKTSLANMVK 


2742 


16643 


A 


2761 


251 


3 


CRGQIGTPTLHYSSSSSSKWIKDLNVRV 
KTI KLS ENTGVNLRELELGDCFLDDTKS 
KNSTRKIRK*SLKLETFFIiGGTR 


2743 


16644 


A 


2762 


1 1 


303 


ALVLKR S S CLDLLKCWD YRHE PLHLALS 
EILMYRFCL*IFDIFFYWYSREIDLIFV 
Y * F F FFFLETG FWFLLPRGDLG* WE P P P 
PGYKGSSCLGLPR 


2744 


16645 


A 


2763 


236 


356 


DWLIFVFSVERGFCHVAQVGL*LLGSGD 
P PAS AS QGAG I T 


2745 


16646 


A 


2764 


2 


332 


LTQTPGLK*SACLCLPECWNYRCEPPHP 
AGFL F * RH FDKGVS * LL I Y PGTGWS 


2746 


16647 


A 


2765 


297 


19 


KKKSWFFFKKKKCWGGATKFKN*RGFFL 
KFVFYFFFIG*GVFFFFFFLISIIIFFF 
FFFFFFFFFFFVFFYFFFFYFFFFFFLF 
FAMKINVFY 


2747 


16648 


A 


2766 


1 


147 


QILRRLRWENCVNLGGGGCSEPRLHHCT 
PAWVTV*NSVSKKKKNPLL.FF 


2748 


16649 


A 


2767 


357 


183 


NWDYRPLSPRPGKFLDF*VNRGFPLLTK 
LVSNF* PCNLPPSTSPKAGISGFNPRAQ 
P 


2749 


16650 


A 


2768 


2 


193 


RWESCLNPGGGGYSEPRSCHCTPSWVTE 
* DS VS TTTTKEKER KQNRTWS ESNSLQK 
YKDTNHF 


2750 


16651 


A 


2769 


333 


173 


VS QDGLDLLTS * STHLGLP KCWNYRRE P 
PRPAQKLLMVIWLGFMSSPKSPLEL 


2751 


16652 


A 


2770 


3 


331 


CMENCMVIPPKKVKHKNYHMIQKFYFWK 
AVQQLSTELNILi*PYDPAITLLGIYpKE 
LKTSFRTKTCT*MLTASL>FVIAKTWKQP 
RCPSVGE * LSNL * YVQTMECYS VLK 


2752 


16653 


A 


2771 


209 


55 


RPGRPQVLRRVRPQNRLNPGGGGFSEPK 
* GPCPPAWGAQRDFISKKKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQtiLRRLRLQNRLNPGGGGCSEPR 
♦GPCPPAWVTQRDFISKKKKTIT 


2754 


16655 


A 


2773 


148 




KATGRKTWVKSRCWVGTVAHTCNPSTLG 
GRGR* TARAQEFKNSLRTUAK 


2755 


16656 


A 


2774 


2 


364 


WS AVRRDW I TALQ PERQRET PS QKKKKK 
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nucleotide insertion 














KKKKKKKKKKKGGVRHGVKS RKP * TSGR 
PGRKNHWRQI QGQP KTPRERAE ALKNRV 
GLKKLFKTPGHVNGAGNPKFRNGKSGKP 
PEVHLNGAW 


2756 


16657 


A 


2775 


121 


2 


HLRSGVSDQPGRHGKAPSLLKIEKLAGS 


2757 


16658 


A 


2776 


195 


3 


GRVDINTLLALLLI IITF*LPQIjIGYIE 
KSTP YECGFDP I S PARVPFS I KFFLAAI 
PFLLFDL 


2758 


16659 


A 


2777 


3 


285 


FLNIRNKVSLCCPS*A*TPGLKQSCCLG 
LPKC*DYR 


2759 


16660 


A 


2778 


135 


379 


HWSATVKLYAILLGLLKWESGTLLCSKD 
VLFFL*G*SSAYSI2SVIMQIKAT*VKG 
KARVQLGAKKLKAYWQRKSPGI PAG 


2760 


16661 


A 


2779 


364 


1 


GSFTGAV I LI I AHGLTS S LL F C1ANSNY 
ERTHSRIIILSQGLPTLLPLIAF* *LLA 
SLANLPLPPTINLLGELSVLVTTFS*SN 
ITLLLTGXJSILVTALYSLYI ftttqwgs 
LTHHINNIK 


2761 


16662 


A 


2780 


356 


1 


GCLRAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTSDCINEI TTQVAQI FLVHFLLRQSIi 
TL.SEKRKKRKKKRKKRNRIESPEINTHI 
YCQLI FNKGAKKIK* G *NS1>FNKTYLDP 
WISTCK 


2762 


16663 


A 


2781 




204 


AQVGLQDASSPRIEELITFHDHAbll IF 
licflvlyalfltl>tt*ltntnisdaqd 
SETDCYSQSYIIj 


2763 


16664 


A 


2782 


1 


522 


Y KCNE CG KVFNQQAHLAQHQ R VHTG E K P 
YKCNECGKTFSQMSNLVYKHRLHSGEKP 

* KCNECGKTFSQMSKLVYHHRIiHSGEKP 

* KCNE CGNTFHHNS TLVSH KA I HTG E KL 
YKCNECGKVFNQKTTLARHHRIHTAEKIi 
YK*EECDKVFGCKSNLiETHKKMQFSKTD 
SAFSLQ 


2764 


16665 


A 


2783 


1 


327 


ENRLNSGGGGCSEPRSHHCTPAWATERD 
SVSEKEKEQKQNFHLNAQSNCQKLKS p * 
KYLKHPE KTD ILS KASQYNNQLTADQS I 
liKFKTKTFDQKKGRPKGHGMTYSNS 


2765 


16666 


A 


2784 


335 


128 


FSLIFCRAGILPCCPGWS*TPVLKQSSC 
SSLPKCSDYKQEPPYLACATLiKCYQIPN 
FYCWPHIFKRMFY 








2785 


208 


3 


RPICPLSLWGVSFLFFFFFLRDRVLICH 
TGYSAVAQYCNTAHCYSPRLK* S S CLSL 
PSSWNHNLIPPR 


2767 


16668 


A 


2786 


49 


332 


VEMGNSPINRKYVYPKSYNRCLKCNTEE 
GVLNDLG I AEFNTCS KSLLLL I FFKGR I 
LLCCPGWIEWQSQLTASSTLGLK*TFH 
LSLFGSWDHR 


2768 


16669 


A 


2787 


331 


187 


GCSEPRSPP CNAAWAT KGDS VS KE KKKN 


2769 


16670 


A 


2788 


325 


34 


RSQLRGLPSMSIS * I *NSRLR* LRPRRL 
F I FCRDR VSLCCSG WS QTPGFKQS THRS 
L1PKCWDYRREPSPI1AQIRYLGLFFSQGL 
SSAFCYVAMVSGF 


2770 


16671 


A 


2789 


3 


150 


AASTRQLI FHFTSKHHFGFEAAA* YWHF 
VDW * LFLYVS I YWLG 


2771 


16672 


A 


2790 


86 


311 


N I HPLND I FTRLKNG FKKKE I SLVKNEP 
NKGT*SLISIKVLFHT*K*FIRPGVAAH 
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T*NPSTLGGQGGWIT*SQE 


2772 


16673 


A 


2791 


2 


123 


GGGACS EPRSHLCTPAWETE * DS I SEKK 
EEKK*LQIYTNI 


2773 


16674 


A 


2792 


302 


162 


PLLRRLRPENCFNLLGGGFN* PRSRNCP 
P AWATKRDS LL KKNLKNF 


2774 


16675 


A 


2793 


1 


258 


GGCSGLRLCHCTPAWETD * DP VS KKKKK 
KNFTRAKLLI FGTLVLGKVKRGGP FKPR 
ILKLPWETWQNLVFTKKKKI FFLKNRGG 
GE 


2775 


16676 


A 


2794 


199 


3 


VGPIFRHDKPPHSKQLCTYSPAFTQLFS 
SGKILKNIHLWPGTVAHACNPSTLGG *A 
RWITRSGDR 


2776 


16677 


A 


2795 


1 


302 


GGCSEPRSCHCIPAWATEPDS t *KINK* 
INK* K* * KINRNKKGKNHFLSSST* * PQ 
VASSFLTGQHRYTKL*SSQKVLLDSATL 
QSKSNAEVKRVNRTTD 


2777 


16678 


A 


2796 


228 


326 


NDNGQSGWAHACNPSILGGQGGWII*G 
REFKT 


2778 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV*KRGFF*NFLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKIFPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


2779 


16680 


A 


2798 


307 


36 


FKNFCCG*EVSLCCPGWSQAPGVK*PSC 
LGFPKCWDYRCELPCLASCSL* CDRNQT 
RISAWLWIEFKLSPILPVLPLFLKKPQ 
AGLLCF 


2780 


166S1 




2799 


132 


5 


IFEVTV*CTK*HKRWMQPGWGHACNPS 
TLGGQGGWIMRSGV 


2781 


16682 


A 


2800 


32 


314 


KQHPGNGPNPSGKGPGRASRFLKXKNFF 
F*KTPPKKPHTPCKPSPKGKGPERLPWH 
FFNQKFGPLPMGPNPDQRAGFCLRDP*G 
GGKNRSPPTIi 


2782 


16683 


A 


2801 


139 


3 


AASTFYI FFETTLI PTLAI ITR*GNQPE 
RLNAGTYFLFYTLVGEG 


2783 


16684 


A 


2802 


238 


3 


AASTSHVISSMYNIYI IIQFKTFPVFFF 
RDKVSLCHTGWSAVAQS*LTAASAALTS 
CVQGILPTQPPEYHTRPLRLFL 


2784 


16685 


A 


2803 


110 


291 


KKLGVFGFGFGFKTKSNFIMQAKGQLPN 
LG * LKPLPRGFPQFSGLSLSGTWDYKHT 
PATR 


2785 


16686 


A 


2804 


97 


2 


S YNNQDS WLEKEKTNRS VKQN * AP I PE 
KKKKKNKVGGLTLPNCKTYYKATIIKTV 
WYWRKKRQIGQ*NRIESPEIDPHKY 


2786 


16687 


A 


2805 


268 


1 


ATFLLFYYVFRPQIPYYL*IFIIiSL.F*Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDFPCRLPIRKWWLVWLMPQLNPTVW 
PTIIT 


2787 


16688 


A 


2806 


153 


3 


IPLPKGLLVPLFGVFLKVFFFFFFFF*D 
RVSPCRPDWSAVAQSRLTASS 


2788 


16689 


A 


2807 


3 


216 


NAARDHI VKPSP* PLTGAL.SALVMTSGL 
AM * CHSHS I S LL I LS LLTS TLT I YQ * RR 
NVTR*TSYQGHHTPTDQKGLRYGI ILFI 
TSEAFFLS*FF*AFYHSSLSPTPQLRGH 
WPPTGLSPLNPLEVPLLNTSELLAS*VM 
PLSLHIASNTKPTNQHTNHIPMKAQCNT 
INLIPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERESGFVAQAEGQGQNLS*WKPFFPI 
LKHFSCLSLLRSWNYGPTSPVPAKVGGF 
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WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNS KCSNHRK+ S LFFFFFFERESS FV 

LRNGDDGPRPQPPANLGLLVKTGFSPVA 
HLG VNLGT LGD CP ALP 








2810 


236 


3 


KMFFQMRYTMLKK* QATFGF I LALKNVL 
KTFLAP I FFFLRRNSALVAQAGVQWRDL 
GPLPPLPSGLKRFSCLSLPSC 


2792 


16693 


A 


2811 


140 


210 


NAKITKC* KGYGKRGTLMHCCWQFKMGQ 
LLWKTGYQFLKKLKLELLSNVPIPYL+ I 
WKKGNPYALLLAI 


2793 


16694 


A 


2812 


230 


1 


VKLCPVSLKTGVRPLLTSSEVIFILNCK 
FNIGV* LLPGSLLLLLASAI VEAGAGSG 
*TVYPPLAGNYSHAEPYALV 


2794 


16695 


A 


2813 


3 


345 


HEVRI DVHTRTCFTFGTI I IAI PTGVKV 
FS * LATLHGSNMK* SAAVL * ALRFI FLF 
TVGGLTG I VLSNS S LDI VLHDTYYWAH 
FHYVLS IGTVFAI IGGFIH* FPLFSGYT 
LD 


2795 


16696 




2814 


2 


184 


ARVG FHH I DQ AGL KL LTL * SAY FGF P KC 
WDYRHEPPRLALKFNNIKKKKKKKKKTP 


2796 


16697 


A 


2815 


276 


3 


ARVHRIDHGLM*HQPLGLK*SLCLSLPS 
S WD* * SMQPCPDDYGTRSDQKKPTFKSP 
YVSQTGLEHLGSSDLPASASQRAKITGM 
SHHALV 














HENQYSSLIIIMAIAIKLGIAPFHF*VP 
EV 


2798 


16699 


A 


2817 


93 


1 


GGGGCSEPRLHHCTPVWVTE+NPVSXKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSEP 
KS GHLTPAWAT * PDL VS TKS CTL I YLVT 
QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF*GVFLiAQGLT1jVAQAEVQWHDIjGSL 
QPCPPRLKRLV 


2801 


16702 


A 


2820 


2 


347 


ARAPG FAENEVFVFS SCFVWMVPGGKGE 
NAEIiMQPSSY*ESQHFFPAPPRCSSCV* 
LCSLGPSLLGTLIFCQFTLSELPRFRKG 
* FSSSLKKVFCFGDRVLLCKPGWSAWQ 
TQL 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPVQKGLPYGI 
ILFITSQVFFFT* FF*TFYHSTLTPTPQ 
L * GHWP PTGITPLNPLKVPLLNTSVLLP 

LiLGLY 


2803 


16704 


A 


2822 


261 


400 


VEHSNSNKENFLGQGTGCHACNLNTLGG 
RGGRITWRSGV*DQLDQH 


2804 


16705 




2823 


273 


97 


LNTPRMQRLTQLATVI LRFEKDQG FAML 
PRQVSNS * AQAI CHLGLPKC* DYRREP P 


2805 


16706 


A 


2824 


159 


355 


KKFFS I FFFFFFFF* DRISLCHLGWS AV 
VQSWLTAAST 


2806 


16707 


A 


2825 


67 


338 


LS PEL * LY * I F* KP FS * QLSLTSHS PSL 
FSALLLLS ISPPTWLHSQSDKNLHLFSF 
LFRDSVLLCCLGYSAVT*SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 1 359 


HEQKYS FLHDSQTLFCF*DS IPTPSNMD 
ETQQKSRLELVR I S LLLIEPWLEPERLL 
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RS MVANNLVY DTS DSDDYHLL KDLQEG I 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 
SHNHDAL 


2808 


16709 


A 


2827 


3 


338 


LERNLDTTFLDPAGGGDPILYQHLF*FF 
GHPEAYILILPRFGIVSHIVSYYFGRLE 
PS G YLRMG * P I I S VGLLGF I V* SHH I FS 
VGVDI HARAYS YFGSLVMAI PCTLEVFT 


2809 


16710 


A 


2828 


1 


406 


RHEGEKLRRPTFGPRHRGAGTAKMSASL 
VRATVRAVS KRKLQPTRAALTLTPS AVN 
KI KQLLKDKPEHVGVKVGVRTRGCNGLS 
YTLEYTKTKGDSDEEVIQDGVRVFIEKK 
AQLTLLGTEMDYVED*LSSEFVF 


2810 


16711 


A 


2829 


332 


3 


GIIVMQATIATALIGYVLPLGQISL*GA 
TVIAYILAAIPFIGTDLVQ*I*VGCSVD 
SPTVTGFLAFPFVLAFIIAAVAALLLRF 
LQGTGSPNPLGGASQSDKIAFQFSC 


2811 


16712 


A 


2830 


186 


349 


YSEREVAFFFFGNGFCF*TQAEWNGGNL 
G*LNLLPPR* KEFSCKI FPRTWNYSP 


2812 


16713 


A 


2831 


303 


590 


NILTTLLNKQSKSNQLQNKIIHTVYIKI 
KIFFFFLRQSLSVTQADVQWHCLCSLQP 
PPPGFK*FSCLCLLSSWDYKHAPLHPAN 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 


370 


EEL I T FRDHAV 1 1 S FL I CFLVLYAL FLT 
LTT KLTNTN I LD AQD I QTV * T I LP AI I L 
VLIGLPFLRILYITDEVDDPSLTIK*NG 
HQWD * TY* YTDYGGL I FNS Y I LRPLLLE 
PGELRLRDVDN 


2814 


16715 


A 


2833 


207 


1 


QFFIFLRHSFTLFAQAGMQWRDLGSLQP 
SPPGFK* FSYLSTLRGLARKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


1 


DNLAHKGKTRVYLKSRNKLGKGGGAGNL 
VSLDSIGSRRDHRRAPPCPANFVFLV*M 
GFPHVGQDGPELLTSCSC 


2816 


16717 


A 


2S35 


1 


321 


GTRKPSP * PVTGALSALLMTSGLTM * LH 
FHSITLLI LGLLTNTLT I YQ * WRDVARE 
ST YQGHHTPP VQKGLRYG I ILFITSEGG 
LFAGFF*AFYHSSLDRTPQLGGQ 


2817 


16718 


A 


2836 


57 


370 


IWMGRVLWKDRLYGVFCRAFNRAVTSRV 
WAEDPWRVPKTLSVDPRKLPPFS*ISVR 
GQIYFT* FFFFFETESHTVAQAGVQWWG 
DLG*LQPSPPGFKRFCLSLP 


2818 


16719 


A 


2837 


2 


345 


ARAHRQLDEP*L*RRPGERHPSW*SEET 
VERQRTKT* S ESS QTGTS I TSSRNARRR 
ESEKSLHLETLNKEEDCKSPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPARSQ 
SS 


2819 


16720 


A 


2838 


15 


376 


AKIEPLYSSLGNKSETPSQKGKKKKKKK 
KGKGGGE KKKKKKTEGGTRGLTQETQ FF 
GNPKGPKHLGARN* KHRAKQAKRGNQ PR 
E YR FTEE KKGE KL FNPGGGR S QKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELLASILLIY* *CRDVTRERIIPLAHH 
TPPVQKGLRYGIILFITS*VCFLAGLF* 
YLYHS S LPSTAQLRGHW PPTGIT PLNPL 
E VPLL I T S VLLAS LCS F I * AHHS LI EH 


2821 


16722 


A 


2840 


338 


3 


KI I FYLFFLRS I AFLAQ I EGHHAVFSS F 
KLWPPGFKHFSASTFLVNRVFKGGPRAR 
VNFGFFIKSGFFPIF*VGF*LFFFFFFE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 
















2822 


16723 


A 


2841 


2 


149 


ARGCSEPTLHPCTPAWVTE*DSISKKKK 


2823 


16724 


A 


2842 


348 


175 


SACKWS ITGMHCHA* VI VSVLLV+TGS 
LHVGQVGLELRTSDDMHTLASVSGGITG 


2824 


16725 


A 


2843 


338 


118 


DPGGGACSEPRLRHCTPAWVTKRDFVSK '" 
KKKK* NIKC * HAKSGTVbYS FL.YLFYS * 
YFLMRWSVHIPGPSTK 


2825 


16726 


A 


2844 


189 


47 


KCVALLLSLFLS CSVM * RWACFRFAFCH 
DCKFLEAS PAMLP IQSVEL 


2826 


16727 


A 


2845 


335 


139 


EVSPSWPG*SQTPDLKRSACLSLSKCWD 
YRPGVSHRARLNGSFLTQLYEMLTYFPI 
IWVTLQVFR 


'2827 


16728 




2846 


3 


352 


HEQRLTPEWKKAATALGDVVKVGAVDAD 
KHHSLGGQYGVQGFPTI KI FGSNKNRP * 
DYQGGITGEAIVDAALSALRQLVKDRLG 
GRSGGYRSGLQGTSYRSSKKDVIELTND 


2828 


16729 


A 


2847 


2 


269 


ARGLFSTNHKDIGTLYGLFGA*AGVLGT 
ALSLLI RAELGQ PGHLLGRDD ISRIHVK 
RIIPGCGTLDGISVGRECACLWTPACSR 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFLPN*ECLAGHGGRPL*FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPLTWATETDPF 
SREKKKRRRREEKAEKRRENRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHVGQDGLDLLPS*SAHLGFPKCWD*R 
CKPPCPASRNFKTLKLYHI FSNNLI E * N 
* KS I TRGTLE TVQLPGN 


2831 


16732 


A 


2850 


224 


3 


SILQAWLKVPSPPGNPLGLLQSQVDFFF 
* RQGLALS PGLDCSGM IMAHCS LKLLGS 
TDP PT1AS * VAGTPSS C 


2832 


16733 


A 


2851 


27 


329 


VRTAAINRPGFLLPCFIGQNFFFY*RFE 
TYRLG V* MII/TPH* YLLS SLSGLDCVI F 
FS F I L FF YW F I VYCHVFM VYLF FFIiF VI 
LFVLCVFWCFYFLLFV 


2833 


16734 


A 


2852 


351 


60 


NliLSSLQGGRLPRPPGLS PHPQIFTPHG 
GVPL*LPGPWKSRPEASLNPRS*PPPGQ 
PRGNPFPPKKNFFFFETVLLCRPGWSAV 
AQFQKKKNLLIC 












365 


LTQLLNIjI*YQNQIEYPQPYAQCGPRSG 
NAQ Q I GHS F PNG * Q VP AS GMHGLP WTHQ 


2835 


16736 


A 


2854 


17 


239 


HFTTLQ I SMSLLTETLARHDS THLWS Qli " 

L*RLRRNDLLKSGAGGCSELGLHHCTPT 

WTTEQDPVSRPGAVASEH 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQDGLDLLTS * SACLAFPKCWDY 
RHKPPCLAAYIFIQSRVFLAVISISTHS 


2837 


16738 


A 


2856 


1 


157 


GTRGFHHVGQAGLEVLTSDNLPVSASQS 
AGITGESH*AHERSDGYANTNSPG 


2838 


16739 


A 


2857 


157 


361 


LLLRIVSLFWRKSSPLGVQAEMYRPDLP 
VTKPPPPD* DKDS CDSLL I TWDYRLAP P 
LPANLEF * I * TG 


2839 


16740 


A 


2858 


3 


346 


HEGNHKFKI INVHLR* LNQNL* SGLEGD " 
KVI WRNI *TSVTFLFC*MNGKMNPDLFR 
QFTYEVGRIFFFSLKD*VWLCCLGSSTV 
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NO: of 
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sequence 


M 
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od 
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USSN 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HKHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














DLGSLQPPPLGFKQFSCLSLPSSWEPPC 
PA 


2840 


16741 


A 


2859 


18 


233 


KNPTILCCKGGGAQA*KPRLLGRLRQEK 
GLTPGGEGCRDLKSPLGIPPWATRPKLL. 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 


356 


3 


RVNFKIFGKKGFCPGGPGGFEVliTPGDIi 
GPLFFQKGGDSPSKPLKPFFFCFALKST 
TLNYFQ FAFCVLRPT PN* S VTQAGMQ * H 
DHSSLQPQPPSLK* SSQLGFPSS WVYRH 
APPSC 


2842 


16743 


A 


2861 


115 


374 


I VCGKI LNV * YTQ * HNKLQNKNKVD * PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGW PGP VAYVCNS KTLGGQGGQ I T 


2843 


16744 


A 


2862 


163 


360 


LKRIWLLIIiHSHPHTSPHDRGVTQLEAI 
GPRWY* TYEYTDYGGLI FNS YI LPPLFL 
EPGDLRIiLDA 


2844 


16745 


A 


2863 


1 


236 


GTS PFFSFFFLSFFPSFLP FLS FFNPTT 
LJjLSI)HSNIS*KKVLiFSFEMESHSVNRIi 
ECSGTI SAHCS PAWATGTDS I S 


2845 


16746 


A 


2864 


67 


364 


VCVC VCVCVS VS VCVC VCVY I CTYICMY 
VCA*YRIRRYIMLPTLVYNSCPEIHDSK 
SCALHTGAITRATR * PCLC I H I P VCLT V 
CLTVWLHI CLSVCMD 


2846 


16747 


A 


2865 


2 


155 


ARGLTQENRLNLGGGGCSELRSHHCIPA 
WATE * DALSQKRKKKKGTYRG I I 


2847 


16748 


A 


2866 


344 


81 


IQPNS FI FLKIFI SRDGALLCCPGWS PN 
SWAQQSSRLNLPKCWDYRHEPPHLWQF 
L*KHYMYKWHKVHCVFSHYNKI FTGQTK 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSLRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 


A 


2868 


3 


349 


HEATS P 1 1 QEL I T FHDHAL 1 1 IILICYL 
LitiYALYLTLTT KLTITI MSDAHE I HTA* 
TILPAI ILDLIDLPCLRILYVTDEVNDP 
YLT I TS IGHQWY *TYEDTDYGGLI FNS Y 
ILT 


2850 


16751 


A 


2869 


1 


352 


GTRIDVYTRAYFTSATIGIAIPTGVKVF 
S * LATLHGSNMK* S AAAL* ALGFI FLFT 
VSGLTGIVLANSSLDIVLHDTYYWAHF 
HYVLS IGAVFAI IRGFIH* FPLFSGYTL 
DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRTSLLIiAFALL*LPWLQEAGAAQ 
TVPLTTLFDHTMLQDHRAHQLAIDTYQE 
FEETYI PKDQKDSFLHDSQTS FCYSDSI 
PTPSNMEETQQKSNLELLRIFLLLIESR 
b 


2852 


16753 


A 


2871 


254 


2 


YPGYQAISQSDMQSYAYRHL*LQPPGVK 
* SS I VSLLSS +DHRPVPPCLANEKNFFF 
* RQGLAMLQSRLVLNS WLLLYSHAEPRA 


2853 


16754 


A . 


2872 


282 


3 


HTHILHYIYIHHGHIST*HLSPPTNIYS 
IjCNIKIJj I YTPYDH* I IRI PLHQEHAVH 
RRRLIHIHIHTSVCVCVCVSVCVACVCV 
CVCVWVLV 


2854 


16755 


A 


2873 


278 


3 


LiWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP * KKFFRKF * KWPGFFFFFFLRQS 
HSIAEAGVQWRDLGSL.QPL.PPGFKRFSC 
LNQISSC 


2855 


16756 


A 


2874 


2 


348 


ARA*SLILVSLI IPNATSNLLGLLPYSF 
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NO: of 
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sequence 


M 
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SEQ ID 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPTTQLSINLAMDIPL*AGTMVIGYSSK 
IRHALAH FLPRGTPTHLI P I LAM I ES MS 
ZjL I QP I AMAVRVTDNITAGHLTMPLiTGS 
PTV 


2856 


16757 


A 


2875 


1 


260 


GTRENHLDPAGRGCSKSRSHHCI PAWVT 
D*DSLKKKKKKKPQI ILGNGGTEGNQK* 
RGNPFRGPWGPSRQKIKELGPGGKHGKS 
KF 


2857 


16758 


A 


2876 


176 


1 


EHLFCKKLLGEEKGSLSKEFGKCWVNPR 
♦KWNSRPGWAHTCNPSTLGGRGGQITR 
SG 


2858 


16759 


A 


2877 


168 


1 


GVRVFGFS SVPE FS FDTCAGAQWRNLGS 
LQPLPPGFK*FSCLSLLSSWDYKRTPRA 


2859 


16760 


A 


2878 


67 


333 


WQGLGRAVARVPRSYEITWAGNLKGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFFFFFFFLFSFFFLFFFFFFFFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRREKEKERKKKKERKKERKKTI F 
KKHI PSPLAFPRKAWKLKEIQI KX.VAH * 
KINRQAIRGCPPNKRPHPFQNMIKPGRE 
EGEGGLETSRDGERNLWIFGPSVGPLR 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFNNILSGQ*TIT 
NTTNQYSSLIIIMAIAIKLGIAPFHF*V 
PEVAQGTPLTSGLLLLT*QKLAPISIIY 
Q I S PS LNVSLLLTLS I LS 1 1 AGS * GGLN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVLLVEMGFFHSGQAVLKLLASSDP 
ASMTSQSSGITGLRHCAQPR*AFLIALC 
LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


IKKAL*HE*LREKKKS PIRTSQSSKAP * 
NLPRSKKKKESSNNYQLL*AHKLQKVKE 
MGKLLETPNLPLLS QKVAE PLNQP ITSS 
KIETLIKKPYHPEKSYGHDEVTAKFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMFIFYMFVYLATRSCPFTLA 
GVQWHNHGSLQL* PPGLKESSCLSLLSS 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCREPRSRHCSPAWVTE*DSKQNKTK 
QKQKRYTNI SSPQTSPLCY S FSAYTPLH 
NMPGNIMFSSL 


2866 


16767 


A 


2885 


360 


247 


FHR VSQDGLDLLTS * SAHLSLPKCWDYR 
REPPRRPKAL 


2867 


16768 


A ■ 


2886 


229 


351 


KLNNRPGVVAHAFNPSTL*KLNNRPGW 
AHAFNPS TLGG *GGWIMRSGVRDQPNQH 
GETP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVPCRQDVQVPHYLEGLF 
LRSCFME I KYDT * KRK I KYTHVHTHTHT 
HTHTHTQREKERDFPRFTAKWKANL.EAG 
S GYATAS S I RAVLWTT I TLV 


2869 


16770 


A 


2888 


4 


305 


ADSHMWKYKAPGITKILLKKSKI *GYHQ 
DWISSRYQDLFHKDMIFKIMSYWYKKRQ 
VDHWDR I E S PETS S QMYEHL I YWKDN I A 
EW*EKDSFLSKWCWDN 


2870 


16771 


A 


2889 


1 


436 


IAILT*YDYTLL*RVGPRGQEFGTRATV 
ITNLLSAIPYIGADWK*G*GGYSL.DSP 
TLTRFFTFHFILPFI IASLATUHLLFLH 
♦TLSNIPLRITSHSHQITFHPYYTDQYS 
IiRLL.PFLLSFTTLLLILPNFLFYPP*TT 
I/TFPF . 1 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
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1 SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
G=Glycine, H=Histidine, l=Isofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2871 


16772 


A 


2890 


1 


218 


RKISPLIKLINHSFIDLPTPSNISA* *N 
FGS LLG ACL I LQ I TTGLF LAMH YS P DAS 
TAFS S I AHI TRD VNYG * 1 1 RYLHANGAS 
I FF I CLFLH I GRGLYYGS FLY SET * N IG 
IILLLATIATAFIGYVLP*GSLLGACLI 
LQITTGLFLAMHYS PDASTAFSS IAHIT 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSNS * ANNL I * LAYTIAF I VK I PLYGLH 
L*LPKAHVEAPIAGS I VLAAVLLKLGGY 
GIIRLTLILNPLTKHIAYPFLVLSL*GI 
I ITSS I CLRQTDLKSLIAYS S I SHIALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI * LAYTIAF I VKIPLYGLHL*LPK 
AHVEAP TAGS I VLAAVLLRLRGYGIIRL 
TLILNPLTKHIAYPFLELSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALVVTAIL 
I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSLIAYSSISHIALWTA 
I L I QT P + S FTGAV I LI I AHGLTS SLL FC 
LANSNYERTHSRI I ILSQGLQTLLPLIA 
F* *LLASLANLALPPTINLLGELSVLVT 
TFS*SNIT 


2875 


16776 


A 


2894 


229 


3 


YTFVQSF I FS WL F I CKVCLL * PAHSMV 
CACECMCVCVCLCVCLSSLKIIPLLECL 
IYLQLILVLSSAI YNSRFE 


2876 


16777 


A 


2895 


3 


391 


GFLITNNISPASPFQTTIPLYLKLTALA 
DTFLGLLTALDLNYLTNKLK IKSPLCTF 
YFSNILGFYPS I THRTIP YLGLLTSQNL 
PLLLLDLT* LEKLL PKTIS QHQ ISTSII 
TSTQKG I IKLYFLS FFF 


2877 


16778 


A 


2896 


381 


2 


GRCRVSGS I IDHRMAPQKWRYTKGGPHQ 
WHNFCFF* KKGVFPYGPKVFLFRAPVFS 
PPGPPKRWEIKV* TPPPALFFFFFFFFV 
EMGSHCVAQAGLKLLGSSTPPTSASQGA 
E I AG VSHRAQPHAS 


2878 


16779 


A 


2897 


324 


1 


LYTNNTKHINPQQNNTK*YQRNTKNRRK 
KEQQTKNNIKRHITESARKNS* *HRNIQ 
KANKSQQSNKHVRARSRHNNNITONNNNN 
KKKKI ENSHAASD * ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


PPAPSS*YTTSLIQDRLFLMMAVLSSAS 
LMRGNVGSNIMNALSHFLPQGTATLF I P 
VLDIMEAISLLIQPIALAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDLLTS*STRLSLPKSWDYRCE 
PPRPAGSGHLYSSFKKQTGDTHHNF 


2881 


16782 


A 


2900 


1 


224 


LSWCLTLYFAYLLALFYFYFLKILDLAI 
LPRLVPYS WP * AS S PASAS QSSGITGMS 
HYTWPLPQTLNPFLFFVP 


2882 


16783 


A 


2901 


1 


228 


KYLINNRLITTQQ*LIKLTSKQMITIHN 
T*GY*YNRSGSSFNGVYDYHLLRSESEH 
P WM I VDNTE YDE I YTRGG I E 


2883 


16784 


A 


2902 


2 


332 


LPPS FSLLLAPS PS S SLPPS PS SSLPPS 
PSSSLPPSPSSSLHTSHSSTLTPSASCT 
LTITESFTQRLAQSYITILP*ASRTTMQ 
LLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 


DIGFGTDFSDMTPKAQTKQVDQLDFIKI 
♦NFWPGTVAHTCNPSTLGGRGM* ITKLG 
V* DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 


179 


2 


CGYFNAYICYSLLCFLYLSLCNQPFWKK 
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Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I FLPLFGLVFFFr FF*NRV* FCCPGWSA 
VSQ 


2886 


16787 


A 


2905 


108 


2 


GRVD I KLTSKQMI TIHNTKGRT* SLI LV 
SLIIFIA 


2887 


16788 


A 


2906 


318 


2 


YKESNSKNNEKKKITRQRKRQKKNRPSM 
MVHTCNPSPMAGRDGQI PSILFNQPHSP 
RIIRLTLILNPLTKHIAYPFLVLSL*GI 
IITSSIGLRQTDIjQSLIAYSS 


2888 


16789 


A 


2907 


17 


308 


KLAGYGGMCLWSKLLERPTRMNHLSPRS 
EGLME P * SHNC I PDWTP * QNP VSVQI TG 
FCFLTPNIiAMPPRLNLSGPICCQLSLRL 
LGPRLFAASVSGV 


2889 


16790 


A 


2908 


378 


1 


RSLHSTETAAKTHGHQTHPGLSTFSK*T 
PTP IAS I P I PNPHNCLPAFNS TTSAS YF 
SECVLTSSFKEIIADFR*KYSSTN*HPY 
HliFIiLKFVFFCSRDGSLTMLP I RPGLNS 
RSQAILPPWPPKVL 


2890 


16791 


A 


2909 


1 


129 


HLFGTNHRDIGTLCLLFGA*AGVLGTAL 
SLLIRAELGQPSTRP 


2891 


16792 


A 


2910 


1 


323 


CVCWGYRIPRCNIGHHLKFFFFSLFFFF 
FGNKVWFCSEG*RAGDQLLIMEPLASGF 
KGIFFLNLPQNWE*RVSPK1jPGKFWIF* 
LKTGFPLVAQWFELRTSGDQMA 


2892 


16793 


A 


2911 


367 


136 


etpgnhlspgv*gcsel*sclcxpawat 
eqdpis kqkrtkks ktllkntkadltr w 

KHIHKSACVCICLCMYMCLRE 


2893 


16794 


A 


2912 


258 


140 


FLIFFCLDCFLNIKVLGVPGFFFSLKKK 
QT*RQGLALSPRLECSGLI lAHCTLEPL 
GSRNPLTLAPQIAGATGMSH*VQGYNEL> 


2894 


16795 


A 


2913 


48 


-242 


YSMIFCVLILPVSFTSSDGIFL.LINLFL 
FY*STYPLAFFKTESHSVTQAGVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GF 1 1 CVCVCVCVC VCVC WF I FTEYS FH 
SLFC+ IRCCVFYFIALSLL JGFYIiFWTJR 
YLEVYNISFEVWASLFNRFLLL 


2S96 


16797 


A 


2915 


1 


122 


FSVETGFHHVGQPGL*PPTSGDPPSLAS 
QSAGITGMSKHA 


2897 


16798 


A 


2916 


167 


1 


GNMCS KESVSGTNINRKPD * * YNPRLGT 
VAHAYNPSTLGGQGRWITGGQEFKTSL 


2898 


16799 


A 


2917 


2 


356 


ARGILLLRIMLTTLTRYP* *RDVTREST 
YQGHHTPPVQKGLRYGI ILFITSEAFFL 
PRFLPPFLSSSFPPTPSLLKPFPPSSFP 
SLPSFFPPFLPSSLFL 


2899 


16800 


A 


2918 


2 


351 


AREYTSLQLILQMTFIMAFTCTDRTLYE 
LAFEC*LMRTLDI ITR*GNQP*RPNAGS 
YWLFYTLbGTLPLLiIALIYTHNTLiGSLN 
ILLLTLTAQELSNS * ANNL I * LAYT I AF 
IVKI 


2900 


16801 


A 


2919 


32 


321 


ALMG ITFFFFLGKGVLAPRGGGRGGNPG 
.LWGGPPPGLGPFSG*SLQGGGA*GPPPQ 
AGANFGFLILRKTRGSPGCPGGFWIPGP 
GDRPAGAPQGGG 


2901 


16802 


A 


2920 




361 


GTSTRLGVLiLLSLHHAGS INPLGITLHS 
DKVTLHPYYTIiCDAI.GLLLFI.LSI.MTI.T 
LFSPDLLGDPDNYTLANPLNTPPHIKPE 
* Y FL FAYT I LRS VPNKLGG VLALLLS I L 
ILAIIPIL 


2902 


16803 


A | 2921 . | 3 


152 


HERLYSVPLRIIiRRLPDPLSIPQGWRHV 
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SEQ ID 
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M 
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od 
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1 Predicted 
beginning 
nucleotide 
location 
correspond . 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I RESAYKSHI/rLPARRGLQDG I ILF ITS 
EWFFAGFF*AFYHSSLCPTPQL*GHWP 
PTGITPLNPPESPLLNTSVLLASGWIT 
*AHTKAISHYLREEAFKTG 


2903 


16804 


A 


2922 


410 


1 


RAG YRQKWGSIAT V I G QLGL P VE RG W YG 
PSGEGGTPQRHCSELVLDDLHVITQNRA 
NHRPRCGGSLLSELKFAPIjALQPGRQSK 
TLSQKKNKQTTTKKNTKKQPGSVAHACN 
PSTLGG*GRQITLPSGVRDQPGQH 


2904 


16805 


A 


2923 


2 


145 


ARAARGDVI * s FHVFAQRDLNLLSSRDL 
PALASQSTGITGMSHQCPG 


2905 


16806 


A 


2924 


29 


340 


EMNEVERRGRK*LFHDSIQSR*EYRCAP 
QCPANS CVLM * RWG FAMLP KL VS S DLP A 
LASQSAGITGVSHCAWLVFTjPLLSTSFI 
sqrhlylpra 


2906 


16807 


A 


2925 


2 


337 


ARVL I LPG FG 1 1 SHI VTYYSG KKEPFG Y 
IGMD*AMISIGFLGFIV*AHHIFTVGID 
VDTRAYFTCAT IIIAI PTGVKVFN+ LAR 
LHG CNM I * SAAVTj * S LG F I F VF I VSGQA 


2907 


16808 


A 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTFEPE 
SSFPVCLENGGGGNFISLFFLFLIILSL 
FFYFY* FFFI IFVYSYYYFYFCLFS YFF 
FYFFINLIIFLLLNYFFFFI 


2908 


16809 


A 


2927 


48 


374 


KDQEPTDMGSAHFQVFKGWRQVGGANED 
RINSHGGPVMFSGQYYGQLVRFLVY* I I 
IFFMLLFFSLFYSYFL.FISIFFVLVFYC 
LFIFLFYFSFFFIFVFFFYYFIVIF 


2909 


16810 


A 


2928 




294 


HEGFHYISQDGLDLLTS *SARLGL.PKCW 
DYRRE PPRPARFPTLRI WL Y I LKALYTV 
LGCYREYEAYRPWCWTHRMDGGQEVEGR 
REGTGGGHILGF 


2910 


16811 


A 


2929 


7 


420 


I RRRDVLLLTLTSLS P TAAR I CYNGGRR 
GNRLNSLAYLSKELLAAWSLRKPSHGLL 
TP IRCVLY I RGHHF * LLPPSHLLLLAS A 
IMGAAGGTG*TV*PT1ARNYSQPGVCVN 
LAMVSLHLSGVS S I LGAI TF I TAI MNME 
APAITQYQTPLFV* YGLITADLLVLSLP 
DLTAGITILiLTDRKLSTTFFDPAGGGDP 
ILY 














GLLHKAPSP*KFFFSPKPFNFFWKFSPI 
FSPPKKKFLiSKKPHIVFKFPPFKGKIFT 
FPPPLKFGPPRVFFKAPPPFFFFFFFFF 
FFFFF 














DSSFFS * LATLiHGSNMK*SAAVl.* ALGF 
I FLFTVCGL.TDIALPNSSLDI ALLDTYY 
WAHFHYVLSLGAVFAIIRGLIH* FPLF 
SGYTLDQS*AKIKCAI IFIGVYITFFPH 
HSLGL 


2913 


16814 


A 


2932 


3 


340 


NYSHPGASVNLI I FTVHLAGVSCILRAI 
NFITTI INI KPPAI TQ YQTPLFV* S VL I 
TGVLLVLSIPVLSAGITILLTDRNLNTT 
FFDPAGGGDHILYQLLF*LFGHPEAYIL 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRY W I ILS I TS E VLLFAGYF * AL 
YHSSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYT YVLIiASRVS FT * AHHRL IQNNRN 
HI IQALL ITILLGLYCTLLQAS *NFEPP 
FTISDGIYG 


2915 


16816 


A 


2934 


2 


367 


PRVRPRVRYLLFGA*AGVLGTALSLLIR 
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SEQID 
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26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspoh 
ding to 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Ar ginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AELGQPGNIjIjGNDHIYNVIGTAHAFVI I 
FFI VI P 1 1 1 GGFGN * LGPLI I GAPDMAF 
PRINNISF* LLiPPSLiLLLLtASAIVEAGA 
RTG*TVYPPL 


2916 


16817 


A 


2935 


3 


365 


YHIV*PNP*PLTWALSALLMTSGLTM*F 
HFHS I TLV I LGLLTNTLT I YQ * WRDVS R 
* ST YQGHHTP PVQ KGLRYG I MLF IT* DR 

GITPLNPLE 


2917 


16818 


A 


2936 


3 


144 


DSHGRHVING*TCVTICFIRQLIGHFTS 
KHHFGFEAAA* YWHKKKKK 


2918 


16819 


A 


2937 


3 


422 


QRLLATNHHDIGTLYLSFGA* PGALGTD 
LSLLIRAELGQPRDLiLGNDHI YNEI*TG 
HALGILFFIGIiPIIIEGFGN*bHALIIG 
APDMALPRINNISF*LLPPYLLLLLASA 


2919 


16820 


A 


2938 


3 


382 


RTRGLFS TNH I DI LTLYLLFGA* AGVLG 
AALSLLIRAELGQPCNLLCNDHIYNVIV 
TAHAFVI I FF I VI P I I I * CFGN *LLPL I 
I GAPDMAFPR I NN I SL* LLPT S LLLLLA 
SAILEA* SGTG* TV 


2920 


16821 


A 


2939 


225 


3 


NISWRMKQIVPKGEPYNGVPVFVSLWFH 
KMFI FETGSHS VYQAGVQ*RHLGSLHPH 


2921 


16822 


A 


2940 


1 


247 


PTRPPVIYSTIFAGTLITALSSH*FFT* 
VGLEINMLAFIPVLTKKINPRSTEAAIK 
YFLTQ ATAS 1 I LL I AI LFNNI KKKKK 


2922 


16823 


A 


2941 


3 


369 


TRDSTYU3HHTPPVHTGLRYWRILFITS 
DAFFFAGFF*AFYHSSLAPTPRLGGHWP 
PTGITPLNPLEVPLLYTSVLLASGVSVT 
*AHHSL>VEDDRYQI IQALLITILLGIiYF 
TLLQASEDFE 


2923 


16824 


A 


2942 


3 


401 


LTVCCVIVLRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTML 
TNL I KGKVPKAP PRI LMGGGINMTF* GF 

SGGS PASGNFLKELGLRS INF 


2924 


16825 


A 


2943 


189 


348 


RNARGWVTDKEKRLRLGMVAHA.CNSSTL 
GGDGR* IA*GQEFETSLANMVKPCL 














ANSNYERTHSRI I ILSQGLQTLLPLIAF 
* *LVASIiANLALPPTINLLGELSVLVTT 
FS * SNI TLLLTGL 1 1 LVTALYS LYIFTT 
SQWGSLTHHINNIKPSFTRENTLMFILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GCSSGTGCCP I LCDLPRPWS CRGVGGS P 
SSTAHLCPRGWRSGRCFLPPLSAS*VDS 
AMSL.IQAAKNLMNAWQTVKASYVASTK 
YQKSQGMASLNLPAVSWKM 


2927 


16828 


A 


2946 


379 


395 


S QHFGRLRREDHEVRS S * PRDP PALTS Q 

DRS*NINIGTIFINLLIFYLSIYLSIYL 
S I YLS I MYS FTVAQAG VQWRDYGS LQPE 
SPGPK*SSCISLSSSRNHSHTPPHGWVD 
PKIP 


2928 


16829 


A 


2947 


2 


357 


HTYD I RKPRR* TLACAIiSALLTTS GLAM 
TDCFHYITLLILCLLTNTLTI YQ * WRDE 
TRESTYQGHHTPPVQKGLRYGI IIiFITS 
EAYFFAGFF * AF YHS SLAPTPQLGGHWP 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PTGITP 


2929 


16830 


A 


2948 


2 


357 


PRVRYSTDHSDIGTLYLLFGA*AGVLGT 
AI SLL I RAELGHPGNLLGNDHI YNVIGT 
AHAFVI IFFI VI PI I IGGCGN* LGPLI I 
GAPDMAFPR INNI S F* LLPAS VLLLLAS 
AIVEAG 


2930 


16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRIjS 
SGG*GCSELCSCHCTPAWTTQQDCLKKK 
KNFWPVGEVKGRHML 


2931 


16832 


A 


2950 


2 


354 


AIPMTTARLTIHEAYLIILERTTTTTKD 
VKNPRR IAAAITAS CLGGGLEDAI SCQY 
GIATKDRPTGLGTPEVLLGALPGAGGTQ 
RLPKMVWPAALDMRLTGRS I RAHRALK 
M*LVD 


2932 


16833 


A 


2951 


3 


150 


LFSCSPTFSSDPLTTPLLILTT*LLPLT 
IMASQRHLSSEPLSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKKKKKKKKKKKKKKKKK* KLRKKKK 
KKKKKKKKKKKKKKEK 


2934 


16835 


A 


2953 


3 


378 


DAWADAWNQTPLFN * SVLI TAVLLLLS L> 
RVLAAGITILLTDRNLNTTLFDPAGGGH 
PILiYQHIjF*CFGHPEVYILI1)PGLGIIY 
RIVTDYYGTKEPFGYIGMV*AMRSIGFIj 

rfivrahhivtvg 


2935 


16836 


A 


2954 


3 


387 


ilyqhly*ffghlegyililpgfgiish 
mvtyysgkeepfgyigmg*amisigflg 
figrahhiftvgidvdtrayftyati I I 

AI PTGVKVFS + LiATLHGSNMK * SAAVL * 
ALGFI FLFTVCGLTGI 


2936 


16837 


A 


2955 


2 


432 


PRVRPRVRKHIAYPFLVLSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALWTAIL 
IQTP* SFTGAVILI IAHGLTSSLLFCIJV. 
NSNYERTHSRI I IL.SQGLQTLLPLKKKK 
KKKKKKKfCKKKKGGGLLKESLGGPNLTG 
EGK 


2937 


16838 


A 


2956 


12 


391 


SYFISSSKPHLSPPWLSSPDEATSKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGFK*KRIGGSFKKGGGEKKIFFFKG 
GKKKKTGGFFEKKFFLGGEKY* dnlpkk 
NKTVGEKKNFLGGG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVDAGEAQHHPRTCRRPLRAI.W 
S SHHERWKVTL CTH CSLGVFFLYCCTYY 
IFVLFIP*SSCGLTLIFITCIII J FGSIS 
FFLFFT I VFS I I I VTTFKFRLLYS I I FL 
SYLLC 


2939 


16840 


A 


2958 


1 


243 


NLPRLNPKE I E I LNRS IICNKIKAIIKS 
LLSKKSSGHNDVTAEFY*TFKEELITIL 
LKFYLTPKKKKKKKKKKKKKKGGPF 


2940 


16841 


A 


2959 


373 


3 


FSSLKKRVTPPPPPKTGFSLEGLHLLKK 
NFPQKPPPPKKSFSQKNPPPPKKKPPF* 
KKKPPPPPPI*HPPPKILQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFIVRKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTLPISYKWNNRALMKAHLIMK* F 
TEYFKTTDELYLHDNT 


2942 


16843 


A 


2962 




358 


HTPPVRKGL* YGI ILFITSKVFFVPRFI 
♦AVYHSSLCPTLQLGGLWPATGMTWLNP 
L*VPLCTTFVIiIiA* * VSIT* AHHNLIDL 
NRCHVIYALVITILLRIYFTLLPASEYF 
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sequence 


M 
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od 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














EAPLTIS 


2943 


16844 


A 


2963 


2 


373 


RAYD I VRPRP * PLLGALS AL.LMTYGLAM 
*VHWDCITLLILGLLTNTLAIYQ*WRDV 
SRESTYQGHHTPPVQKGLRYGIILFITS 
EDVFFAGIV*AFYHYSLAPTPQLGGHWP 
PTGITPLNPLEV 


2944 


16845 


A 


2964 


403 


61 


LFFPLEKNFLPPGFSAFFSPFSP*KFFF 
SPKAliIFWGNFSPFFPPPKIRFLPKIPP 
WVFFSPPFHEKLFSSPPPLNFGPPRVXiF 
KGPPL.FFFFFFFFFFFFFFFFRFEGECW 
RL 














LIPNLAMVTR*GNHPQRLNAGTYLLFDT 
LDGS LALLI GLT YTRNTLGSLNVILLT I 
TAQELSNS * ANNLV * LAYTLAF IEKIPL 
YGLHL* L PKAHRE AP I AGS I VLAAVLS K 
LGGSGKN 


2946 


16847 


A 


2966 


321 


.2 


STGMHFPHINMAINPPPPPRPPPPLFPP 
NPKTKPNPTQ *KGGFPPLGPF * KKLSPF 
LTLFFFFFFFFFGHHPGPWQKKKKKKVF 
FFCVCFS FQRVHNI YKNTHQQQ 


2947 


16848 


A 


2967 


200 


1 


RRTYTSHLLACLRQGIiAFSPRLECGGKI 
RAHCSLQLYGSSDPPTtAPQTAGTKQHN 
QRIAQCNADN 














LLTASSSEIAPLQSSLjGDRARLCIjIK* k 
EGVLNSI*SGNQGKSYANVYRLLYLDPI 
PKIYAEAYTP*NVNSTNLETKSPKTIQK 
FPEDREFKNDFFEKTKNG0RETGAFHPF 
LL.FLV 














ALiGIHFIFTVSGLjTGIGLiANSSLHIVLH 
DTYYWAHFHYVLSLGAAFAI IGGFIH* 
I PLYS CYTLDQTYAKI HFTIIFI GVNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


377 


NILLLTLTAQELSNS*ANNLI *LAYTIA 
FI VKIPIjYGLHL*LPKAHVEAP IAGS I V 
LAAVLLKLGGYGI IRLTLILNPLTKHIA 
YPFLV1,SL*GIIITSSICLKKKKKKKKK 
KKKKKKKKRGGPF 


2951 


16852 


A 


2971 


1 


407 


GTRSYTHLYRVFLELIKMSVYDLNHTVI 
MVI SGHVRLAFYG I VHLTLI LNLLTDH I 
LYPFLVLSLSGVI IISSIFLRRTDLESL 
IAYSS I SHIALWTAI L.IQTP * S FTGAD 
IIiI I VHGLTSSLLCCLANSNYER 


2952 


16853 


A 


2972 


10 


259 


SRSVAI YFKGMASA* * RMFSSKKKKKKK 
KKKKKKKKKKKKKKTAI TKKTTTAKWKN 
* RTE K I RHTRL P L I S S RDAKCVD FL YT 


2953 


16854 


A 


2973 


12 


400 


IiNCRTPSLYLQRAGELLSVENPHIWCQK 
CVRKNNFFLFFFFFFKTDLYCPHI CIAL 
S ITCFLiAI S ISGLFAFFLIiNS FNYHFI I 
VP*NFSLS IYLCLHSLFYVGFFFSFDMM 
IFSILT* TLFFCLLFHL 


2954 • 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ * KNLSSLQPVPPGFKL 
FFPLSLLSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG*LECLPPDGGGQHRALGI,GATWHIjEN 
.ACALVLADIjGTGPKPSGPFSGQACGASG 
RLPLLSGP I LPPLGQGHTLSAWGHHA 


2956 


16857 


A 


2976 


7 


329 


SRDIIjVMTEDGEFF*GHV*P*DRSRRAD 
TPRLGGHWPPTG I TPLNPL * VPLLNTS V 
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nucleotide 
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ing to first 
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residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L>=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LLASGGSIT*THHRLIEYIRHQI I HALL 
ITILLGLYFTLLQASEYFESSFT 


2957 


16858 


A 


2977 


2 


339 


AHHIFTARIDVDTRAYFTSATII IAIPT 
GDKAFS * LATLHGSNMK * SAAVL * ALGF 
I FLFT VRGLTG I VLADS S LD I VLHDT YY 
VGAHFHYGLSIGAVFAIIGGFIH*CPLF 


2958 


16859 


A 


2978 


394 


0 


QGCSEL*SCHCTPAWVTE*FPVSKKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PLCFPSQHWGKSSNCLSSLKYFPSSGLP 
LRCKAELDDVKQKADKELDVVAHACNPS 
TLVGQGGRITLRSGV*DQPGQH 


2960 


16861 


A 


2980 


2 


378 


ARVS IGFLGLI VGAHDMFTGRMDLYTRT 
YCTFATILIAIHTGVPVFI * LATL.HGCN 
MK*SAAVI J *AIiRFI FISTVCGLTGIVLP 
N*SLDIVLHYMYYAVAHLHYVLS IGAVF 
AILGRFIHRLPVF 


2961 


16862 


A 


2982 


2 


409 


PAVAEAYLKPWDDS KGS FLWGKPDLDG 
I REC CQRNFGWNRTRTDES L FP VLKQLD 
AQQTQIjR I ds F ftlaqqekd d akr I ks q 
RIjNRAVT CMLR I E I E AAS S E I E AVS VD * 
QKELELIDKAVACHLEYMYETDP 


2962 


16863 


A 


2983 


3 


453 


HASAHASAHASGQRKGAAPAEKKCGAEA 
QHEGLELRVENLQAVQTDFSSDPLQKW 
CFNHDWTLIiATGGTDGYVRVWKVPNIjEK 
VLES KAHDGE I EDLVLVPEGML * IVCPD 
HYSPVGINDFMWLL* LNWGNRLYFPHI s 

vylmfhfrtf 


2963 


16864 


A 


2984 


3 


417 


LILPGFGI I SHI VTYYSGKKEGFGYICM 
V* AMI S IGFLGLI VRAHHI FTVG I DVYS 
RAYFTS ATI I IAI PTGVKVFS * LSTLHG 
SNMK* SAAVL +ALRFI FLFTVSGLTG IV 
LANSSLDIALHDTYYV*THFHYVLSI 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSPPLFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLAVFTFI 


2965 


16866 


A 


2986 


176 


335 


TFQPSEIjWRTALLPRLiECSGLIIAHCSL 
ELLGSSSPLASAS*IAGTTGSILRY 


2966 


16867 


A 


2987 


65 


398 


KKEFKIGRKAAEKMTRNINNAFGPGTAN 
ECTVQWWFKKFCKGNKSLEDEECSGRPL 
EDDNDQWRAIIEADPLTTTREVAEELSV 
NHSKAVQHL K* VG KVKKLNKWVPHE LS 


2967 


16868 


A 


2988 


2 


158 


PGWSLTPDLR* STCLSLPKCWD YRH KR P 
S PTS RQT FDHAPNKN S S HS P P 1 YM 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF*KPPPKRRKKIFH 
DPGKKGPPKGIFKRGPPLFFFFYFFFFF 
FFFFFFFIITIECLHSRFPHRHHNKKFP 
PNPPS PR FWPQHLNTS LPNP KNKE P 


2969 


16870 


A 


2990 


410 


3 


GGRGHFFSGAFFIKFPWNKKGISQPLCF 
PRGGGSPPPLGPVRGGGPPCWGPLCHKG 
PVKKTGAPRGKNGISPFFCFPPLGSFNR 
SGFFWALliGSPPLFFFFFF*DRVSIiHHP 
GWNAVAQS * LTKTSTS WVQVI LL 


2970 


16871 


A 


2991 


371 


132 


KQSLPFKVKGFFFPPEVENAIYPDSHTA 
FYS W * KRSTFSKKKKKKKKKERKRKKEE 
RKEGRKEGRKERKKEGKKEIENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK*STCFGLPKCWDNRHES 
PCPASSVLND*QLSTVQRAT*WQTKSLV 
LKAPPARGAbPVSRAVSPPPSSHHFAYF 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
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X = Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 














LHMI * QVSCLFLR * SLALVPQAGVQ * RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHLSPG * LRKFFFFFGRDGV 
LMLSMLVFNSWTQVMKLPQPPQVLGLR 


2973 


16874 


A 


2994 


2 


181 


FHHVGLDI/LTL*SACLGLPKCWDYRREP 
LCPAXjVILLjTAKFTNFRYRVKNVCTFHV 
ESN 


2974 


16875 


A 


2995 


1 


165 


GFHCI SRDGLDLLTS * YARLGLSNCWDY 
RHEPPSPAPLFISYSICLFLSKLIQFL 


2975 


16876 


A 


2996 


405 


2 


KKGFPIFSRMFFLYKHIEKPPLASQKFG 
DPKWS PHPRP I FFFLL.KKGVL.YVWREGF 
KFHPP*FPPPGPPKRWE*RVKPPHPPPF 
FFFFFLYVEMGSHCVAQAGLKLLGSSTP 
PTSASQGAEIAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGLTGIGS AGSSLD I VliHDTYYV 
VAHFHYDLSIGAGFAIIGGLIH*FPLFS 
GYALDQTYARIHFTIIFIGVNLTFLPQH 
FLGLSGMPRRCSD* PDAYTT*SMLSSVG 
SVITLTAVI VLI IMIREAFDSKRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY * S I TAFRP FQHMELRNF FFFFLF F 
FFFFFFFFVFLFIFLiFLiFYFLFLYEIYL 
ILiFFL*HHFYSAV 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFSKEKKKKEKRKKKKKKK 
KKKKKKKPGGGS * KTALSHDCATVLQPG 
RLFQKKKKKRKKGKKKKRKKKKKKKK 












373 


SADREXXSKTDNLLGH*TNVWKCKVPRV 
IQSVFSSHSGLKIiEINNRKMKEKSLNT* 
KINN I S YWVQCSLYNLKSNS YTKSSDNT 
TTQYMY 








3001 


1 


396 


LDCSKISSYLQKSSSHVLFFSFSFFFFF 
GGGGGFALENPYPPAGLGPQKKKTLS PP 
WLVGGPPFPEKAPP*GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 
PTGFGGKPFFSF* PREPPGP 








3002 


412 


104 


FFFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP*AFNFFWGFFPIFSPPKFGFFSKI 
SRLVFFSLPFWEKFFFFLPPFFFAPliRF 
FFKGPPXFFFFFFFFFXXY 


2982 


16883 


A 


3003 


400 


46 


LFFFFFYKLFPPPAFGGFFPPFPL*NFF 
FPPGPFFFLGGFPPFFPPPK*VFFPKIP 
PGFFFPPPF*KKFFFSPPPFFLPPPGFF 
FNPPPPFFFFFFFFFFFFFFFFFFLCW 
VQVEAY 


2983 


16884 


A 


3004 


411 


69 


YSPLPPLFFSSPPSKFPWPPFSLFFLTR 
VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGDPAFFSIPHPRVGSLPPPPNWP 
FIGPSFW*ALFPAPPLFFFFFLSSFFFF 
F 










3 


200 


DAWG * LFS TNH KD IGTLYLLFGA* AGVL 
GTALSLLIRAELGQPGNLLGNDHIYNVI 
VTAHAFVI I 


2985 


16886 


A 


3006 


259 


146 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFGLLNDILLi ... 


2986 


16887 


A 


3007 


118 


414 


QNQPPQNKATHTVKIEKKEKPETKTVAK 
EHNKAKTAEKSEE * TKKEVKGGKQEKVN 
HTAAKVKEVQKTPSKPKEKEDNKKAAVS 
KHEQKDQYAFLRYM I 


2987 


16888 


A 


300S 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF" 
FFPPPL*FFWGGFPPFPPPPKKVFFPKS 



730 



•ISDOCID: <WO__ 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalnnine, 
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PPGFFFPPP*GKKFIFPPPP*FCPPPGF 
FLSPPPPFFFFFFFFFFFFFS 


2988 


16889 


A 


3009 


3 


302 


S LAS CLi S YLVC V I F LGQP K P T I * L.QNS T 
PHKK*NPTERYVKTCTQIFIALLFKKEK 
QPRCPSAGEWINKMLYACTIEYWLAIKR 
YE I L I YATV * MYLEKI 


2989 


16890 


A 


3010 


1 


409 


RLHDATFPIIEELITFHDHALVITFLIW 
LLVLCALFLTLTTKLTNTNISHAQEIET 
V*TILAAIILALMVLPSLRILYITDEDN 
DPSLTI KS IGHQWY* TYEYTDYGGLMFN 
SYILAPLFLEPGDLRLLDVDNRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLARCGGACLQSQHTQQNHL 
NPGEKGCSES*LPPCPPD*VTKQ 


2991 


16892 


A 


3012 


2 


423 


ARAARAHIVTYYSGKKEPFGY I GMG * AM 
IS1GFLGFIGRAHHIFT* * I DGHTRAYF 
TSAT 1 1 1 AI PTGVKVFI *LATL,HGSNMK 
*SAAVL*ALGFIFLFTECGLTGIVLANS 
SLYI VLHDTYYWAHFHYVLS IGAVFAI 


2992 


16893 


A 


3013 


2 


140 


ARANI LLLTLTAQELWDPRANNLI *LAY 
TLAFIVKKPLYGLHL*LPKAHVETPMDG 
PILLAAKLLKLGGSGIIRLTLILNPMTK 
HIAYPLLGLSL*GI IITRSICLRQTELK 
WLIAYS*ISHIALVVTDILIQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 
KETSLRTPLMTP 


2993 


16894 


A 


3014 


2 


420 


PVLAAG I S I LLTDRDLLTTLFDPGGGGD 
PILYCHLF*FFGHPDDYILILPGFGIIS 
HIVTDYYGRKEPSGYIGMV*AMTPVGFL 
GF I E * AHH I FTVG I DAHTRA YLTS VS 1 1. 
IVI PTWRQVFS * LATLHGSNMT* YAALF 


2994 


16895 


A 


3015 


6 


292 


AHH I FT V * I DVYTRA YFTS AT I V I AI P T 
GVKVFS * LATLHGSNMK+S AAVIi*ALGF 
I*LFTVGGLNGIV*SY*LLDIELHDTY* 
FVMGCPRKVYF 


2995 


16896 


A 


3016 


3 


422 


TPIIIGGFGN*LFPl>IISAPDMEFPRIS 
NISL*LLPPSILLLLASAIVEAGT*TD* 
TVYPPIiAGNYSHPGAS VDLTI FSLHLTG 
VSSILGAINFITPIINIRPPAITQYQTP 
LCA* CDLMTAVLLLLSLPDLAAGI TILL 


2996 


16897 


A 


3017 


21 


490 


TPFPGRHLTMFSLHLAGGCSILGAINYI 
TTLINIRPPAITQYQTPLFV*SVLITAV 
LHLLS L PGLTAGVT t LLADQNLNTTFFD 
PAGGGDPILYQHLF* FFGHPEVYILILP 
GFGIISHIVTYYCGEKEPFGYIGMC*AM 
ISIGFLGIIV*AHHIF 


2997 


16898 


A 


3018 


98 


402 


LRS QHS KS FQ I SGKPSQEE W PQIS PDS T 
DYIINT*H FNAQMLKN I Y * HQ PHGLHDF 
FPKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKASSSSSKKDPPGGGPT 


2998 


16899 


A 


3019 


399 


229 


PPPPGGGGPQGPPPPRGGFLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 
P 


2999 


16900 


A 


3020 


2 


401 


S D A VL *ALiGF IYLVLEGGLTGI VLADS F 
LDLELHDTYYVGAHFHYVLS IGAVFAI I 
GGFVD*FPLFSGYTLDQTYAENHFTIIL 
IGANVTFLPQHFLGLSGMPRRYSDYPDA 
YTT*NILSSVGSFITLAAALL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKIFFKKTPKKKIFPPPQF* IFFP 
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PLPPKKFFFS PNP*FFLGGFSPFFPPQK 
KIFFPKIPPNFFFSPPLKKKIFFFPPP* 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 
FFF 


3001 


16902 


A 


3022 


2 


332 


LTLS I LS I IAGS * GGLNQTQLRKILAYS 
SITHIG*IIAVLPYNPNITILNLTIYII 
LTTTAFLLLNLNS S TTTLLIj S RT * NKLT 
*LTPLIPSTLLSLGGLPPLTGFLPKW 


3002 


16903 


A 


3023 






RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRFFFFFKKKKKSSRPTRDRV*FYPK 
GWRSPLFLFFSPGGRGFFPLSHQVGFSN 
EVL WFKKNLEL FRGS PARKKKKKKKHL 
E 


3003 


16904 


A 


3024 


279 


1 


LGRNTELWKSGKGMD I LKTNCGKIANE P 
FRQPRVTjGIGGEAPRAGSGPPSRAPPA* 
TPGPSSAGSWP* PPGTGRAPRGPAPSAP 
GARSPGRPG 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFS LKGGKKKP P 
GKFLKKTFFLGGEKMGKTPPKKLKP * GK 
KKI FKGKRG KKN P KTIAVKKFS KKKKK 


3005 


16906 


A 


3026 


416 




YIjSPLKKFFTPPPLiRMFLPPNPLKNI FF 
PPQLKIFWGGWAQNSPPPKKGFFSKNPK 
RVFLPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


16907 


A 


3027 


379 


31 


PPPRRAGVFFFKNPKPKPPPPREGGRF* 
PFPPIiKF*FFPKPQNFLGGGGAIPTPPP 
KRGLGQIPTERFNL.SPPTQKRINFPPPG 
KGGPPPPLLKPPPPPPFFFFFFFFFFSL. 
NSFI 


3007 


16908 


A 


3028 


28 


420 


MQQTTMAHI FLCNKFANCAHVPRT* S ET 
KPMSTPLQFD * TYKGEKS CKYAEHERTW 
KQ * C VFS LYQ 1 1 P T * EKT WKCNQ CGTNF ' 
NQFFKQTTHL*NHTRDNQICFSKIGLEY 
YYRITTRQHLLKLRTVCIL 


3008 


16909 


A 


3029 


1 


401 


LGNNGEAVSEKRKEKSQKE KSHNVWGF 
FFFFWGKPPFVPQAEGQGRNFC*PKPWP 
PGWGE F PGPAPRGGGNGRQKQ PGRGNFG 
V*REKGGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


A 


3030 


285 


397 


MF I KGDGLNKXRPGAAAYACNPS TLGG * 
AGRITRSGD 


3010 


16911 


A 


3031 


265 




KKBSSCIKATNSNSLFFLF*SVFFFFVF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFF*HFKSALLWVTDLGFLNSD 
PRV 


3011 


16912 


A 


3032 


410 


49 


GFSPPPP*KFFFSPKPLNFWGGGGPNFP 
PPKKRFFPKNPPGVFYSPPKKKKKFFSP 
PP*NLAPPKIFLKSPPPFFFFFFFFFFF 
FFLSNVSNGLTNMYILPCKDPSCPTTFP 
ILGSLISL 


3012 


16913 


A 


3033 


177 


2 


vtppfffffetrshsvartgv* *sdrcs 
lophppsik*sshfslpsswdyrsmpph 
as' 


3013 


16914 


A 


3034 


?5 


395 


MS YKHXXKKKKKKKKKKGGAP * KKPWGG 
PKLTRDGKKKFFSLiKGAKKKPTWKFWKK 
TLILGGRKMGTTPPKKLKPLRKKKI FKG 
* RGKKH P KS L P VE KF ASRGR I KK 


3014 


16915 [ 


A 


3035 


400 


S3 


KKEGPARWSPVTPPLLEGPVGRSPQTR 
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NF*PTPPTHRKSLPS*KSKIDWRGIPLY 
PPPPRVKPKKSFNPGNRRFC*TKIFPCP 
SPWAPKTHPPFQKKKKKRVYK 


3015 


16916 


A 


3036 


3 


196 


DSMPQT * NKS FAKARKKKKKKKKKKKKK 
KKKKKKKRGGP PLKKTLGGP KLPGGKKK 
NFFFFRGG 


3016 


16917 


A 


3037 


1 


411 


FCYDVCVESGCADYS I VI IMKKKKEK* K 
KKKKKKGGGPLKKNPGGAQNYPGVEKKI 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPQKNKPLGEKKKFKREKGGKKLQFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


3017 


16918 


A 


3038 


3 


429 


NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHFPPPPKRFFFQ 
KPPRGFFFPPL*EKNFFFPPPVFFAPPP 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


I GL S GMPRRFS D Y P DA YAT * N I LS S VG S 
FMYLQQDNK 


3019 


16920 


A 


3041 


76 


967 


QLLKGGVSGVCPLLMFRCVRSFFLLVGS 
WSSLASGVKPQTFAVSVTVLKAARLELF 
I PPRGLWSLiASGVKXiQTFAVSVTAHKS 
S VDP KNSGAQLAS P SGS RTRAAGGAACQ 
SRCRVPALLSPWWDGTGRRGAGGGARR 
GGSGRTGAHGVGGRLRHGGLHVPS PAPW 
KGS * GLARNRAQRRWAGTAGG PS TPS AA 
AGPGAKSLTALCEQGWPAAPSAGPTKPT 
PTRNSSWPASVARSPGSRSCLSLHTSLQ 
AEGVGSSLGQPSKGLPQCSGGAEGLLKC 
RQSGS PGRGGTESERGLi*GIjPQCSGGAE 
GLDKCRQSGSPGRGGTESERGL 


3020 


16921 


A 


3042 


39 


141 


LSIRGLNIIIKRQRL*DWIKQQDSTLCC 
P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTNHIYIGTLYLLFSTRAGVLGTALS 
LL I P AE LG QPGNLL.GN DH I YNA I VT AHA 
FVIIFFIERPIIIGGYGN*LGPI.IIGAP 
DMAVPRINNITF*LLPPSLLLLLASAIE 
EAGAGTG* TVYPPLAGNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLN PGGRG CSEPRLHHCTPS WAT E * DSV 
SKKKKSRKGWTGLFI 


3023 


16924 


A 


3045 


44 


187 


DPRVRQYQTPLFV*SGLIFAGLL1jIjSLP 
TLGAGITILLTDRNLHTAVFDPDGGGDP 
ILYQHLF+FFGHPEVYILILPGFGIICH 
IVTYYSGKKEPFGYIGMV*AMISIGFLG 
FIAGAHHIFTGGIDVDTRAYFTSATI I I 
AI PTGVKVFS * LRL IR PNLCRLTS PISP 
NPRGRHHYTTNRPQPPHRRV 


3024 


16925 


A 


3046 


434 


40 


. GAPPPPPGRFFFFJjNPREDTFPPPPQKG 
GFSPPPPPKFFFSPPALFFFGGGWPNSP 
PPQKNFFF*KPPEFFFFPPFLKKKIFFP 
PPL,FFPPPQIFF*TPPPL.FFFFFFFFFF 
FFFFFFFRGCKINF I VRGF 


3025 


16926 


A 


3047 


1 


399 


LFTGGGLTGIVLTNSSLDIVLHDTY* W 
PHFHYGLS IGAGFAI IGGSIH* FPLFSG 
YTLDQTYAKIHFTI I FIG VKLTFFPQHF 
LGLSGMPRRYSDYPDAYTT*NILSSVGS 
LISIiTAAILI IFMS *EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASM ILSSMI FLE CIVGG FPVFS VYLFK 
LQ I LRQS STMCFVLFC FFEIRS CSVTQA 
GV*RRGHGSL*PQPPGLSHPSSRDHGHV 
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3027 


J 6928 


A 


3049 


6 


345 


SQLLRRLRKENCLNLGGGG YSHQR * HQC 

P^K^PLGTTLS^raWLPLSpSK^T 
GPWNWERKNFGKKKGGVIGLQGGKTI PK 


3028 


16929 


A 


3050 


261 


2 


NKKSPPVNLWWKMGFFFKFAKRWLSWK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQ PRL PDLR * S ACLGPPDC* DYRREPQ 


3029 


16930 


A 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGLATLS 
CLSLMSSWECRQPPPIjG*FFVCPR 


3030 


16931 


A 


3052 


89 


3 


pltsgllllt*qklapisiiyqispsln 














HVEAPIAGSIVLAAVLLKLGGYGIIRLT 
LILNPLTKHIAYPFLVLSL*GIIITSSI 
CLRQTDLKSLIAYSSISHIALWTAILI 
QTP * S FTGAVI L 1 I AHGLTSSLLFCLAN 


3032 


16933 


A 


3054 


266 


2 


FQPPISAYTKI SPSLNVS LLLTLS I LSI 
I AGS * GGLNQTQLR KI LAYS S I THVG * I 
IAVLPYNPNITILNLTIYIILTTTAFLL 
LNLN 














KYNSLIMPTMIATITLLNLYFYLSPLLY 
**SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNSL I I PTI I AT I TLLNLYFYLSPLLY 
**SSSPPS 














ANVWAPHGPAKLTNKDN YH I WKS KRLKI 
ANMTIKKLNEVIGLTLPDFKTYVELVQ* 
RQNAID* RKHKQPVKQSPEA* PHS YSQL 
I FHWGAKANHGRKDSLFYK* CW * NWT I S 
SQKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 


1 


RVGLLLKLNKISWPPPPFYGPS * EKEPL 
CFSQIGL FLTRTMVLNNLHS P PVKTRPY 
NKIAPFRELFFFFKDRVS I CLPGWSAW 
* SQL TATS TS QAS S DPGR V 


3037 


16938 


A 


3059 


. 138 


411 


WERPWKAQEAVFWI * VSAFWAPPPLMEK 
QIPPDLEQHYRNVPGVNRNQPFVSFFLR 
W S LTVAQAG VW WRDLGSL* PLLPGFKRF 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTS SDPPAS AS * SAE I T 
GVSQRAWSKITILKSSSFSYFPNSCKMC 
FWLICLN 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLILPRFGRTSHIVTYYSGKK 
EPFAYIGMV*AMISMGFLGFIV*AHHIF 
TVGIDVNTRAYFTSATIIIAIPTGVKGF 
S * LATLHGS NMK * SAAVL * AL.RF I FLLK 
KKKKRGAVLKVPWGGPSLPGCG 


3040 


16941 


A 


3062 


451 


82 


PPPTNYFSPPPAFLPGGGGPPRPPPPKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPPGE 
NRGVFFSTKPPPPWGKKNFAAPGAPPPP 
P FFFFFPRGGG 


3041 


16942 


A 


3063 


430 




FFPPKQLIFWGGGGPKSPPPKKKFFPKK - 

PPGVFFSPPKKKKKFFFPPPLNLAPPKI 

FFKSPPPLSFLFFPFFFFFFFFFFFFFF 

FFFFPLSVQTLLKRTRAPPQPPLLD*EK 

APAPRVP*TGEGMPAVNVAFAPPPFYKE 

RPS 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPPVQKGLRYGIILF ~ 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glut:imine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HWPPTGITPLiNPLEVPLLTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFSIYQKATVIKTVWYWYNNRHTVE 
SPEINPYIYS*LIFFLF 


3044 


16945 


A 


3066 


4 


426 


KLEN * KMVLKE I KEDLNKQTD I LFS * LQ 
RLITVRMSILPKLI YKFSAIPIQIPA* F 
L* IKI IIKCMRKGK*TRIAETIFFFFFL 
SQS FIL.SPKLDHRGGITANCTPPWAI KG 
KLlili KKKKKP P KT KPE KFF I Q KATGAEG 
GVH 


3045 


16946 


A 


3067 


411 


187 


RNLPNVPPRPTHFVLLVKTGVSQVGQGG 
GKLLASKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 


A 


3068 


193 


471 


QCTCI KVHSGQKTGSTPL VI GELQI KI T 
LGCYYTPTLMAQIKKTDRTKCW*GYGAI 
GMLILCWRECKIVQSL*KRVWQFII*LN 
IYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFFFILXYFYYF*KLFYLYIjFX 
YI FKI YFI YNFI Y I ILFYI I FI I F 


3048 


16949 


A 


3070 


1 


378 


GTRRFFFHS I TLL I LGLLSNTLTI YQ * W 
RDVTRESTYQGHHTPPCQKGLLNGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTG ITPLNPLE I PLLNTCVLLASG V 
SIT*AHHSLIENNR 


3049 


16950 


A 


3071 


201 


2 


TTPIQLiFIjKHYHT*NFNYNFFFEIGSCS 
IAQAKVQWCDLCLLQPQTLGLKHSSHIjS 
LQSSWDYRHA ■ 


3050 


16951 


A 


3072 


261 


I 


EKAMGGGPFKVKKSPGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


EIFKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLETFFFFPERGFFFFFFFETES 
YSVTQAEV*WNYLGSQQPPRFKRFSHLS 
LPSSWNYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRLNPGGGGCGEPRSCHCTPVWA 
TE * DS VS KKKKKKKKRGGWVP PSLGGGP 
KKKNP FFNQEGGIj 


3053 


16954 


A 


3075 


180 


419 


QKHLS I LH YL FKRDVFF FFL KGS FWS Q 
VGGQGHNLG* LKAPPPRLTHFSCLTLRE 
TWKKRPRYFFCFFI KTGFHHVTRE 


3054 


16955 


A 


3076 


371 


208 


QKLSGHGGSRL*S*PF*EAKAG*QLVSG 
GRGCSQL*SHQCTPAWVTE*RLVCKR 


3055 


16956 


A 


3077 


349 


3 


TFCWQKYTMCRSLCHPHPPRTWSTKKKR 
PPFQKGQGDAPPYKKVQRGNPPPPLKGR 
PSRGPPKKCKVFKAPVFLFPP.FSPPPFF 
PPPLFFFFFF*DRVLFCCPGWSAVAQLT 
ATS 


3056 


16957 


A 


3078 


124 


1 


ISSETPAVKSS FWPGWVHACNPSTLGR 
*GKE I TRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CLVQNIWACVSHWRYIMSCKGC*RI J GI J M 
CVCVCVCVCVCVCVYFCCCCCCCYC*DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 


413 


GHKTQPVQKGLLYGI I LFIT * EGFFFSG 
FF*AFYHSSLAPTPQLGGHWPPTGITPL 
NPL*VPIjLNTSVL1iASGVSIT*AHHRIjI 
♦NNRNQI IQALLITILLiGLYFTLLQASE 
YFETPFTISDGIYGSTFFVATGFHR 


3059 


16960 


A 


3081 


1 J 


321 


NSLNPGDGGCS * PRLHYCTPAWGTELDS 
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Predicted 
beginning 
nucleotide 
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ing to first 

residue of 

peptide 
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Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glut»mic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]VIcthioninc, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 














ISKEKKYPYHVyHPSMKMTIYNYRQQNV 
LP I VLGEGEKNQHS I SFKLFFNF 


3060 


16961 


A 


3082 


117 


484 


VLKYFIDSEVNAVLFSISCSFVTDFVFL 
FFFFGKGVSFCPPAGIKGGGFGFLEPLA 
SGFKRIFFPNPLEKWE*RARPPPRGKFW 

KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


CISRDGFYHLGQAGL.ELLTSSDPPALDS 


3062 


16963 


A 


3084 


2 


391 


SHAYHIG*PSR*PVTGALSDLLMTSGLA 
M*IDFHSITLLILCLLTNTLSIYQ*WRD 
VTREST YHGHHTP P VPKGLR YG 1 1 LF I T 
SEVFFFARFF*AFYHSSLAPTPQLRGHW 
PPTCITPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


IMRSGDRDHPG*HGETPSLLKYKRLAGH 
GGMRLWSQLLGKGGTADSHHHVLLILET 
FYSLRERRHLTSVPTLGMNYWAQDIR 














EKWPD*SRAACPVLCRGNGQYSKGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGICI 
MGSCACNSGYKGESCEEAPRYI PEKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQAGLELLTL*SACLSLPKCWDY 
RREPPRPAHTPPHS 














SSSSVFCLLVWTSSSSSSSAARLPPLTG 
FLPKWA I I EE FTKNNS LI I PT I I ATI TL 
LNLYFYLRLIYSTSITLLPISNNVKIK* 
QFEHTKP 


3067 


16968 


A 


3089 


404 


60 


FSHGKMRFFSPPSPKKIFFSPQSFYFLG 
GGGAKMPPPKKRFFFKKTPRGFYFPP*K 

FFFFFFFFFFFFFSPTFFTVFHLMLKSD 
ND 


3068 


16969 


A 


3090 


405 


186 


INKKPEAFTNTVDQMVLTNSKRTFYPTA 
KTEITLSTLSNHKLEP 


3069 


16970 


A 


3091 


371 


1 


SPPLVQKGLRYGI ILFITSEVFFFAGFF 
*AFYHSSLAPTPQLGGHWPPTGITPINP 
LEVPLLNTSVLLASGVSIT*AHHSLIEN 














NKRSPPVNLWWKMGSFFKFAKRVKISWK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC*DYRREPQ 


3071 


16972 


A 


3093 


1 


392 


FFFADFKKMFILINHFKMELTTYFELKR 
NEATASENC*DAVKAVLGGKF I VLSTYI 

KKKKKKKKVTGPEI PKFLIVKSGKPPKV 
ILTGAWGPIKFLSFITRL 














* *GHGTSGTLIYHWQE* KMVQPLWKIV* 


3073 


16974 


A 


3095 


3 


257 


HEVSQDGLNLLTS*SARLGLPKCWDYRR 
EPP CLAWL I LPDDCVI FQKLKLLHHNLL 
NLSCIDVLMGIYSLSNFSQSNFPFFFFF 


3074 


16975 


A 


3096 


189 


414 


KGLVMVTS * CKM FFCLSIFF FF FE RG FC 
FFAQAGVQGHNLSSLEPLPPOLKQFFCL 
TLPRS * KYRPAPPCP ANFY 


3075 


16976 


A 


3097 


1 


299 


ENYRP I S FMNTDAKI LNKILANQI QQCS 
KRITHRDQVGFLPGMQGQFYI * KS I KSI 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proiine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NAIHH I SRLKKRNHMI PS I EAKKAFNTH 
S* FMKKNKKLAFQET 


3076 


16977 


A 


3098 


362 


126 


FIHTHTHTHTHTHTHIYI YIYI IASFQW 
NAIRGRM KC INKP * KDMEELKC I LPSER 
SQSAKATYLLYDSNYITLEKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYHIVKPTP*PLTGALSALLMTSG 
LAM * FHFHS I TLL I LGLLTNTLTI YQ * W 
RDVTRESTYQGRHTPPVQKGLRYGIILF 
ITSKVFFFAGFF*DFYHSSLAPTPQLRG 
HWAPTGITPLNPLEVPLLYTCVLLASGV 


3078 


16979 


A 


3100 


392 


83 


LRNTNCGHGA P FKNS PFFPQ FLVKNS PA 
YFQNKEKDVGKPPFFFRAPDGAPFFKKK 
KK* GLALSSRLEYGGMISLELLGSSDTF 
ASAS RVARTTGQCHHAGPT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF*KKTGKNFWPPFFKG 
KEKPPPNPRGGGKKGGPKPPPVIFFFF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPF* YIGMG*AMISIGFLGFIV*A 
HHI FTVG I YVDTRAYFTSAS I I I AI PTG 
GKVFS * LATLHGNKMK* S AAVL * ALGFI 
FIFTESGLTGIVLSNSSLDIVLHDTCYL 
VAHLHYVLS IGAVFAI 


3081 


16982 


A 


3103 


3 


387 


HERHEELSNS * ANNLI *LAYTI AFIVKI 
PLYGLHL* LPKAHVEAPIAGS I VLAAVL 
LKLGGYGIIRLTLILNPLTKHIAYPFLV 
LSL* G 1 1 ITSS I CLRQTDLKS L IAYS S I 
SHIALVVTAIL I QTP 


3082 


16983 


A 


3104 




381 


HEQSHAYHI VKPSP * PLTGAL1AL.LMTS 
GLAMGFYFHFI TLL I LGLLT YTLTI YQ * 
WRDVSRESTYQGQHTPPVQKGLRYGI IL 
FIT*EAFFFARYF*AFYDFRLGPTPQLR 
GHWPPTGITPLNSL 


3083 


16984 


A 


3105 


412 


III 


FLGHPLFKKKMGGKKRGLPKMGV* HPPA 
PKGKPPPLKKKKKKPGGGGAPLYSPFSG 
GEGKKNPLTPEGGGPKNPNSPPPPPPGG 
KKKNPPPLSPKKKXKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F*L* I IY*AICAINNIRGAGAELYRSRD 
FVCY 


3085 


16986 


A 


3107 


2 


417 


ATHVGLQDVTS PM I EQLITFHDHALMKI 
L 1 1 C FLDLG ALFLTLATKLTKTNM * YAQ 
EI*TD*TILPAIILNLMALPSLRMLYIT 
DEGDDPS LTIKSI GHQWY * TYE YTDYGG 
LIFNSYIRPPLFLEPGELRLLDVDNR 


3086 


169S7 


A 


3108 


8 


430 


VGLVLFLSELPLNGGILTFFHQGIYSPF 
PGGRTWALMVGSWGLVMASTDLLGPLCH 
A FTPATQLLLNLAVAS PL * PAALR IGCH 
S KTTNALTHFLPRGTPTPPR P I LVTI ET 
MSLL I PP I APDQRLAAGFTARHLLLHLS 
V 


3087 


16988 


A 


3109 


3 


453 


PRAIKFYRDWPGHERKRIAWKGDPCHMV 
L I KDE KGLMCQKKKKKTPFFWAP KI PLV 
F P PAQKNQGS * PNP PGGGGNP PLRPGP* 
RKKPPALMGPPPSSPGGINPKSFFYLNP 
GPAH * P RGEN * LG P VFENLFL PLLKKIF 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SLFQKNPNPLVG* KKKGKALPQGPPPLF 
PPLGGAGPGGSQGRGWGPPRPPWGNPFF 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G— Glycine, H— Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R = Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 














F*NPKIWPAKGGAPFSPPSEKG*AGGFF 
liPRGGGFPLTQ IPPPP PHLGGKS KPPFQ 
KKKKKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


SSDPPTSAFQGSGTTDMCHHHHAQLIFN 
FFVETGS C YVAQAGVQWHDHGS LLSQTP 

SLVLMKPRGSLIRSACPDCFLVFWFSFF 
HE AEGCAS EC 


3090 


16991 


A 


3112 


233 


449 


FAL FS * LATLHGSNMK* SAAGL * ALG F I 
FLFTAGGLTGIVLANSSLDIGLHDTYYV 
EAHFHYGLSIGAEFAIIGGFIH*FALFS 
GYTLDQTYPKIHFTHFIGGNITFLPQH 
FLGL.SGMPRRYSDYPDAYTT*NILSSGG 
S FNALNAGS IT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPEG 
VFFPPS PPPNFFFS PPPLFFFGGVPPI S 

PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMLACVI 
LLRANRKARTRG 


3092 


16993 


A 


3114 


I 


418 


LNTTFFYPDGGGDPNLYQHLY* F*GHPD 
PDIHILPRPGIRSHIDTDYSGKKEPYAY 
VGM G W AM T S I G F Jj W LMVRAR P L FT VG VG 
VDAQAYSSFASITIAIPTGAEVFS*LCP 
LPLSGMK*TGAAVWALGLRFIFTCSGR 


3093 


16994 


A 


3115 


1 


425 


PRINNI S F * LLP PS LL LLLAYA I VEAG A 
GTG*TAYPPLAGNYSHPGASGNLTIFSL 
HLAGGSSILGAINFITTIINIKPPAITQ 
DQTPLFV*SVLITEDLLFLSLPGLAAGI 
TILLTDLNLSTTFFDPAGGGDPILYQHL 
F * FFD PAGGGD P I L YQHLF 


3094 












S LLIRAELGQPGYLLGNDH I YND I VTAH 
AFVIIFFIVIPTVLGGFGN*LGPIiIIGA 
PDTAVPRINNISI*LLPPCLLLRLACAI 
EE AGAGTG * TVYP PL 


3095 


16996 




3117 




259 


P TR PAL WTA I L I Q TP * S FTG A VI L 1 1 A 
HGLTS SLLFCLANSNYERTHSR IIILSQ 
GLQTLLPLIAF**LLASLANLPAPTPTP 
HQ 


3096 


16997 


A 


3118 


3 




Li I RGGRGCHELRS RHCTPAWATRARTLS 
QKK*KTTNPKKKLCLIFFGGKKKKLKKG 


3097 


16998 


A 


3119 


155 


1 


PDFFNKSMDKKKTARGWEDSSSSFCFFK 
RDRVLLCCPGWSAVAQS*LTAAS 


3098 












FLKKPPGVFFFPPIiKKKKFFFSPP*FLA 
PPKIFFKRPPPFFFFFFFFFFFFFFFFF 
FFFFFFFFFLVFIVLFQV'KVHFLKKCFN 
IQFPIASDNS * PSM 1 HEKFYCESN I EF 


3099 


17000 


A 


3121 


48 


387 


RDPVLQKKEKKKKKKPKNQKKKKKKGGP 
F*KIP*GAKIKPGKEKKNFSPKRGAKKK 
NPGNFEKKTNFGGGKKWGKPPPKN*RFK 
GKKKFLKGKGGKKTQI PWGLKI FFNGFD 


3100 


17001 


A 


3122 


237 


2 


P FP WLPP FPLKASSS P * SL * FLLGGWP 
NLPSSPNKGSFPKFPSWFLFRPP*GKNF 
YLALPR * P WPPQGFL * TAPP 


3101 


17002 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPFFKTPVFIjGVK 
PGVFFFSPYQKKPTNFGPKMGAL*RrPL 
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ed end 
nucle- 
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ding to 
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acid 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pi-oline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, >-=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FETPIWVFPIKGFHKKKPPVLN*PPTRK 
PPPDKILKKKKKDDCISR1ARN 


3102 


17003 


A 


3124 


453 


3 


SSAREGGGVPPPPPKIIFFPPPPIFSWG 
GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFFFFFFFFFI FFFFFFFFFFTKKKNWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVRPRV 


3103 


17004 


A 


3125 


443 


2 


YFPPFGRVI WGDSLiGAGVLNPPRPHKGT 
PFFPKKIFSIRPGWGGGPPLPPPQRQMW 
GGP PP PLFGLFRPQGKI PFP PKKKI KPP 
GGVCVGVGVCV+KKKKNPFPPFLWAGL.K 
EPCFFFFFFFEMEFCSCCPGWSAMARSQ 
LNCNLHL. 


3104 


17005 


A 


3126 


3 


186 


PVI YSTIFAGTLITALSSH* FFT* VGLE 
INMLKKKKKKKKKKKKKKKKKKKKKKKR 
GGGL 


3105 


17006 


A 


3127 


171 


38 


KKKKLFFPPREKWGPPKNFLKRAPPFFF 
FFFFFFFFFFFFFWSERSS+VA 


3106 


17007 


A 


3128 


16 


189 


ILGEVIWV* *FF*FIKKKKKKKKKKKKK 
KKKKKRGAPFKKTPGGPQITPGEKKKIF 
SL 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
FF FLGGGGP IFPPPKKSFFS KNPPGVF F 
SPP*KKKIFFFPPPLFWAPPRFFFKGPP 
PFFFFFFFFFFFFFFFFFSQF 


3108 


17009 


A 


3130 


2 


312 


ANNL I * LAYTI AF I VKI PLYGHjHLi * LPK 
AHVEAP I AGS I VLiAAVLLKLGGYG 1 1 RL 
TL I LNPLTKHI AYPFLVLSL * GGGVF 


3109 


17010 


A 


3131 


449 


3 


FFFFFFFLGGPPEIFFFFFCPPKKPKPP 
LGGGKKKPPFF*NFPQKPLGVLGAPPPP 
LCFFIKKKKGGGKKFFSPPLF*KGGPFK 
KFFFFPPPPKGPPPFLKNLRGWVFFKPP 
PKKKALSFFKKKKKKKKKKKKKKRAAAR 
DLELADAW 


3110 


17011 


A 


3132 


95 


448 


VINRE * KCVC * MKKKKKKKKKKKKKKKK 
KKKERAGKGGG 


3111 


17012 


A 


3133 


60 


442 


LGGFFFFGGKKGFCLWCPRWGAKAG I PV 
NGTPPRGV*RNFIoAQPP*EGGITGPPPIj 
PQ*FGFLRENGVPLRGPGGFEPPILGEP 
PPLPPQKGGKNGRNPPPPLKGFLVLFKK 
EFSSLVPSWKARGDP 


3112 


17013 


A 


3134 


236 


45 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFGAAD* VILFRYFQ 
FEATIMGV 


3113 


17014 


A 


3135 


441 


88 


KQQTPPGLI FF* KAPRREI FLPPPIMVF 
FSPPSPFKFFFFLSPFIFFGGVLPFFPP 
PKKGFFFKNPRRVFFCPPLKKKNFFFLP 
PFFFGPPRVFFYPPPPIFFFFFFFFFFF 
FFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMLSPKPQQLNQQNCSPEHYEPQLKTQR 
TWR* KKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRGGGQKKKMVGGEKKKPG*KIFF 
FF I KKVKKKTALGDKKKTQFWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTPIPSQGQQSLAWSWAGIGSAQ 
PPALLHS * P IGKI FKNCMP VGRKS PQLP 
RNTSWQLGAVAHPSNSSTLGGRGGRI TR 
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C=Cysteine, D=Aspartic Acid, 
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K = Eystne, E=Ecucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














SGVRDRTRG 


3116 


17017 


A 


3138 


448 


3 


FFFLPPSPFFPY*KRGFSLFGRWFNFP 
PPVFPPPWPSQ I FGFQALI FLPPPPPLF 
FSFPVGFFQTALFPFNGFFPGFFHGLFP 
LiLRFCPRK*VFWGGFFFFFFFFIjRDKVS 
LCHPGWNAVAQSEFTTALTSKAQAPTRP 
PTRPPTRP 


3117 


17018 


A 


3139 


2 


436 


DR*LFSTNHKEIGTLYLLFGA*AGVLST 
ALSLL I LAELGQPGNLiLGNDH I YNVI VT 
AHALGKIFFIAI P I I IGGFGN* LAPLI I 
GAPEMAI PRINNIS *GLLPPS ILLLLAS 
AIEEAGAGTG*TDYPPLAGNYCHPGASG 
DLIIF 










362 


2 


KPRRGKFFPPREGGGGFPPPPPKNFFFP 
KGGKFLGGGGGKNS PPQKKGFFQKNPGG 
VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGQSGQVK 
LKSPKCKL 














PTPPPCCKFSFKRPPKKTLFFPTTNLVF 
FSPIPP*NFFFSPQALIFVGFLAPIFPP 
QKKFFFSKFPPLFFITPPLIKKFFVPPP 


3120 


17021 


A 


3142 


1 


79 


FKLDYFSI IFIPVALFVTWSI IEFSL* Y 
INSDPNINQF* KLDYFS 1 1 FI PVALFVT 
WSIIEFSL 


3121 


17022 


A 


3143 


3 


441 


FFFP P PLKKKKFFP PPPNIG PP P KSL * K 


3122 


17023 


A 


3144 


223 


3 


LiPYW KL P Y tiKH * * *IjQDTNQESRG*HFL 
RPRPFKNQMKSGTVAHACNPSTLGGRGG 
RITRSGV* DHPGQHGEI 


3123 


17024 


A 


3145 


65 


414 


KKKKKKKKKKKKKKKKKKKKKKKKRGGP 
P KKKTRGG PQNP P P KKKKFP PQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKXNTPP 
KKKKKF*GGGGQTPPPPPPGKKFSPPRN 
KKKK 




17025 




3146 


410 


31 


RANQKAFRGKPLCDIiAVGKNLSSRTQIA 
LTI * KWINWTILKLRTSGH* KTPIKTIK 
RYPIEGEKISDEEIjYIjQy*KVL* IGKKK 

QR*LGSMAHICSLS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPIPNPLV 
NLNLGLLF I LATS S LAVYS I L * SGGASN 
SWYALIGALRAVAQTISYEVTIAIIIiLS 
TLLIRGSFNL 














KTKKGLKIKDPIjTRF* isvc* ITKSI*F 
FKLLPFFPLiLKEH I PLKYLFFFFFLFET 
E FRS CC PGWSTMV* S LLTAT S TSQI OA I 


3127 


17028 


A 


3149 


499 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 
RFFPPPPLKNFFFFSRVFFFWGGGAQKA 

SPGFFGAPPGFFLSGPPSFFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLSLQGFF 
LVSKELFLFGLTVKFTRGFRGFCGQ 


3128 


17029 


A 


3150 


256 


146 


KNAKVTQVGPEFNKGPG * HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


TI SCLCTRGEHPLS PRRAGPYTGS PLHC 
CVDWDKVFS SWKDLTDWPLGDLD IEYF 
TDGSS FILRGVCRAGYAAVTLDSAVEVL 



740 



JSDOCIO: <WO 016483SA2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
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SVSAETSA*KAELIALTRALWLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPETPPKRAAG 
ALHSLAQPFSVALPPCFFDPCAPSLSPG 
♦HALRPLPLPGLASEIQTAPSWHVPPKS 
LSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFPPKKKRGF 
FPPPPPKNFFFPQGGNFFGGGGGQISPP 
QKKG F FQ KNPRG V F FT P PKK KK I FFS P P 
GKMGAPPGFF*RGPPPFFFFFFFFFFFF 
FFFFFFLKKSWRPLAI 


3132 


17033 


A 


3154 


371 


105 


SPSPQVNFIKGPKPPPPK*IL*RAPNPP 
LPKKKFSNPPTWGPGPQPPPS KFS KFAR 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I * KNP PFSGFQS PDS I K 


3133 


17034 


A 


3155 


3 


371 


DVGAD P I LHTS TGL * LAMQY * PEA* TAF 
S S IAH I T+ DVY YG * VI RYLHANGA* I F F 
ICLLLHIGRGLYYRPFLYSKT*KIGLII. 
LLiATITTAFIGYVLP*GPI*F*GATVMT 
NbLSAIPYIGT 


3134 


17035 


A 


3156 


1 


398 


IAIPTGVKVFS * IiATIjHGSNM K * S AA VIj 
•ALRYIFLFTEGGLTGIVLPNSSLNIVL 
HDPYYWAHFHYVLS IGAGFAI IGGFIH 
* FPLFSGYTLDQT YAI IHFTI FIGVNLT 
FFPQHFLGLSGMPRRYSDYP 


3135 


17036 


A 


3157 


401 


46 


PPGGRNF FKKTPGKK I FSTKKKKGFF P P 
LPPKNFFFSPGGFFFGGGGGPNFPPPKK 
GFFSKNPRGVFFSPP * KKKI FFFPPG* I 
WAPPRVFLKGPPPFFFFFFFFFFFFFFG 
KKSVRA 


3136 


17037 


A 


3158 


404 


18 


FFSPPPPPPGGGVFPPNPNKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 
PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSNPPS 


3137 


17038 


A 


3159 


3 


192 


SLVIAGCPR*NLSSTLNLPTEPSKSPCK 
FNC * KKKKKKKKKKKKKKKKRGGALKKN 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDNFGFLiQPSPSGVKLFFCLSLPNKW 
DFRCGPPNPG* FFS 


3139 


17040 


A 


3161 


3 


384 


LIVPTIILLPLT*L,SKKHII*IINTTTHS 
LIISIIPLLFFNQINNNLFSCSPTFSSD 
HLSHP I LKKKKKKKKKKKKKKKKKGGGA 
FKRTPGGAAHWGGGGRETFFPKGGEKKN 
RPGVFWKQTFFWGGK 


3140 


17041 


A 


3162 


348 


70 


GPPPKKRVFSKTPKWLNKPPQKKKKFI 
FPPPVNLGPPKNFLKGPPPFFFFFFFFF 
FFFFFYFFWMGCDR*CSWRHSSPPRLSG 
TPKCSPSVT 


3141 


17042 


A 


3163 


2 


353 


LKTI PLTSTCLTIGSLAIiAGI PFLTGFY 
SEDHI I * TANVLYTNA* ALS ITLIATSL 
TSAYSTRI ILLTLTGQPRFPTLTNIETK 
KKKGG P FNR YPLG AQ VYGGGQN E KF FL I 
GREII 


3142 


17043 


A 


3164 


1 


221 


PTRPRDCSELRSCHCTPAWATEGDSISR 
KKKKLSTRTAF*YTEAINSLIYSLNIGT 
FKTYFPKIKTYDRHFDF 


3143 


17044 


A 


3165 


26 


383 


IP FYQ*SLI * YTRKKKKKKKKKKKKKKK 
KKKKRGG 
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E=Glutamtc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possible 
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3144 


17045 


A 


3166 


251 


381 


GNLCAGWAR WLTP WPAL WEAEAD * S RG 
QE IGAI LiANTVKPHIj 


3145 


17046 


A 


3167 


391 


115 


LFKKISPHAGIWGFFSPLTP*NFFFSLE 
PFIFGRGLAPIFPPPK*RFLSKNPPVVF 
IPPPLMGKPFPPPPPVRLGPPIYSFKGA 
PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


SPG I LGQKGQ IGP IGNHVPGGLiAAP VTP 
*FRFKPRLP*GFGPKASPPLALKPERAQ 
VGGTPPPGPRGPNGQPPPFKENQPGI.GF 
RFRGKLiAEKRGFHL 


3147 


17048 


A 


3169 


3 


363 


WATALQ PGQQS ETQSQ KKKKKKKKKKRG 
GPPFFFFFKKKI FFSPPGAKNKRGKFF * 
KPPKKKKKIFFFPPPPLKKKKKKKKKKK 
KKKKKKMGGAFLKKPRGAPP P PGKRKL I 
FFFLKGV 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFLGGIV*AHHI 

FS * LATLHGSNMK * S AAVL * ALGFI FL F 
TVGGLTGIVLTNSSLDIVLHDTYYVVAH 
FHYVlil)IGG*FYIiRYHNHCYPHRRQSI 


3149 


17050 


A 


3171 


390 


27 


QSLTVKS PYP WILI KTKGHH * VMNAGL 

VSESPVKHSCVOVLDSVYSSGPNL*DHP 
* TS VDWE LYVDG I S FANPCKVS LKKMTS 
PAPVTPRS 


3150 


17051 


A 


3172 


127 


310 


KNPGGAKI LRGGERKNFFLKRGGKKKHL 
GIFGKKTFFWGGKKWAKPPK*WRFFSPF 
SP*KFFFSLKALIFFGGFCPFFSPFKKS 
FFSKNSQVFFFSPPFKEKIFPFPPP*NF 
GPPRVFLKGPPPFFFLGVFFFFFFF 


3151 


17052 


A 


3173 


376 


3 


FFFFFLRRQSLAVTLAGV*RCNLSSI,QS 
PPPGIK 


3152 


17053 


A 


3174 


365 


14 


RENFFSPEGGAPKNKPPPPPPPPGGKKK 
IFFQKKKKKKIFYPWKNFPPPPKKKKKN 
P*KPGPLKSQGFFFFKKNQNLPWGPPQI 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETQLL 


3153 


17054 


A 


3175 


1 


377 


VPLHS SLGGKARLHLRKKKKKNPGFLKN 
FGPLALLGMGVGNI KGLKGQKGKNPAFG 
AHTGGGCFSLRGRNTPFPKRAEGIICYN 
SPH* KET* KALE PRGYKGL* QALALPNL 
KSGKMEH I LRGAP 


3154 


17055 


A 


3176 


352 


125 


GHEVLDSSDLPASASQSAGITGVSHHAL 
* E I LL S M FE TT WACDLLFQN I S F I KS S I 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


3177 


139 


366 


TAHTS * GY * VKNY INLSFCFFFFFLERN 
LSLPRSWDNGLVPQHLVNFF 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDFFFKNFPKKVFLGVPKNSLF 
*KILTLPPVLNPYPFFFFFFFETVSLCH 
PGWNAWPRLEP 


3157 


17058 


A 


3179 


330 


1 


IVSTIiETCYIAYNEEEKDTFITLRIYVI 
GGNGKFLG 1*1 KQH I KKI I HHDQVGF I P 
GMQGWFNICRSINVIHHISRIKGKSRAQ 
WLTPVIPALWEGDAGGSPEVRSSRPA 


3158 


17059 


A 


3180 


3 


399 


HASAPIiQSSLGYRARPCFKKKKKKKKGG 
GGGGFFFPXGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPGIKKPLGKGPPKKRGGK 
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T=Threon«ne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KGQKGPHF* KKKKKKGG KKLKKGP F * KP 
FKKPLLGRGVQP*NPHLLGG 


3159 


17060 


A 


3181 


2 


204 


CPTACPFW*NKELLMPKKKKKKKKKKKK 
KKKKKKKKKRGGAPFKNSLGGPHFPGAG 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQITFFPPPPPPPEKKPPFF* 
KKKKKKKJCSNCSD 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKIjFQKIEEEGCSPNSF 
NEAFIILIPKPFR*TTKKENF 


3162 


17063 


A 


3184 


152 


387 


YFQGFIYWACICTSFLFFFFFLKREFC 
FVTQVEVQGPNFT* LNPPLFGLKKFFCL, 
TLQIGWNNRPLPPPQVIFCFFK 


3163 


17064 


A 


3185 


22 


336 


YEKCTALLQMVSSFIWMEREGTHQYSFY 
RKDFSLASKVNIVSYYLSPIVE*FFFFL 
RGSPFAPQAGGQGPNIjGSWKPIjPPGLMP 
FSCLTLPGGWNCRPPPPGPVN 


3164 


17065 


A 


3186 


344 


1 


WVLKKIFFYPGRGGPPFIPPPLGGQGGP 
IPWARGF*PPRGPPPKNGF* KKKKKRGG 
GGPPPGFPPPGGPRGGVPFFIiGGGGPRK 
PKKI TKKKNPGEKKKTS FKNQKRKTKI K 
TT 


3165 


17066 


A 


3187 


296 


1 


NPKKI LTLPKKT KVYKCEGENQVP I I FQ 
GIKNI FWKGI F* PKKEREVCV* SMRHVI 
PVFPKKRGSKRSNKSCCYKDTCTRMFIV 
ALFTIGKTWKQPKY 


3166 


17067 


A 


3188 


2134 


1 


GVAAHACNASILGGOGGRII *GWEFETS 
LANMVKPC 


3167 


17068 




3189 


1 


159 


LQDHPG*HGEPPSIVKIQKLARHGSLRL 
* S * LLGRLRQRMRQETCLNPGAR 


3168 


17069 


A 


3190 


119 


340 


Q I K KNRL VS ARG KNNKRK * I YKP * VD I F 
FKEDIQMAGKQMKRDLISLI IREIQIKT 
TITCYLIHARMGTITRD 


3169 


17070 


A 


3191 


75 


1 


LS VNNF WPGTVAHACNPSTLGG *GG • 


3170 


17071 


A 


3192 


343 


1 


IFILGGGPCCSPVFFQFFGGGGGGFFLP 
QNFFPPRGKIFRPFFF*KKKLKRPNWGF 
F*NFFNPPIjGFFNIjFFFFFKKKPKNFFF 
LGGFFFFFFFFFFFFLAGGDSLALSPRL 
EC 


3171 


17072 


A 


3193 


105 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF+ TPKGPPP PKKKKC+ KKKILKGGRG 
KKKKKTPPKRPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 


A 


3194 


1 


365 


FCRDEVLjIjFCPDWS PTPGLKQFSHL.CLP 
KCWSYRC* PPYPAQGVFLK*HLTKSLSC 
LKLFMASLCLQDKVPAPQPCVKGLSKFF 
LCHLLSTLIPFTHSLFFLFLGWFLRQHR 
SVTQAGVQWR 


3173 


17074 


A 


3195 


1 


362 


GNQPKRLNAGTYLLFYTIoEGSVPLIilAL 
MYTHKTLWSLNILLLTLTGQELSNC*AN 
NL I * LAYTRAL I VM I PLYGLHL * LPKAH 
VEAPIAGS I *LAAVl.IiKLGGYGI IRL.TL 
ILNPLTNT 


3174 


17075 


A 


3196 


137 


3 


KGQPRFPPISLKKGSQGKKGFFFFFF*Y 
RIPLCHPGWCPWQSW 


3175 


17076 


A 


3197 


1 


266 


EGRGCSEL, * S CH CTPA* VTVRYPVS KKK 
KKKKKKG GG KGKKKGG EKNTL FG PKKG K 
LRGPQKRGKKIGPEKKVGNKLKKGIFFR | 
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EKTF 














SIRRIIFYSLIKPSIND*GERELiEPITT 
SQALQIAGRAGRFSSRFKEG 


3177 


17078 


A 


3199 


345 


314 


QPGPEGKIRFFLKIPNLTPSGGKSLKFP 
LFKRVKPENCLSLiRG*GCN* PI 


3178 


17079 


A 


3200 


2 


330 


SRHYTPAW*QSKTLSPKKKKKKKKKKKK 
PPFPKNTQKKPKKKKRGGLNWGVKTPPP 
liKTQKMGISPGKKLECKNLFPLKKKKPEG 
NPLF* FRGF*KKKKAPFKKKKNPPK 


3179 


17080 


A 


3201 


2 


385 


FPPFLGGPQIPKFLKFFLRAN*NLFGLF 
FLGGVLKKIFLRKPFSLWIKPFPTPFKG 
KKI FFKTFFQKPLFFF* KKKFFFCFPPF 
FFLSRGFFVFFSPKKPFFFFFFFFFFL 


3180 


17081 


A 


3202 


355 


2 


FFFFFSETESCSVAQAGVQWHYliGSG*A 


3181 


17082 


A 


3203 


.156 


1 


LKSLLWEAKVGGLLEVRSSRPA*ATWRD 
FISTKN* KISQVWWYILVWTTWE 


3182 


17083 


A 


3204 


352 


175 


QPRGRPAP AHPP * CPLRIiAL PC * CPCPA 
CCPPWAEASPSGVQASPARAPACPARAIj 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDLQT*KKKNKSPFKILLLID 
NALGHSRVLTERYKDI FRPANTTSFLRP 
MVQELI PTFQS 














DLCKKNYKTLLKEIIDDTEKYDML.MD*N 
NIVKMAI LPKAIYRFDTISTKLLMSFFM 
ELEKI F*NLY* KA*MAETTI SKKNKAGG 
ITLLDFKLYY 


3 1 85 • 


17086 




3207 


368 


12 


faqkkkkkkkkkkkkkkkkkkrappqn* 
RAPQKPLKPPPRVFIjIPPPLGSPPPPAF 
FHRGGG P PPGP FS KKKKKMRL.TGGGRL F 


3186 


17087 


A 


3208 


2 


356. 


KYLFSSIPEGKEKMKGIANLFNEIISEN 
CPSLARDLD I QKTRHANP YNLKKS SPQH 
IIVKLSKVKDKERILjKTARKECFITYKR 
TP I RPGMVAYACNP STLGG * GGW I MRSG 
DQDHPG 














INIKQLPRCGGTCLiWSQIjSRRLRRQDCY 
SLGGQGCSEL*SSQCSPAWATERDSVSK 
YKINNFL 


3188 


17089 


A 


3210 


295 


122 


VIKTV*NWNKDRHIDQCSRIESPKIHLH 
N 


3189 


17090 


A 


3211 


218 


466 


ATFDTTLPSCYQNAPNNRFDDLSD*EQE 
I DTMTVN I ILPLRSXjNI VI TWPYNI *HH 
QHDLDYTYPDTTGQLVNCAYFINLIiQP 


3190 . 


17091 


A 


3212 


340 


355 


LGLQG*REGNFGSLQTSPPGFKRFSGLS 
LLSSWDYRV 














WPGTMAHACNLSTLGG*GGCITRPGDRD 
HPG*HNNNN 


3192 . 


17093 


A 


3214 


198 


25 


YNEKEKPVKMLKI I ANSLGAVAHAYNPS 
TLGGQGGR I MKSGDRDHPG * HADAWVET 
VL 


3193 


17094 


A. 


3215 . 


376 


3 


PPGVF*RGPPGFRTLPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
PPSSSSRAPLKKGGPFNPAGGFPLFGGP 
. PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 
PRPACTLCISSTYNDP 


3195 


17096 


A 


3217 


3 


387 


HASASSGSRAPPFFFFFNKKTGS RELET 
TRGTVLKTCLFFKKKKKVLGGPKPPLGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 
*KNGF*DPGPKKSHIPSWVGGKGSSLLP 
GPLGGPSEVYFSPLRA 


3196 


17097 


A 


3218 


77 


406 


RMARPELGLPGNLLSNDHIYNGIVTAHA 
FVIIFLIVIPIINGGFGN*LIPLIIGAP 
DMAFPRINNISF*LLPPSLLLLLASAIV 
EAGAGTG * TVYP PLAGNYSHPGAS VK 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWGPGILKIFVKKKVSS 
LKKKKKKNFPPVFLQGWGGNKNFKGGGL 
KFSKPNLI *FFFPQKEAR*KVFFFFFRS 
PQKDPLREFFFFFLRQVSLCHPDWSAVA 
RSQLTSAS XSRAQGAGRV 


3198 


17099 


A 


3220 


376 


157 


KFFFS P * KFFFS PKPLKFGGG VGPI I PP 
PKKRFFFKNPQGVFKKPPQKKKKIPFQP 
PVNFGPPRDFLKGPPPFFFFFFFF 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
P PLGG FLN PAP KNPPPGP PKKVG FPGG P 
PPPPGGFFFSPPLSF*NPGERVFFGPKI 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAAARDLELADAW 


3200 


17101 


A 


3222 


207 


402 


SILM*LCCLFPLPGVTPIDGAPHRSYRE 
CYPVLLDGVMVGWVDKDLAPGIADSLRH 
FKVMREKRI 


3201 


17102 


A 


3223 


309 


3 


YPFFHLIDLAIHPCVCFTKFYKATVTQT 
AWSWYQIRYIDQ*NGTEISEIPPHIYNH 
VICDKHDKNKQWGKDSLFNKWCWEN*LA 
ICRKLKliDPFPTPYAKI 


3202 


17103 


A 


3224 


3 


382 


LDRERPP FFFWGARHMDI PQLVNLS INK 
GHWANFNFLGYKKKGGWEKKKKKKKKKK 
RGGPP*KKPPGGPK*PPGGKR* IFPYMG 
GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 


3203 


17104 


A 


3225 


133 


2 


FFFETESHSVTRLECSGTISAYCNLCLP 
GSSDSPASAS*AAGI 


3204 


17105 


A 


3226 


349 


2 


AGVPPGNPPLWGGEGGGSPRGGGLKPGF 
PQRGNPFFFKKSQPTPPGGGPPLIPPPW 
GGGAGGSPLPQGQRFQ*TKIGPFPSPRG 
KKKK P P F P KKKKKKKKRKEKS EM PG FMV 
LNA 


3205 


17106 


A 


3227 


3 


239 


LNKVGRGCSEPRSRHCTPAWATE*DSIS 
EKKKKKPESRGILKVKGLTQALFYlAIiW 
LYYLHPTAKQQIWGFFIYFSKP 


3206 


17107 


A 


3228 


205 


1 


IGLKIQNSCPLKDSLKKIKRQATDWRKY 
LQNTSDKASVFI I YKEHLQLSNYKAVDP 
IK* WAKEMNKLH 


3207 


17108 


A 


3229 


174 


1 


VQMLiEDKSFEETP* FSSETLVtiKIPHVQ 
PGAVAHTCNPSTLGGRGGRITRSGDRDH 
PG 


3208 


17109 


A 


3230 


2 


340 


KNHSVYLLCVF-SIPFPTFCFF* F*VFNT 
F*FKLN* IN*FTYIDRVLLCHPGWSAVA 
* S * LTAALNSWAQAVPCLSLLLAHHHAW 
LI I FKKTAYI IHIWYVWYDTTYPFKVYW 
S 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

risLof 0 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glvcine, H— Histidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
T— Threonine V=Valine 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \— possible 
nucleotide insertion 


3209 


17110 


A 


3231 


1 


313 


KKSTPYQRGFDPISPARVPFSIKFFLEA 
ITFLLFDLEDALLLPLP*ALRFIFLFTV 
GGLTGWLANSSLDIVLHDTYYVAPHFH 
YVLSIGAVLAIIGGFIH*FP 


3210 


17111 


A 


3232 


207 


355 


ELS PFRLKKTLY * LGMVAHACNPSTLGA 
RGGQ I A* AQE FKTS LGNMAKP 


3211 


17112 


A 


3233 


3 


355 


TTOQ * LI KLTCKQTI AI HNTKGRT * AL I 
LISLI I FI ATTNLLGLLPYSLTPTTQLC 
INLAMAI PL*ADAEVIGFRSKI KNALAH 
LLPQGTPTPLIPILVTMETINLLIEPIA 
LARRXi 


3212 


17113 


A 


3234 






KYLINNRLITNQQ*LIKLTSKQMITIHN 
TKGRT*SLILISLIIFIATTNHLGLLPY 
SFTPTTQLSINLAMDIPL*SGAMVIGFR 
S KI KNALAHFLPQGTPTPL I P I LAI I ET 
ISLLI 














QGCVKGWVLEEQVRRGWILDSSEGKTDL 
KQRGSPGSWPEHVGGWSGVMG* SEAWTG 
QARWLTP I ITALWEAEVGGSLRPGVQWH 
NLGSLQPLPPRFKRFSYFSLPSSWD 


3214 


17115 


A 


3236 


15 


356 


LIQPSLKLMIS IHNTKGRT*SVILLSLI 
IFIATTNLLGLRPYLFTPTTQLSINLSM 
VIHL * AGAMVI G FRSN I KNALAHLLPQG 
TPTPLMPILVIIETIRPLILPIALAVRL 
TA 


3215 


17116 


A 


3237 


1 


376 


GTRTNTLTIYQ* WGDGTRESTYQGHHTP 
PVQKGLRYGI ILFITSEVFFFAGFF* AF 
YHSSLAPTPQL*GHWPPTGITPLNPLEV 
PLLNTCVLLASGVSIT*AHHSLIENNRD 
QIIQALLITIVLG 


3216 


17117 


A 


3238 


1 


358 


GTRNG* YTNA* ALS ITVIGAS LTSAQGS 
RI ILLTLTGQPRFPTLTNINENNPTLLN 
PIKRLAAGSRFAGFLITNNISPGCPFQT 
TIPLYLKITDLGVTFLGLLTGLDLNYLT 
NKLIIKA 


3217 


17118 


A 


3239 


258 


390 


RQGLLMLAGLVLNSWPLQSSHLGFPKCW 
DYGREPPCLGN*LIL 


3218 


17119 


A 








ARARFHHVSRDGLDLLT* *NTHLRLPKC 
WDYRREPLRPGKTFFLKKKKKNSIFLFF 
REG FKE KS I LG I KFFRPTGG VL I LTGNH 
GWGCKTGTELLVP S RFPGLAFK I CGLWA 
HDTPHRVRNWL 


3219 


17120 


A 


3241 


3 


283 


HERLWGGWKTGAAGLGRTSSRPTASLTQ 
T*TTMTH*SRTTGCSTGSGRTWTRSRHS 

WLCWAWKVP 


3220 


17121 


A 


3242 






I KSQAGLVGFLGP FS FQDSLNLVFVFGF 
FCF*DRVSFCSPGWSAWQSEFTAALV 


3221 


17122 


A 


3243 


2 


385 


ARADVDTPSYLTSDTIMRDIPTGGQVFS 
* LSTLHGSNMK* TAAPLLTLGF I FLFTA 
GGLTGLELTNSWLDI VLHDTY YVGPDFH 
YVLSIGAVFAIIGGLLH* YPLFSGYTLY 
RTYAQIHFAIIFIGEN 


3222 


17123 


A 


3244 


1 


106 


GTRYVGQAHLKCLTSSDSPASTSQSAGI 
TGVSHSA* PASTSQSAGITGVSHSA, 


3223 


17124 


A 


3245 


2 


360 


ARANTLTIYQ*WSDVSLENTYQGHHTPP 
VQKGLRYRIILFITSEDYFFSGFL*AF* 
HSSLSPTPQL*GHWPPTGITELNPLKMP 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLNTS VLLASG VS IT* AHH S L I ENKRNQ 
IIQALIiI ■ 


3224 


17125 


A 


3246 


17 


160 


GG*GCSCSEL*SCHCSPAWVTEQDFDSK 
KKPAILiASCLKHLNPLSSH 


3225 


17126 


A 


3247 


236 


2 


WAHI YMTPLSPPSFLKPVQ*KNFYI YST 
YSLDN*NPSSPKAKRAPKKSYTLPYIiHLi 
CVCVCECV*VCVCVCVCVCVWV 


3226 


17127 


A 


3248 


2 


2220 


FFGGGRPSPPQGYFLLNNHSSPSPPVKL. 
NPGPA*FYPPTKGKNFPPPQR*PSPPKN 
IKTPPPSFFFSS 


3227 


17128 


A 


3249 


2 


372 


AYTISFIGKISFYGLH**LPKAHVEAPI 
VGS I VLAAELLKLGG FG I IHLTL.I FNPL 
TKH I AYPFLGLSL * G I S I TS S M WLRQTD 
LKSLIAYSSISHIALWTAIL.IQTP + SF 
TGAFFLIIDHG 


3228 


17129 


A 


3250 


293 


56 


EGSPKVIFNKSPPHHLFLFLFFFFIFFF 
*FFFFFFFFFFFFFFFFFFFFFFFFLYL 
liAMFYLS FFFKQDNQRYRQYS I K 


3229 


17130 


A 


3251 


169 


370 


LKMTELRGAPASKPRGQEPPPHYPCHHH 
HHHHHHFIj*VTKGQGPHHWPSPTRDPGW 
L*SPS*EDQRR 


3230 


17131 


A 


3252 


22 


156 


GERIGLGLGGQGCSEP*LCHCTLAWVGD 
TVRPCLKKKKKKKGPPF 


3231 


17132 


A 


3253 


3 


400 


QNQTPLLD*GGLITAVLLLLYIiAVLTGG 
I T I LLADRSLDS Tli F YPAGGGD P I L Y QH 
LF*FFGHPEVYILILiPGFGIVSHIETNY 
WGGKEPFGFVGMV*AMIAMGFLGFIG+A 
HHI FTVGVDVHTRA* FTSAT 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP* KI FFFP KGLNFWGGGGP KF 
PPPKKKGFFKKS PVGVFPPPGG * KSGPG 
PGFKKPPQKGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWWVQVERP 
TL 


3233 


17134 


A 


3255 


1 


379 


LNL I QRQ * R* V * KF L» * LP PQT * KKKKKK 
KKKKKGGGAFKKNLGGAKFNGGRKKKI F 
FLKGGVKKKKAGGGFKKRGKGKKCYLGI 
FEKKPFFGGGKNWENPPKKIKGLREKKK 
F*GEKGEKKPEKAG 


3234 


17135 


A 


3256 


42 


376 


FCYISLVHHCIYNDLSFERKKNIFVPGQ 
INSISSIA*EAHCKNKSLLHAVKKKKKK 
KKKKKKKKKKKKKKKKEKKRGEKKKKKK 
KEGRSSLKKEK 


3235 


17136 


A 


3257 


353 


67 


CYPLSPIiKFFFS PRSLKFWKGVGP I IS P 
PKKKVPSQNSQEAGFPSPNVLKRRPGPN 
FKTTP*KEKKNPFPPPVKFGPPKESLKR 
PPLFFFFCGLQ 


3236 


17137 


A 


3258 


2 


129 


FHR I SQDGLDLLS S * S ARLGL PKCWDYR 
CEPPRPAKNKILLS 


3237 


17138 


A 


3259 


205 


415 


QQKNRRliIiHFKGARTHNSYNRGQPTTPS 
I TAHMYPRLRQSHT I YVLRVHHP * VPSA 
I EGP VSV * ALLHS S TI WAGNLLL I RFH 
PLAEKSPLIQTVTLCLGANTTLVAGGWA 
LTQNDS KRI AAFSTSRGLGL 1 1 VTIGVN 
QPHLALLHICTHAFVKAILFMCSGS I IH 
NLSKEQDIRKIGGLLITIP 


3238 


17139 


A 


3260 


3 


393 


S FNLSTL I TTQEHL * LLLPS * P1AI I * F 
ISTLAETNRTPFDLAEGEAELVSGFNIE 
YAAGPFALFLIAEYTNI I I INTLTTTI L 
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peptide 
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Predict- 
ed end 
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ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possiblc 
nucleotide deletion, \=possible 
nucleotide insertion 














LGTTYDALSPELYTTYFVTKTLLLTSLL 
L*IRTAYPRFRYDQLIHL 


3239 


17140 


A 


3261 


2 


400 


ISDLSEK* FKRLWKLIMEAPEKGKAQC 
KEIQKMTQEVKGE I FKE* IA* RKKKSKF 
QETLDTLIEMQSALESFSNRIKQVEERN 
SELKDKIFELTQSNKDKGKRIRKYEQSL 
* VAWD YI Q * PNLG I I G I PEEE 


3240 


17141 


A 


3262 


450 


129 


NNLAFN*V*EFCVLAIKLLKVKRYPIEW 
EKMFANHISDKGLVSGIYKELFRLSNKQ 
AIDLTF * KWAAGHGGS PL 


3241 


17142 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHFS IHTAWS IIIYM 
FSPLTI ISRKRMGQPGIVAHTCNPST*G 
G * GRW I TRSGVRDQTGQHGKTP 


3242 


17143 


A 


3264 


350 


3 


SPTLLGSKDPNLLGFRFPLWKKGKIIRA 
PLSLGLN*RFFSEWLIP*KPPKNWPGG 
TFLWCFLKRGFPFLSQKKKKKNPGAVA 
HACNPSTLGGRGGRSQGQELETSLANTV 


3243 


17144 


A 


3265 


69 


200 


RLiECSGVISAHCNLiNLiPGSGDSPASAS* 
IiAG I TVMVKLP V I AK 


3244 


17145 


A 


3266 


223 


408 


GGFPFFPPGGGEGGNFGELEPLPPGLRK 
FFCLGPPRRGD*GPRSSSPGSFWVFKKN 
GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFIIYFYRQSLA*VHWNNPSSL* 
PRTPGLKHTPVPSLLISWDYGRTPPHLT 
NFCIFFDRGSFFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGCHS I PQAGVQWHNHGLLQPQPPGLR* 
S S CLS LPS S WHY 


3247 


17148 


A 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKSPPPNQPKGAQ 
TliPLKIWDKMGGGGGLALVV* KAPPGNF 
KGPPGGKPMEQP*LGPGPPIjKWKGGLPH 
QKGGFSKAPGEKKKGGEGRL 


3248 


17149 


A 


3270 


422 


183 


ETEWSLFKVIITEKSPNLEKDTNIQVQ 
ESYRTPSRFILMKTTSRHLIIILPKVNN 


3249 


17150 


A 


3271 


3 


35 


KNNSLIIPTIIATITLIiNLYFYLSPLbY 
**SSSPPS 


3250 


17151 


A 


3272 


155 


1 


KDFFFFFLRQSFTLVAQAAAEWCDLCSL 
QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


A 


3273 


413 


71 


PPSTGLFi.TEEYEVSFPPFPL*KFFFPP 
SGLFFGGGVPPFFPPPKKGFFPKYPRLV 
FKGPLLGGGGLPP PPP * I LP PLGS FLPA 
PPLFFFFLFFL 


3252 


17153 


A 


3274 


336 


45 


DRVLS CS PAWRAMARS HDFG * LQL P P PR 
VKVESCLSNPSSWDPRHVPPRKGNFVFL 
VKTGN P WNLG GQG CR E RRLCP C I PAWGT 
DKDSVSKKKKKS 


3253 


17154 


A 


3275 


206 


366 


SVFFFVLFGFFYGALLCFPNPWLECSGT 
I LVHCNLHFPGSKDSSAS PS * VAGT 


3254 


17155 


A 


3276 


334 


75 


ENTRRVERRRRTHI IYYNFF*EFMNRFF 
FFIFFDGRFSCVFFFFFLFYFFFFFFFF 
FFFFFFFFFFFFFWFSARSFIYFLFPRH 
VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFFFFFFW*FRG*T 
HWNGDACMCNLTKS 


3256 


17157 


A 


3278 


2 


109 


YHIVKPSP* PLTGALSALLMTSGLAM* F 
HFHSITL 
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NO: of 
nucleotide 


SEQ ID 
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sequence 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1— Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, ' —Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3257 


17158 


A 


3279 


404 


65 


NL VSS PKRSVFFIS PSQKI FFFFFNWKB 
SKISLPKEGGGPLLLLIRQRTPIFFFFF 
FYFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL*TACLF+TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


GI*KNSMDGCCLGPFWFETGSHSHPGW 
SIVWRNLGSLQLLPPKFKRFSCLS 






A 




91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 


I7I61 


A 


3282 


3 


321 


HTVIYYFGKNEPFGYIGVA*AMISVGSL 
GFMA* AHH I FTGGI DVYS PAYLTCAT 1 1 
VAIPTGVKVFI * LATLLGSGMKRSAAVL 
* TLGFI FLFTARGL TGI VLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKNFFFSSRRFFFWGGV 
APFFPPPKKGFFSKI PPGFFFPPPLKKK 
IFFSSPP*FWAPPGFFLKGPPPFFFFFF 
FFFFFFFFF 


3262 


17163 


A 


3284 


360 


47 


PPLHRWVPPPPP*KFFFSPQTKFFGKVF 
SQNPPPPKKSFFPKNPPGVFYSPPKKKK 
KFFLPPPIFFPPPRFFFTTPPPFFFFFF 
FFFFFFFFFFLPiFFWSVSF 


3263 


17164 


A 


3285 


386 


157 


FFSPNPLI FWGGGGPKFPPPKKRFFSKN 
PPGVFFSPPLKKKKFFFPPP*NLAPPRD 
FLKGPPPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAILiILKI 
LI FNI ENKS I * YLLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKPTPKKKKK 
KDSR 


3265 


17166 


A 


3288 


81 


270 


1 1 KLEKKKKKKKKKKKKKKKKKS * KGNE 
KTRGPKKKVHRDVTKKILCYKI CPFNKG 
ISLIGT 


3266 


17167 


A 


3289 


3 


145 


LGRLRQENHLNLGG * GCRE PKSHHR I LA 
WATEQDS ISKKKRKIGGPV 






A 




404 


47 


GPGGKI FFKKNPEEKI FSTQKKKGFFPP 
FPPKNFFFPPGGFFFWGGGGPIFPPPKK 
GFFPKIPRGVFFTPPKKKKIYFFPPREI 
WAPPGIF*KGPPLFFFFFFFFFFFFFLV 
LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKKKKKKKKKKKKKKKRMKNK 
KKKKKGRGGGKKKKPRGAQIIRGKKKII 
FFFKKVFK*KGGGDL+KKKSFCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


NPL.VNLNLGLIiFILATSSLAVYSIL*SG 
GASNSNYALIGALRAVAQKKKKKKKKKK 
KKKKKKKKKKKKKGGGVL 


3270 


17171 


A 


3293 


337 


2 


FFNRELLKSKNKETEKNHDYKKERIWHF 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
FFFFFFFFFFFFFFFFFFFFLNSIiNI*F 
FSINTHKCGNRGQGGGAIGPRLSNGRV 


3271 


17172 


A 


3294 


327 


21 


KFFFSPPNKYFFFVFPHKFPTPTKMFL* 
KIPPPP1.ISPPPKENHQCPPPPHIFAPP 
SHPFFGPPHFFFFFFFLFFFLWRTGSRY 
IAQAGLELLGSSYPPAS 


3272 


17173 


A 


3295 


367 


101 


FFFSFF*KKKNFFPPPPLWPPPPNFL*N 
PPPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFLVNSLCRRYRG*SLLY 
YAWL 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDSITLFIITKNEK 
KTRCTS IGE * I TWYTHTGEYYS VMKRNE 
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sequence 


Predict- 
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nucle- 
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correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K-Ly ^ine 6 L-LeucVne' "iVl-Methion'ne' 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T = Threonine V—Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possib)e 
nucleotide insertion 
















3274 


17175 


A 


3297 


147 


1 


RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFTW*FALTFLV*NF*AI 
ITF 


3275 


17176 


A 


3298 


3 


347 


PFGEGETNGRFDLDTKKALVSKKKKKKK 
KKKKKKKKKKKKJCKKKKKKRGGALKKKN 
RGGQKKTGEKKKKFFFKKGEKKKTPGNF 
♦KKKKFLGGDNLAQTPPKKKNLWGKKKN 
FWG 


3276 


17177 


A 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 
TPKLGRGCGGAPQFPLPVRARGENPPNP 
RFLGFN*PKFPFPPFAL*GGGNNETFFQ 
KKKKGPRFWAG VL PTGGAEGE SIPHLSP 
ASGSSRHPWCFMLCRPSRliCLL 


3277 


17178 


A 


3300 


125 


376 


DQPVQNSETSSLQKKKKKKKL.FPGGGGP 
PLFPHFLGGWGGGGTRPPG*RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 
PGKSFFFFFFFWRDEVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHLAIHF 
FPTRPRTRG 


3278 


17179 


A 


3301 


348 




PQRliKILGGGGPPNSPPPKKGFFPKTPR 

RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLASLRAGI 
TGVS 


3279 












yPKJCKKKKKKKKKKKKKKKKKECKKKKKK 
KKKKKKKRKRKKKKKKT * GGGAY I KKKG 
GGS PRTEEENQ YCFLLGGKKNI LRGS CL 
KKTPH 


3280 












KKNFGGGKNKGGEKKKIFFFKGGGKKNP 
GGNFKKKNFFGGGKIGENPPKKK*RFRK 
KKKFLRGKGGKKS 


3281 


17182 


A 


3304 


371 


3 


NQKKPPPPVFFFFFFKKKKNPPPATPPP 
GRP * FFPPQKKFFPEGGP PP FFKKGAFF 
PPPKKNPFEPFFPGGVFFF*KKKNIKVF 

PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR * L.FS TNH KD I GTLYLLFGA* AGVLGT 
ALSLLI RAELGQPGNLLGNDH IYNVIVT 
AHAFVI IF 














L» F S TNH iGD I G T L YL L FG A * AG VXjS TAL S 
IjIj I RAELGQPGNLVGNDH I YNVI ATAHA 

DRAFPR INNI S F * L.LPRS ILLVRAS AIA 


3284 


17185 


A 


3307 


2 


367 


KPPAITQYQTPLFV*SGLITGGLLLLFL 
PDLAAGITILLTDRNLNTTLFDPAGGGD 
PILYQHLF*CFGHPEGYILILPGFGIIS 
HIGTYYSGKKEPLGYIGMV*AMRSIGFL 
GFIE*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCNPSTLGGQGGWIRRSG 
I *DQPGQQGVTPSL. 


3286 


17187 


A- 


3309 


2 


363 


LVPLIIGAPDMVFPRINNIGF*LLPPCL 
IjLLIAYAIMEAGAGTG*TAYPPLiAGNYS 
HPG AS VDLTI FS LHLAGVS S ILGAINFM 
TTIINI* PPAI TQYQTPLFV* SVLITAG 
LiLLLiCLiPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


j SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3287 


17188 


A 


3310 


1 


364 


LYITGPTLiALTIALLL*TPLPI PNPLVN 
LNLGLl.FILATSSLAGySIL*SGGASNS 
NYALIGALRAVPQTISYEVTLAI ILLST 
IilRGSFNLSTLI TTQEHL*LLLPS * PL 
AII*FISTL 


3288 


17189 


A 


3311 


215 




GMAKLKARFLPKKGGKRSLSPLLWGSGF 
PTSGTGETFFFFF*DRVLLCQPGWSAVA 
QS * LTTASTFWAQAI 


3289 


17190 


A 


3312 


153 


3 


MNKCI CMCVCMCVCFRSHKHTVCAWVCV 
y* V* TYTVCVCVCVCVCVCVC 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKGPPLFFFFFFF 
ETEDGVLL.CRPGWSAVSRDRA 


3291 


17192 


A 


3314 


282 


69 


KIGKTAPFFSWPRRVPFLKKKKKKK*GL 
TLSSRLEYGGMISLELLGSSDTLASASR 
VARI IGQCPHAWLT 


3292 


17193 


A 


3315 


479 


60 


MMQHS I KHALVE ITGWVRWLRP VI PALW 
EAEVGEFNSISKTNNRKRSNIVKMSILF 
NL I YR I NAP FFKFL.I ALLNKNS KSKSIW 
NLKS* *STQ*SSRNNVGGITISDF*THQ 
QATELiRQEFLHTGIiVHRSTVSRFRFHA 


3293 


17194 


A 


3316 


129 


3 


SFHQKVWPDAVAHACSPSTLGG * GGRIT 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


EFHRVSQDGLDRbTS * STLLGLPKCWD Y 
RCEPPRPASFPSFIiTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHLLFFFFKKGFWFCPPGGRAG 
PQGNliMEFPVSGGKGIFCPGPPKSGE*R 
APPPSRGKFWFFKKKGGLPFWPGGV*TS 
DPRGTPPPLGPKGGEYR 


3296 


17197 


A 


3319 


326 


2 


KAFFLNFKPHPGFA*GLKFKPNSLLPI * 
SFIRKT*AFLICL* IQLNRADGKIPLKP 
GVI P FKKCFPNFTP P LFFKKNCS S TPNL 
FFFFPFFFFGDGISLCRPGWSAVA 


3297 


17198 


A 


3320 


175 


312 


QAGVQWCDLSSPXPPPSGLT* FY*CSLP 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWGPRLPVAWLPSVSRRAEECS PGRGAH 
GQGGGGGGQAR + REG PDEWTGGAP VGMG 
SPSLNRGAGRGPPQPGPSSKSRRGRVRC 
TPHSCL.IGCGLSFFICKMG 


3299 


17200 


A 


3322 


344 


187 


LiRQETCFNPGGKNFG* QKL P P C P PAWAT 
RG WVS KKKKKR KKRKE KIRI CVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERERERERERE 
REREFFFFFF*KKNLSPPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
PPVFFFFFKKEGAPPGGGV 


3301 


17202 


A 


3324 


365 


1 


SLGWWRVTPPPFPLGPRGGSPPRPDKRG 
SPGRVKFGGVKPFPKVKP*PPPKRLDPS 
GA*FFFSPKKICQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
INCICKRHC 


3302 


17203 


A 


3325 


1 


269 


ARALSL.SLSLSLSLSLSLEFFFFFLAPT 
QILGGGliFYPPKGGAHLSNRLGGGRVFF 
GPPGKKTAPLGGGRVI KKKTPPGTPL* L 
SPPPGGPFFFQGGQKKLSPPPAGLTGGP 
P PWGGKKAP PPKFGSGPKKKKKTLERER 
ERERERERERELV 


3303 


17204 


A 


3326 


1 


316 


ARGERERERERERERERERERERERFFV 
SPPGPDTLQDRGGVSLFASDPTHEIYLP 
CPLRGPLSERDPEELHGVCFFSEPSHRW | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
Inst amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isolcucine, 
K— Lysine, L=Leucine, IVI— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, **~Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPEEGVGD ICQC* HRGECGS P 














ARGDVTRESTYQGHHTPPVQKGLRYGI I 
LFITSEVFFFAGIF*AFYHSSLSPTPQL 
RGHWPPTGITPLNTLEVPLLNTSVLIiAS 
GV* IT* AHHSLIENNRNQI IRALLITI I 
LGLYFTLLQA 














Cj'l'KKKHSTlLIIREIDIKFMPRFHISPI 
RLVKI * ALANI RC * QQCGKVGNLLHC * F 
S KLVQ STGRAI WHALVLS LARS RALSLS 
LH 


3306 


17207 


A 


3329 


2 


352 


ARGTLLLLCLLFFALTIYPRHCDVTRKS 
T YQGHHT P F VQ KE P IjVIjGGKIj F I TS * V I 
FFAGFF*AFYHSSL>TPTPLLGGHWPPTS 
ISLFNPLKKPLLNTSGLLASGVS IT* AH 
HSLLQ 


3307 


17208 


A 


3330 


3 


36] 


HEEPLGY I RMA* AM I S IGFLGLI VRAHH 

VFR *LATLHGSNMK* SGAGL * ALGFMFL 
FTVGGLTGIVLTNS *LDIGLHDTYYWA 
HFHYVLS 


3308 


17209 


A 


3331 


1 


340 


GTSGDTRAYFTSAT 1 1 1 AI PTG VQVFS * 
IiATLHGSNMK*SAAEL*ALRFIFLFTVS 
GLTG I VLANSSLDI ELHDTYYWAHFHY 
VLS IGAEFAI IRGS I H* FPLL* GYTLDQ 
T 














jm\ju vjj i foil t 1 otj ill ifti F Xij V rojr b 
* LATLHGSNMK* SAAVL* ALGF I FLFTV 
RGLTG I AL.ANSSLD I VLHDTYYGGAHFH 
YVLSIGAVFAIIGGFIH*FPLFSGYTLD 
QTYAK 














QVSHRVRPCFERERERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKE 
LRMCHCTPAWVTQGDCVS *NKK* HWEKK 
GLWFLGVNI S T 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHSLQRGEQKRNSVSKK 

AKNGKQFKFPSTCEWINNM*YIHTMEYY 
SATKNQLIHVTT * MNLNNI YTKFLKARQ 
KMITYSC . 


3312 












KTFHPYYTIKDALGLLLFLLSLMTLTLL 
* PDLLGDPDNYTLAHPLNTPAH I KPE * Y 
FLFAYT I LRSGPNKLGGG PALLLS I L I L 
AIIP 


3313 


17214 


A 


3336 


17 


350 


FIFTLTETNRTPFDLAEGESELFSGLKI 
KYAAGPLALLFITEYTNI IIIDTLSTTI 
VLGTTYDALS PELYATYFVTKTLLVTSL 
FL* IRTAYPRLRYEQLIHLIi+KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSL I S PI LRLHPR PNLRQNAFHYHIH 
RGKSNFLPTTLiSRPIRNAPTLLGIjPRCI 
HHMKHPIICRLIH*FPL,FSGYTIjDQTYA 
KMHFTI 1 FI GVNLTFFPQHFLGL-SGMPR 
RYSDYPDAYTT*NILSYVGSFISLTAAI 
LIIFMM*EAFASKRKVLIVEEPSINLE* 
LYGCPPPYHTFE 


3315 


17216 


A 


3338 


2 


386 


ARAILNAMAFLILTERKILGYIQliHRGP 
NWGPYGLLQPFADAI KLFTKEPLKPAT 
STITLYISGPTLAIiNIGLLIi*TPLPIPN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proltne, 
Q^GIutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLRNLNLGLLFILATSSLADYSIL*SG* 
ASNSNYALIGALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 
TEAAVQWRNLGLLQPLPPGY * SS WC 


3317 


17218 


A 


3340 


1 


369 


GTSNLSLLFIVATCGLiAAN*ML«**GGKI 
N * I YTLRVALAA I I HTILYE I TLS I ILL 
* TLL I SGS FNLSTLI TTQEHL * LLLPS * 
PLAI I * FI STLAETNRTPFDIiAEGESEL 
F*GLNIEYSAR 


3318 


17219 


A 


3341 


1 


363 


gtrggtilpamvlmlidvpslrilyitn 
kvnepsltiksnghqwy*tyqytdyggl 
ilnsyilpplflqpgdlrildvdnrvvl 
pie tp i r i int * qdglhs * ag ptlglkt 

DAIAGRVNQ 


3319 


17220 


A 


3342 


181 


315 


PKSPPTPPPPTXXXXXPGGNPPGPTL*G 
PFYWGGERGPPF*TGGP 


3320 


17221 


A 


3343 


3 


183 


HEVSQSCRELLTSGDPPASASQSAGISG 
MHHRAWPVS *TFHI FNI YCGPGTVLKP Y 
LNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCAGVSRVSMEPWFLWQQACLL 
PWWRCCLSTLYANRADRDVP* TSRTGPV 
MVAHACSPSTLGGQGGWVTRSGV* DQPG 
QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQG I CFMWSLGLSHRLAHCRYTVGVNN 
LLCSSSLPFLLPLLPLPPPEGPGWDDIN 
1 1 1 F 1 1 FRDKVLLCCLGWSAVT * SQLTT 
A 


3323 


17224 


A 


3346 


3 


368 


YEPI P*PLTGALSALLMTCGLAM* IHLH 
SITLIiILGLL.SNTLTIYQ*WRDVTREST 
YHGHH T P P VQKGLRNG I I LL I TS EAFF F 
AGVF*AIYHSSLAPTPQLGGHWPPTGMT 
PLNPVQGPLL 


3324 


17225 


A 


3347 


1 


352 


GTSAGDVNYG-* I IRYIiHADGAKI FLRCR 
FLHSGRGLYYGSYVYSKT*NIAINLLIA 
T I ATAF I G YVLA+ GQ I LF *GAT VSTNLL 
SDIPYIGTDLSQRI +GGYSVDRPTLTRF 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAE I S PLHSNLGNESETPSQKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE*RE 
EPPPLTKRVF*KKKVFFEGPRVKKNLGG 
KGKKKRGFPAR* KNPVFKAKGKGRNFGP 
QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLMEI*AWQKFEYKP 
GKVPMDPEGCLLPLAKKKEEEEEEEEEE 
EEEEEEEEEEEEEEEEEEEEEGEEEELV 


3327 


17228 


A 


3350 


2 


374 


ARGGGYSPHRATLTRTVTFHSILPFIIA 
GLAAIHLLFLiHETGAYTPL* I TSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 
PNLLGEPIIDYTLTNPLNTRPHIKPE*DC 
LFAYTNMRSGPN 


3328 


17229 


A 


3351 


1 


266 


LGLVEHFLTLTTKLTSSNISDAQE I QSV 
*NILTAI ILNLMVLPSLRMLYITNEVFD 
PSLTI KS IGHQWY* AYEYTDFGGL I FNC 
YILP 


3329 


17230 


A 


3352 


207 


362 


ILFLLKQKIYVLFGFSN+GSSIPELAHS 
DAYQTKE I CSSGKVYKYLQCWEKW 


3330 


17231 


A 


3353 


178 


365 


NHG I D KGLR YR I YTVQQEDN SVKR * L KD 
FNQHLS KDNIRSADKHMKRCPTSLVINI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


1 SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nil de- 
tention 
corrcspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine,D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Eysine, L=Leucine, lVl=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, w== Stop codon,/— possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MPIKPP 


3331 


17232 


A 


3354 


359 


3 


LQVRS LMPLGQFHPF*NFFCSFFPKPKI 
PFFFLFFLKGVFVFFPGWRARGQFWVFG 
PPPPKFKKFFFFTLLRNWGFRDFPPSRG 
KFFFFFFFWYF* *RLRFTMLTRLVFNS* 
PQVISC 


3332 


17233 


A 


3355 


2 


354 


AREPS P* PLTGALSALLMTCGLSM * IHL 
HS ITLLILGLLTNTLTI YQ * WRDVTPES 
T FQGHHTP P VQKG LG YG I I LF I TSEVFF 
FAGFL*AFYHSSLAPTPHLGGDWPPTGV 
TPLNAL 


3333 


17234 


A 


3356 


1 


364 


GTREM I CCSALS PRIHLS FHRRWPDWHC 
ISKLITRHRTTRHVLRWSSLPLCPINRS 
WICHHRRLHSLISPILRLHPKPNLRQNP 
FHYHIHGRK*SAAVTj*ALGFIFLFTVGG 
LTGIVLANSSLDIGLHDTYYVGAKFHYV 
LS IGAGFAI IGGFIH* FPLFSGYTLNQT 
YAKIHFTIIFMGVNLTFFPQHFLGLSGM 
PRRYSDYPDAYTT 


3334 


17235 


A 


3357 




376 


ARGGQPEDYILILPSFGI IFHIVTYYSG " 
KKEPFGYICMV*AMISIGFLGFIA*AHP 
IFSVGIDVNTRAYFTSATI IIAIPTGVK 
VFS WLATLHGSNMK* SAAAL* ALRF I FL 
FTVSGLTGIVLCN 


3335 


17236 


A 


3358 


357 


139 


AQPWFFFLETVFCHAAQAGLKSIGSSDL 
PTSASQSVGTTGMSHHAWPERSC*QNTF 
TYYVQSIYSDDIRKSMF 


3336 


17237 


A 


3359 


. 277 


1 


SCQKDISIGFLFVPGFRVKGVAQEHLVG 
VRMYVLVPTVGGDGKERER*RGIRHTPP 
PSERVRQRERERGRDRDREKERASQRKP 
EMDRETSC 


3337 


17238 


A 


3360 


261 


2 


TICSKCAWLCFSKTLLTNTCASRSDLAH 
GP * CPNLiCSG I FS SAWTD ISRLLLFFCL 
FVLRQSLTVAQAGVQWCNLCSPQPLPPG 
SC 


3338 


17239 


A 


3361 


199 


354 


HLPYWPMFMATFNFFFFFETEFHFFPQA 
GGQWGDFG*LKPPPARLKQFSCLN . 


3339 


17240 


A 


3362 


3 


364 


HE DRDT P A YFTCAN 1 1 1 A I P TG VKVFN * 

GLTGIVLSYSSLDI VLHDTDYVVAHFHY 
VLSIRAVFAI IGGFIH* FPLFSGCTLNQ 
TYAEIHFT 














HEETYI PKDQKYSFLHDSQTSFCFSNSI 
PTPSNMEETQQKS VSGCRLP * AGMGETC 
GQGSRAAQPLPSLPLQNLKLLRISLLLI 

HLLKDLE 


3341 


17242 


A 


3364 


352 


27 


VYLS S QRNSAASVNRMT WTDRS LS P YT 
LGWHRPDRLFHFVFSLFVYTFVBRGSQS 
IAQAGVQWCDHSSL* P*TPGLK* *TCSC 
LP*VAGTTGTHHYTGPLILLNLFN 


3342 


17243 


A 


3365 


347 


99 


GGPPLPRGFSPPPGARPLAPPWGPPQES 
PPPPPPKTAPQKKKKAPS I PAGQWLVEQ 
NPRPQPFKKPPGVGF*NPPPPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECLAHTLCFFNLMS*RTTVILHRMFS 
IVTAI *YLLSSQYVIGYYIYVCIHTHTH 
MYIHTHRHRTFICVCVCVCIYIYTSIC 


3344 


17245 


A 


3367 


3 . 


474 


YELLLLNGPILNARAFLILTERKILGYI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
ncid 

residue of 

peptide 

sequence 


Amino ncid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QLHKGPNAVS P YGLLQ P FADA I KLFTKE 
PL KPATS TI TL Y I TAP TLAL T I DLL L. * T 
RLPIPNPLDNLNLGLLFILATSSLPVYS 
IL*SG*A*NSWYALIGALRAVAQTI* *E 
VTPTIILLSHYYKWLRN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFP I I EELITFHDHALI I I VL 
ICFLGLYALFLTLTTKLTNTNISHAQEI 
ET V*TI LPAI I LVLIALPSLRMLY I TDE 
VNDPSLTIKSIGHQWY*TYQYTDYGGLI 
FNS 


3346 


17247 


A 


3369 


64 


363 


KKWGFVPQQKKGGGRISVNGTPIFKGKG 
NFPP*PPGEGGTKGPGPKPK* IWIFKKR 
GGLIFWPKQFPTPGPKGTHPPGPPKGGK 
*RGG P PS PGGFNFKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKEYIETVAEKKKKKKKAPPPI*KAP 
•NPGKKPPLALKFPPPSQKAPPPALFCP 
*KGPPPRVFPKKKKKFPPFGRKGFFFTG 
GKPP 


3348 


17249 


A 


3371 


1 


374 


FLHSLHLSLRLLYLP*KFSMFIILLKAY 
YTLCVCVCVCVCVCVCV* PLNKIHINHF 
TCSTVRGRSNPI FFPPLLN* RPFYTT 


3349 


17250 


A 


3372 


429 


3 


FFFFLKKQKISRVFPRGPEFFGQRKPFW 
AKGPPRAPQKGGE KGKNPS PPPPL I FFK 
KGEKPQGGPLFKKGLGFFSPPKVFFKKG 
VFKSGPGGP * NFGFRGFSP PS PPKKGGK 
KRKPQSPGFFFFFFFFEILGCSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLSLI IRADLGQTGDLLGNDH I YNA 
IVTAHAFEIIFFIEIPIIIRGFGN*LIP 
LIIGAPDMAFPRINNISF*LLPPSILLL 
LASAIVEAGAGTSGTDYPPLSGNYSHPG 
AC VDLT I LSLHLAGVSS ILGAI 


3351 


17252 


A 


3374 


2 


417 


AFV * IAHITRDVYYG * 1 1 R YLf IAKGAR I 
FFMCLFLHIGRGL YYGT FLYS ET* N I G I 
ILLLATIATAF IGYVLL *GQ I S F* GATE 
ITNLLSAIPYIGTDLIQ*I*GGYSVDSP 
TLTRFFTFHFILPFIIAGPNPLHLTT 


3352 


17253 


A 


3375 


85 


1 


FWPGAVAYACYPSTLGG*GGWIMRSGVR 


3353 


17254 


A 


3376 


396 


1 


KEGYIKGPRGENFF*KNPERKIFSRQRK 
GGVFSPLPPKKFFFFPKGLIFLGGGCPI 
FPPPKKRVFSKNPPGVFINPP*KKKKIF 
SPPPENWGPPRVFLKGPPPFFFFFFFFF 
FFFFFQTLFLGGCGYNTKLR 


3354 


17255 


A 


3377 


3 


118 


LLP PSLLLLLAYAI LVAGAGTG * TDY P P 
LTGNY * KTKA 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN*FFSNPPKKKKGGKDPPSKKRGG 
GKGVPPPRKKKKKKKKE 


3356 


17257 


A 


3379 


2 


195 


DR*LFSTNHKDIGTLYLLFGA*AGVLST 
GLSLLIRAELGQPGNLLCNDHIYNVIVT 
AHAFVIIF 


3357 


17258 


A 


3380 


21 


411 


VFCTNH*DIGTLYLLFGA*AGVLRSALS 
L L I RAELGQPG LLLGNDH I YNV I VTG HA 
FVMNLFIVIPIIVGGSGN*LGPLEIGAP 
DMAYPRINNIGF*LLPPSLLLLLAYAIV 
EAGAGTG * TVY P PLAGNY 


3358 


17259 


A 


3381 


2 


421 


GRVGGRVGSTNHRDIGTLYLLFGA*TGV 
LGTALSLLIRAELGQPGNLLGNDHMYNV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IVTAHALAKIFFILVPIIIGGFGNGLVP 
LIIGAPDMAFPRISNISL*LLPPSVLLL 
LESVIAEAEAGTGRTVYPFLAGTYSNPG 


3359 


17260 


A 


3382 


3 


185 


V I Y S T I F AG TL I T AL S S H * F FT * VGL E I 
NMLAFIPVLTKKINPRSTEAAIKYFLTQ 
ATAS 1 1 LL IAILFNNILSGQ * T I TNTTN 
QYSSLI I IMAIAI KLGIAPFHF* VPEVT 
QGTPLTSGLLLLT*QKLPSSISSRKQPH 
P 


3360 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISHIGAYYSGK 
KEPFGYLGMA*AMIS IGFLGFIA* AHHI 
FTVG IDVHTRAYFTS ATI I IAI PTGVKV 
FN*LATLHGSNMK* SAAAL* ALGFI FLF 
TVGGLTGIVLANSSLDIVL 


3361 


17262 


A 


3384 


1 


250 


LGNTGETLSLQKNV*KLARCGGTOLRSQ. 
LLRRLK * EHCLTPGGRGCSE PRS CHSS P 
AWATGQDLSQQQQQKYPLRIFSAIIKS 


3362 


17263 


A 


3385 


456 


0 


I FFFSQVETGIHHIGQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHLAWLIHSLLSNF 
YYC*FGFCKLSVFILINVFCNKTCNSSM 
FLFSYS*NPCAQ*KNEGPLLNKNPCRIT 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


PSRNFLVLVDKLILKFI*KYRGPRTAKT 
TLKKKKKVVRLTLLIFKSYYKTIVITIG 
W YG FQDRQ VD * WNR I E S LE 


3364 


17265 


A 


3387 


3 


400 


LLILGLLTNTLFIYP* WGDATRESTYQG 
HHTPPVQKGLRYGIILFITSQVFFFAGF 
F* AF YHS SLSPTPQLGGHWP PTGI TPLN 
PLEAPLLNTSVLLASGVSIT*AHPGLIE 
NNPNP I FRALLI TN I LGLYF 


3365 


17266 


A 


3388 


24 


407 


I ASGRPFFFFFFFPKGKFFI SRGGGPFS 
PPKI*KNWGKGKSFFPPPQREPKKRGVP 
KNPSLSFYPFGGKPPLGPPPKLVPPSS* 
PGKKAF F FF * TP P PGFLTPLGGGLS PLK 
GGYLLAPHFVPTEGPL. 


3366 


17267 


A 


3389 


148 


413 


LNLNNNNNG KNHL LNT YS CT * LEE KK KK 
KKKKKKKKKKKKKKKKKKKAPGGGFF 


3367 


17268 


A 


3390 


157 


14 


RVSACCQVWW*MPWPATREAEARESLE 
PGRHRLRS CHCDRDPVS KS 


3368 


17269 


A 


3391 


2 


398 


L F S TNHKD I GTL YLL FGARAGVRGTAL S 
LL I RA E LGQ PGNLLGNDH I YNV I VTAHA 
FVINFFIVIPIIIGGFGN+LVPLIIGAP 
DMAFPRINNISF*LLPPSLLLLVAYAIV 
EAGAGTG* TVYPPLAGNYSH 


3369 


17270 


A 


3392 


2 


395 


DQ KD I GT L YLLFGA * AGVLGT AVS L L I G 

FFIVIPIIIGGFGN*LVPLIIGAPDMAF 
PRINNISW*LLPPSLLLLLACAIVEAGA 
RTG*TVYPPLAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSKPPSHVCS 
PTPAIPQTPRSPPSESTPPTEVSLTTPT 
liS LG AI S S LAS AG P QGAS PAS PG F WGT S 
PP*PYS*NRPQTAACGVFSQLFFIRPEV 
P 


3371 


17272 


A 


3394 


417 


60 


KGWVFQRILWPGPGGSPRYSRPFGGPRG 
LIPLGSGV*TPPGPHWETPFFFNNQPIS 
GQGGGGPL I PLLGGVKPEKI FYPGNQS F 
H*PKIPPHPPPLGAKQNFPPKKKK?QKN 
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TKGFWV 


3372 


17273 


A 


3395 


417 


3 


FFFFFFFQHHFGFEAAA*YWHFVDW*L 
FLYVSIY**GSYSFSINS 


3373 


17274 


A 


3396 


229 


1 


RSFWTKTMGFS CYKDFF + TNGNKDTTYQ 
NLWDTAKAVLRG KF I ALNVP INKLERSQ 
I SNLI S QVKEGQAW WLNP VI 


3374 


17275 


A 


3397 


344 


165 


ELKSPQPRKRNGCFFLAVDPDEGF*NLN 
FCGNGQGLNAPGGYPCEI PMGFVPNAYG 
KTF 


3375 


17276 


A 


3398 


1 


409 


HKD I GTL YLL FCARTGILGTAL.SLL I RA 
ELGQPGNLLGNDHIYNGIGTTQALVRIL 
FIVIPIITGGFGN+LIPLIIGAPNMAVP 
R I NNI S F * L * P P S LLLLVAYG I VEAG AR 
XG* TGYPPLAGNYSHPGACVDLTM 


3376 


17277 


A 


3399 


3 


396 


KDIGTLYLSFtS * AGVLGAALRLLIRAK 
LGQPRNLLRNDHI YNVIVTAHAFVI I FF 
I VI P 1 1 IGGFGN * L.LTL 1 1 G APDMAL PR 
INNISF*LLPPSLL,LLLAYGIVEAGART 
G*TAYPPLSGNYSHPGASV 


3377 


17278 


A 


3400 


49 


350 


KSNSHTEVQAGQEQKSFFRSKHKAGRAG 
SDPAPYVLAGCLCRSISSFLFFFFSLFF 
FFFFKRDGV*QGYSSGSNTPGVKQSSCL 
GLPKYWDYRREPLHPA 


3378 


17279 


A 


3401 


_ 


398 


KP P AI TQYQT PL, F V * S VL I TAVLLVLS L> 
PGLGAGITILLTDRSLNATFFDPAGGGD 
PILYQHLF*FFGHPEAYILILPGFGIIS 
HIETYYSGKKEPFGYIGMG*AMISIGLL 
GFIA*AHHIFTG» IDVDTRS 


3379 


17280 


A 


3402 


388 


42 


TPLFNPPGAKCVNPFFPKKLKKKNPSRA 
MGGPPLFPPLWGGKPKGFP*NRKSGAPV 
THFAPPAPPPGGKREPPPPKKKKKKKRK 
EEKESNRAVQI KKRGNKKGRLPVLDPAE 
ESQ 


3380 


17281 


A 


3403 


413 


2 


SPHPLKNFFLPPKPPNSGGGLiAPFVPPQ 
K*GSLPKIPRGRYKSPPFKEKTNALPPR 
GKPGPPRASSKRPPLFFFFFRGRVPLCH 
PG * S TVQCVQAQCTLELLGS SNFLTSSS 
PVTWGHRQGPPRP INFFI FFRGRVP 


3381 


17282 


A 


3404 


433 


274 


L.GWAHACNPS TLEG * GGRTAWS PGVQD 
QPGQHSETPISTNFFWYFRISYFHI 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRD 
VF+QFFLRRRGPQKFFKPPFFCLQKTSL 
FSEKKKKKGFGPPLFPQKKKGCGGGPHY 
KKEGFTLNSFLFQGDGVSSVAQAEVQWR 
NISSLQALPPGFT • 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNI,G 
FFS P PS PLKFFFFPKGFI FLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF*KKNFFFL 
PPVILGPPRVFF*GPPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKDQCEESFTTLNVDIRNHQNIiLDSL 
EQHGKGDLL * GANAYHCEKCNKKVRAVQ 
LCLILITFNGFRKNRFFW 


3385 


17286 


A 


3408 


2 


403 


SLHLGGGSCIVGAIYFITTIINIKPAGI 
TQYQTPLFV*SGI»ITGILjIjIjIjSLiPVLAA 
GITILLTDRNLFTTFLNPGGGGDPILYQ 
HLL*LLSRPEVYILILPGFGIISHIGTY 
YSGKKEAFVYIGW*AIISIGC 


3386 


17287 


A 


3409 


2 


142 


KKENYRPVSLNRDAKSLNQIQLHI *KII 
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HCNQVGFIPGMQGWFNIL 


3387 


17288 


A 


3410 


407 


48 


PPGGKIFFYKNPEKK.IFPTQQKKGVFPP 
FPLKNFFFPPRGKIFGGGGGQNAPPPKK 
GFFPKIPQGVF*SPPKKKKKNFFPPGKI 
WAPQGVFLKGPPPFFFFFFFFFFFFFFF 
FFFLGVR 


3388 


17289 


A 


3411 


3 


402 


SYRLE*QLiEDKAAPPQDFTQQLRWPEKQ 
HRMVWPDKEDMHKQLVEASETLKSQAKE 
L*DAHQQQKLALQEFLEFNELI4AELYSH 
NHKVWDKEEEMEVAMHKADMMWQEI * RS 
KKLTKRMLFTQMRR ITPVI PS 


3389 


17290 


A 


3412 


1 


224 


L.LFQLLRRLRQENHLNLEGRS CSELKMH 
HWTP P WGTE* DCL* LiKKKQS AYLPNNT I 
KKLTG S RT I ED I CTE YN K 


3390 


17291 


A 


3413 


202 


1 


IVMSWPGWTHA* I PSTLGGQGGWITRS 
GVRDPLASAFQSAGITSVSHCSQLIFVT 
VFYLS P LSHLY 


3391 


17292 


A 


3414 


407 


309 


PS FKAS KDRLTLLLGANAAGDLKFTPMF 
IYHSENPRALKNYTKSTLPVPVLCIRNI 
KAWMIAHLFTAWLNEHFKP I VETHCSGK 
KI S FK I LLL INKAPGHRRAQMERHKMNV 
VFM LANTTP I h KLMDQGVF + TF I LCL S I 


3392 


17293 


A 


3415 


404 


46 


LKKPPFQKFLKILKKKKKGGGGGPPP*T 
PPFGGGRGGVPPRAGVSKNPVPQGETLF 
FFKNPKFPGVGGPAP* FPLLRGVNKKNF 
FNLGGGGFKKRKFSPPPPPRGKKFFFPK 
KKKKKKE 


3393 


17294 


A 


3416 


23 


404 


S VL I TGAL.LLLS M PVLGAG I T I LLTDRN 
LKTTFFDPAGGGDPILYQHLL* IFGHPE 
DNILILPGFGIGSHIVTYYSGKKEPFGY 
IGMA* AMKPIGFLGDIAGAHHVFTG* ID 
VNTRSYFTSDTINMG 


3394 


17295 


A 


3417 


38 


469 


SGTTHASALFDPAGG*DFILYQHIiF* IF 
GDPEVYILILPGFRI ISHIVTDYSGKKE 
PFGYIGMA*AMRSIGFLGFIG*AHHIFT 
V * I DVDTRAYFTS AT 1 1 IAI PTGVKVFT 
* LATLHGSNMK* SAAVL*ALRFI FLFTV 
GGLT 




17296 




3418 




419 


FSTNHKDIGTLYIjLFGA*AGVIiGTGLSL 
IjIRAELGQPGNLLGNDHIYNGIGTAHAF 
VIIFFIVIPIIIGGFGN*LDPLIIGAPD 
MAFPRINNISF*LLPPSLLLLLASAIGE 
AGAGTG * T VYP PLGGNYSH PGASVDLT 


3396 


17297 


A 


3419 


397 


107 


KNSAPPFSPIFLP*GPKFAFFPPGKKVG 
KK I F * K F P APG PQGVYLREGGGS P S P PQ 
TPNWPNKKAAFVLP PPPFFVFGNPKKKK 
KKAVAFS S PNRR 


3397 


17298 


A 


3420 


3 


253 


RGCREMRLCHCSSAWAIAGVSEEKKKKK 
KNWQ * CLLG I CS YLDLWLPGLFKWLSQ I 
GRIVELQLLELKQYLRGITKQTLKILiH 


3398 


17299 


A 


3421 


1 


393 


RTRTIKTPAITQYQTPLFVGCALITAAL 
LLLSLPVLAAGITILLTNRYL.NTTFFDP 
VGGGDPILYQHLF*LFGYPGDYIVILPG 
FGIISHMVAYYSGKKEPFGYIRMD*AMI 
S I DFLRI I V * AHH I FTVEN 


3399 


17300 


A 


3422 


142 


377 


LS KNCYKPS YVGFVCQCKFFNQLS KFLE 
VKIiLYCMVKLC* * EG* MAQPL* KQFVLV 
ALKLNIVLPYNKAISLPGIYSTD* KTYT 
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DTQNLHTTVYS S S L 1 1 1 KN *KQQRCPSV 
GE* INKCFI YTTEYYSLTKRNKLSSHMK 
A 


3400 


17301 


A 


3423 


2 


435 


I.ILPGFGIISHIVTYYSGKKEPFGYIGM 
G*AMISIGFLGFIV*AHHIFTVGIDVYT 
RAY FTSAT 1 1 1 AI PTGVKVFR * LATLHG 
SNMK* SAAVL* ALRFIFLFTVGGLTGI V 
LANS * VD I VLHDTYYVGAHLH YVLS IRA 
VFAI 


3401 


17302 


A 


3424 


2 


436 


LNTTFFDPAGGGDPILYQHIjF* ffghpe 
VYILILPGFGI ISHIVTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIFTVGID 
VDTRAY FTSAT 1 1 1 AI PTGVKVFS * LAT 
LHGSNMK* SAAVL* ALRFI FLFTVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 


193 


PTRPPTRPACSEIjRSRHCTPAWVTE*DS 
VSKKKKKKKFSPARGAHVCTPTFLGGQR 
GKKFCFAP 


3403 


17304 


A 


3426 


3 


446 


HKKFLFPKSFFNRVFPPTKKKNPRGPGF 
FFFFLKKIFFFPPGGI+SGGFGSLQLFP 
PRVKKFFSPPPPEKGGFKAPPPGPGNFF 
FFLKKRGFSFFGGGFLK*NPGFPPPPPF 
KKWGVPKGGAPPPPPFF 


3404 


17305 


A 


3427 


202 


1 


FS PGRMLGNT IKLPYTHSFI LMNS QNNL 
EKYKVGGI PLS NF KTNYKATAI KTMWY W 
HMMRYMD*WNR 


3405 


17306 


A 


3428 


159 


1 


NPPHEKNFCSFFQMESHSADQAGVQWCD 
L,SSLQPSLRGFKRFS*VSLVSSWDH 


3406 


17307 


A 


3429 


22 


298 


ESYASTAKATERDMSDRMLSASGMPRRY 
SDYPDAYTT*NILSSVGSFISLAEGILI 
IFMI *EAFASKRKVLiIVEEPSINLE*LY 
GCPRLRGR 


3407 


17308 


A 


3430 


267 


26 


GITFFFFFWLEI * LYY WLFLANKLYFHE 
QIWFGIiVFFFGLLLFDFSFYSDYFPLiLIj 
CLGSFSNF*CLIHCFFSVIIVLLF 


3408 


17309 


A 


3431 


274 


376 


NKG* ISCSISPPQHTVRRGLTL.SPRLEC 
SGVISAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


FYFFFFFFFFFFFFFFFSLIiFFFFFFFF 
FFFFFFFFFFFFQINRSFIASFKYHK*V 
LIFKKKKN 


3410 


1731 1 


A 


3433 


379 


83 


FKRH FTKEDLRMVTEHIKS CS S SLATRA 
MQIETTMAHCYTFMRLAKIKKYQYTKCC 
*R*RVTGTLTYCWWECKMTQPFWKTIVR 
RFLKQYIYHVIGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTYYVCIFLKSKRMINK* 
NYI KLKS FCTTTTKINKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 


GGQGQ PGQ * S ETLS Q * KVSWAW * LQLHR 
RMRQDNLIiSPGDQG*SEPC*CYYTPAWA 
RVRL CQNRS VNEKSLHDI CNDKK* QNI * 
I ISVPEGKQRMKGLENLFNEILDENYPS 
IiARDL 


3413 


17314 


A 


3436 


29 


394 


VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF* IFGPPPPRGKGNFPPPPPGK 
GGKKGPPPPPGFFFFF*KKGGSPPGGGG 
FL.TPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 | PVAVAHSCNPNTSGGRGGWIS*GHEFQT 
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SLANTVK 


3415 


17316 


A 


3438 


215 


3 


LLLLREFFLPKHREVFIFVFYFMRQGLT 
LiTEAGVQWRSHGSIjWPQTPGLK*SSCI>S 
LPSIWEYSAYHHIR 


3416 


17317 


A 


3439 






QTSS FS FRRS VITNFFKLKDDV* TH* KE 
AKNLEKRLDEWL I R I KSVE KTLNDLMEL 
KTSTRP 


3417 


17318 


A 


3440 


290 


3 


PGLGGGPRYSRFSGGLGPKIPLNPEGEG 
S I KLN PP PAP P PGG PN*DS FLPP P PKKK 
KKKTHVLFAGSGNLLQPLQPGAFTTEVN 
GGLAQQSPKVL 


3418 


17319 


A 


3441 


163 


2 


PAGL P VKP PRFQDKLNFPFGP FFF FF FF 
* I *DKVL.IjCHPGESATVHSSLGDRA 


3419 


17320 


A 


3442 


131 


1 


RPRRPLTIKSIGHQWY*TYEYTDYGGLI 
FNSYILPPLFLEPGS 


3420 


17321 










PGAAHSTGHLPLRFHLRLGAVTHAGNPR 
TLGGQGRWIT*GQEFKTSLAKRVRP 


3421 


17322 


A 








WEKIFANHVSDKGLIYRIYNELQQLiYK* 
KTNGPIKKRPKSTRP 


3422 


17323 


A 


3445 




117 


IiALLGIjKPGIjK*ATCLGLPRYWDYRHQP 
PHPACFSKGI 














FGFFWTLELKKGIPFFLNFFFFPFLRVL 
KGSKNPF*KRGPFFFFFFFFDNFYIWN 


3424 


17325 


A . 


3447 


106 




LLQPNGQLGPGTVAHGCNPSTLGG * GER 
ITTSGVR 


3425 


17326 


A 


3448 


28 


333 


GQQEQNSISXKKKKKKKKKTPFKKGKRK 

KRAPEKI FKKKPRGNWETFKRMEKGFLK 
GQ*PGNP* KVKPLWGGKK 


3426 


17327 


A 


3449 


81 


339 


P I VSS CPGVAGALKQAMTLEFKVYQHHV 
VANCRALS EAXjTEXjG YKIGTGRDTDGVQ 
QAJLLWGLRLiAS * P I VSSCPGVAGALKQ 
AMTLEFKVYQHHWANCRALSEALTELG 
YKI GTGRDTDGVQQALLWGLRLASQFV 
QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


322 


RDFSTTVLITDRTSRQKIPTDVSQLNLI 
DI YKTL* PTIGKYTFFSSTYRSLR 


3428 


17329 


A 


3451 


257 


354 


KFLS FFRDRVLLCCPGWSAW* S * FTAA 
LSSW 


3429 


17330 










TDQQKTKEL * EEQLYAHRVDNLDEMDKF 
LEGYK * PSMTHGKIENLCRP I TSKDIES 
VIKNLLTDPP 


3430 






3453 






GQ PGT VAHACNP S TLGG * GGRIT 


3431 


17332 


A 


3454 


108 


353 


THLGGNP VRLYAWLLLHYVAS YR I FS VS 
HCKDQTYLPILL*NFLMELGTVAHTYNP 
SILGGQGRRIT*VQEFETSLiANMVKC 


3432 


17333 


A 


3455 






GPVTPNIPPVPPLGGQKELFPPKKKKF* 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTPIPVKV*GKKTP+TFSKKKKTPPIP 
ELFIiLKKKKKLIN 


3433 


17334 


A 


3456 


343 


2 


KAPFFFFFKKFWSQKKRGLFFFFFKNFF 
IPGFPGPTFFFWGFFFFKKGPPVF*LKK 
FFGGPLPPLKDPPGVFPFSPF* FWDFPP 
PPPFFFFFFFFFETVSLCRPGWSAVSRD 

C ■ 
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SEQ ID 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidinc, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3434 


17335 


A 


3457 


2 


259 


LTKPTESGTEDKKGRDSKESEGKTERTE 
SLRSQNGGENSV*SCPSTSSTAALNTAA 
AANPLALNPQVSLSSLSCCPISIKGSFV 
P 


3435 


17336 


A 


3458 


186 


334 


AGGNKKMEKRNM *NKE I F * WFKKNWPGA 
GAHACNPTTLGGRGRR IMRSG 


3436 


17337 


A 


3459 


207 


1 


YFKPTLKINDKGYLNETIDKFDPRDINR 
TSNLTL*EHALFPSTCGIFMRRDYIEGH 
EVNLNQFQRIRTQ 


3437 


17338 


A 


3460 


268 


3 


PLNPGPKQSSPPGLPKPWPFRGEPLPLG 
PWFFFVKNRGRP*AFFGTNPLAFFLFFii 
*DRVLFCHPGWSVWQSQLTVQPPPLRL 
KQFS 


3438 


17339 


A 


3461 


350 


2 


AETAHNINNAFGPGSANKCTVRWWFK* I. 
CKGVKRLEDKALGGWPSKVAINQLRAI I 
KGAPLKPTRKVAKELNIEPSAVIQHLKR 
I GKVKKLHKW VPHELHENKRNRFEWS S 
LIIP 


3439 


17340 


A 


3462 


312 


1 


HIVTYYSGKKEPFGYIGMV*AMISIGFL 
GFIV* AHHIFTVGIDVDTRAYFTSVPI I 
IAIPPGVKVFS*LATLHGSNMK*SAAVL 
* ALG F I F LLP VGG LTR I VLA 


3440 


17341 


A 


3463 


181 


3 


FLPLLAAFSGLLLFSGFLGVFLFCFFET 
ELPRLECCGT I SAHCSL* LLGS SDS CAS 
GTH 


3441 


17342 


A 


3464 


176 


263 


VGHDAHN PSILGG PGGW I T * GQE FKVDA 
A 


3442 


17343 


A 


3465 


239 


336 


EPGEVAYSCNPSTLGGRSGWII *GQEFE 
TSLT 


3443 


17344 


A 


3466 


88 


326 


QTSLLNTI**PLH*KNIPLPPPPLPPTL 
TPPFSPPPPPPPPKKKKRKRK 


3444 


17345 


A 


3467 




345 


EEERRRERKKKRRKXEEKKKEEREREEK 
ERKKKKKKKEKKKRRGRQEKRKGTG* IE 
RKQ 


3445 


17346 


A 


3468 


265 


335 


QMNPDRHLQF *DKVLLCHPGWGTVARS * 
LTATSRSSCLSPPSGWDCRRVPPCLASF 


3446 


17347 


A 


3469 


115 


342 


INKQINKIKDPTSYNILLSQSLI*FKAL 
TLLNS VKAQRGEKATEEKFEGS IS* FMR 
FKQINHLYNIKVQGEKPSTD 


3447 


17348 


A 


3470 


56 


328 


RTYPTKPQKMRDPEDPTSRPRYI PRHLY 
ITDEVNDPSLTIKGIGHQWD*TYEYTDY 
GGLT FNS YILPPLFLE PSDLRLLD VDNR 
VEPATRA 


3448 


17349 


A 


3471 


3 


329 


RIGARDESSLAVTHKPRKIHRKLTLWVF 
VNDSLVMPRPDKMHQWAFNKNCFPLVDV 
GIDPYLVYHLRPHQAQGIIFLYECVMGM 
RMNGR CG A I LAD * NGLGKTLQ C I S L 


3449 


17350 


A 


3472 


154 


2 


PNLLYPKSYWKKKDIWTPMFIVALVTIV 
KIQKQPRCPLMDKSIR*L*YIH 


3450 


17351 


A 


3473 


132 


340 


ALKTENI TTLCLD F*LIETVR* * MCVFK 
KKKKKKKKKKKKKKKKKKKKDPPKKKKK 
IK 


3451 


17352 


A 


3474 


3 


345 


VIYSSIFAGTLITALSSH*FFT*VGLEI 
NMLAFIPVLTKKIWPRSTEADIKYFLTQ 
ATASIFLLIAILFNNILSGL*TITNTTN 
QYSS L 1 1 1 MAIAI KLG I APFRRRGRE FQ 
IY 


3452 


17353 


A 


3475 


1 


254 


FHHVGQAGFTPDLR*STRLGLPKCWDYR 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 

locution 
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last amino 
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residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
C— Glvcine, H— HistiUine, I— Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N = Asparagine, P = Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown; *=Stop codon,/=possible 
nucleotide deletion, \=possible 














REALCLVHILLYLPMIGPFRIQKLLKAA 
VFVFLQIjHQAFIQSLIjTIjFYFISFPCNS 


3453 


17354 


A 


3476 


351 


186 


SVLSADNTEMGFHHVGQAGLELLSSSDP 
LAPAS * SAG I RG VRHNTWP VLT ITLN 


3454 


17355 


A 


3477 


330 


3 


GEKVFEIPLTRDNVSEIIiRCFLMAYGVE 
PAIiCDRLRTQPFQAQPHQQKAAVLAFLV 
HELNGSTLIINEMDKTLESMSRYRKYEW 
NVKERLRRLKTARA* RLGRSQ VKK 


3455 


17356 


A 








NRLNPGGGGWSEPRSQHSTPTWVTEYDS 
VSKK* INKNKNWK 


3456 


17357 


A 


3479 


3 


92 


EGQGCSEP*SRYCSPAWVTE*DCVSKNF 


3457 


17358 


A 


3480 


380 


2 


VEVPLLNTSVFFASGVSIT*AHHSLIEN 
NRNQI IQALLITNLGGLYFTLLQASEYL 
ESPFTISASI YGSTFFVATGFHGLHVI I 
GSTFLTMCFIRQI I FHFTSKHHFGLEAA 
A*CWHFVDWCIFL 


3458 


17359 


A 


3481 


225 


-] 


ERAERWEQVMIALFSPNYIRLSFPISP 
TPWRHKFRFHLKGYIWPGTVAHTCNPS 
TLGSQGGQI TRSGD * DHPG 


3459 


17360 


A 


3482 


332 


1 


IKYFLTQATAS I ILLIAILFNNILSGQ* 
TITNTTNQYSSLI I IMAXAIN*GIAPFH 
F*VPEVTQGTPLTS 


3460 


17361 


A 


3483 


277 


1 


LSMSSFSISSIPSASINS*ALLF5SAES 
NSLECTFSSSFSPNSSESFSSSFPSCSC 
AS*FSNSQSLIKASSSTEASPLNSSASC 
FLPLPLVG 


3461 


17362 


A 


3484 






ALSVQPLWRPRQENGVNPGGKASR*PRF 
PPCG PAWATKGELVPKKKKKKI S FLGHD 
QSLELLLQR 


3462 


17363 


A 


3485 


31 


347 


FCILERKAFLYVV*CWKFFFFFFFFFL,E 
KGFFFIjPLLVFFWCYYHLKGASLP* vfl 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
*TFFYYDPLMFFFLTLFLELV 


3463 


17364 


A 


3486 






QKKVRDPYEQL*ANKFKN*EEMPRFLEK 
HKLPKLTQEEIKNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGRFQLTQNFALPFNLGNKTKI PFKK 
KKKKTE PLCPCTCVCAAFVCLCFHLCT * 
P CDCAHDCDN I CVCVNI CVYVCWTHLH 
TGLCSCVDCDHI CVGTCC 


3465 


17366 


A 


3488 


245 


1 


RPRRLLALLLI 1 1 TF * LPQLNGYIEKS T 
PYECGFDPISPARVPLSIKFFLVAITFL 
IjFDLE IALiIjIjPLP*ALiQATNIjPLIV 


3466 


17367 


A 


3489 


2 


400 


AAP I LGMLGLRQRGKYKRQRQLLCLWS A 
ILLFLYFLNKLL+LTKKKKKKKKKKKKK 
KKKKKKKKKKKKGG PRGGN PQILKGKIG 
KPPGGFFKGGGGPKKIFFFKNRGALFFK 
KKPPQKKNLGPGPPGLKNFLKEKKKA 




17368 




3490 


57 


366 


INVFAGKDQLQPLCACPSSPI ISVESDR 
PDMYQCGLNHFI YLI I YLRWS FALVAQA 
GV*WHHLGSLHPLPPEPKCFSC 


3468 


17369 


A 


3491 


25 


384 


IRGTF*FFWGGLGGLSKNNPL*KKNFSP 
PKX.GVLPPPLFFPGGGFFFFFPGAPPGF 
KKKKPPPKKFFF* KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGI CL* S 
QLIGGLS FI TQKI TC 


3470 


17371 


A 


3493 


1 


240 


VLSPADKTNVKAAWGKVGAHAGEYGAEA J 
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Amino acid sequence (A=Alanine 1 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














LERMFLS FPTTPS * FLDFFFFLEKIGFH 
HVGQSSLGFLTSRDPSAYNSLLGS 


3471 


17372 


A 


3494 


359 


204 


TQWLMP VI LAPWEAE VGR * PEVRSLRPA 
LNSPFFKRLWIMMYIMNI IILS 


3472 - 


17373 


A 


3495 


1 


178 


DRLS PGV * GS S EL * LCHCTPGWATE PDH 
VSRKKKKRKKVGLGKNKAKQKHPRGYYR 
AEL 


3473 


17374 


A 


3496 


361 


1 


LGPKLLKKPFWISIGIKGVAGLPGFFPT 
PPQ*G+NFGLKRGGFFFYFFEAKGWFNT 
PGGKQRGPFRSLEPFPFRVKLFFPPNLP 
NNWGHKGAPP KP P FFFFFLRWS FALVAQ 
AGVQWCDL 


3474 


17375 


A 


3497 


276 


79 


AEVYKI KCELGAVAHACNPSTLGGQGGR 
ITRS*VSASFTKKYWQLALMKELLNE 


3475 


17376 


A 


3498 


3 


476 


FGTSMSIITLITPRLARGNNNLFSCFPT 
FTFDHLATPLLILTT*LLPLTSMASQRH 
LFREPLSRKKLYLSILNSLQSSLIITFT 
GTELI IFYIFFETTLIPTLAI ITR*GNQ 
PERLNAGTYLLFYTLVGSLPLLIALIYT 
HNTLGSLNILLLTVSAQE 


3476 


17377 


A 


3499 


581 


2 


AHACNLNTU3GRGRWIT+GQEFDTSLAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGKTPSLVKIPKLTPPKGGVL* 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
PPTIVAKKQNPVSKKKKQTRQEPSA 


3478 


17379 


A 


3501 


1 


386 


GSR * GNHPERLNAGA YFLFYTL * GSLPL 
LIRLIYTDNTLCSLNILLLTLTGQELSN 
S*AHNLI*LAYTIAFIVKIPLYRLHL*L 
PKAHAQAPIAGS I VLAAVLLKLCGYG I I 
RLTL I LNPLKKH I DY P 


3479 


17380 


A 


3502 


1 


389 


EKQL * KDKQVYRATHRLLVLGAGESGKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
YIDGEKATKVQDILNNLKEGIETIGAVM 
SNLVPPVELANPENQFIVDYILSAMNVP 
DFDFPSEFYEHAKALWE 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMISIGFLGFIG* AHHIFT 
VGIDVYTRAYFTCAT I I IAI PTGVKVFS 
* LATLHGSNMK* SAA£L*ALGFI FLFTV 
SGLTGIGLTNSSLDIRLHDTYYWAHFH 
YVLSIGAVFAIIGG 


3481 


17382 


A 


3504 


281 


68 


FKMRFGWGHSQTVSENYKPLLKEIRENP 
NK*KNIPCSWIGRISIVKMPTLPKVIYR 
FNAFPIKLPSPFFTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQ AGL * LLAS SD P PATAS QN I 
GITHVSYYVQPL* PIFWEAVAGGSLEAR 
S 


3483 


17384 


A 


3506 


2 


427 


LKTSMTM ISTTLLT* LRPAWH * AQ YQTP 
FFV* SAL I TAGLLFL FLPVLAAG I TVLL 
TDRNLNTTFFDPAGGGDPILYQHLF* FF 
GHPEG Y I L I LPG FG I I S HIGTYY S GKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
GG 


3484 


17385 


A 


3507 


343 


3 


GWHSRSSGGASSPSSPSHRSRLRVSSGS 
LGRL * WS KS S KTSVS CGWARDTE KPPL 
S KIEASS S DS CES VMAKANAVFFTS TTG 
EKPASSASLSLSVWASGSSPPGAANSSS 
S 


3485 


17386 


A 


3508 


3 


363 


G P GD KAARR I ALENLD IT* KWGMRTHKC 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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nucleotide deletion, \=possibJe 
nucleotide insertion 














AKQLLS KQPF * SSRTGDLKLELQIELVR 
ETKRKC*RVIj* IDRALTSHliYSrjLQTQD 
ALGDAS ADL I Q KTS * LQEE FG YYAKTQK 
LLCRNGET 


3486 


17387 


A 


3509 


3 


434 


GPGDHVARGISGKKYDIGQKWGMGTYKC 
TKELLSQRFGLGS PTVDLEL * LQIELLR 

ALGDAFADLRQKS PQLQEE FGCYAQTQK 

EDTL 


3487 


17388 


A 


3510 


1 


401 


GTRKNS FLHDSQTS FCFSDS I PTPSNME 
ETQHKSNLELLRISLLLIEPWLEPVRFL 
KSMFANWLVYDTSDSDDYHLLKDL*EGI 
PTLMGRLEDGNRRTGQILKQTYSKFDTN 
SHNHDALLKNYGLLYCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFIFSEMESHSVA*AGVQWRNLGSLQP 
LPLWFKQFSC 


3489 


17390 


A 


3512 


491 


90 


KKVKIV1AGFFFKPPPPKN*KKKGLK*R 
KGVGPRGYTVNPHPCGYKPFGGPGVQNP 
PGQKGETPFFQKKNKKPGGGGGALIPPP 
KGGGRGVPLSPEKFGLKFPPGPPP*GKK 
KNRFQKKKKKKRQN* PQKSQC 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKIIAKWEDNKLP 
GAICSLTCGGAHIGTPMARDERVNLLSF 
TGSTQGGKQVGLMVHERFGRRLVELGGN 
NAI I AFEDAHLNLYVPSTLFAAVGTAGP 
KCTTARRQLIHES IHDEA*NHFPH 


3491 


17392 


A 


3514 


2 


390 


KEFHKTHPLiENITIjAPDPEGPDGIjPPVA 
ynpwmdxrqredgqtlnirepygpipvd 
F*RKIRQS YFASGA* LDTQDGRLLS ALD 
DLQLANS T I IAFTSDHGWALG EHGEWA * 
YNNFDDDTHDPL I FYDT 


3492 


17393 


A 


3515 


3 


410 


RGTGNNWAQGHYTKGAELVDSVLDWRK 
ES * S CDCLQGFQLTHSLGGGTGFGMGTL 
LITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEPYNATLLVHQLVGNTDETYS I DNEA 
LYDICFRTLKLTTPTYGDRHHLVS " 


3493 


17394 


A 


3516 


1 


396 


GE DAAN I YARGH YT I GMEN I DLG LDR I R 
KIiADPCTGHjQGFrjVFHSFGGGTGSGFTS 
LLMERIS CDYGKKNKLEVS I Y PAPQVCT 
AVFEPYNYI LSTHTTLEHSD * AFMVDNE 
AIYDICRINLHIERSTYTNIi 


3494 


17395 


A 


3517 


2 


399 


EKIGYNPDTVAFVPISGWNGDNML.EPNA 
NMPWFKGWKVTRKDGNASGTTXjLEALiEC 
I LPPTRS TD KPLRL PLQDVYKNGG IGT V 
PAGRVETGVLKPGMEVTFAFWGTTEV* 
SAERHHEPFREALPGDNVGF 


3495 


17396 


A. 


3518 


1 


402 


GEDAANNYARGHYTIGTEIIDLLLDRIR 
KLADQRTGLQGFLVFHSFGGGTGSGITS 
LIiMERLS IDYGTKSKLEFSI YPAPQVST 
ACSEPYNSILTTHTTLEHSDCAFMVHNE 
AIYDI*RINLDIERSTYTNLNR 


3496 


17397 


A 


3519 


1 


408 


GSCLGQKLYNSTTKTAT* WSSNHTKKNP 
FSKFPKXQNVWTHPESHRDWTAPTRLYW 
ICGHRPYVKLPNQWAGSCIIGTIKPSFF 
LLPIKTGELLGFPVYASHAKRS IAIGNW 
KDNEWPPERIIQYYEPATWAQDGL 


3497 


17398 


A 


3520 


3 


162 


ETVLLCCPGWL*TLGbRQSSCLSLPKCS ' 
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DYRHATVPGQKYRFFNRYYYGSYS 


3498 


17399 


A 


3521 


184 


354 


ETGSSIjLiP*GSSELjPVSHPYFFNTHSIjS 
LSHTHTHTHTHTNSSISNYHCPLYTNMS 
CQRLVAVYRQKGSFKDQCTDI 


3499 


17400 


A 


3522 


2 


313 


YYLSPPMGDQPGQHGETLPLQKLAG*GG 
ARLRSQLLRRLRQENCLTPGIQACRELL 
GWYKSNCAFCQQKPQLL.CVNLTLYSSLD 
DKARDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 


127 


LGL.QARTTS M PS S * G YRRAP P P C P ALL, Y 
F*VETGFHHVGQAGLELLTSTRP 


3501 


17402 


A 


3524 


148 


1 


VCMFLEES CSVAQAGVQWYNHSSLQP * T 
PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 


A 


3525 


I 


98 


VNPGGEGCSEPRSHHCTTAWVTE * DS IS 
KINK 


3503 


17404 


A 


3526 


138 




AASTHDLG*LQPPPPGLTPFSCLRLLSS 
WDYRPVPLRPANFLYF 


3504 


17405 


A 


3527 


10 


206 


PTTSliISFTSYSPCLKCSNVYEQVNRMR 
VLFYKSKQRLGMVAHAYDPSTLGGRGSK 
IT*GQGRRG 


3505 


17406 


A 


3528 


345 


3 


VTEL I QSNKAKLKKKS KNKNEQSLQKI * 
DYAKQPNLRISGVLEEEEKSKSLENIFE 
KI I EENFPGLTRDLiEIQ I QKAQRPGKFI 
AKRLSPRHTVIRLSKVKTKGRILRAERQ 
K 


3506 


17407 


A 


3529 


3 


90 


AVL * ALGF I FL FTVGGLTG I VLANS VDA 
A 


3507 


17408 


A 


3530 


3 


90 


AVL*ALGFI FLFTVGGLTGIVLANSVDA 
A 


3508 


17409 


A 


3531 


354 




AVINSLPTKSPGPDGFTAKSYQRYKEEL 
VLFLLKPFQRIEKEGILPNSFYESSVIL 
I PKPHRGTP KES FRP I SLMNLNAKI LNK 
RLAS *NHQKIKKVIHHGQVGFIPRTQGW 
FNKQ 


3509 


17410 


A 


3532 


2 


357 


FTRPR FS FT F FVL I RKYALS CGKISIQH 
S Y * KKKSKGKLGGL *G INFCFFERKFHS 
VTQVGGQKHDPG*LKFPPPGFRGFSRLS 
LPSSWKHKRGTPRLANFCTPNKNGGRRG 
RKFQIY 


3510 


1741 1 


A 


3533 


76 


365 


KKKKKKKKKKKKKKKKKGGAGPE I PKLK 
MGKSEKPPGVFLRGVGEKIKFFFLKKGG 
PFF* KKKPLKNKNWAREPSFKKIFKKKK 
KKAHFFFGPKI LKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF*KYPFFFPPPPL 
FFFPFFKWAPRRPPPPFFSSFFKNFFFF 
FPPPPPFFFFFPQKNFPLFFSPLFFFFS 
PKFFFFFFPLTKKKKKKNKKKKTKTPKI 
HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVPPGIPPPWGGRAGGSPKGRGFGPP 
WPKRGTPPFPF* KKKKKKKKPGGPAI PA 
TREGEGGKS P * PWKPRVQ PT* NAPP P S P 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF*RETGSHFFLEAGVQ 
GNHSSLNP*PPGLRGSSCLTLPKC*DYR 
REPLPPA 


3514 


17415 


A 


3537 


38 


368 


PGRLRLQEAVNH CTLE PGQQS V* KKKKK 
KRGGAF F KAKLTIiAG W KGE KKKKTGV FF 
* TLKIRPKE I RNQGGSQQR KKAELGKLQ 
QTYPVLNFKATFYGEQGDYY * TFFKP 



765 



JSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/U SO 1/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutan»ic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valfne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


S VTFP FFFFFLKQTGSHF W * AGGQGGN 
FD*LEPKPPWLRESSLLTLPSSLDYGPP 
PTGPSNFGIFFEKRGFCLVPQASFE 








3539 


274 


3 


PRRWSLPLTQMGPLPSRMGPKTKLCPKK 
KKKKI P PRKI LE PDVFTGQFYLMFKE IT 
SIFLKLFQQCEEEGTFSNSF*EARIALK 
TK*DKN 


3517 


17418 


A 


3540 


1 


374 


ACALVRNDH I YYWGPTYALGELFFTET 
PI 1 1 *G*DN*LGPI«VIGAHWALTLINY 
I S L * L * PPS VLLLIACAI PEARAGTG CT 
A Y P PIiAGD Y S HRAA YVNL T I IjS LHIaAR V 
SSILWDTDSITT 


3518 


17419 


A 


3541 


427 


76 


RGSLSPGV*K*PGGQNQTFPKAPKLRPG 
APRCFPAGKGSIKPTPGSPPGGRRGTPC 
SKKKKNFRETPPSFFWGGPPFLKNRGGF 
MFWASKTKNGLNKTLGPNPGPPLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 


88 


FFFFFFFFFFFFFFFFFFSLFFFFFFFF 
FFFFFFFFFFFFFW* * ILYFV 


3520 


17421 


A 


3543 




419 


TFSFFPFLFFFGFLVLGGEKQGKWDPKP 
PGAEKPPTFSPPKI*KKGAPPPPRVNFK 
KF * KKG PLPFS PGG KNLKS FWGIWSIGN 
PLW VTP P KNPQKKP PFFKGP KKNFF FKG 
PKPGGKKTQIFFFLWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFLHISSSVE 
GRFSSFAQFLLSCLNFCTFPLLFLLLRF 
* DRVS LCHPGWS AVA* S QLTATSTFWVK 


3522 


17423 


A 


3545 


2 


351 


CLAHTSGVES*ASLPCGQNSYIHYLKLiL 
CME E LS F P P YLL I YS I S Y L Y * HT IMVI Y 
F I LVL * S I LLNL I YLFVTQ I F P ALAS GS 
SFNWLLCLFNTT1AVHVCVCVCVCVCVR 
VCAC 














KPPPPCSYGKKKKIFLLVFFRTLPTKPP 
GAGRKIFPRGPGFFLKNPKAL*TTFAPP 
PFIQKKGPFLLQQA*IPFPVFFLFKKKP 

AGVQSCDLGSLQPLPPGLKR 














FLVFLiKI FFFFFGGFFLPFFFFLKKFFF 
FLGFFFGKISQKIFFFFVN* I FLLXXGA 
PPLFFFFFXj 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD * EKI FAKHVSHKGLISQ I 
CK* LIHLNSEKTNS PVRKWAEDLNRHFS 
KRDTKMVNM 


3526 


17427 


A 


3549 


175 


309 


KSTLKVMLISISIFNFYFF*DRVLLCHP 
GWSAVA* S LLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFHRVSQDGLDLLTS * SACVS FPKCWDY 
SHEPPRPAHKTSLKKNKKRVRIIIMMTI 
TYPAPWC 














RAVE I FC YDVCVESGCADI QLLIjLCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQARS * CQCLNDKE * YCSLWPTP 
PGFKKLSCVSLPSIWDLRCAPSHPPNFC 
VFGKNGV 


3529 


17430 


A 


3552 


326 




KDNKR I L KLMLAKS G KVRH YVFDFGKQ I 
SKMSEKNIGVKPPIGKEKTKKPKKKPSR 
TEDPKICTYVL*PDKKKWRRFLNT*GIQ 
LG*KEIKL,SLFADDMIVYLENPIV 



766 



NSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threom"ne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l)nkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3530 


17431 


A 


3553 


186 


1 


RPRRLPIKLPMTFFTELEKTTFKFI*NQ 
KRVRIAKSIL.SQKNEVGGITPPDFKL*Y 
KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTSLGNMVKPYNFFV*TFYI * *KKEIKK 
NS IGVL* PGTEQS KHE FKKI LPSKRNKI 
ASKRIKYL * INLT * EGQDMYTENYKTLL 
KEIIEDLNKYKDIPCS*IIRLDLVMMAV 
LTKLIYRFN . 


3532 


17433 


A 


3555 


239 


1 


GRGKML.PVSLLNITRVTFAPCSGSIILV 
LIALPSLRILYITDEVNDPSLTIKSIGH 
QWY*TYEYTDYGGLIFNSYILPP 


3533 


17434 


A 


3556 


3 


123 


QAGLEV*TSSDLPTSVSQSAGITGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFLGVFPPQKKKTKKNPR 
PPKGLTPVS*KGML.TEPLEPFFFFFLRQ 
SRSVAQAGVQWGNLGSLQALPP 


3535 


17436 


A 


3558 


87 


379 


KKKKKKKFTLQKPVGFLNPPNGEAKSQI 
INELLXjK IATKKKKNIiKLRliKKKGKDLF 
RGNYNPLFKKIKDDPTNGKNFPCS*MGK 
ISFEKMTLLPKAI 


3536 


17437 


A 


3559 


390 


181 


KKKKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKV* ISPPPPPPGGKKSPPQKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVI IFFI VIPI I IGGFGN-LV 
PVDAA 


3538 


17439 


A 


3561 


2 


126 


I NTLIiALL L 1 1 1 TF * LPQLNGY IEKSTP 
YECGFDPISRRRG 


3539 


17440 


A 


3562 


16 


349 


FAFQKITIAMEKATEAQTGGAKKACQGT 
NNIMKFLKKKC I PTT I FW I S F FS FLFFF 
FETKPHFFNWPEGKGPNLG* LKPLPLGL 
KQFCLTLPRKGNYRHVL.PPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRILEDHFSL.RS *GCGEPCLRHCMPA 
WL,TE*DPVSKNIY 


3541 


17442 


A 


3564 


3 


152 


G FHC VRQDLDLL TS * SAHLS LPKCWDYR 
RKPPHPAGKGTYFYIDYSTIE 


3542 


17443 


A 


3565 


241 


3 


NliVSHVISMRPCVINLVFQNVTSSNRSNH 
TWLSFI KLYFI HIS I VFGEQWFGYVA 
KFFSDDF*DFGVPITQTVYTAPN 


3543 


17444 


A 


3566 


71 


256 


SVFLLTVFLYPLATSFFFLNRVSLCCPG 
WL*TPGLKQFSCLSLPKRWNYRHEPLTP 
GSNFF 


3544 


17445 


A 


3567 


1 


392 


AG AGT S GTD YP PLS GN Y S HPGAC VDLT I 
FSLHIAGVASMLGAISVMTTVIWIEPPA 
ITQYQTP1iFV*SGRNTAGLiIjLLCVPVIjA 
AG ITILLTDRNLNTTFFDPAGGGDP I LY 
QHIiKGKRG 1 1 1 CS HRDR I 


3545 


17446 


A 


3568 


334 


409 


LGTVAHACNPSTLGG*GGHITRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP*IWPP 
PGFF*KAPPPFFFFFFFFFFFFFPTFQI 
YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF*FFGHPEEYML 
ILPGFG I LSHI ATYYSGKKEPFGYIGMV 
*AMISIGFLGIIV*AHHIFTG* IDVDTR 
AYFTSATI I XAIPTGVEVFS *LATLHGS 
NMK* SAAVLSALGFI FL.FTRRGLTS I V 


3548 


17449 


A 


3571 


1 


418 


NLINHSFIDLPTPSNIFA**NFGSLLGA 
CLILQITTGLFLAMHY* PDASTAFSS IG 
HrTRDVHYG* IXRYLHAKGASIFFICLF 
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JSDOCID: <WO 016483SA2_L> 



WO 01/64835 



PCT/LSO 1/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LH IGRGL Y YGAFL Y * ET * NI G I I LLLAT 
IATAFIGYVLPGGQISF*GATVTTOLL 


3549 


17450 . 


A 


3572 


3 


414 


S SGLAM * FHFHS I TLL I LGLLTNTLTIY - " 
Q*WRDVTRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTGI TPLNPLE VPLLNTSVLLA 
SGVS IT*AHHSLIENNRNQI IQALL 




17451 




3573 


2 


423 


GTLYLLFGA*AGVLGTALILLIRAELGQ 
PGNL VGNDHI YNGI VTAHAFVI I FFIVI 
PIIIGGFGN*LVPLIIGAPEMAFPRINN 
ISL*LLPPSLLLLLASAIVEAGAGTG*T 
VYPPLAGNYSHPGAYVDLTIFCLHLAGV 




17452 




3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS * P 
RLCPCPPAWGAKLNSLSGKKKKKKRHVE 
KYAQHDYI INVLCWCVHIKWLSLYMYS 
I TKS I DKI CKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSLHQYAVHRI ISLFSLLS 
KKHDRVLEQATQYLRGSLNTNDVPLPDY 
AQDLTVIEELIPMMLEIINS *LTNSLHH 
I PNLVYALLY * RDLFEQFRTH PS FQD I M 
QNIDLVS VNED I YYDSFLS 


3553 


17454 


A 


3576 


262 


418 


G WRE WTEL I I FY I FFETTL I PTLAI I TR 
+GNQPERLNAGTYFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 


443 


PSVQTFPCCPLSEEGPWLLCVAMTPSPG 
VPVPTERALYSMECAFHPLFSLTSGACR 
LDYRTPDNR*VQLP 


3555 


17456 


A 


3578 


48 


410 


GGLLLHRAGCWCGHADRRADQPDSSD PH 
S LHAACLP PPG AARAAGAP SPA* TPAAA 
CPAPVPAPCSEHRC*PPGLSQRPCLPGL 
PDARPPGAAAHGGWSLPTRLVLHDRSHP 
EEAPRRHED 


3556 


17457 


A 


3579 


422 


186 


VYSKFL*SQLLRRLRQENRLNPGSRGCR 
EPRS *HCI PAWVTQQDSLS ISSSTSHEI 
CGSHCSPGYLTVHICDQPVFSL 


3557 


17458 


A 


3580 


442 


3 


GSAISSRCTQESTYQGPHTPPVKKGLRY 
G I I LF I TS E VFFLAG FL* AFS HSTLAP T 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
LASGVS I T * AHHSL I ENNRNQ 1 1 QALL I 
TILLGLYFTLLQASE*LESPFTISDGMY 
CGRSRG 


3558 


17459 


A 


3581 


428 


1 


IRDIFMASSLGM*FHFTPITLLILGLLT 
TPLT I YQ* WRDVSRESTYQGHLTPPVQK 
GLRYGIILFITSEVFFFAGFF*AFYHSS 
LAPTPQLGGHWP PTG I TPLNPLE VPLLN 
TSALLASGVS IT* AHHSLIENKRCI AAA 
LE 




17460 




3582 


160 


415 


RCLSHCLAFPMLSILFIPSQLLSLPFHV 
KWIFFFFFLNKKFNCVPQAGGQGKNLI * 
LHPPPRGLKKFSCLRITSS*NYGREPQP 


3560 


17461 


A 


3583 


399 


2 


STHLGLPKCWDYRCEPPRPALLAIVKLL 
CKTIHVRINNVCCLPLLTGAASVIGLSA 
SLLFSVSPGTSLQLEAR*TFPWPHMSST 
CVWRFIFTSLGRAQPQHSDLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


1 


KS PGL LNAFSRGSQ I YQASRAMGA * LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNS KDGVSPCGRV 
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SDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/O4927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

locntion 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 


3562 


17463 


A 


3585 


138 


19 


LNQRFWPGTVAHACNPRPLGG * GAR IMR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 


262 


3 


GRPEVFYKPPPRGEFFFFFFFLFFFFFF 
FFFFFFVXjFFFFFFFFFFFFFFFFFF*Q 
VRLYLLEKYELFFFFLFYPQE I SLCCPG 
WS 


3564 


17465 


A 


3587 


381 


1 


KKFTPP?PRKMGPPPGFF*RAPPFFFFL 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF*KKRGPPFFKRGGSPK 
TPPCFFFKKKNPPKKKKKKKKKKKKKKK 
GRPSSKSDAWVRTRG 


3565 


17466 


A 


3588 


198 


1 


PKS PPRP1ARGKE PNFFS KKKKKKS I * N 
LNTNSWLDMVAHI CNPSTSGGRGG* PHL 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGG?FWPPPRKGFFLKKKKGVF*SP 
P*KGKKFFFFPGGGVGPPRGFIjKGGPPI 
IFFFFFFFFFFFFFFFFFPATIFLITYK 
DLTCHPGTHFTH 


3567 


17468 


A 


3590 


506 


98 


HTSRGPPPPPKKKIPPTRPFFFCCFWW 
FFPPKKKFFFFCSPRSFIHSPPKKKKKN 
FFFPKGGGPPLSPFLTPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF*LFVSLFSI 
FVLL I FFLFYNAITTI 














LVRGQITNKKNLFNFLGEELREWRWFF 
LPQQRLF FFKL * LQ * KE I YL F I * K3V1TCS 
RGGAGFKSSLYFFFRSPPQWLFFFFFF 
IFFYFF*FFIFFFFFFFFFFFFCNDFMI 
YLYISPPLPILEKKKGGRSRSRS 








3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPKGPFKRPPPF 
FFLEGKTLFLGPDP* *NLGWGG*RGHGK 
*GPANPRVPPVFFFFEEYSY 


3570 


17471 


A 


3593 


394 


45 


ALMFSTGQEGRQEKHPPPHQKKKASRGE 
EPRGGVRKNYPPPKKKSFPKKNPPGVTP 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PLNFYPPKEPTKVLS IFFS FFFFFSAND 
VELY 


3571 


17472 


A 


3594 


402 


62 


AMIVLLHSSLGNKARPCLFKKKWAKDLN 
KHFSKEDSQMANKYMKRCLISLVIRNMQ 
MKITVRCHFILTRMAKIKKMVNSKCRKE 
VQKI PTLKHCWWECKI V* ILWTNLAARV 
G 


3572 


17473 


A 


3595 


220 


425 


FI SGFLNLQEERE I AFFFKVNI RLGAW 
HTCNPS TLGGRGRR I T * S PGV* DQPGQH 
GEPCLYLNLQKN 






A 


3596 




412 


AE IAPL.RS I HRARNCLKKKGRKGRKNPG 
GDGGNPQIGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGP KRKGVP P KKKT P 
TIWGGTPLiAFGVEPNWWGHTGNPGGADP 
KPPGRE K * S GGAQ ITGPHTGP 


3574 


17475 


A 


3597 


2 


293 


CQLS P * ELGTPRERERERERERERERER 
ERERLPPRDLREKRAQDVFATHPNYEFV 
CVTRHYRGCWYTPLTTAHS IDAHKMCV 
CVKKRPLYKKKKK 


3575 


17476 


A 


3598 


243 


9 


II J LRQEKQCWSKGTRFQLQDEQVLNT*D 
RT WE VM W PGAVTHACNP S TLGGRGG RIM 
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NSDOCID: <WO 0164B35A2_I_= 



WO 01/64835 



! 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystei«e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V~Tryptophan, Y= Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RSRDRDSCRIHQGGSKPKSPR 


3576 


17477 


A 


3599 


1 


329 


PSARRDWKNIPKLPPPRGKNPTGFFFLK 
PGGPQRGPGKKGGKNF*SFGKPF*KGGG 
PPQFCPPPPRGGFFFFFPGGPPGEKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 


3577 


17478 


A 


3600 


206 


1 


LRALG I MERAAHTE VRLVF IQ I LGS P L* 
YIF*RQDLSPSPRLECSGT IAARCSLDL 
LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFF PPLiFNKKKKVFLiGPEGPK 
IVKKKKKKEKKEKQNKNRKQTKKK*KTN 
NKQNKKRETDIFGVEDET 


3579 


17480 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPPFFHPFGGVNGG 
V I R VGE LSPPCPPGGTRF FFKKTKN Y PG 
G*GGPFFPPPSGGGLGPKKTLSPRGGAS 
L*PNFGPSLPPKGKGGPPFSKKKKKKKK 
QSRPRDS+SLRRGIRK 


3580 


17481 


A 








RSS I QCGGILLFYFLYFMVFND I FWL * 
GFSAITSILISMLFSCI* YYFFKMYCLL 
FNFRCHWIFYILF*MKTIAQIGN*IFFR 
DAYLGYFFLiFIjFFFFFFGGFQjYL 


3581 


17482 


A 


3604 


88 


435 


RAISTCLQNEQYKWI/TTITSVEKSLNDL 
TEI»KTMV*ELHDKCTSFSSRFIQLEERI 
SVTEDQMNEMNNPPLIHGFIFYGSSYSW 
STMVRKQDKLRVKDAGIKHRNQNKGFTA 
EKSA 


3582 


17483 


A 


3605 


331 


428 


YGHYGQIEATFLPAITTEKETLKTRKIjT 
* SS INLLK*T *NVRPGTVGHACNLSTLG 
GRGGQIMRSGIRDQPNST 


3583 


17484 


A 


3606 


3 


294 


GG*GCSEL*SCHCTPSSLG*NTVRKKER 
KKGKREREKERKRE I I PGFGPCAHHKGA 
VTLGNLFPRVQVSQLVQRRKI IITVI PD 
RLGEIVLVSQSL 


3584 


17485 


A 


3607 


263 


426 


RKLAGTFPFLWKSGPFQTILAAAQRMGG 
NDPKETRVFHAGLSSSLGENK*VWKI 


3585 


17486 


A 


3608 


213 


439 


liSWACAAEWTVTQPANFPLDYILPFFLL 
IRES S KF * RHMATQLETAAS RPG WVHA 
CNP I TLGGRGGR I TRSGVH 


3586 


17487 


A 


3609 


2 




MTDLESFPLESELSANSPVQCSGLCKPV 
LTIFQVI SHKGHCHS KYYLLNAN YNVFL 
HLSNSVXiPRLLSGQRTETQKHFYTAVLS 
VCLSIYLSRDRVLLCYPGWSAVM*FYSL 
* PQ I HGLKQS S CLS L PKC * DYQHG ISPP 
WCN 


3587 


17488 


A 


3610 


102 


354 


EPGAWADRCWLGRVLLTHLLWPCCLYFF 
YFIIFILNSGDRVSLCCPRWSCLK*SSC 
LSLPECWDYRCQPLHPACAVFIENSKR 


3588 


17489 


A 


3611 


462 


371 


GLNFLTL*SARLGLPKCWDYRHEPPRPA 


3589 


17490 


A 


3612 


463 


319 


RNSHDGLELLTL*SAHLGLPKCWDYRCE 
PPRPVLC 


3590 


17491 


A 


3613 


49 


476 


PRNSITLLILGLLTNTLTLYQGWFDVTR 
ESTYQGHHTPPVQKGLRYGIILFITSEV 
FFFAGLF * SFYHSRLAPTPQLGGHWPPT 
GITPLNPLEVPLLNTSVLLASGVSIT*A 
HHS LI ENNRNQLMQ ALLIT I LLGLYFTL 
LQ 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAI I * FI STLAETNRTPFDLAER 
ESELVSGFNIEYAAGPFALFFIAEYTNI 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MIINTLTTTIFLGTTYDALSPELYTTYF 
VTKTLLLTCLFL* IRTAYPRFRYDQLIH 
LiL * ENFLPLTLALL I * YVS IPITISSIP 
P 


3592 


17493 


A 


3615 


2 


405 


ARGLQEAGAVQNDPVSRLFDHAMLQGHR 
AHQMAIVTYQEFGETYIPKDQKY*FLHE 
SQTYFCFLNS I PTPS SMEETQHRSSLEIj 

LiRISLLLIES wle p vrflrs mnannl vy 

DTSDRDDYHLLKDLEEGIQTLM 


3593 


17494 


A 


3616 


328 


63 


PSKF*KPCLISPPPL,GPGPPN*TPGFLK 
KPKF P PRGGAHP * S PLLFRVRPGDPLN P 
RGQGFP*PKLPPGPPTRATKPNFVSQKK 
KKKE 


3594 


17495 


A 


3617 


1 


379 


GTS VG I PTGGQE FS * FAALHGSNVK* SA 
AAL* ALR I M I LFT VRGLTG I GIANS S LD 
I VLHDTDYVEAHFHYVLSI GAVFAI IGG 
FIH* FPLFLGYTLDQTYAKSHFTIIFIG 
VSVTLFPQHFLGLC 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETSPRNVMSLMS 
PMS PTLPMRPTS PGTPMHLTNSMGPMQP 
PNSSRPWPASSATPVSPASPARPRKPS 
CFPAPQ* * SPPL* S PEPRHGLSCLSWVK 
KECETWHWSMHIVEDRHSKPYHGPGMV 
AHACNPSTLGG 


3596 


17497 


A 


3619 


350 


480 


LGS WDYVCKP PHLAVtiliL *RQS F VLL PR 
LVTNS + AQ VIHLP W P 


3597 


17498 


A 


3620 


2 


400 


AAAEPHGCYAKVLDEFKPL,GEEP*NLIK 
QNCELFEQLGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS*NLGKVGSKCCKHPEAKR 
MP*AEDYLSWLNQLCVMHEKTPVSDRV 
TKLLTESXiVNRRPCFSALEVK 


3598 


17499 


A 


3621 


3 


476 


LGYSVHPPQPLAQVLSPPLPGASRAGWL 
LiRMRGPPSPHPPRTPAGLQAPHAALiVPA 
RASPSTPPCKLREWAPALASPERGSHSA 
VGG*RAPQMPPKWEPRQGRCQEQARALR 
TASMLS PLSFTKP VTLS LLN F PTSLPPR 
NSPSWPWSVR 


3599 


17500 


A 


3622 


40 


390 


MKGALTQPGRAGGLGSQSQMDMASTPKP 
SPDDQGCSIHLWGFPFLVSLLPPSLSPS 
PPSPPRHSVGTHCAP * VLPRPLGALRVP 
HSSVPRPHAVPWGTTYRYSRGAWPLSL 
DFRGL 


3600 


17501 


A 


3623 


52 


475 


AVEFHLLFLQPLETTNLLSVCMDLPILA 
NSPK* NQI ICDLFVFFHLA* CF * DLFML 
EHVSVLHFMAE * FFFFFF* KGVSLFSPG 
WRAGAQFWLTKTPPHSPARLKQFSCLTL 
PGSWNYRRAPPSPANFFVFSGKRGFTIL 
P 


3601 


17502 


A ■ 


3624 


333 


466 


SIIGKIRLWKMRPWPGAVAQACDPGTLR 
G+GKR I TRSGVRDRPG 


3602 


17503 


A 


3625 


2 


138 


LRRGNRIiNPGGGGCGEP * SRHCTPASST 
E * DS VS KNQKTKNS QWH 


3603 


17504 


A 


3626 


3 


337 


LSLRPPAIjPPCPPRRLPTPGQGHPGSAG 
FPGRPTL*RLPPLPDPPPSLGLSPGDDP 
GLAREELiKS PCP WGPHPLCHLPFP * CWA 
S SQPS INAYEWRKKKKKKKKKKKKKKK 


3604 


17505 


A 


3627 


2 


393 


LGTDLSJjI, I RFELGQPGNLLGNDHI YNV 
I VTAHAFV I I FF I V I pi I IGG FGN * LFP 
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nucleotide 
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USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Atanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \-possible 
nucleotide insertion 














LIIGAPDMALPRINNISL*LLPPSLLLL 
IAS AI VE AGAGTG * TGYPPLAGNYSHPG 
ALVDLT I FYLHLAR VCS I 


3605 


17506 


A 


3628 


2 


379 


QDATYPIIEELITFHEHALIIICLICFL 
ALDAL FLTLTTKLTNTN I S DAQE I ET V * 
TMLPAIILILIALPCLRILYITDEGNDP 
SLTIK*IGHQWY*TYEYTDYGGLIFNSY 
I LP PLFLKPGDLRL 


3606 


17507 


A 


3629 


1 


400 


PTHPPTRPPTRPMTNTLNS IHPPDPR * K 
KKKKKKKKKKKKKKKKKKKKKTAAGGGA 
* KKKNRGAKTHRGS KKLNF FFKKRNKKN 
PPVKIEKKTFFLGGKKKNKPPQKNNALK 
KKKNYFEGEGESLLLL 


3607 


17508 


A 


3630 • 


416 


1 


SQLGYSGVRDPLEEATCLFSDLKLHAGR 
TTPLFQAVRQGHL I LQRFLQP FA*LCPA 
PRGGVYRGRQASLS CGGLHPVRASWPLC 
LPTQFSAMEGAPPLALLPRSSSiSDYCA 
SNEQGSVGIGPSEPCTGYNLLVCHLL 














RWGSCYWLAGLQLLGSSDSPALASQSA 
GITGVSHCTQPKNVI *KI SHLTVAGPAA 
DKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
F FFPF F P FFFLVCQ VPLLCNL + HLVYL F 
YMVL 












235 


GIISFLLIR* * Y ARADANTAAI RAI LYN 
RIGDIGFILALA*FILHSNS*DPQQIAI 
LPHPSDSKKYLYLSHIQLKKA 


3611 


17512 


A 


3634 


1 


428 


RTLKTTLFNPDGG*DPILYQHLF*FFGH 
PEVYIIIIjPRFR*ISHIVTYYSGKNEPF 

gyismv*amislgclgvia*ahhiftv* 

I DVDTRAYLTFAT imiai PTAVRALS * L 
APLHRSNIK*SAAKL*ALRLIFLFTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVIYSTIFAGTLITALSSH* FFT*VGLE 
I NMLiAF I PVLG FWD 


3613 


17514 


A 


3636 


2 


422 


DR * LFS TNHKD I GT L YLL YG A * AWLG T 
ALNLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFG I ILFIVIP 1 I IGGFGN+ LDPLI I 
GAPDMAF PR INN I S C* LLP PS LLLLLAS 
AI ME AG ARTG * TA Y P P LAGNYS H PG AS V 


3614 


17515 


A 


3637 


14 


479 


QNCKINNFPMYFLPHQSEFPRKSCKQQN 
TICFRLSNQNTMKKTTENHALLWIRKWQ 
T*KQQCKSNSGYTPSTPSRAKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKPGGGV 
LKKL 








3638 


2 


449 


I I I ICFVGLCALVLTLTTKLTNANILDA 
QEVETG*AILPAIMLVIIALPCLRILYI 
TDEVNDPSLTIK* IGHQWY*AYEYTDYG 
GLMFNAYILPPLFLKPGDLRLLDGDNRV 
VLPNEAPIRIVMTSQDVMHS *AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 


3639 


347 


98 


HFFLGGGKGGGFSPFPPQKFFFFLRGFF 
FGGGGGPNFPPPKKGFFSKKPQGVF*TP 
PKKKKKFFFFPRGFWGPPGIFFKGPPP 


3617 


17518 


A 


3640 


2 


426 


DR*LFSTNHKDIVTLYLLFGA*AGGLCT ' 
ALSLLIRAELGQPGNLLGNDHI YNVI VT 
AHAFVI I FFIGI P I I IGGCGN* LVPLI I 
GAPDMAFPRINNI S F* LLPPSLLLLLAF . 
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M 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














AI VE AGAGTG * TVYPPLAGNYS HPGACA 
D 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAMWSKGRQRGRNGPVR 
GREGGRKSHFRKEG+KEGRKEPPPPPKK 
KSRGEKNQREG 


3619 


17520 


A 


3642 




405 


RTRGFDLERFVLFDVAQQMVESGYVCEG 
DHKTMANAI IDRVSLIKRKREQRQLVR.E 
EQEKKKQEESSLKQQVEQSSASQTGIKQ 
LPFADTGIPTASTTSASVSTQVEPEEPE 
AD+HEQLQYQQPSLSVLSDGTVD 














HLLGRKQNKAI FLEAEKKKKKKKKKKKG 
GPLKKNPGGAKNNRGGEKKI FS P*GGVK 
KTPLGIFEKKPYFGGGKKWANPPKKIKT 
LiKEKKKF*RGKGGKKPENPWGEKDESSS 
PK 














EFAQLSPPQIKGFPSPPPNFLPPSLFGF 
FSPFSP*KFFFSLKGFIFVGGFFPFFSP 
PKKSFFSKNPQLVFISPPFKKKIFLFPP 
PLNFGPPRVFFKRPPPFFFFFFF 














AFFFTKKKKEFFPPFPFKNFFFFSRVFF 
FLGGFFHFFPPQKKIFFLKIPRGFFLTP 
PKKKKIFFFFPR* FLAPPGFFL*APPPF 
FFFFFFFFFFFFFFFFFFSPCSWSCHQV 
FPHASNRIHNPSNSYPLQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGGPFKKTPGGAQKNPGG 
EKKFFFL*GGHKKPPRGIFKKKPFFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKKN 


3624 


17525 


A 


3647 


327 


94 


QKKKKKFFPPPPPKNFFFSPRGFFFGGG 
GGPFSP PQKKGFFS KNPGG VFL P PPKKK 
KIFFFPPGGFGAPPGFF*RGPP 














TSSVASTFIMSLFPTTIFMCLDQEGIIW 
N*H*ATTQTTQLYL>S FKLDYFSIIFIPV 
ALFVTWPIIEVSL.*YINSDPNINQFFKY 
LiLIFLI TI LI LVTGC 


3626 


17527 


A 


3649 





406 


STNHKDIGTLYLLFGA*AGILGTALNLU 
I RAE LGQPGNL VGNDH I YNVI VTAHAFG 
IIFFIVIPIIIGGFGN*LVPLIIGAPDM 
EFARINNISF*LLPPSLLLLLGCAIVEA 
GAGTG * TVYP PLAGNYSHPGACV 


3627 


17528 


A 


3650 


3 


418 


HAYADA W VHFHS ITLLI LGLLTNTLT I Y 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTGITPLNTLEVPLLNTSVLLA 
SGVSIT+AHHSLIENNRNQIIQALLI 














WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGLLLFPS 
LPPLP*AAWFPHPGQPGRNLNGESPTTR 
EKRLWGPPLPTPLPWPPTQPTPLSRRTG 
CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRGAALANVNHENISDTDENKL 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


RNSFHRVSQDDLDLLTL*SPCLGLPKCW 
DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


NSKKQQFVLLSE I TSYVYWLHKRDLDSH 
T I I MGDFNTPLS TLDRS ARQ KVNKDIQE 
L.N* ALHQADLID I YRTLHPKSTE YTFFS 
APHCTYNFLiFQS YFSSQF 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possib)e 
nucleotide deletion,. \=possible 
nucleotide insertion 


3631 












L/MMLS I FS C VHLAFVCFLWRREMS G VS L 
CCPDYSGAAAVHRCNHTALQ PQTPGL K * 
SSCLCKRHbQLRPQVYTTMLGYLFLFIiF 
LiRQSCSVAQAGVQWHDHSSLQP*TPGLK 
RSNSPTLV*DLD 


3632 


17533 










VGQDGLLDLL/TS * S ACLGLPKCWDYRRE 
PPRLAY 


3633 


17534 










HCTQQSGCAFCYSTMI PTKPYTC 














EFDTAHAFVIIFFIVIPIIIGGFGN*LV 
PLIIGAPDMAFPRINNISF*LLPPSLLL 
LLASAIVEKKKNEKKKKTGAPLV 


3635 


17536 


A 


3658 


351 


470 


FTTLPVTSASLPLKTIAWPGMVAHACNP 
STLGG*GGRIT 


3636 


17537 


A 


3659 


40 


182 


GIRKENHLNLGGRGRIEPGSHHCTPAWA 
TK*DSVKIKKTRIQKTPLI 










160 


461 


INGKDTDTPGMVAHAYNTSTLGGGGGG I 
A*GQEFQPSLRN 






"7[ — 








YFSLS1>CLIjELMNSVILFFFLVLERGFY 
PPPLVEMQGPNFG*LNPPPPGL>KQFFCL 
TLPKSWNNRPPPPRGGNFLDF*KKRGFN 
YVGRVDLKLRTSG 


3639 


17540 


A 


3662 . 


255 


451 


L.LSFNPFFFFFFFGKGGPPSLPPGGKGG 
GLNFI *TLGPQGKRNSPPQPPGEPG*RG 
PTPHPGLFF ' 


3640 


17541 


A 


3663 


185 


66 


PLEQPLRK*LYLFRDGVLLCHPDWSAMM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHLNAGGTS CSEPRSHHC 
TPA+VTETQSQKQKVKVLGFSL 


3642 












KSKGKCSFLHSRVNRGNSKELLFLRDGV 
WRYCPAWS * TPGLKQS PHFN IPKCWDYR 
REEFHHPGLWYEFE 






A 






267 


GRL WS AMTPG KLKTL C KI DWP ALE VG WP 
LEGSLDRSLVSKVWHKVTYKPRNPDQFP 
YRDT*LELVLDPPPPTHSG 


3644 


17545 


A 


3667 


212 


2 


SQLLGRAEARGWLEPRSLRKINKTSQVW 
WYTPVLPSSCDYRHTPPRPANFCIFSRD 
GVSPC*PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVLFPPCFLLHIDFCMVLDFFSNFFVLN 
YF*NYRRVQKMYEEFPFFLFFFLI FSFS 
FLFFLSTFNFFLCPLCDLWYH 


3646 


17547 


A 


3669 


369 


260 


DGLDLLTL * S THLS L P KCWDYRHE P P CP 
ANMPS FKN 


3647 


17548 


A 


3670 


111 


342 


NGVSFFPPKKGGIWANGNFRFPPKPGG* 
RNFLA* PPQKFGMAGPPPPPQPVGIAGP ' 
PPHAREILEFFFFFFKKKGV 


3648 


17549 


A 


3671 


377 


3 


FAIIGGFIH*FPVFPGYALAQTYAKIHF 
TIIFIGVNLAFFPQHFLGLSGMRRRYSD 
YPDAFTT*NILSSVGSFISLTAVILIIF 
M I * EAFAS ICR FCVL I VEAPS INLE * L YGC 
PPPSHTLEDPVY 


3649 


17550 


A 


3672 


39 


348 


AKIAPLNSSLDERMRVSKKKKKGHGLTP " 
KPFLKKKLGKNFPLSQLEGVGKKNQKPF 
G F KKGLELAP FNG KGNGENPNTWR WG * T 
MS FFFFF * DSHPF I Q AGVQWRDLGS LQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


SKDPVFFKVFKKNPPTPNIGVFPFALPL 
KRGQFQPLFNPKGFFFFFFKPVQLGRGK " 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=TI»reonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ISPQFFFKKRFWG*TKSFPFFF*-DSHPF 
IQAGVQWRDLGSLQPRPPRFKRF 


3651 


17552 


A 


3674 


379 


267 


EPRLS P CTPAWATE* DS I S KQNKTKKP K 
LALNKKKMF 


3652 


17553 


A 


3675 


2 


270 


FCRDRVSLYCPVRS*TPGFKGSSCLDIiP 
SSWDYRHAPPGLAKIGGLHVFCLPCCGS 
LLVCWSSHCSIAHQELLLENKHTDVWMG 
GWMDR 


3653 


17554 


A 


3676 


3 


54 


P VYKNKNKRPDAVAHTCNPS TLGGRGGQ 
ITRSGDRDHPG * HL 


3654 


17555 


A 


3677 


291 


30 


YSLVLLIFFFYCFSISYFISLMNFCHNL 
YYFLLLDFGLVCSYFPSILWHNEIYFIG 
SMFFIFSVIV* IFFCWEMSCNIYKLHQE 
MF 


3655 


17556 


A 


3678 


1 


298 


RDFS AS ASQVAGTTGMHRHAQLI F V FRV 
CVCVCVCVCERERERERDGVLLCRPDCC 
SVAHT*VQGRDLGSLQPPPPGFKRFFCL 
KLP S S WGHRCWDAP 


3656 


17557 


A 


3679 


20 


376 


VPPPHL1ANFLNFII1L1FVDTGSYYVAWAG 
FKLLAP CNLSASNS * SVGITGVSHHALL 
LIiALKCNQLFQF VI F I LLHAVC IYDFIM 
VRFS LQC FPQF YD FP FCF I LFVS S KAQF 
CFLRLPY 


3657 


17558 


A 


3680 


110 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HI»N 
TVREYTSKYGWRPGWAHACNPS TLGGQ 
GERITRSRDQDQPCQHGET 


3658 


17559 


A 


36S1 


326 


511 


LTMMLQMKI SVI CCLLRNI FWPGALAHA 
CNPSTLGS*GGRIARSRDLENSPSLLGS 
QPSS 


3659 


17560 


A . 


3682 


448 


248 


AKDLNTFFTKE * TPLANKHMKKGSARFT 
YRNVQ I QT T * RCLGT PTRVTGV * MRNS A 
RHR*GCGGLV 


3660 


17561 


A 


3683 


486 


176 


PAPINPPQLRSLVQRSGIRFC*AGQVGF 
KLLVSSDPPTLASQSAGITGVSHQTWP* 
NYS*NSGMPTHPRQASSWRLREQWYHVS 
LLPCAYYKNGRQNSLLI 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
SFQVCPSAVAHACNPSTIjGG*GGQITIjV 
QPGKHGKRRIPALF 


3662 


17563 


A 


3685 


] 


299 


LITQTCHVGCLiKIKCI *TSKKKKKKKKK 
KKKKKARGPLKKKGPLKTPEKPPGGPFK 
APPFWGGPPPRFFFGGGGAPPPVFLKKK 
KKNSPLWGGGGFFFR 


3663 


17564 


A 


3686 


494 


91 


PLGKRFPGGDIALTKLSRPWAWAPPGI P 
PPLGGPGGGVP*VTPFKPTPFRPGEPPL 
PLKNQTFFGVPGPPPYFPPLGGLGPEIP 
FTPGGPGS I KLAPP PS P PPGAPKRNFF P 
KKKKEVMKMLCFHFNLLVQNI F 


3664 


17565 


A 


3687 


17 


249 


YSRFGRHFGHDPGLYFQILRRLRWENHL 
NPGGRGCS Kb* SHHCTPAWATE * DPVS * 
KKNVTTIAKRWKQLTCPLADG 


3665 


17566 . 


A 


3688 


296 


461 


FWLWVSSVLKHMNWPGVVTHACNPNTLG 
GQGE*SA*ARV+DQPG*HGETCSAKEK 


3666 


17567 


A 


3689 


149 


422 


ASGRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP *GDPSQRAGVRLP 
REPTSRKASGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 | 1 


415 


GDPAGGGDP I LYQHLF* FFGHPEVYI LI 
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Predict- 
ed end 
nude- 
location 

acid 

.residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gliitamic Acid, F=Phenyla!an ine, 
G— Glycine, H— Histidine l=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utnmine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AMISIGFLRFIE*AHHIFTVGIDVDTRA 
YFTSATIIIAIPTGVKVFS*LATLHGSN 
MK* SAAVL*AL* FI FLFTVSGLTGIV 


3668 


17569 


A 


3691 


1 


393 


GTRKYS FLHDS QTS FCLSNS I PTPSNME 

KSMFANNLVYDTSDSDDYHIiLKDL * EGI 
QTLMGRLEDGTRRTGHILKQTYSKFDTN 
SHNHDALVKNYGIiLYCFRK 


3669 


17570 


A 


3692 


402 


2 


FLTQRWGAPLPIPPFWEAPFGPSP+APG 
FKPPLPPRGNPPFS * KTQ I YPALGARP V 
IPPSLGGGGGESFL)PPGGGFP*PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACLHECLPLGNLLKLCPR 


3670 


17571 


A 








AGTYFLFYTLGGSLTLLIALIYTHNTLG 
S LNI LLL.TLTGQE LSNS * ANNLI * LAYT 
IAFIVKIPLYGLHL*LPKAHGEAPIAGS 

IAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL * SQLLRRLRE VCLS PG 
VQGCIEPWSHHCTPAWETE*DKTLSQKQ 
TNKKTCQHLYDHRPSALEATFRLSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRATKGNLFLP * KPKLTPPGGFRPP 
FQPFWKPRPLGPPKSGVLDPPGPQGKTP 
FVLKKPNLiLGGVI PTLFFPLSGGLGRKI 
PSPRGGGGAKKRKPPRQKKKKK 


3673 


17574 


A 


3696 


2 


465 


VTRLRERPALLVSSTSWTEDEDFSIKLA 
ALE KFEQL TLDGHNL PSLVCVITGKGPIi 
REYYSRL.IHQKHFQH I QVCTPWLEAEDY 
PLLLGSAEPGVCLHTSSSGLDLPMKVVD 
MFGCRLPVCAVNFKCLHELVKHDENGLV 
FEDS*ELAAHLEKL 


3674 


17575 


A 


3697 






APITQG*SFLSI 


3675 


17576 


A 


3698 


380 


26 


KQRTNFFLFFFKGTGAFWFFFPPFFNKK 
KKVFLGPGGPKIVKKKKKKEKKENQNKN 
RKQTKKK* KTNNKQNKKRETDI FGVEDE 
TWRQRWGLGGGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RiiWTAILIQTP*SFTGAVILIIAHGIiT 
SSLLFCLANSQQKKKKKK 


3677 












S S SLTLSNNNKSFLNQ FVTFNEMW I LYD 
NW* * PAQWLDQEAPKHFPKPNLDQKKVM 
FAVWWSAAGtilHYCFLiTPGETI TSEKYA 
QQFKEMNRKLQPLQSAMVNRKGPILPJDN 
A* SHITQLKX.QKLN-* LG YEVL 


3678 


17579 


A 


3701 


395 


140 


TPPKKKKKIKPPPPKIFPPPKKYKKPPP 
PFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFSIYIDFNLCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDI 1 1 KGKNEQNV* CS 


3680 


17581 


A 


3703 


36 


409 


LLKCEGIWKKEGKKKKKRREKKGGPFFR 
VLLGGPTFRGGAHKKSLLFRGVFFNLGG 
FFFEGPFFLGGALLGP P PPP I FTPLGKK 
KIFKGKRGPPPLIFLFF*IRGPPPPPSP 
PSTKPPPPPPPTP- 
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ed end 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Piienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


3681 


17582 


A 


3704' 


56 ' 


415 


QAPWGKKKKKGKEKKKKKKKKKKKKKRG 
GPLKKNPGGGPTMGGGEKKYFS PWGGEK 
KNPGGNLGKKTPLGGGGNWAPPPPKKRG 
AGGKKKI*RGGGGKTPLIGGGKKNGGPP 
RTSPR 


3682 


17583 


A 


3705 


108 


1 


NLAK* IQ**IQTIMHHDQVSFIPGIQGW 
LNILKSIN 


3683 


17584 


A 


3706 


425 


52 


PFPH* * *L*KVDPIPLPPQNFLHPSNNI 
YFSPLSPKKFFFSLQPFFFFGE IFP I FP 
PPKKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKSPPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLLVIQAEYLLPEVFGTRSVSDFGFFRF 
WNICIILSG*ASLI*KSEI*NALKNISS 
FKKKFIFILEPRSHSVTQAGVQWQDHGS 
LQPQTSGLKLFSHLSLP 


36S5 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYNITLY I KKKTETFTETNDNGNATCQN 
LVTOTAKAIPREKFIAISTYI * KVGKHQI 
NNLMKHLKEIEKQKQTK 


3686 


17587 


A 


3709 


94 


420 


KKKKKKKKKGGGAFKKKPGGAKI *RGKE 
KKNFFLKGGGKKKKVGNFGKKPYFGGGK 
KREKTPKKN*SLKGKKKFLKGKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


3710 


413 


184 


LGECLSLRSKKLWSGPAGGGVCL*FQQV 
RRRKWEDHLSPG I RG* RALS SHRSI PAW 
TAEQDPVSRGLVWGIjPGGSW 


3688 


175S9 


A 


3711 


3 


104 


GGCS E LRS CHCTPAWRQS * TS SQRKKKK 
KKKMY 


3689 


17590 


A 


3712 


2 


393 


GLFS TNHKDI GTL YLLFGA * AGVLGTAL. 
SLLIRAELGQPGSLLGNDHIYNVIVTAH 
AFVI I FFIGIP 1 1 IGGCGN*LVPIiI IGA 
PDMAFPRINNISIj*IjLPPSLLIjIjLjASAI 
VE AG AG TG * T VY P P LAGN 


3690 


17591 


A 


3713 


424 


1 


FFFFFFL.RRSLTWQAGVQWRNI j GSIj*A 

rprv 


3691 


17592 


A 


3714 


2 


130 


grvggrvg*nlklysqrlkkkkkkkkkk 

KKKKKKKKKKGGGPF 


3692 


17593 


A 


3715 


3 


197 


FHHVGQDGIjNLLTL * STRLGFPKC * DYR 
CEPPRPAAFFFFFLKKKKTIiAPKKFKTL 
DLTTRERS 


3693 


17594 


A 


3716 


3 


497 


RSPPSGFAWPGPAGPWEAGAAAAGQPLG 
ARPHSPCTGGCRSAGTPP* PAGPAGPPP 
HAPAAAERKPRGAEQSQRRVQETGPPPT 
RAESNPSGLRGREVPGSAGCWSGSQESD 
FGAGCPAVT*GTYPYRIAGQVSSRAPHP 
AESRGCTAGLSIGPDGRQ 


3694 


17595 


A 


3717 


40 


379 


AITS PRS P PAVHGS CLARTHPNS PQCLP 
LSGTLLLPSSSLSLSHPATAAGE * IWDA 
E KKKKKKKKKKKKKKKKKKKKKKKKKKK 
S PGGAL KKKNPGG PNP PGGGKNI SSPPR 
G 


3695 


17596 


A 


3718 


445 


286 


KKKK*METRSCYVGQSGLELLASSDPAT 
LASQSVGITGVSHQAQPVNKLPRVQ 


3696 


17597 


A 


3719 


15 


383 


GYFLEASCLLHFLLHPMYQNCGHVACLA 
AKENGKFMQAACGQLNSPALSLKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*ALNKY 
TLNGPGAVAHACNPS TLGG * GGRNLRSG 
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Amino acid sequence (A=AI:inine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiontnc, 
N=Aspnrngine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
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nucleotide deletion, \=possible 
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VGDQPGQHGET 


3697 


17598 


A 


3720 


14 


472 


RSCARNPGLTHASAQYQTPIjFV* svlit 
AVLLLLSL P ALAAG I T I LLTDRNLNTTF 
FDPAGGGDPILYQHLF*FFGHPEAYILI 
LPGCAI I IHI VTYYSGKKEPFGYIGRV* 
AM I S I G YLG FI VRAHH I FTVG I DRDTRA 
YFTSATI I IAIPT 


3698 


17599 


A 


3721 


3 


29S 


EAAIKYFLTQATASIILLIAILFNNILS 
GQ*TITNTTNQYSSLIIIMAIAIKLGIA 
PFHF*VPEVTQGTPLTSGLLLLT*QKLA 
PISIIYQISPSLNV 


3699 


17600 


A 


3722 


86 


505 


AGAVTYIVTYYSL*KKDSFWIHSVWC*A 
MRVNWE S * GLS V * AHH I FTVG ID VDTRA 
YFTSATI IIAIPTGVKVFS*IiATliHGSN 
MK* SAAVL * ALiGF I FLFTAGGLTG I VLA 
NS SLD I VLHDTYYWAHFHYVLS IGAVL 


3700 


17601 


A 


3723 


432 


48 


FKTSFLPFLLPSFCPSFPSFSLFPFSFS 
CFLSCPSSLSFLFFFLLSFHPFFFPTLL 
ISF*VFLF 


3701 


17602 


A 


3724 


2 


406 


HH I FTERI DVDTRAY FTS AT I I I AIPTG 
VKVFS * LATLHGSNMK* SAAAL * ALGI I 
FL FTVSGLTG I VLANS S LD I GLHDTHY V 
VAHLHYVLS IGAAFAI IGGFIH* FPLFS 
S YTLDQTYAI IHFTM I FIGVNLT 


3702 


17603 


A 


3725 


174 


158 


KFFGGGFPHFFFPQKKGFFFKNPPRGFF 
TPPLLGKNFFFPPPVNLGPPRGFF*GP 


3703 


17604 


A 


3726 


2 


392 


LFSTNHNDMGrLYLIiFGA*AGGLGTALS 
LL IRAEIiGQPGSLLGNDHMYNVI VTGHA 
FGIIFFIVIPIIIGGFGNRLGPLI IGAP 
DMARPR INN I S C * VL P P S ILLLLASAI A 
EGGAGTG* TAYP P LAGNY 


3704 


17605 


A 


3727 


48 


474 


KKKKKKKKKTKKKKKKKKKKKKTRGPPP 
KKTPAGPPPPPPREKKNPPPEGGQKKPP 
PAH*KKPPPPRGGQQPPPPPPQKNPPPE 
KQKTWGAEGGTPPPKPPRKKNGSQQIAD 
PPSNAAKNRHPTQEPPPQSQCI PPPPTP 
PP 


3705 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTPPK*GVFSPLPP 
*KFFFSPKGLIFWGGGGPNFPPPKKKFF 
SKKPRGGFFFPP* KKKILISPPREKLGP 
PRIFLKGPPPFFFFFFFFFFFF 


3706 


17607 


A 


3729 


1 


397 


PTRPLQPLSPEFKRFSCRTSPDVAW*VS 
LPSPPPSGRSFFVFL*VESFCVWIKKRF 
TMLARLVLDS * PSDPPAWASQSAG I TGA 
SIiRASLLYIY * CFS IQLGLLPEFL IQ YV 
QLLIVYPRIHHLYLSVPHLH 


3707 


17608 


A 


3730 


407 


0 


TLKHTKKKKKKECKKKRGGLLKKTLGGPK 
♦NGGRKKKI FFLKGGKKKQTGEILKKKL 
ILGGEKKGKNPPKKKKPLGEKKNFKGEK 
GKKNLKMFGVIKLGTSSKKIIF 


3708 


17609 


A 


3731 


3 


200 


GG CS ELR S CRCTLA WVT E * DS VS LKKKK 
KKXGPFKVLRGKGPSGNKREGAKKKIQG . 
KPCPFWESF 


3709 


17610 


A 


3732 


3 


380 


TNHKDIGTLFLLFGA*AGVLGTALSLLI 
RAELGQPGNLLGNDH I YNGIVTAHAFVI 
IFFIVIPIIIGGFGN*LVPLIIGAPDMA 
FPRINNISF+LLPPSLLLI^LASAIVEAG 
AGTG.*TSYPPLiAGN 
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NO: in 
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Predicted 
beginning 
nucleotide 
location 


Predict- 
ed end 
nucle- 
otide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 








26 


correspond 
ing to first 

peptide 
sequence 


location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, IM=Methionine, 
N=Asparagine, P=Proline, 
Q=G(utamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3710 


17611 


A 


3733 


6 


394 


AM I S I GFLGVI G * AHHI FTV * I DVNARA 
YCTS ATI 1 1 AI PTGGKVFS * LiATLHGSN 
MK * SAAVL * ALRF I FLFTVSGLTG I VIA 
NSSLDI VLHDTY YVRAH FH YVLS IG AG F 
AIIGGFIH*FPLFSGYT 


3711 


17612 


A 


3734 


27 


238 


GIPGSSHASAHIG* I IAVLPHYLNITIL 
NI/T I D 1 1 LTTTAFLLLNLNS STTTLLLS 
RT*NKKKKKRGGPF 


3712 


17613 


A 


3735 


402 


249 


ASAHAS APL F V* S VL I TAVLLLLSL P VL 
AAG I T I LLTDRNLNTT FFDP 


3713 


17614 


A 


3736 


100 


430 


VTWP KSDSYEE VKI KLE PRQTDLRALGF 
CLSFTGKKKK*KKKKKKKKKKKKKKKKD 
KRGG 


3714 


17615 


A 


3737 


158 


306 


FLS SVFS LVQGVLFSSLCLDRNLPDMMQ 
LWSEI FNK* LYLQELS ILGVY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYNFNIiNFYFQIFFQNVLiANSGSL 
LFLYQTLILSFNIFYILYLIIQISIVFQ 
SVIL*FIFSNSYSWWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEADS VAKAGV * WYDVGLQQPP 
PPGVKRYS*LSL 


3717 


17618 


A 


3740 


395 


153 


GWLMRFKERCGLQNI KV*GEAASADAEA 
AAS * PEDLAKI ADEGSYT*QWI FYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 


3718 


17619 


A 


3741 


271 


386 


TCPPPLKX,WVYELGL,Ii*RKNMRQGAVAH 
ACNPSTIjKG*GGR I SRSGDR 


3719 


17620 


A 


3742 


406 


190 


QL>LKRLRQENPWRWGG*GCS KPRSPPRP 
QAWVTERDFVSKSKTKTNQPKQNGERKV 
FFWFFFLRKPKVQFEI 


3720 


17621 


A 


3743 


272 


408 


MGLQACANS PL* YVDIAI PCNNKGAHS V 
GLMWWMLAREVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRNILFGPVKNXFCDTQFTSCVNFFR 
EEVSLDCPGWSQTPALK*SSCLGLPTCW 
D*RREQPRL 


3722 


17623 


A 


3745 


405 


189 


CVSQDGLDLLTS * STRLGLPKCWDYRKC 
WTNQLFVNGQKSLNS FVCYS FMVLDTTG 
FKEIVMLWHLVPLF 


3723 


17624 


A 


3746 


120 


362 


LKA * E I SWIiNGTIQTHQEKPEVQNQVIIi 
EENIVFLGPQDRIFECQAVTAELEPEXK 
A I VAGCS DSHLYPSTLGGQDGW I T 


3724 


17625 


A 


3747 


401 


233 


NSGGRGCSEPRSLPCTPAWVTDRDPISK " 
NKNKTNHQ * QKKCKR * TGKVI FMTRS I L 


3725 


17626 


A 


3748 


353 


109 


LGLANFCIFSRDGVFPCWPGWSRTPDLR 
*SAHL.SLPKCWEYRA*ATMPGIiLMCFLI 
KALiD FQG C VGR WGGGS WTDIKLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 


17628 


A 


3750 


1 


355 


FHHVSQDGL,DLLTS*SPRLGLPKCWDYR 
REPLRPAENTVYVHFS SWS FTNHSGKIN 
* TLDQWKEQLPGWL * I LRGVFFFKPS I W 
NS * QKKAS I YVFPLCWKRDI FLYHFPVS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


Q VCEKVL I S L I TEMQ I KTTVRYHLTP VK 
MAFRQKTDDNKCWRGYGEKGTPVHCWWK 
CKWQPIi*RTAWRFFK*RKIELPYDPAT 
PLLGI*PKERKSGQV 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKEESTSIYSSYEE 
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correspond 
ing to first 
amino acid 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
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sequence 


Amino acid sequence (A=.4lanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K = Lysine, L— Leucine, IVI=IVIcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine V = Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 

nucleotide insertion 














YLG I NLTKEM KDL * KENY KTLME K I ELD 


3730 


17631 


A 


3753 


2 


247 


FHHVS *DGIiNLLTS * S AACLGLPKCWDY 
RH E PR RP AIjT R FLFVFL YLQRLDH L I R I 
NSFFIMSVNGMFTFWLKLFNLIHA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL*SACLGLPKCWDYRHDPLCPA 
QYIiSFLYAINKLFSSTYLEIHNRLVCCI 
VTLLIYKP 


3732 


17633 


A 


3755 


2 


103 


VSQDGLHLLTL*STLiLSLPKCWDYSCEP 
PRPAW 


3733 


17634 


A 


3756 


103 




FRPGMVAHACNLSTLGG*GGGITRSGVQ 
DWPGQ 


3734 


17635 


A 


3757 


394 


244' 


ASTHPANFFCRNGVLLCCPGWSQMPGLK 
QSS*LSLPKC*DYRCEPPHSAS 














AEIAPLHSSLGMKRDSVSKKKELKKLNK 
QKPNNPLKK*AKDMNRHFAKGIQGANEH 
KIKCSTSTNLREMQIKTTVRSLLI *VRM 
AI IKKSTNISCW*ARREKGILIYTWWEC 
K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNTINQMETICMYTKCYLII 
AE*TLFPSAHGTLIKIGHILGHKASLNK 
FKRIKIIQSAFFKHNG IKLENDKRKITG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


444 


422 


VRLPHRPSPPPDPRVRFHHVGQACLKLL 
TSSDPPTSASQSAGITVPSLQ*QP*SVR 


373S 


17639 


A 


3761 


266 


1 


GGGVPLGPVIPPLRPPQRVGFQALAPQP 
GPGSKIF* I FFGPQFKNRVFFKGRULV^ 
FPPLFSLLPFFFFF*DGVSLCHPGWSLE 


3739 


17640 


A 


3762 


70 


381 


HSITSYI*AFIWGFFLFCFVFFLIKEME 
SHYVAQADHELPASWAP PTSASQYAG I S 
GGSHPTQHYFIFYWIFYFTVFCGGGFLR 
RSLAPS FFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


DI PYS FCFFFFFWKGS FVFVPQAGGKGR 
KLG*LKPPPPGLRGFSCLTLPGSWEN 


3741 


17642 


A 


3764 






QTSCRPGAVAHTYNPSTLGG*GVDHLiRS 
GVRNQPDQHGE 


3742 


17643 


A 


3765 


400 


200 


ITAYCSLDLLQPHFLFFSRDKVSLCCPV 
WSRTPELK*SPCL,SLPECWD*S*ATASG 
LKHFIKSLPL 














TQPIFLjGPGFQAPPGPMG^PNFLF^EKK 
LVFFPKKPKTAPAGGGGPKFPLPRKPKQ 
KKGPHPGSQNFQKPRPKFPPRPPTWGKK 
REFFSQKKKKKKRERDVKITDLNTST 


3744 


17645 


A 


3767 


403 


3 


DHFFFKGPPSQKGEFPLFLKTFPSPLGG 
FG* ISLiFPPPINFPPPPKKRIFPGPLSF 
FFFKKKKTYPAGKGLLFFPFPKKPGGGF 
KRWPGPPKKFFFPPPGFWACSPPFFFFF 
FFGAKVSLVAQAGARSRSRTS 


3745 


17646 


A 


3768 


407 


177 


FHHVS QDGLDLiLTS * S ARL.S LPKC * DYR 
HDPPRIARCI KNFCRGWRCNILRKS PLS 
LPPSAGELRAPFLDFLEDAQ 


3746 


17647 


A 


3769 


292 


391 


RPGWAHTCNPSTLGG*GGQITRSGVQD 
QTGQY 


3747 


3 7648 


A 


3770 


395 


198 


LDMGFHHVGQAGPEIiLTSSDIiPASASQS 
AAITGMSHRAQPY*VISMCQQCYKHFIR 
IPFKTLHNNV 


3748 


17649 | A | 3771 


1 


241 


RTRGRTRGRTRGLLSAHTGRYQKHPRIR 
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peptide 
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Predict- 
ed end 

otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, MHMethionine, 
N— Asparagine, P=Proline, 
Q=Gluti»mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPQIFSTGTTKAEAGELLEPGGGACSE1, 
RSCHCS PARATRVKLHVKKFKKI K* KHP 
RIRTPQIFSTGTTKAEAGELLEPGGGAC 
S ELRS CHCS PARATRVKLHVKKFKKI K 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI*KPPPP 
K* FMFPKKKLPPPPPPALKLFFFSPPPP 
FFFFF 


3750 


17651 


A 


3773 


24 


197 


I FS VEMGFCHVGQCDLDLLGS SNPPAS V 
S * SAGVI GVSHHAW PLVKLFVKRNKLS Y 
V 


3751 


17652 


A 


3774 


85 


201 


VNFFHLiI * FVNKRI KLEKKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLS PGSRGS S E P * SHYCTPA* VTE * 
DS VSKKKKPNLQEAEKRLTLLLI KKAI F 
KNRGS PCFS I KWAGFKKI KQGWFFVATR 
MGDRSRGTPNLGENFLKSFLSFSIKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKIIiKLIWNLERA*IPKAILS * KNKMG 
LSNS 


3754 


17655 


A 


3777 


70 


385 


LNKMQTKTFDSDTHTSICFTLHFINTKL 
QFIFLLAIYLLFVNFPFIFINFFEEPGF 
CFDDLSS CFLFY+ CI P * FYYFFSAFFRL 
HLFFFYRFLS LLTLDLFS FLI 


3755 


17656 


A 


3778 


2 


406 


KDDRNTEHQVHPNASDQEDEAFDPYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQ Y I IAKLAKKKKKKFCKKKK 
KKKKKDSSSPGGGPKKGAPPPPPRGEKN 
FFFFGPPIiIFSGGGF*NGGGGGP 


3756 


17657 


A 


3779 


403 


176 


FLFP ILFFFLRQSFTLVTQTGGQQRDLG 
SMQPPPPRFK* FNEGKGKRKLWGGA 


3757 


17658 


A 


3780 




248 


GLNQTQLRKILAYSS ITHIG* I IAVLPY 
NPNITIliNLTIYI ILTTTACKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKGGGGA 


375S 


17659 


A 


3781 


3 


330 


VDRLRTGVQDKPGKHSESPSLPKKKKKR 
* KILPKRGGVYLNPHFFERLRGENNLTP 
GGEGQRETESPPCL>PAGGKNKIFV*KIQ 
KKKRGGPFKGNKFFPRGGGKKIFF 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHliFGLPPGFSPPPVFK 
PRPRNL ILGPP * KKFYLPPPRP * TWFL* 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KKLKVNLPDHPAVPPLGIFPKEMKAGI* 
RAICTL.MFIAALLTIEK*WKQPNWLLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPFFLVPPGFF 
PPPLF*TPPPDFFFLAPLKKFFFSPPPA 
LNFFFFKGPPPFFFFFFFFFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKIFFSPPPQKKYFPYKTPPPTFFFL 
SSFFFFFFFFFFFFFFFFFFFFLC*MVH 
F I VKYRKCKRGTR 1 1 LHTPTLRQLVLCC 
LSDVLLGCCIFIYVTMCDLCAPKLG 


3763 


17664 


A 


3786 


3 


369 


S CRGLVWVYGE LLLHP CRGNQ I S R VMQK 
KKKKKFGASPPREGAPPGEPP*RGP*TQ 
KEP P PL KK * AP PPGPGGGRPP PMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 1 3787 


1 


125 


FRHRFSRDGLDLL IF* S ARLGLP KCWDY 
RREPPRPAHSIII 
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nucleotide 
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1 M 
eth 
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NO: in 
USSN 
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Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 

sequence 


Predict- 
ed end 
nude- 
location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosjne, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3765 


17666 


A 


378S 


■All 


251 


LTPKGRGCSEPR*CHCTPAWVTEGDSVS 
KKKQAKNRCSLRMKYERNE FTSSPC 


3766 


17667 


A 


3789 


250 


2 


GFPPVEKIKKNPFPIF*KKPGFFFFPNP 
FF* ISGGKGKIFFLGPPPFSFFSPFFF* 
DRVLLCHSGRSAVAQSQLTATSTFRR 


3767 


17668 


A 


3790 


412 


3 


GSTPITTPFWGVGGALPPGGGVPPPPAP 
PGGTPFFFKKKKKKKFQNFPRPRCSRYN 
GGLGR KN FFTLEAKGS INPNFS PAP PPG 
GQNKTPFPKKKKKSR*MTLQSFPRPGTA 
PGAAM+PVSPPEPCGPPAPALSTK 


3768 


17669 


A 


3791 


10 


187 


GCS EL * L I hctpawvhsetls qkkkgrg 

KGGFKKNGI PCFGKNKGGKRFGKSFGHF 
STG 


3769 


17670 


A 


3792 


1 


331 


FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLRPLAGPSRKGS FKIRLSRLFRTKS C 
NGGSGGGDGTGKRPSGELAASAASLTDM 
GGSAGRELDAGR*DRPGAGRQTSFSW 


3770 


17671 


A 


3793 


1 


431 


FRKGSARCRAPKPAGADGVHWGHHRASP 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHSPSPVCPRGWAPGTHSPTGASGA*G 
RTPVGGGK*VAGVQQRIHPGLAPG*LVL 
PGIAVHITKKPGSKGTITPGTVAHACNP 
NTLG 












48 


KRENMPGVAYNPSTLGGCNPS I VGGFCN 
RVCNPS ALGGCNP S FLGGFCNHVCNP S P 
LGG * GGR I S RSAAQDHPRQHGEKMSPQK 
IQKISQACFARVG 












189 


LNPGGRGCSEPRSCHCTQAWATE*DSVS 
KKKKKKGSRFQNTI PSALPGFPVSGELI 


3773 


17674 


A 


3796 


1 


401 


LNPGCKGCS EPRS CHCTPVWVSE * DP I S 
QKKKKKKKKK 














ILAKVYPMVNSFLEKIHYFLLVELCCFL 
QHVLYYLTFKTTETGSRSVAQAGVQWHD 
RSSL* PLPHGLKRSFFLSLPRSWDYMKT 
PPCLATE 


3775 


17676 


A 


3798 


229 


394 


VFVKSLLCQPGMVAHACNPSTLGG*GGR 
ITMSGD*DHPGKHGEVSTYNTKKKKGG 












206 


AGHM*S*LLRRLRQEDPLSPGVQGCSEP 
* S * PCP PAQVTVR I S LKKKRLHERDKS S 

AS ... 


3777 


17678 


A 


3800 


450 


3 


IYLLRGRHSUHYSRPSFYFSNYRGACSY 
MRPPQQATLLGRLNQGITREGPAIAHSP 
AVRPRLSLAHAHVRRLCLPRPGAAPEFP 
AF FL P CLLVAQAG VRW RDLGS LQ P P P P R 
FKRFSCLSLPSSWDDRRPPPLSS*FAFL 
VEMGFRRV 


3778 


17679 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLiFPPPKGFKGSPPF 
F FLTP P P LKNLPWTLKALPQ I FLRT FF F 
F FFERGS PS VTQAGVLWHDLGS IiQ P * TP 
GFKHMPSCPVLLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 
PPFFFPPPFF*PPPPFFFFFPPKKKKIF 
PPPPQKIFFFLKPPPPFFFFFFFFFFFF 
FFFCGFVNCQFS VL I 


3780 


17681 


A 


3803 


253 


73 


QTTPSKRYS YVLNCCNN* ITTCKRI KLD 
P FLTLDLK I N* KS FADLNVRAKT I DTI K 
KVKG . 
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M 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno»vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3781 


17682 


A 


3804 


385 


55 


AGTR P VF AALAAAAP AP RliLPIiAPHG S A 
ATGFPPPPCPHGEGLRGAPAASSSPPIA 
HRSQTPNWVDCGES P I PDPQFPLL*NGP 
PALCLPQGLETVNVESLALCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKHLKALNFLPRCVCVCVCVCVCVCVNV 
TF*ACKICTCSDFPGSPRKKYLEFGTKI 
INIVDFN 


3783 


17684 


A 


3806 


1 


367 


KXiRQVI YLEDLPT I 1 DEGS YTKQW I FNV 
N*TAFYWKKMPI*KTSVT REEKS MPGFKA 
SKDRLTVFLGANWSDFKLKPMPIYNSE 
NPKAIQNDAKSTLPMLYKRNNKACIAAH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


IiGPGGGGCSEPRSCHCTPAVJATRGKIjCL 
QEKKKKR* RLDPEAEVAVSRGRAIALQP 
GQQEGNS VSRKKKK KGRD KVS LCC PAG F 


3785 


17686 


A 


3808 


362 


86 


NPPFICSPPFFPPLWGPKQWPPWCEG* 
TPPGPPCEIPLPPKKKSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPL 
PFGLGEKN 


3786 


17687 


A 


3809 


245 


377 


FGYMPSFFETQSRSVTQAGVQWHDLGSL 
QPLPPGFK*FSCLGLiP 


3787 


17688 


A 


3810 




268 


KRNFLNLI KRRYE KP I ANFVFNS KIVNT 
FHLRSSTS*GYPLLPLLLNILKVLPNEI 
RQQKE I KGRKNKGKKTI SVYS IMLSANT 
EKPK 


3788 


17689 


A 


381 1 


2 


388 


SAENCLNPGGRGCGDLRWCHCTPAWVTE 
RDCV S KTNKKE TE K YMRN * TGNMLT I ID 
AGRRYMRCHQTIFTFFFFKRGSLIFLAP 
PPSQGEG I FLPHP PE KWKKKVRI TTPTY 
FCFLRTDGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPTVIARKGKPRPKLSSEPWQFSAEQIjG 

L PS flcffvcvr ccclllflllfcrde V 

S I CCPGWS * TPGLKQS SHLS FPNCWDCR 
CEPLCRAIF* FFCLCTLLLFAFVSAAVI, 


3790 


17691 


A 


3813 


359 


52 


KKGRFFFFEKNPFGDLVPKKKKKKTPPP 
PNPKKKKRGAP P FF FFFFF FRH+ VPLC C 
L1GWFRTPGL1KGSSCVGLPKPWDYRHKPL1 
PLTKDDYYVYGLPFVSLQ 


3791 


17692 


A 


3814 


382 


1 


GGPGMEAPLGYKI KTPFPLKPQKNWPGG 
GAKPWPPPPGG*AGKMGLPPRQRFPLT 
QKGPPPPPLGGKREPPPQKKKKKRVTAM 
GDP*NSECR*GCHLRVQKEMRLPGLRNS 
P PHY I PNRNTDM CSC 


3792 


17693 


A 


3815 


472 


340 


TGFHRVSQDGLDLLNS*SACLGLPKCWD 
YRRGLF I I I FMMGE WS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXNLGLLKPLPPG *REI PPP 
PPRGGGNPGGAPPPPG 


3794 


17695 


A 


3817 


25 


465 


TPPLGG*GKRFPRGPGFSPPPPPKGNPP 
FF*KFKKKPRPGGGAP * FPP PGGGRPKN 
PLPPGPKGSH*PKFPPPPPPGETKRKPP 
PPKKKKKKRKKEKKKE I * AYKAY I CFDC 
PLTGPGDLKSDKTLAFPSVIGL.PRPCTC 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG*RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPGYSFPF*KKRGPPPTFFFTLPGP 
PGFLKGPPSPGGGGQKIPFPRGGGGPLK 
* I PPFPPPPGKKKDFFFLTKKKKKKP 


3796 


17697 


A 


3819 


461 


57 


TNQKKAAPPPLjLKKKFFFFFKKHTGGAP 
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M 
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nucleotide 
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sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFHVKNILFVF*DYIVI IADL 
IQSLRHKDFQELLRR 


3797 


17698 


A 


3820 


1 


418 


HAYHI * KPSP * PLAGALSTLRMTSALAM 
* FHFHS I TLL ILGLLTTTLTIYQ* WCDG 
TPESTYQGHHTPPVQKGLRYGIIL.F3TS 
QAFFFAGFF+AFYHSSLAPTPQLGGHWP 
STG I TPLNPLEVPLLNTS VLLASGVS I 


3798 


17699 


A 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPS KLLPLLQ I LGRG FYPQKS LLVS LG 
FFFFFFU I KKHRNEVLLCCPGWSRTPE 
LK*SSHLGTPKCSDYRCEPPHPPDPWDI 
CSYVPFLQSTHFGLNY 


3799 


17700 


A 


3822 


72 


424 


KKKKKKKKKFFPRKKGAPPAPPLFGKAG 
GEKLFPPRGLIFKKPKKAPGEKKKNLFK 
KKKKRGGP * KKKKLKAGGGEKKI FFKGA 
QKKNPGAGFKKGGRGKNRGAPQKKGGEG 
SSS 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTEPQRLiNNSYRYAELiKHIjSNTK 
NNPI*KWAKDMNRHFSKEDIQM 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT+FYVE 
MGFCHVSKAGLGLLGLRDPPPSASQSAG 
VAGVSHCTRLWIIL 














KRVNTRGFPPPPFFKTPPPKIFMGPQKK 
KNFSPPPPIKIFFF*KAPPSP 














ACGRGKYFKSGGRVKPFCAVDFPPSGKK 
ANQPTPKNPRKKQXPSGRGMSPMGKPPG 
CPCPMPKASPGPVS FNKGGVAPATKDFC 
PIGKTA* IPSKKKKKKSDPG*VKKGKPN 
LYSIYNK 


3804 


17705 


A 


3827 


285 


136 


FFFLFFXFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFSGA*KIFIIATIHY. 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS*SARLSLPKSWDYRHEPPH 
PAQTFFKQ 


3806 


17707 


A 


3829 


3 


169 


LSNS * ANNL I * IiAYTIAF I VKI PLYGLH 
L * LP KAHVEAP I AG S I VLAAVLL KLGG 


3807 


17708 


A 


3830 


6 


452 


IGFCQGFAKDFCSKKFQRNVRVSPKGGE 
GRKFLLFKGGGFQKFPPLPPGWGAKTPF 
LKPPPKKKKKKERKEKKEKEKKRNPGPP 
VSPSSWSPVP*APSEQRSGPQLPPSRGD 
♦WRLPSPPRPFRDCVRRSATRGNSPPC 


3808 


17709 


A 


3831 


45. 


387 


NFGVKVFS*LATLHGSNMK*SAAVL*AL 
RYIFVFTVGKKKKNKfCKKKKKKKKRGGT 
PPGGEKRKFFS P KKKKKKGFLKNP QP KK 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 


3809 


17710 


A 


3832 


463 


288 


DGLDLLTS *SVRLGIiPKCWDYRHEPPRP 
APCILMATFFQFEVLFSEFLETIVKSVL 
LH 


3810 


1771 1 


A 


3833 


294 . 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 


37 


SYHFGRLRRADHEVRRL*NRKGRPGAVT 
HACNP I TLGG *GGRIMRSED*DH 


3812 


17713 


A.: 


3835 .... 


499 . j 53 


AKLMYFQSRENVAKRSYRRNQKASMVHP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine ( 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibte 
nucleotide deletion, \=possible 
nucleotide insertion 














NQEPAVIAGQGTIALEVLNQVK*LSSYH 
CRLLVGMYHGCLLVSSVTPFCNIFSGLK 
VH 


3813 


17714 


A 


3836 


304 


3 


RLKI TKTNNKQTE INS * Y I KDLNARAET 
I RLLEEN IGGKPHD I RFAM I FW I * HLKK 
TKN I Y KLD YTK I LD FCI S KD T I NKVKR * 
PLEWEKIFANHISDKG 


3814 


17715 


A 


3837 


475 


135 


MEGLFC PAPMVHPF IPPP LGG * GGGS PG 
PGIQTPPGYPGEPPFFFTL.KKLPRRGAV 
FFFPPPLGGLGPKNVFTPGAGASLEPGW 
APSPPPGGPP*NPFPKKKKKKRLKIGWH 
F 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQDGLDIiLTS * SACLSLP KC W 
DHRRE PPHPD PQVS F IRAL FP FMRAPLS 


3816 


17717 


A 


3839 


460 


413 


WDYI KLTGFCTAKETRVKRQPTEWDKTS 
AKYI SDKGLKTRLHKEQ* KMTNLI *NWV 
K*LGRYSH 


3817 


17718 


A 


3840 


2 


470 


GGDAPHFPDGVAARRRGSSLL.RWGGCQA 
QGLLTSQTGWPGRDAPHLPDGVTAGPRR 
SPHLRRWAAGQTRSSLPRCDGGREEALL 
TS*VGWRPGRDAPHFPDWAARQRGSSHP 
RRWAARQRRSSLi PRRGSGRAE AAI SNS A 
S LQRVS KVS VS CRPGG 


3818 


17719 


A 


3841 


54 


332 


Q FGFHHVDQAAP ELLTSDHLP S SASQ S A 
GITGVSHRARPSLAHSFTHSIiAL.*PTPA 
HSLIHWLTHSLTHSLSFPCSFSSPPSHT 
HSLARPLGV 


3819 


17720 


A 


3842 


208 


468 


ENPTPPPLWCFALGVL*EDGFSLYSL,SA 
HVCE I VFLYCP VLESARPLMFLWNLAHV 
CYYEWHLFSVSLEFHTVS 


3820 


17721 


A 


3843 


406 


2 


PSSSSSSSSSPPAAFFLGCPGVFPSPPF 
IjKPAPGFFFWGPKKKNFFS PPRGVKFFF 
FKGAPPLFFFFFFFFFF*DGVLLCHPGW 
SAAHAS 


3821 


17722 


A 


3844 


370 


253 


VGLLEPRGQGCSELRSHHSTPAWATE*D 
PILKQNETEQK 


3822 


17723 


A 


3845 


369 


73 


LRQDNGMNPGGRACSEPRSHHCTPAWAT 
E * DS I S KKR KSQNS VFTES YFASEHCYS 
LTGGGVAILMEGPTGPDGAI FMCHLRNL 
CRCSPILPIALQCDH 


3823 


17724 


A 


3846 


198 


388 


FRKSINVRYHVNRINLKNHTI IPIDAQK 
P * EKLQQHLTMKYS I IMNSRKLSQPALR 
NLF+KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF*HGPANFFLiGPPKKKFIIjPP 
PR* *TWFLLKGPPLFFFFFFF*DGVSLC 
RPGWSAVSKN*QTNKKSIILPIRGKLNL 
FVHSML I KG FTCDLLI FKNRKVKESTKT 
IDHIFSRQLMA 


3825 


17726 


A 


3848 


227 


2 


KRRNPFHKAI AAI DSSDGPR * SKLKNFW 
RGFSIADAIKIFRDSWEVKISILTGFWK 
KW I LTLMDD FEG FNRTRGR 


3826 


17727 


A 


3849 


373 


2 


SSPQKYWGGPGFFPPPGFKTPPPPFFFW 
APTKKKIFFGAGRENFFFLRGPPPPIIF 
FFFFFFLFFFFFFFFFFFFFFFK*YNSF 
I *DIKTHVFIiIjHSVGFRVLVYKTRIjYFT 
LYHSFLKNELG 


3827 


17728 


A 3850 


394 


2 


EACS VTQAGVQWRNLGS LQAPRRQS fcl 
RGSLLSPQH*EDRT*HIGGTQQTSMNE* 



785 



JSDOCID: <WO 0164635A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 
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nude- 
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Inst amino 

residue of 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glut»mic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MNLYLQWLCPLLYQLCSTS YT * EVIQLN 
IAFCVCLVGFFCFFVFLFFETASQSFA* 
AGVQWCHHHSLQPQLPKL 














NHLNLKGRGCSDLRLCHHTLAWATEGDS 
ISKTKTVQ*RKLQNNIPHEHTCKNL 


3829 


17730 


A 


3852 


35 


390 


NFVSNSFFFFFFGNGAPRGPRGGGQGPN 
LG*GEPPPPG*KQFSFLTQPRGGNNGGG 
PPPRVNFGFLRKNGVPPGGPGGPQPPDL 
GTGPP P PPEGS PPPGPQFSKTKRTAPGQ 
PKGGPF 














PNK* KTPGLPPPPFFYFFPPPPPFLSFL 
KPFSFPPPPPPPLFFFLPPP 


383] 


1 7732 


A 


3854 


1 


236 


PTRPLRWLRQENRLNPEGQGCGEPRS CY 
YTPAWATE *DS I SKKKKGGGRLKGPTLT 
SGGGQGLFFLRGPQKSTSGAAF . 








3855 


52 


351 


TVGIMEPQTTDVYVKLHKCTDFLLPDYI 
FNKOHYPKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRTKQKKKKREKKKKRGGGAPQ 
♦LTLSPPPPPLYYFFF 




17734 




3856 


72 


420 


NTYGKKKKKKGKKKKKKKRGGRE KEKLW 
EAGGGGTTRGL* KENSGGNMGKPRIKKK 
GGKARGAPLGVSSTQLL*RGEGEKCLSP 
GGGGCSEHKTTPLAYPGLGKKKETQSGK 
EKK 


3834 


17735 


A. 


3857 


410 


107 


SSSSSPLARYFLGVAPGFPLPTLFKTGP 
RIFFLGPP*KTFSLPTPGR*T*SP*KGP 
PFFFFCRDRVLLW*PGWSEMCGLKQSSC 
LCLQKCWRYRYKPPQLA 


3835 


17736 


A 


,3858 


419 


239 


TTRSQLLLFLVEMGFHHAGQAGLKLLTL 
R*SAGLPKCWDYRRESLYLADA* GFYNL 
EDPL 














TAAQEFNFGGPIKKIFRCQPGEVKIjDPF 
KRPPFFFFFFLFFFFLFFF*RRVLPCCP 

PAS 


3837 


17738 


A 


3860 


357 


133 


NTPPPGAGRNKENPLFPKKKPRGVGPLF 
PPPRKAQTEEWLYPGPK*FFFFPPPPPG 
GKNTKKKPPPQKKKKKKSL 


3838 


17739 


A 


3861 


253 


375 


TLSPQG*GGPPPPPPRGGGLEAPPPPPG " 
LFFVFWGKRGVS P 


3839 


17740 


A 


3862 




482 


YITDEVNDPSLTIKSIGHQWY*TYEYTD 
YGGLIFNSYILPPLFLEPGDVRFL 


3840 


17741 


A 


3863 


1 


!29 


LANSNYERTHSR III LSQGLQTLLPL IA 
F* * LLASLANLAKEG 








3864 


289 


372 


NHL VKKWAKDLNRH F S * ED I Q I AHRHM r 


3842 


17743 


A 


3865 


266 


31 


KICFIFSPPQKKSVFFPKVRFPQKRKPP 
PALL FS AY FFFE AE S S S VTQAG WQ* CN 
LGSPQPPPPGFKRFSCLSTLGG 


j 843 


17744 


A 


3866 


1 


395 


NTSDFCIKPRAYNESEHHWDMVRRHL*G 
KEDNLTLD I S KLKEQ I FETS KAQLNLVS 
ETEAMVKAVDSLTNLNPVTWVKTIGNST 
IANFVLILGCLASLLWYRYIQQLRRDS 
DQREGAMMTMAVLS KRKGG 


3844 


17745 


A 


3867 


137 


2 


APLIKKHECLRAVAPACNFSTLGGRGGE 
ITR*GV* DQPDQHGETP 


3845 


17746 


A 


3868 


1 


108 


ENRLNPGGRRCSE P RS C YCT PAWATERD 
SMS*INNK 


3846 


17747 


A- . 


3869 


198 


302 


LLNINHNYGGV*GCSEL*SHHCTLAWAA 
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peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph»n, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASLHPKKKKKKRKKKSVKSQGAFYSHYA 
MAILNYIREKKLFLKWK*LLNINHNYGG 
VLEKKKP* SYTYNLKKSQNLPQLL 


3847 


17748 


A 


3870 


237 


419 


KEIQSKQGLRAICIPMFIAASLMIDKG* 
RQPKCPWREE* INKXiWYKHLMAYHSALK 
RNEIL 


384S 


17749 


A 


3871 


239 


232 


KAFLHTRVKRG I Q PG AVAHACN P SALGG 
RGGRITRSGDRDNPA*HGFSWGKCLGIE 
LVGHMFNFLRNCQTVFQSGYI I YILPMP 
LDS 


3849 


17750 


A 


3872 


417 


219 


CHVAQADLKLLSSSDPPASASQSTGIAG 
ISHRAWPRGTNPIHKGSTLMT*LPPQRP 
HLLTLSPRGF 














NILFFFFFWKKSNLVPQGEGQGQSLG*W 
NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF+RKGGVTQGGGGGSKTPALGGPP 
GLTPQRGGNNGQDPRPPAKKFGIiGKKKK 
NRQLiPFLiGFFR 


3851 


17752 


A 


3874 


278 


2 


LISIYVSIIYLYHSGFQLSKISPPQGQL 
AYLSIYLSVCHFLIYEIGVITMPASKID 
K*DLIKIKSFCTAKETIIRANR*PTEWE 
KIFAIYSS 


3852 


17753 


A 


3875 


233 


379 


QCDYFNELLRME I LGPGAVAHACNPNTL 
GGRSG * I TRSGV* DQPGQHGE 


3853 


17754 


A 


3876 


6 


247 


MLITSASYPEDLAKIIDEGGYNKSQIFN 
VDETALN* KKTPSKTFTARDKSMPGFKA 
SKDRLTLLLGANFKLKPMNDLILI 














AKI P PLTSS QGHRTRVCQKKRE RERE KE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRMLFLDKVDFRSQKIITDKEGHYIMIR 
E * MHQ * D I I LNM Y I SNNRALKYM KQKLT 
E LIGRVGRPGRSGTVPGRRNG VQ 








3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPlCLiKYLG 
INLTKYVQDLYK 


3856 


17757 


A 


3879 


1 


106 


FHRVSRDGLNLLTS * SARLGLPKC WDYR 
REPPS PA 


3857 


17758 


A 


3880 


383 


24 


FALNNFPQGGAFSGPLTLSPQVGLFFAP 
GPPPPGGGFSCPJuAQSFFRIFPQKSGPP 
ALSFSPLFKKNPVVFFLRTQPLLPLFPQ 
PRVFFFPPFFFFF*DRVSLCCPG*SAVA 
QYGSLQP 


3858 


17759 


A 


3881 


400 


71 


RIRQENRLNPGGGCYSEPRSRPRTTAWV 
TG * DS I SKNNNKNK * NLS F I YFY YYFLR 
WSLTLSPKL 


3859 


17760 


A 


3882 


282 


3 


I FNFCGYIVS VY I YEVQEIF* YKHAMCN 
NH I VENGVS IPSSIYSLCYKQSNYI IX V 
ILKYAG WAQWLTPVI PALWEAEAGGSRS 
QEMETILA 


3860 


17761 


A 


3S83 


1 


175 


li YKNF FKliAEHGG VY Q I LRRLRQED GLS 
PGG*GCSEFKSCHCTPAWVTQQDPVSKK 
FF 


3861 


17762 


A 


3884 


415 


233 


SQDGLDLLTP*SARLGLPKCWHYRHEPP 
CPAANFFLKARYFWL.MVARYFWLWCNYS 
ALPS 


3862 


17763 


A 


3885 


237 


367 


GPVLGPGMVARTCNPRILGGRGRWIT*G 
REFETNLANMAKPCL 


3863 


17764 


A 


3886 


359 


108 


TGQGGGGPPPPPLGRLKKKNGVNPEGKN 
SNKPKLAPSPPPGGKRKPPSKKKKKKKG 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, v, =Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














G*HLPMLPRLGSNSWAHDMIIjPPYPPK 


3864 


17765 


A 


3887 


245 


377 


QFIKNISWPGAVAYACNPTTL,RG*GGWI 
KRSRV* DHPDMVKPHL 


3865 


17766 


A 


3888 


406 




SLNI FFFFFFFFFFFFFFFFFKPGRVFP 
LKNFFFFPFKGCRIPFPPIFPLWLKVFF 
GPTPL*NPRG* KEGFCPPPFFFFFFFFF 
*GRVLIjCRPGWNAVAQSRLTATSAHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVSQDGL)DLPTS*SAHLGLiPKCWDY 
RRKPPCPA 


3867 


17768 


A 


3890 


395 


-j 


MVL I S * PCD P PTS ASQSAETTG I SHRAR 
QFPMLIiSQ*LVCHSVTEVFLIFLLKLFY 
IYDILLYIIYNYItiLIIYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLKSHPSGL, 
KRSSHLSLPSSQDYRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL*SQLLRR*ENHLNLGGK 
GCSEPRLHHCTPSWATEQDS I S KKKRSF 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*SRPCTPAWATE*EPVAQKE 
KRKKKR KEKK JCHVLATLA ISSLiICL 


3870 


17771 


A 


3893 


244 


412 


GIEEACGNFRVIVLKSSLALEGEEPSGR 
DSS * EASGGRARWLTPVI PALWEAEAGG 


3871 


17772 


A 


3894 


208 


2 


RISPPITPFPPWVTKKTKG* PGFFFQKN 
* KS FFF FLR W S LAL VPQAGAQ * LDLGS L 
LPPPPGYKPFSC 


3872 


17773 


A 


3895 


407 


IS 


PLLGHPPKKAGAPL,IFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PSSFKG*MGGPP+RPEV*GPINPFWAPS 
LPVWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETFSFFKMESLRDGK 


3873 


17774 


A 


3896 


82 


401 


KGFLKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAIIARCSL*LLGSSNPSDP 
PTSTFGEAGITAAGHAQFPPPDSPPPKT 
RAHHVTQ PGLKL P S * S S P PAS A 


3874 


17775 


A 


3897 


263 


396 


VWWPGGLG* Y * TPPLLGGQGGRITRSGD 
QDHPG*RGKPVSTKNT 


3875 


17776 


A 


3898 


2 


386 


KTGLE LLGLS NP PAS ASQS AG 1 1 GMSHC 
AWPNLFLYWNHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEFKTSL 


3876 


17777 


A 


3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEEVNFSVNDINKIIAFFVT*GVDV*S 
RIFTMAKTKPYNISLKLCS*KTF 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKIKVPYFRGNKFKKFPLC 
FSKVPP I STMWVLKNQFQPP I PFLKKNF 
FKTPKKGVSHYNFPPQPKS*GVFFFSKK 
KNFYPFFWEEQKKQRGPNAFLFFLKKKK 
RAAARPTRPPTRP 


3878 


17779 


A 


3901 


2 


123 


RVSQDGLNLLTS * STCLGLLKCWDYRHE 
PPCPALRCVFYG 


3879 


17780 


A 


3902 


188 


3 


LRCTWCPGTDGWVSHLAHL* FKKGRPGA 
VAHACSPSILRGQGSWISRSGVQNQLGQ 
HVETP . . 


3880 


17781 


A 


3903 . 


422 


238 


ILHVQIiPSSWDSKRKPFFFFL*R*GFTM 
LPKLLLNS*GSSDPIiASACQRLGITGMS 
HYTWP 


3881 


17782 


A 


3904 


.414 


1 


STHLGliPKCWDYRHEPIjHPASMSYL** I " 
FFIYLFTHWSVSLTARMQEAEDRDLIFL 
LHCYISSDSNSTSWt^YILVFVRSPWL 
YTTIAL.ILGFYVFVFVFLFFFETGSRSV 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine I— Isoleucine 
K=Lysine, L=Leucine, M=Methionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAGVQW*GLSSLQPSIPPGFK*FSC 


3882 


17783 


A 


3905 


396 


236 


DRVSLLSPRLECNGTISAHCNLCLLGSS 
NSPGL*NRGHQIFSIEGQIFLQAMSLSQ 
LLCS Ala WQKQPYTMYKGMEMAVCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* KNLGRGRVFFPSGAQ 
KNPQKGGWAFFGRPPPRFGGRPRGPGAR 
CPLNLTPGGPKGKALGPPPWGWGSKFQK 
GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 


341 


3 


TRFFFFSPLGKKGFFSKGFKLGTPRVFP 
LPPFLNPGPGIYFWGPIKKKKTFPPPGV 
KFGS FKRAP P FFFFLMFI AALFMWKTW 
KQPRRPS VSE * * NKLS YL* TTE Y/FFS I K 


3885 


17786 


A 


3908 


3 


162 


MMRKAIRGHL*NNPALEKLLPHIRGNVG 
FVFT KEDMAP FQKP ALLQHHSQ Y I 


3886 


17787 


A 


3909 


1 


421 


PTRPPTRTRWGFTMLVRLLLN * KHSQAG 
AVAHACN PS T LGG QGRQ I TR TG VQEQ P D 
QHGETPSRTRGRTRG 


3887 


17788 


A 


3910 


2 


424 


PRVRTSPGLALSGLTGLKFS KNQKKKGK 
KPQTTKTQKKKKSPKKKKTPKKKKGGAP 
LKKKNFNARGGENFFFF*GPKKKNPGAR 
FK KGG GG KTRG T P KKKAVG KT P F FRGGE 
KKKES PSQFYKKI KKKVFGGKKI FP 


3888 


17789 


A 


3911 


3 


236 


GFCHVGQADLELLATDDLPASAPQSAGI 
SGLSHCAWPV* PHVMSSLKSLLATRPSL 
HRVKLQKNGLRKTRELYMKNS 


3889 


17790 


A 


3912 


304 


2 ' 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 
QNQIWEKFFFLTPKNRWGAPL*GGGGPP 
SLPKKRGPPPPGWSNPPPPFFFFFFLRQ 
S CSVTQTGVQ WRDLE L 


3890 


17791 


A 


3913 


91 


344 


HAAPVCGRRTLKVHPflAPSWPPSSWAHP 
PPTWPEPSGTTWAPEGTWTLGPPQELS 
LRRGVSYHCPPP*PLIWGRKWPPPIjPCL 


3891 


17792 


A 


3914 


328 


402 


KSWGGWPXAVAHVCHPSTLGG*GGR 


3892 


17793 


A 


3915 


16 


179 


GGGCS ELR* RHCTS AWVTRARLSQKKE K 
KKKG P RKG E VATG S VW TRGE LKS WAL 


3893 


17794 


A 


3916 


229 


46 


AGPELLASSDPLTSASQSSRITGIPFLT 
GLY S KDH 1 1 ET AN I S YTNA * AL * I TLRT 
RGSTR 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECTLYFFLFLFFF 
FFFEKKFFFFSQGGKKGGEFGSMEPFPS 
GLKETPLPHLLEEGKFWAPPPPPVYFFV 
F* *KRDFTLVARWSNRLPSREKGLDYR 
P 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF*KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RNFGSLQPPFPGLKPSSPLS 
FPSSWVPR*TPPNPANLiFFFKKMRISHV 
AO 








3920 


28 


327 


KTTLLNQATXIQAKAGTWGHPPPLSPRE 
NKQNQKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKNGGGGP * KKKNFS GGGGEK 
FFFFWGAKKNFGGGGF 


3898 


17799 


A 


3921 


414 


1 


NFPKVSGPKLMSPSPPHFFFPKKSFPPP 
PLFLE KKNGRFS PPFPE KKV * F * KKP PF 
FFSPKKKPRFPPKVLKVGGKKRVLKNLG 
PGLLSKKGRGTPPPPPLFFFFFF*DGVS 
LCLPGWSAWRSQLTVTSASQVQVQE 


3899 | 17800 


A 


3922 


1 


169 


LRSCHCTPSWATE*DSVSKKKKKKTLKN 
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last amino 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H = Histidine, l = Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N = Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*NRAGKI FLRVFPLEKKGQTQLL 


3900 


17801 


A 


3923 


263 


74 


LLSVWRIPCAQKKERKKRYECECFLIVF 
VFDLSEENSSYISFKPIIiCVCVCVCVCV 
CVCVW* VKNFRKS * *SSDRSKTKTIKKH 
SHSYLFFLSFFCAQGILQTERSHLLPHL 
EVSLFKE 


3901 


17802 


A 


3924 


3 


129 


TGFHHIGQAGLKLLTL*SACLGLPKWWD 
YRHEPPHPAPFCD 














AWVTGAKPHLR KK * KKKRQMLKT I S KE M 
GNFNMVYILDKITK* FLYFKQLILSFLM 
GELGVK 


3903 












QLGAVAHACNPS T * GGQGQAEHLRSG I Q 
DQPGKHGK 


3904 


17805 


A 


3927 


180 


368 


EINFITNYYIILFFNRQGLALSPRL*SG 
GAI ITHGTLKHLDSSEPPTSDS*VAGTS . 
GTYHHA 














DH 


3906 


17807 


A 


3929 


2 


146 


HLNLGGGSCNELRL.HCCTPAWATE*DPA 
SKKKKSFKITSAFTCQIVML 














KERAIWGTELNMVSKIFPI *LKQRKNL* 
PPNI KCYNNLLRP * LGPGTVAHACNPST 
LGSRGGR I TRSGDQDH 


3908 


17809 


A 


3931 





102 


GGRGCSEPSSHNCTPA*VTKRDSISKNI 


3909 


17810 


A 


3932 


351 


69 


GQGGGFRFIQPPSSRGKRFFPPPPPPIW 
G*RIGPPPRANFFFLFF*KRGFFLFGRE 
IFFFPPPGVPPPWPPQNFGFQGGAPPPP 
PPFFFFFFF 


3910 


17811 


A 


3933 


401 


94 


RGPPPLNP PLWEKNHAGP PKVGG * KP P P 
PTKKKPLFS * KTKI FPPGGPPPIiIPPPR 
GGKPKKFLYPPKFRFREPKKGPFPPPRG 
TKKKTPPPKKKKKKKKVS 


3911 


17812 


A 


3934 


184 


2 


LKEFFPSRNSRSVLHTSNF*HLFI*LII 
Y FFE I E S YS VTLAGVQ WSDLS S LQ PR P P 
GLKQ 












393 ; 


KILIKVEIEGTYRKITNAICEKPTANIV 
PNRKKLKAFPLRNGTR*GCPVSPVLFNI 
TLEVLARVIYSEK* IKGTKIQISIATME 
NSKKFPQKTENKTTICSSNSTTGCISKR 
KEMH IS KRHLHS PVYCNT 














FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR* PFRPLLLVRASbRPPLGLPTPTGP 
TGGMQRPEA*RTPATDKPQNSRRGP*GA 
WGRPGCRCRGGA 


3914 


17815 


A 


3937 


382 




QWTAQTWSVQNELLLFCLKPHPPPSLLP 
PSSCPGPHLGTPWTPAFPSHPTSTLKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PASSLTPTVCSQHGNRFFKKGVRAGRSG 
SHLESQHFEGPEAGG 


3915 


17816 


A 


3938 


398 


2 


PPPLLLNKIGAKKKAFFCPSPFGPRKIP 
CGQKTKKSVFPFRVP*DKSTFDCFLCPI 
VYVKKQIWLAPLHCVFSGRVAQGSKKSP 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGLPSS 


3916 


.17817 


A 


3939 


240 


385 


KJjLiFLFIjFIYFF*DGVS1jCCPGWSAWQ 
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Amino acid sequence (A-Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProJine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TLLTEP S AI PGAGGTP VS P I 


3917 


17818 


A 


3940 


230 


381 


LIS SS YREY1AKDLRFSS KQS * WLCS KE 
FRGLGAVAHVCNPSTLGGQGRW 


3918 


17819 


A 


3941 


1 


107 


CWPVWS*TPDLKQSTCLGLPKCWDYRQA" " 
QPHRNLL 


3919 


17820 


A 


3942 


2 


95 


DLLTS * SACLGLPKCWDYRRKP PRPALF 
AFS 


3920 


17821 


A 


3943 


163 


404 


CRHKAQWLPQLPGHL.LiLSEKTIiKYSSPK 
GLRSQH*KLLFLFCFG*TESRSVTQAGV 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 


3 


200 


GLDLLTS * SAHLGLPKCWDYRREPPRLA 
YFTFIKKKKKDGCELLNLFPNSCMDTAH 
VHPAHL.TGG 


3922 


17823 


A 


3945 


423 


2 


bFFFFSEV*SHSVAQAGVQWYNLGSIiHS 
S 


3923 


17824 


A 


3946 


209 


413 


TTCIEQCNKTRRLGAVAYACNPSTLRG* 
GRWITRSGVRDRPDQHGETTKNTNISTT 
KKSTKNTKISQA 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTP I SLGQSTWGKGQMWAQR 
QQT*TSWPDGSEESSRSPSTFEQQISER 
S SS AKGQTNS S SGS LNSVS PDWETP P SR 
SQQTPHTGELQLASGRCSSGMNLPEEGT 
DSNLCCSATHAS 


3925 


17826 


A 


3948 


299 


3 


ASQTTFREGCFLF FSFFISVCL FVCL FE 
MEFHS CR PGWME CNGDRARLHIiKTT ITT 
TIEIVKDDTNKWKHIH*RLPNIKMSILP 
KVIDILFNAIPIKI 


3926 


17827 


A 


3949 


331 


374 


NAPKSERASKAF*AILEENKEKLPLDIN 
PWHPHGHIFKCVAPPHCLGWDLSAVPA 
SHALGFPQPQNSAWHVARSKSAASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 
R 


3927 


17828 


A 


3950 


439 


69 


TNQKVWVPPKEKGLFKKKQLRPRVGGPA 
FHSPPLEGQGS*FP*GQRFKPRPPPWVK 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 
KFPPTPKAKVPFIQISPPPAQAGGPTQV 
PFPKKKKKKSH 


3928 


17829 


A 


3951 


422 


3 


CFPALGIGPPHPPKRVG*KGGTPPPGLF 
FGKFLKEKNFGKNPGPGQGGPFFPPKKQ 
RGPFNNGRPGKVKKKGAPGPGVPKKKPP 
RVPPPPGFFGNPPGAPPKFFFSPRGGED 
QKRKNFF F F FFLR WS FTNEAQAG VQWR 


3929 


17830 


A 


3952 


223 


3 


ALFFWGRGTLPRVKGLNPKGGGGKFFFF 
FFF*ETESHSVAQAGLQWHDLG*LQPPP 
SRFKP FS CLRLPGS SLS 


3930 


17831 


A 


3953 


1 


388 


EKGVSPGGRTCSEQRWCHCTPVWATERD 
SVSKKKGMSNNFSLILRSWNNKGQGGTY 
FQQKFI YPCFLKI KGHSYINCKI FKK* * 
NLSLHVHVTDVNVRI ILQKLDCQIHWVD 
FNFELNILQCQGLGGCK 


3931 


17832 


A 


3954 


1 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKSISEILFFFFLKKNFFLGPQLKGRGQ 
ILVKGNPCPGGKKKFPAPPPQRGGKKGG 
PPPPQF I FVFLKKTGV* KIGPGGV* TPA 
LG I APP FFP KGGE * RGGPP P 


3932 


17833 


A 


3955 


2 


199 


PGGGGCGEPKTRGCTPAWATERDSVSKI 
NK*KLKLKNNSWLYAKSLASSPTGR*EE 
WGCFTNEHI ~ - 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iW=Methionine, 
N=Aspamgine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W = Tryptophan, Y = Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 


3933 


17834 


A 


3956 


393 


2 


KEFPLFPPGAPPWPLSGPFSVLEKPKCG 
GG WPG INTP PGGGGF PQGS LWAGD FP * 
PLFSPQIPFFKKQKFFFFLPRGGLWAPF 
FPPFGGPLFSQNF* KTGF*GGFFFFFFG 
DGVS LCYPGWSAVAPHAS 














LVY I FK I RTCW PG AVAHACN PGTLGG * G 
GRVTRSGDQ 


3935 


17836 


A 


3958 


385 


311 


GPKTIPPASASQEGSrTEV*HHAWLIFFL 
FFVELWSSHVAQAGLKLPASSDQS*TLY 


3936 


17837 


A 


3959 


410 


119 


SPPPRFKTPPPEFIFGAPKKKFFSPPPR 
PLNLILLKGPPLFFFFLPKAFKPHINFV 
FPPQKKKNV* INLYFPPPKKENESS*GK 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


77 


400 


TAL P LiKP KKKKKKKKKKKKRGGPL KKKK 
FKAPGEEKKNFFKGAPKKKFRGRV* KTG 
ERKKPGGNKKKTFGKKPPFPRGEKKKKP 


3938 


17839 


A 


3961 


2 


194 


CL * SQLLGRLiRWEDHLS S GN * GCSEP * S 
PHCTPAWVTE*DPVSRKRKKYPVVNQTL 
RGFLSPAH 


3939 


17840 


A 


3962 


52 


409 


NS KTS S QEKKRRVI RLS LRSRVGDDTKV 
WTRRWGVFGWHLKNLPTTLVLiKKMEKIlj 
ETYFMTPDLHDPHTTSY FNGCQLQAI KV 
GGTITDLIVF* IVSKPNAAGLSLNENW 
AKFNDLI 


3940 


17S41 


A 


3963 


382 


96 


LSSQLLREAGERNE*TGWLQQ?LQMQIH 
KNPCIEPRSRYCTPAWATEQDPVSKNKQ 
TNKQTKNSLHL 


3941 


17842 


A 


3965 


3 


169 


DAWELRSHRCPPAW*QSETPSQKKKKRG 
KKKI FRFYLFFNKRKKFLNGGNLGVIP 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGLDLIiTS *SARbGLPKCWD 
YWCEPPHLAENS 














PPFRIYWGEVLTPL.FLRVKPKFFKPWGF 
PFCLFFFFFFEMESRSVAQA*MQWRDLD 
SLHPPPPAFK 


3944 


17845 


A 


3968 


471 


440 


MELRFYHVGQTGLKIjLTS*APPVMASQS 
AGILGVSHSTWEAPNSLAITFLSSHLS 
PRQP * THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQPECQLQAGDW VS LWNLVCSGLEVRA 
WREAPRLEWS FCLFFVSEM * SHS VI * DR 
VQWHNIGSLQPLPP 


3946 


17847 


A 


3970 


53 


398 


GDLPKKQQQQKPLNFCFQLLLYARLFGP 
IFLV*HLATYNNKHLLECVGKVMINCQA 
DILNKMEISKVGAQKKKKKKKKKKKRGG 
PLKKKKFIPRGGEENFFFLGAPKKKTGG 


3947 


17848 


A 


3971 


396 


160 


KRERI KEKEDNKEEKDKEKKASGKKERK 
KEKE IRDRESKEESSNLLLQEELiSLCCP 
SPGHPPNS SL *DPYFATGNSLRC 


3948 


1-7849 


A 


3972 


416 


3 


FPFPRVPEGVPGGEGVFPPGFFGKSGVP 
FPRPRGGGAP I * KPPPFPGGPPNFFF * K 
KTG P PFS P PQGAP PAKRAP P P F FGGG GG 
KI FTPLKKGGLF FFG PPKKGVF FGPKKT 
LFFFQKKKKKKKGRPPRSHASAHAS 


3949 


17850 


A 


3973 


232 


381 


YKTFSLIF*LFFFRDRVSFCHPDRSAW 
LS * FTVAS ISWAQGILLAQPLE 


3950 


.17851 


A 


3974 . 


396 .... 


. 141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamtc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L-Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














F*NPPPKIFFGAPKKKKIFPPPGGKNFF 
FLKGPPPFFFFFFFFFFFFFFFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQLISPHSQDNSKQKEQSWRHHATCLQ 
TILQGYSNQNS * * AP I AKT I LS KKNKAG 
GITLPVFKLYYKATVTKTAWYWDQK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQNGFDLLTS * SACLGLPKCWDY 
RHEPPRPAR 


3953 


17854 


A ■ 


3977 


223 


399 


EGPRTQSFFDTEVHSVTQDGVRRRNL.GS 
LLPPPPGIK*FFYLSLPSNWDYRRLPPR 
PVN 


3954 


17855 


A 


3978 


227 


422 


SFPSPCIFFSIiVDLGVSL'YCIjGWSQTIiD 
LKGSSLFSFPKCWDYRQ*ATTPGLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 


519 


313 


■ KKNLLNPGDGGCREIKSSHCAPAWVTEK 
DCLKKKKKKTSVCVCVCVCVCVCV*NHL 
SSL.GLFPEPVGLN 


3956 


17857 


A 


3980 


217 


414 


IYIFCGDGVSLCCPGWLKQSFCLSLPKC 
* DYRHVPPRPTS FHLLTN IGVLQFLKLQ 
ATLS FTLFLL 


3957 


17858 . 


A 


3981 


2 


409 


KXjTFAV*DY*EKLiLiIjSIjASLiTSQAAF1jF 
PEHPYTVISNPCSPANYIFTLSSGTTFH 
QFYAKS * CVTYE * KKKNLL S P P TGDKPN 
*GRSQTKKIQTKY*SEICFFVHLFETGS 
HHVAQAG LELLS S RD P PTS AS QS T 


3958 


17859 


A 


3982 


234 


416 


SVLPSTPLI^NKL* KSKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKSRGGAPKKK 


3959 


17860 


A 


3983 


414 


291 


GGRACSEPRSCPCTPAWATERDSV*KNK 
QPNKQKNKKDLL 


3960 


17861 


A 


3984 


192 


415 


TNANKLDNLDEMDRFLETQNLLRLNNEE 
TEYQNRPVTREE I E * VIKNLLTEKKNPG 
PDGFTGEFYQTFKEKLIP 


3961 


17862 


A 


3985 


403 


145 


AFFFFFFPPGEKGDFSPTVLFGGPPGFS 
PPPVFKTRPRNFFLGPPKKKYSFPPPGP 
KNWFP + KGPPPFFFFFFFFFFWKL.GIEV 
IS 


3962 


17863 


A 


3986 


255 


425 


GGRFKGSNFTSAGLQGFXXXXGPPKFIS 
RPGV* QRGEWKNPGVPNLFAIiSTS PFGR 
W 


3963 


17864 


A 


3987 


247 


2 


EEAVRVSNLYTKLDEEQ* WKKQLNYVE 
RLKK*ELFLQDLYRKVPLKI * IF*VKNL 
NSFWPDAVAHACNPSTLGGRGGWIT 


3964 


17865 


A 


3988 


1 


394 


HHHTWI IFKFFCKDGVSLCCLSWARTPG 
LKRSS CFRLTKCWDYRHEQLCPAFKHFF 
LY*HNFMSTEKMQE * HKNS * YPS PTVPK 
GVTFYY I S YI CF 1 1 L I LS A I TYKFVHTY 
IVSLNHVCYSNEAPLSLNI 


3965 


17866 


A 


3989 


208 


402 


ETDRRLKSGGSLSKTLIKCMTCFWFRD 
GVSLCCPGWIQTPGLK*SSCISI J PSSWD 
YRHMPPCPA 


3966 


17867 


A 


3990 


2 


408 


THRPYHATPTYLPFYTNNLI KGGKKKKK 
KGGKKKKKGGPPFKKKHFSPPGGEKNFF 
FYGPQKKKSRGRV* KRGEGKNPGAPQNK 
PLGKNPLFPGGEKKKKPRFAS * PPPPPF 
S FGKKRAP PTKY PGAGLHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLLPRGKGKGKIWVKGNLGLKNKK 
NFPA * P S E EGEKKGG PHQAGK I L KFGGK 
RGLNKGGKTGQKLKPGGI PPPGP* KVGE 
KKEKPPAPGQKLFFKKRKGQKKNF 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gluramic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3968 


17869 


A 


3992 


396 


64 


YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP*KKIFFPPPRE*KFFFLKGPPPF 
FFFFFFFFFEHLQVLL I C 


3969 


17870 


A 


3993 


367 


193 


KLW E KKK I LWKKG *GEIKSCHCT PAW AT 
RVKLCLTKKKKKKKKKGGEGE I FKKKS F 
S FIjGKQKKKRGPAFF 


3970 


17871 




3994 


1S8 


2 


KRANTLPDFKTYYKAIVLKTLWYWRKDK 
HRSMKQN * ESNEVELLYHNI HRNKLKMD 
HRLiKCQ 












147 


QENHLNPGG * GCS E LRS * H Y I P AWATE * 
DSVSKKNYI**TSHIFSGDF 


3972 


17873 


A 


3996 


23 


329 


RNSFIRSAFSNVLNKWLFLTSEKKKKK 
KKKKKKKKGGRKREYLTRRGAGENNKYF 
YK* KKKKNTCG* KKKGWGRKREREKKRL 
GGKKKFPPFEEEGKKKPG 


3973 


17874 


A 


3997 


164 


392 


KPLGRAGLVPCLCSSFTGFLM* KTEPCS 
WQAGVQRGHLCSLQPQSPMFKRFSLLS 
liPSSWDYRVLLRHPSWSAW 


3974 


17875 


A 


3998 


396 


3 


FFFFFPPGGKWGISDSPQLFFSGVKNFS 
PPPPPVNWGLRVWPPPQKNFFFFF*KKV 
FSFFWGGPPNPPFFFSL.LLGGPPPPPIF 
SGEKX P PSL FFFFF FFEMES CS VAQAG V 
R*RHPGPLQPPPPWLKQF 


3975 


17876 


A ■ 


3999 


383 


2 


VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS*RPPPPPEGKKLN* IRLMGPPFN 
SKEKFF*KF*AFKPFFFPPKGEGI*GFF 
SRAHSFPNQRALFWFFFFEMESHS ITQA 
SMQWHDLRSLQPPPL 


3976 


17877 




4000 


213 


1 


RGRQNKTLFLTKPNQTKPNQTKPNQXKP 
TTFL*SVIEIRVKKKYKTTRKTTKNPTT 
SLiTRSLGRYKSPTRP 


3977 


17878 


A 


4001 


412 


129 


GSL.SSPRLECSGTIPAHCSLNLPGSSSP 
CTSAPRVAGPE*MGQACMPPCLPVNFFF 
WGGGGIIjVETRSC*VAQADLELLSFSDP 
PAS AS KVTLE L 


3978 


17879 


A 


4002 


411 


293 


RDTRCLPP*LDNFVFLVEMGFHHVGQAG 
IjELiLTSSEEIA 


3979 


17880 


A 


4003 


46 


392 


QDFFFFFFFLGKGAPPPPGKKGGTPFPK 
KKTKGGGPRGPPLEPPPLGGPGPPEPGG 
GDLiKKPWPPQGTPPIi*KKPKMGGGGGGP 
P*TPGTGGGGEKNPLTPGGGPKNGPPPP 
PPG 


3980 


17881 




.4004 


360 


1 


KCLGFQFFFRFWNICIIFISQLSIASQV 
QKSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWIFILGMLSTYKTLILLjKLK*RFNA 
MPINIPWIFMEIDKLILKFTWKYKEPR 
RVKAILRN 


3981 


17882 


A 


4005 


152 


408 


NKF VSKFYWMGNVLSRYPQFT* QFS FCH 
VRLPSVF* YLYTRNKIJj I FFAHKAFFLD 
AKYVCVCVCVCVCVCVCVYFGIGEGNLP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPNPF 
ILGTPRFFPFPPF*KPPPKFFFLGPKKK 
K* FSPPRGKKI FFFKRAPPFFFFFFFFF 
FFFFIJLGGSLS FMYFHIKTYK 


3983 


17884 


A _ 


4007 


217 


2 


PLFFFFKHLKAYYSDVIHISFAILRYDL 
.F.*DRVSLCHSGWSAVARSWi,PVAGQNTF 
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ITRQEAPYLLAI PRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVSSNP I S I * RRRGLSLLPKLECSG 
VS I AHCSFDLPG * SDPSAS AS RLLGRLR 
Q 


3 985 


17886 


A 


4009 


833 


237 


SLDNRARPCLVSFIFKKGRSTAERCGAP 
GTTSPPFPRLSERGHPIIPS *RGWERKS 
QLCSHLQSSRDPSGLGRAGCGGGGDLRE 
SPGTFPGNGGSDIiVPVPTAPIPIEPQIC 
QATCAVMKLS FDEE YRRAMNE LGECPRS 
SGKPSSSPRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLESHSTSHEWTVTLGPL 
SP 


3986 


17887 


A 


4010 


276 


2 


VEVKGNNSFSEGSITLIPNPDKQPKETT 
DQPLiMSTYAKIESPGKNPSHDG*LIFDK 
GAKI IQ* VKNSI FNKWCWDNWDFHVKTM 
NPHFTLiI 


3987 


17888 


A 


4011 


188 


27 


REKFQIYTIKNDKGSITTDTTER*KI IR 
DYYEHLYAHKLENLEEMHKLIGIPP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN*L*ATTQTTQLiSLSFKLD 
YFSIIIIPVALWTWAIIKFSL*YINSP 
PKKKEAEKDLN 


3989 


17890 


A 


4013 


1 


403 


CLDQEVIISN*H*ATTQTTQL,SLSFKL.D 
Y FS I I FI P VALF VTWS I I £ FS L * Y I NS P 
PKKKKKKKKKKKKKKRGGAPLKNSRGGP 
NFWGGGKNIFFFFGGGDKKPPRAFWKKP 
LFLGGGNLGPPPPQKFTPWGKK 


3990 


17891 


A 


4014 


67 


439 


TEIiIIFYIFFETTIjIPTIiAirTR*GNQP 
ERLNAGTYFLFYTLVGSLPLLIALIYTH 
NTLGSLNILLIVTLTAQELSNS*ANNLI* 
LAYTIAFMRKKTKKKKGGGliLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


LIVPTIILLPLT*LSKKHII*IINTTTHS 
LIISIIPLLFFNQINNNLFSCSPTFSSD 
HLSHP I LLKKKKKKKKKKKKKKKKGGGA 
F 


3992 


17893 


A 


4016 


258 


3 


LKKIFSPPTINLFFPPFPLKKFFFPLSL. 
*FFLGVFSPFFPPPKKGFFPKIPPGVFF 
SPPFKKKKFFFPPPFYFAPPRVFFKGPP 


3993 


17894 


A 


4017 


287 


2 


KKQKNFIFFFKNIjSFFGRGAFFPPIiFPP 
ILRGKGGDFFYPRGLKPALGNQ*NPFSR 
F* I FFFFFFFLRRSLAPVTQARVQWHDP 
GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 


SYDVKQEKPNGI *ASKYWAYVFHSFFliS 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGVKKLFGLNLSKSWE*QNCPPPPI 
IF 


3995 


17896 




4019 


35 


228 


EELNHLNQGGEGCSEQRSHHCTPAWATE 
*DSSQKKNKFWYIHPCNRIVTTTNTPMT 
ESPKHYAE 


3996 


17897 


A 


4020 


2 


402 


ARGNLNTTFFDPAGGGDP ILYQHLF* FF 
GHPEVYILILPGFGIISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VGIDVDTRAYFTSATI I I AI PTGV*VFS 
*LATLHGSNMK*SAAVL*ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELIILDILVETTLIPTLAIITR* 
GGQP * RLNAGTYFLFYTLVGSLSLLMGL 
IYTHKTLG*VNILLLTLTAQEL*NS * AN 
NLI * IAYTIAFIGKIPFYGLQL*LPKAH 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Gluramic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














VEAP I AGS IGLAAGLLKLSGYG I IRLTL 


3998 


17899 


A 


4022 


409 


3 


LEMAS CLI I CNNNNPFFYQ I VTCDE KW I 
LYENW* LPAQWLH* EEASKHSPRPNLHE 
KKIMVPVW*SADGLIHYSFLNPCETITS 
E * YAQQI NVMH * KLQGLQLT F INRKGPI 
FLHNNTKLHVAQPMLQKLNELA 


3999 


17900 


A 


4023 






HLiPPWEDKARRHLPDARTFIliAFLASRN 
VRNKFFSLQATQFLVLYASSTKITKPLK 
TNQKKRLKKSEQSLRNMWTPIKKTNICI 
VGIIEGKEGEKGAERIFEEIVPQTF*NL 
VKDIK*KVQ 


4000 


17901 


A 


4024 


58 


323 


RHKJQLCEHVKKCSAVLIIREMQTKTTLK 
YHFLLVTIAKIFNLIHF*RSVGETLICQ 
WAGRNAKS * P F WRR I SQYLTKL YVCVCM 
YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRLRQEDGMSFGCQGCSEL*LHH 
CTPAWVTEQDPVS KKKKKKKAYPDP I PK 
KTDKSNGKKGPDFSRKNPPKNSPPPPWQ 
RSTAFRGRAAKS CPTH 


4002 


17903 


A 


4026 


204 


1 


KKKFGALFFFFMQGLiTLLPRLEYR*WCD 
HEFLRSGWLSK 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 

Y LVLQM LGLS L FMA * RGAG S S C S AH KKW 
WLGLAS PSC*TFFKNRTKGRARWLTPVI 
PALWEAE 


4004 


17905 


A 


4028 


355 


444 


VSYKKK*SRPGAVAHACNPSTLGG*GGR 
IT 


4005 


17906 


A 


4029 






AS I I LL I AVFFNN I LSGQ * T ITNTTNQ Y 
SSLIIIMAIAIKLGIAPFHF*VPEVTQG 
TPLTSGL1jLjIjT*QKIiAPCF 


4006 


17907 


A 


4030 


49 


474 


PXRXXFCHIVTYYSGKKEPFGYIGMV*G 
MISIGFLGFIV*AHHIFTVGIDVDTRAY 
FTSATIIIAIPTGVKVFS*LATLHGSNM 
K* SAAVL * ALGFI FLFTVGGLTG IVLXN 
SSLDIVLHDTYYVXAHFHYVLS IGAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF*EMESHSVTQAGLLGRNHSL,IiVPQT 
PGLKRS SLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPS'PGEG* IKKMHDFNEMFLQN 
CWKI LFFGGGFFTGQI FPRAAPP INKNI 
LVWGGGK 


4009 


17910 


A 


4033 


2 


386 


SEPLSRKKLYLSILISLQISLI ITFTAT 
ELIIFYIFFETTLIPTIaAIITR*GNQPE 
R LNAG TYFLFYTTj VGS £iPL2j I AL I YTHN 
TLGSLNILLLTLTAQELKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


1791 1 


A 








G * I IR YLHANGAS I F FI CLFLH I GRGL 


4011 


17912 


A 


4035 


3 


317 


HASADAWAFIGYVLP*GQISF*GATVIT 
NLLSAIPYIGTDLVQ* I*GGYSVBSPTL 
TRFFTFHFILPFIIAALAALHLLFLHET 
G SNNPLG ITS QKKKKKGG P F 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGPPPPPAFPVGGLIKKRFPPT 
PGKKFKFR * KFFP P KGG FFVRD P P PKPG 
CWEREKKKKPLLNTDKWVILEGSEAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKARFLPQKRGQRVLNP.PWWGKGF 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparngine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PNSGTKETPFFFF * DRVLLCQPGWSAVA 
QS * LTTASTFWAQAI 


4014 


17915 


A 


4038 


307 


36 


RKGFFFSKCGEFKKLPGPPLGGWYLKLI 
FCQKKFFHPKFPGFPG* IFLPPPKIFFI 
FSFEMEFCSCCPGWSAMGRPRLTATSAC 
WAQANS 


4015 


17916 


A 


4039 


2 


372 


NDTSLTVKS IGHQWE * TYE YTDYRGLVF 
NCYILPPLFLEPGDLRLLDVDNRVILPI 
EAP I R 1 1 NTSQDVLHS * AVPTLC * KTDA 
IPGRLNQTTFTGTRPEVYYGQCSEICGA 
HRRSMLIVLELI 


4016 


17917 


A 


4040 


107 


279 


LLLLFFFFWRKSLAIiAPRWECRGKI *GH 
CKLRLPSPCHSPA*ASPEAGTTGWTYGC 
F 


4017 


1791S 


A 


4041 


3 


351 


GSHYIAQAGL.KNPGSSHLPASAAHSAGI 
TGVSHHAWLKI FFKESVSDYS PHLNSRK 
NAVYVS FFFFWKKS FVFVPQAGGKGLNL 
GSL KFS PLRLKQ FS CLTLPRS * E YGLAP 
PPPV 


4018 


17919 


A 


4042 




385 


RPRRPDIE* QERRTQEVLQ AVAKKVKE E 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGLKKETDWLKVDAQEAKTEPFTQG 
KGGGQTTPESFEKAPQVTES I*SSELVT 
TCQAETLAGVKS QEMV 


4019 


17920 


A 


4043 


3 


282 


TAILIQTP*SFTGAVILIIAHGLTSSLL 
FCLANSNYERTHSRIIIIiSQGLQTLLPL 
IAF* * LIiASIiANLALPPTINIiLGELSVL 
VTTFSVDAA 


4020 


17921 


A 


4044 


2 


407 


GTRLEIMSRNHGIFPFTLEIFKDNEFEE 
PYREALPTLiKLRDSLYFGIEPEEHVSG* 
ESLEESCFATPTSKTDEVLKYYLIRDG* 
VSDDSVKQYTSRDHliAKHFQVPDFKFEG 
KDHKEVILH*RDLVCGVMDERSR 


4021 


17922 


A 


4045 


4 


380 


ENWLTI IRQAWHEADRNLNTTFFDPAGG 
GDPIIjYQHLiF*FLGHPEGYIIiIIjPGFGI 
ISHIVTYYSGKKEPFGYIGMG*AMISIG 
FLGFIV+AHHIFTVGIDVDTRAYFTSAT 
1 1 IAI PTGVKVFS * IATLHGSNMK*S AT 
I I IAI PTGVKVFS 


4022 


17923 


A 


4046 




94 


ADRNLNTTFFYPAGGGDPILYQHLF*FF 
GHPEGYILILPGFGIISHIATYYSGKKE 
PFGYIGMV* AMIS IGFLGLI V*AHHIFT 
VGIDVDTRAYFTSAT INI AI PTGVKVFS 
*LATLHGSNMK*SIPTPILIFRPP 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGEPRSRHCTP 
AWATE*DSVFLjLWREALLTQTAPFNRIjQ 
IiLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


E P I QACS SRRVCE P I QACSSRRVC VCVT 
PS RHALPG C VCV* E P IQACS SRRVCVCV 
RAHPGML FPACVC V* E P I QACS SRYVCV 
RAHPGMLFPACV*AHPGMLFPACVCVCD 
PIQACSSRLCVCV*ERAHPRLLF*ACVC 
VCESPSRHALPGVCVCVRAHPGMLFPVC 
VCESPSRHALPGVCVSPSRHALPGVCVC 
V 


4025 


17926 


A 


4049 


2 


414 


ADRNLNTTFFDPA*GVDPILYQHLLi*FF 
GHPEVYILILPGLGIISHIVTYYSGKKE 
PFGYIGMV* AMIS IGFLRVIV*AHH I FT 
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peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartrc Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N—Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W— Tryptophan, V=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














VGIDVDTRAYFTSATI I IAI PTGVKVFS 
*LATLHGSNMK*SAAVL*TLGFIFL 














LTRDGGACPFS KLFRRVRQKNPLVLGE * 
SCSEPKLCPCP P AKKTKDY PVS KKKKKF 
I WI INRTKI PRKRKKNKAYHKVKYF 


4027 


17928 


A 


4051 


238 


3 


RRHGS FAPLPPGLPLC I S Y I WL, F I TI LC 
NILYLYFETGSRYVAQVGLKL.LCSSEPP 
PSTS*STEITGMSHGARPHNIP 








4052 


225 


172 


IARYPFHIQSCTNMPSPNKSITSKEEDW 
GPGAVAHACNPSTLGGQGGQITKSGVRD 
HPG*HGLV 


4029 


17930 


A 


4053 


471 


30 


GPGGIKISGALFPLNKVKRVPWFPRPP 
LIFPQKGVFNQNGGDPGGGFWPGETPGR 
VPPPPGVFFVKSPFLSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLN 
RPDLSMGPQPE + SLPLPGEPSVSHAGCL, 
PQAWRG 










425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF*KKGFS PVGRGGFKFLP 
PKNPPPLPPQKVG I PEGGPRAGPI FS PP 
QLFFFFL.TWSLTPSPRLECSGAILAHCT 
PAWETEQDSISKQTKK 


4031 


17932 


A 


4055 


312 


1 


IKNARAHLLPQGSPPPLIPIWIIETIS 
LLIQPIALTVRLAANITAGHLIMHLIGS 
AALAISTIILP*TRIMFTIIIERTIIiEI 
AVALIQAYVFTLWSLYLHA 


4032 


17933 


A 


4056 


648 


123 


DPDPVPSARGCVPSGAPGRGAQLKKAWD 
GALA'LPPCLCAQEIjPSFSLGTGGPAVG 
LSVNSERDGRLGEVSINAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
KGQRSSQGGAVSQSSPRGRQIPPSSPPA 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLARVIDLDYEDE 
I S LLLHNGGKEE * A WNTGE PLGCLLVLP 
CPVINVNGKLQRHNPGRTTNGPDTSGMK 
VWVTPAGTKPQPAEVLAEGKENKECG 


4034 


17935 


A 


4058 


3 


575 


L.RSRPLHIILSCGL.LjVTPRSLSPPPPPQ 
RLRLCRPS RGAAE F FFSLRTKLHFATL P 
L.RIEGS KTL* NCC * LYHGC* I S * I HHGG 
IiFDVAAKNI IHE EVE KYDKQ YRG KELLG 
FATYKTFEIIVHQYIQ*LVEPALSMLQK 
AMET I Q 0 A F I NVANKHFGE F FNLNQT VQ 
S T I ED I KVKHTAQAE YM I QLQ VR 


4035 


17936 


A 


4059 


389 


3 


TFKGPKVFRYPTFFL.CPPPRGFFSKNIW 
TPPPPPFFGVFPEREKIFFFLKGAIjPLF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG * LQPLPPGFKR 


4036 


17937 


A 








F*KAPKGFCAKFFPFFYPPLPKENGKTH 
RQKTPFPHFCQIFKNGFWVLHAFSLGDF 
PARVWGPPFKRFFKGIFKEFPPYGSPPP 
KGPKKKKRTAARDLELADAW 


4037 


17938 


A 


4061 


38 


410 


AWISI * ERPFTLVRVFKNFLAISGL.LFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYI LPFFPPHFLFGW I CSYAA 
Li*FFIYFCFLPFFFNGPTSP* IGLELLI 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFKKPTFGFGG 


4038 


17939 


A 


4062 


405 


81 


GG ANVFPQKS P P FGNKKGGH FFSYKILN 
NPGHPPFLKKKKNAPGGWGNPWFSPPWK 
GAL* PKKKNPPKKKKKSPPPKGGKKKNP 
FPPKKKKKKKKA * GDNFDMGTGN 


4039 


17940 


A 


4063 


415 


239 


RPSFFFFPPPKKSCFSTFFFFLSPGFFF 
PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KKFFFF*PPPPSLFFFFFFFFFVFFFFF 
FFFFFFFFFFFFF * GLTS PHKQIGL.VLA 
FLIiALS KITHAS I PVPVS S PS KKKKKKK 
KKKKKKKKKKK 


4040 


17941 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI*LAYTIAFIV 
KI PLYGLHL* LPKAHVEAPI AGS I VliAA 
VLLKLGGYGI IRLTLILNPLTKHIAYPF 
LVLSL 


4041 


17942 


A 


4065 


28 


288 


YFDI FVEARS P YVFQVGLELLGSSNP PA 
SASQSVGITGGSHCAQP+VTILFTTCTL 
HVNG * I * C I KFL I HN I P I L VI S LQY I FG 
FFL 


4042 


17943 


A 


4066 


338 


134 


PGGGCSEPRSGHYSHCTPAWQ * SETPSK 
KNKKKKXTYFSVKNWGIRVFIEIIIjNSF 
TTVRP 


4043 


17944 


A 


4067 


26 


365 


WVSLMTS ALAMRVDFY F I TLL X LGLLT S 
TLTIYQ*WRDVTRESTYQGHHTPPVQKG 
LRYGriLFITSNDCFFAGFF*AFYKSSL 
APTPQLGGHWPPTGITPLNPLEVPLLNT 
S 


4044 


17945 


A 


4068 


49) 


174 


TQLKTH*GSTTDNRTEVRVEPRVRTNYK 
DLLKFLi*SKGYDFESETETETIAKIjVKY 
MYDNRESQDTS FTTLVER VIQQLVLSHI 
FKDNYANINAE KLSGN 


4045 


17946 


A 


4069 


210 


402 


NVS KGLRKVPSTQ * VIN I S CRPG WARA 
CGPSTLGG*GG* ITRSGVRDQPGQHGET 
PSLGPGAV 


4046 


17947 


A 


4070 


202. 


1 


DSAIALQPRQ*EQNSVSKKKIYRTIDLR 
SEYFARWSFTLAARAGVQWRNLGSLQPP 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLGAGV* PRLGPKG*TPFLLKNQKLIGH 
GGGALYS QYFGGWGRRNS FNPGGKG FTN 
QNFP PSLQTWGKKG VS FS KKKKKERKTR 
KWGGE 


4048 


17949 


A 


4072 


2 


324 


RRGNFCMF*SDRVSSCCPSWS*SPGLKR 
SSCLSLPKCWDYRFEPLYLARLVLMRCY 
STHNTYIMYCQMQRLFQGHRYLSVSTSS 
ATRPYL.DNFFFFCKNKVMPCCQG 


4049 


17950 


A 


4073 


2 


326 


RRGRMLPRTPGRPGNDAQQPQKHSLVD* 
LiFIYLFYFIjWTGSHSVAQAGGQWRNHGS 
LQP* PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLSFGKPYTYKKVDKWSFEA 


4050 


17951 


A 


4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGFFPTPLI 
WGPPGFPPSPFLKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
XjVPQSTFIHFIFHICYLL 


4051 


17952 


A 


4075 


3 


294 


VFCHVGQAGLELLDSSNLPSVASQSAGI 
TDTSHLT* PDI.SFCCKTKFD 


4052 


17953 


A 


4076 


60 


389 


PSAQLGSTKHIHMAVQP*PPPSPELLHL 
PKLMLCPHSAHTPRPPPNPGTVCTYSPR 
SSPAFPQEGPSSGEGLGGLVSLFSSPPN 
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^Lysine* i>=L*u rii 1 ^M-Mct n^onine' 
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nucleotide deletion, \=possible 
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HVHPAPTSPESGRGRGGRVPREGAYV 


4053 


17954 


A 








KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFSFPPPKTLWVAPLFQETQ 
GLRLCP PFFF * RGKGAL F * KRGGGRFKP 
PPPPPGLLGPPTP 


4054 


17955 


A 


4078 






RKVEGRVS*DEDLKLiTELIjRYYMLiNIEA 
AKDLLYRRTKALIDYDNSNKALDKARFK 
SKDLKSDCAHPRDC*RA*APYFLFAKNE 
LWPTGEQRHFLQDVPLIVQRTLTLRTP 
DHTSLPLSL 


4055 


37956 


A 


4079 


6 


298 


PLAIMGGFFTLAETNRTPFDIiAEGESEL 
VSGFNIEYAGGPFALFFIAEYTNIIIIN 
TLTTTIFLGTTYDAPPRP*LSPSLFYYE 
PPSPYPTPWSTST 














PIPATREGKAGNSLiNPEGEGCNKRRSRH 
CTPAWQEGKTPSQKNKKVLLLALKV*SI 
FHLMES*KYLKPGFSLDLITVIPTYFDL 
FRC 


4057 


17958 


A 


4081 


347 


1 


VLKPRPGNI I FS PKKKKNLP PP PGRNYF 
FFFPPPPFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFF*RGGLLLFRLKIQLPPTPHP 
YLSNTLRGWNPGEGSCWRSEGGRGEDCL 
QDH 














STFI ISLFPTTIFMCLiDQEVIISN*K*A 
TTQTTQLSLSFKLDYFS I I KKKKKKKKK 
KKKKKKKKRGGGGLKKKKFNGGGGGKKF 
FFWGGKKKNWGGGVKKPGGGKKRGGKKK 
KGGGEK 


4059 


17960 


A 








LEFPIKKKLSPPPPPLKNFFFYTPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFLLM * AKKF I SWR 


4060 


17961 


A 


4084 






FHRFGRDGLDLLTS * SARLGLPKCWDYR 
REPPRPAFLPQP 


4061 


17962 


A 


4085 


223 


401 


NGKLVSVFFGFCFCFFEM*SRCVAQARV 
QWRIFSSLiHFLPPGFERISCLSLLSRWD 
YRC 


4062 


17963 


A 


4086 


160 


408 


AJjPPRRGGGGAPS FNFCLLPPGP*GKKG 
GYGDAPSCRSGLGQHGETAAPLKAQKSF 

APPPPRLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YD I DGRI FCQKLES SH YS PGAVAHACNP " 
LQA 


4064 


17965 


A 


4088 


428 


140 


QPLKNPPLGGRAPPFLRGKKIiPPPPPQG 
EPPPF*KKKKKKRGGGGPPFFPPPPRGE 
PKKSL* PQKGGGKKKKKSPPPPPQKKKK 
NPPPKKKKKKKK 


4065 


17966 


A 


4089 


1 


205 


FRLRQENCLNPGGGGCSELRSHYCTPAW 
VTEQETLSQKKNFFN* KTFRKAKSTGNF 
HMPNTS IQQLP I 


4066 


17967 


A 


4090 


154 


410 


ITGCS I I KYFYRAMREREKSEVGYFPSS 
SINTPLRHRLSDGVLDSRITSLSTHSPA 
P FI KS KNRPGTVAHACNPSTLGG * GGRL 
TK 


4067 


17968 


A 


4091 


428 


67 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
PPPLLPGQGRTLTRPLPAPAGQLGSRES 
CGAPPSSLGASLHVA*NADGRQ * RAALS 
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TPHGPLTGS PAAW RGR R EQDLLL LLC S 


4068 


17969 


A 


4092 


431 


3 


APTSNKGSTIPILQIRKLRATPCSSIGP 
SPSTLPVKLEL*SVLLPGQWLNFSFCHF 
FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRKEFSSPSTPASDA 
FPFPPAPVSARRQE VLCES IPS VTMHE 


4069 


17970 


A 


4093 


151 


1 


S IMPMWMDAVAHACNPCTLGG * GGWI I K 
SGVRDPPDQHGETSSPPKTQKL 


4070 


17971 


A 


4094 


35 


363 


LQSKIYCTFFFFLERGFFFSPRVGRPGA 
HPGLGAKTTGVKGNPPP*PPKGGDPGGV 
PPPPGCFFWFFLGKGGFFLLPGGG*NFG 
KKKKPLVWPPKGGELKGGTPGPPPY 


4071 


17972 


A 


4095 


376 


1 


IiGKS P P FLG AP F FL KKGSL LG P LLG KG F 
TPP*LGGPHPLEKGPLLGQWNPNQKVWK 
KRGFPQKGGENPPFFFFFFFFRDGASLC 
CPGWPQTPGLKQS SNLSLPS S WDYRCLS 
KSLFKENTETLAS 


4072 


17973 


A 


4096 


65 


446 


PFFLFFLNKGPLFTPKRQGKPRNFPT*T 
PCPRGKKNPPPPPP*GGEKRGGPPPPGK 
FFFFFGERGFPGGGVLFLDLGTPPLGPL 
KGGE KRGR P PG PG P KK PFL * KR KKNS P P 
TGGERGRGGKKHCLP 


4073 


17974 


A 


4097 


34 


387 


I KEADE YVLTRLACVFVGDMEPLSSPHR 
LHTRIGSSPSTTCCTPACPCSSW*HGAS 
VIPSQTAYEDWFITLYNVLYTSLPVLLM 
GliLDQVGAS HKQGH FWTDENALE KSQAN 
VSMHSSGVYFFLIKPYYNSWRHQGSWL 
CFLFCNKERGF 


4074 


17975 


A 


4098 


80 


422 


ILHVLYTCI ILCFVLQS * ILWFVLQNYI 
LHVLYTCI ILCLHVLYTCII YMCYIHV* 
FCILAIiFAGLiS*DSPYHESL**SFFMEA 
DISLCRCVITDQLNFICSLLQH*ELRMF 
QF 


4075 


17976 


A 


4099 


74 


402 


IYLLSTHLHYQFSWISITFDISFFFFFF 
LKKESLFFPQRGGGGGNLGLLKPPPPGE 
SPFPPPIFRGGGKKGPPPPPWVIFFFF* 
KKGGSPIYPGGG* IFGPGDPPPPPP 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTEYDSVSKKKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSLKTP 
PPPFF*F*KPPFKKGGAEPWEPFLKPPF 
CLKDRGGAPKIYFFPNPPGAPRGAAFIK 
RGGGK* +GPPGGKLL 


4077 


17978 


A 


4101 


406 


1 


RFPKPFPPKPKMSKWALFK*KGFFPPRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 
FF*NFNQVSKKRPTFPLKVGQ*PPPMLV 
QKKPTRGFPALFKKKKRSSLVI RE I KRK 
PTMRYHLTPIRMAVFHKSKNNRC 


4078 


17979 


A 


4102 


2 


183 


VNPGGGACSEPR* CLCTPAWVTERDSVS 
KKRADNDKQGALRSFFLGFPFFLPLHPT 
SFAF 


4079 


17980 


A 


4103 


119 


391 


DNLQCFFFFFWKRGLLFLPRGGGFKSTG 
PFSFGVPGTPPPPPPGEGGLTPKPPPPG ' 
* ILFFLEKGDFPLLARW+NRQKKKNPP 
PSPPKGG 


4080. 


17981 


A 


4104 


413 


64 


GFKPPPPKDPPPPPPQKGGFMGWRPPPR 
PFAPLFSPPPPRGGGGIFP*GGGGGAKK 
FHIWGPFG* KKPPPGGG*NSGGQKNPPG 
GGGKNPPQKKKKKKQPTTANIWFLCLEI | 
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LSGM 














RGRVGKVLiISLi* W iDLLWXjLCILICFHK 
TQ * LCNG * CHS RMKCVCVCVCARVC VCV 
CVCERERD 


4082 


17983 


A 


4106 


1 


335 


FLVET*FHPVGLAGLELLTLGDPPR*ST 
LiGDPKCWDYRCEPbDPASVSFFLCLKbF 
IFVLLKLFFFFKPPLYSKLTRGRDNNYN 
YFPPPNPGPLPHWRADKS I I GGNRRLL 


4083 


17984 


A 


4107 


1 


148 


GERGCSELRSCHCTATWAAE * DS IYKKK 
KKKE P P KFFLAG FSGGGKKTP 


4084 


17985 


A 


4108 


331 


41 


GLNFLAQKFSGRFSPPPPGRVPLGPKPF 
FFLGKPPILEGFPGP*KVFFSPILPPKE 
VLTFFSPPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPWSN 


4085 


17986 


A 


4109 


137 


3 


TKKERN I NRG PG AVAHTCN P S TLGG * S G ' 
RITRSGDRDHPGQHGE 


4086 


17987 


A 


4110 


3 


269 


GFHRVSQDGLYLLTS*SACLRLPKCWDY 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
SENPSHKKKKDEGPFWKGAGKIFLFKPN 
SSAL 


4087 


17988 


A 


4111 


196 


330 


' GAIARGRSNCDCI*GSVGHVL*NDPTLN 
CILDQDTLYGS IMINEYYERASDAWALL 
PDEEQLPEGDQTVIA**A*DMSGGQQHR 
VSLiARAVYSGADVYLLiDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATE R I S VSHKE KKRKVL VR P * VCLSN 
SSLSVL 


4089 


17990 




4113 


394 


1 


GPGVFSPSGCLYHARAQYFWPHKKKKYF " 
PHPGQKNSVFLKGRPLFGFGFLLIFFFF 
LVEMGSCCIAQAGLELPTSNNPLTSASQ 
RAE I KD VSHRSQ * FF VFLFE TGS CS VTQ 
A*VQWCNHSSPRTPGLKQF 


4090 


17991 


A 


4114 


328 


2 


KKRAH P I WPDTGI E VPNSPS PNI RFLGA 
PESFFFKRNLPQGDFLEKPNPCYFFFQG 
VTTPPFPLFPPPPKIFFFFF+DGVSLCR 
PGWS AVARS QLTATTHLPGSS DS P 


4091 


17992 


A 


4115 


220 


3 


FKKGE PGDPPPVWFGLiGWVWVWVGFFFF " 
ERHS FTLVTQVGVQWCNFS SLQP PPRGL 
KSFSHLSLPSS*NYRH 


4092 


17993 


A 


4116 


342 




PINFLYLSFNPNQV*RSPNFSKILHFCF 
KILSFFYTLAFLPFSSQLIRQSFFRVYL 
INTI CFLQDLYYLYI LNLiYLI liFLFFHL 
AS RLLDFCLFFVFETVLLCHPG WS TVAQ 
S 


4093 


17994 


A 


4117 


124 


352 


TSGFT*TRMKSKIPT*RSFI IDLLEAKM " 
KEKFLKAARETQL I T YRGTS IQMPVDFS 
WRKTEAKAQWNLMFEVLKEKNCETRSLH 
PATMS FRNEGKTKMFSDERKLRDS VTLP 
XiKD 














KTGKLLATS APGDG I MVMVETFCLWS F 
FLSVSKKKKKNPLPQIKNPLPKPCWVFK 
RAPTPNH*KAFPGPPKTPP*KQPKKQIP 
CRPPKKGPFCI*PNPGINPSPLFFEKKN 
KGKAPPFKNSKQK 


4095 


17996 


A 


4119 


2 


407 


NTFQDQSGS S SNRE PLLRCRDARRDLE L 
AI GGVLRAE QQ I KDNLREVKAQ I HS C I S 
RHLECLRSREVWLYEQVDLIYQLKEETL 
QQQAQQLYSLLGQFN*L.THQLECTQNKD 
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ZjANQVSVCLERLGSLTLKPEDST 


4096 


17997 


A 


4120 


157 


2 


LTGSHSVT*AGVQWCICHGSL.I j PRPL,GSK 
RSSSLNFLYSWDHRCGPPHSANF 


4097 


17998 


A 


4121 


3 


168 


IALKSGRKVDIRELGHWCPGWS *TPEL 
NQS TCLGL PRC WDYRRE P P WliALNMF 


4098 


17999 


A 


4122 


2 


376 


ETGFHCVSQDGLCL,LTS*STRLGLPKCW 
DYKMREPPRPADGFINIRDFNPLLRTST 
CCCHVKKDIFASPFTMIVSFLRPPQPCR 
TDPIQNTLPLVIMSPVALLGCDSFSAFP 
VLMNLPFSSDCTK 


4099 


18000 


A 


4123 


1 1 


221 


GTQLLIjRLRGEDLLNLGRGGCTEPRLHH 
CAPFWETEPDPV*EKKQAAKAKNSLLTA 
ALLKTQVKKDSVAW 


4100 


18001 


A 


4124 


2 


125 


AI I KKTKSNRWW * GCR*RGMLVHCF WKC 
KLVQPLi * KAAWRL 


4101 


18002 


A 


4125 


425 


152 


TPVI LAIiWEAKAGRSLE PRS * RS AWATW 
QNPIS TKKY KK * GGRITSAWGG * GCNGL 
* WCRSTPASATETPSQRKKKK* EKNLFT 
LILKVIK 


4102 


18003 


A 


4126 


3 


99 


CQAGliELLTL*SDSLRLPKCWDYRCEPP 
RPA 


4103 


18004 


A 


4127 


396 


58 


ENCNPLSWSGGI ISGLLLVSGYSNKFPS 
FLVCLLPVIGPPENTAGPSLVPGTKWHD 
HSTL*TSQTPGPSLVPGTKWHDHSAL*T 
SQTPGPKLSSRLSLPSSWDYRHKTPYPA 
H 


4104 


18005 


A 


4128 


51 


549 


LGQSYLLLXRKCFSFNFQVGDLDISYIN 
IEGITATTSPESRGCTLWPQSSKHTLPT 
ETS PS VY PLSENVEGTAP P * AHQS FMSP 
PSWGGS PLNLFFGGGGFEKEQS PLKKKS 
FTLYPLGPPSEGEGHF* PSPLCFPPGFQ 
SPPKTGI PSGDELGFF*TTPGPDKKK 


4105 


18006 


A 


4129 


97 


362 


RK*SACLSLPKRWDYRHEPPRPACSC 


4106 


18007 


A 


4130 


376 


279 


DQDDLDLLTS * S THLG I S KCWDYRHE P P 
HRAS 


4107 


18008 


A 


4131 


133 


344 


IIFFFF*NFVLQAGGQGGNLGSLKPPPP 
GLKQFLGITFGRSWDHGPTSFTRANFCI 
FRKKRVL1L1CCPGWS 


4108 


18009 


A 


4132 


164 


3 


MNEMSYESSLLDHLLKRQ*NQCCLGSSE 
PSPPRLKQFSKLSLPSSWNYRKTSK 


4109 


18010 


A 


4133 


355 


2 


GKKQNRKTGNFKTHSASPPPPPKERSSS 
RATEQSWMENDFDEMREEGFRRSNYPEL 
REDIQTKGKEVANFEKNLEECITRIPNT 
EKCLKEIjMELKTKARELREE CRSLRS * C 
DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNPSLCQNDLFWHL-VLSPPTGVQKYN 
HGSLQP*PPGLE*SSCLSLPGS*EYRH 


411 1 


18012 


A 


4135 


357 


104 


LRRL*YENHLNPEGGGCSEPNLLiRCTPA 
WATEQDS ISKI 1 1 1 1 *CNFLH* QNMQKI 
KQNPKTNSSKILTEIKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


381 


DTSLHIIQNRLSISKKKL*TLYFIY*DR 
VPLCCPGWSAWQS*FTAALTSLKRSSC 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 


CLE I S CTKIGQDLYTENYKTLLGKVCKC 
PNK*NTTFMDQAEVSILKISFLPRLIYR 
LRI I PVR I PAG I F VE ILKFG 


4114 


18015 


A " 


4138 


315 


60 1 KKLPAEIHAPLAEEATD*AELKPAGETH 
j AEVQAPTEQTPAAE AATT I AE AS VfCVQ P ' 
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K==Lysine, L=Leucine, M=IVIethionine > 
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PPAEEAS LAKFPAEIQPPADQERGSEDI 


4115 


18016 


A 


4139 


377 


51 


GFTSPFPED*RIPGLFSCIGWTLQSFHF 
RNIGSVPFSPPGFFFFPQEMRVLLCCPG 
WSQTLGLKPSSCLSLSNSWVYRCAPLVP 
GMPIF^LSCFLAIGFLEFVIYIGY 


4116 


18017 


A 


4140 


413 


175 


NFG I FCRDGVLLCCPGWS * TPRL.KPS S C 
LPKCWDYRREPRTVPGLPFLFLNLRKDG 
NS VLE I YLEYKI TQ I I VQRLLVL 


4117 


18018 


A 


4141 


1 


394 


EIFVTKRWSGLGWAHASNPSTLGG*GG 
QI TR S GV * EQ PDQHG S PE RRGL KERRGL 
MVMKATFLGLRHHLEHIVGHRELRDLHN 
NSKKKISSHPLSTPHVPGTVIjKPCRLIL 
CFKHHQKPVRKPPNS PS YR 


4118 


18019 




4142 


22 


376 


L vnkgkti ffflkrg wlppkgkgrg in 
tvngslnfrgqgnpl.a*ppkkpgtkggg 
hnpgei * *ffgkkea*qcgpggsgipgp 
rrpsgltlqkggtygrepsplpgptken 


4119 


18020 


A 


4143 


161 


1 


P I PAKYEHFC F FVS LF * DRVS LCHPG * S 
AW*SQLTTASTFQGSSDPLTSATP 


4120 


18021 




4144 


18 


159 


KHLDPGGGGCREPRSCHCTPAWVTTAKL 
HLKK* K*VELKSPSVIHT 


4121 


18022 


A 


4145 


383 


109 


LFFYKIQNYTGHGGPCLLTPFLQRVKQK 
NCFNPGGGGCR* PKLVFCPPTWGKKQGF 
VFQKRLKKPKPKPKPKPNKRGLFIiPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


.374 


164 


QCQLLRSLRWEDCLSWGG*GCSEP*SCP 
CIPAWVTARSCLQINKSLPAEIVIPSLY 
WQIARG 


4123. 


18024 


A 


4147 


25 


375 


RKKKALiFFCQGGSQPPPSNIjMDPPPPGE 
KK I S WLNLi FPKKE KKKF * K KKKF F * KVK 
KNPFLKPKGRPPPGEKKKGKKGKKPTPQ 
KGKGFLFAPPPKKPIKKSPHFKIPKKQK 


4124 


.18025 


A 


4148 


376 


202 . 


HYNSKVFPGGPKRSFLFFLKALSFFFAP 
APIPFWLHSKIFFFFF*DRVSLCHPGWK 
IiR 




18026 




4149 


3 


144 


LFCPDWTPTPGLKQSSHLGLPOCWDYRH 
*VTVPGLPISFVFFFFPP 


4126 


18027 




4150 


3 . 


188 


QLQSQLLSSLRPEDHMNLGVEGCGKL.SL 
HHCTLA * MTE * DS I S * KKI LKKKCF * KK 
HSFLY 


4127 


18028 


A 


4151 


83 


358 


GWARWLTPGIPALWEAKAIKSLEDLNA* 
A* QTYAPS * EES LHP IMNTL, I RDPVWT 
SFRVTVDTSTIARHWLFCWDLiLQASIGH 
ILILGPG ' : 


4128 


18029 




4152 


162 


1 


TVWY WHKNKL.SNQWNKI * FL.ELDP I TYK 
HLVYDKTGI SNHWVKDALVNTQCRAG 


4129 


18030 


A 


4153 


369 


20 


FQLilARQRQENGVNPGRGAFG* PKSRPC 
PPA WGTK * DS VSKKKRKKKS CE F VHRI L 
KLEEAYHSLGSNTAQRLFVSCFHKKILC 
USXSI PDPYLQFKMWPVPPSKPL YQID W 
VGLL 


4130 


18031 


A 


4154 


117 


2 


QIFFFFFFFKMESHS IAQAGVQWRHLGS 
L*PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 . 


FFSPPPFFKPPPPPKKIFPP*KKKNPPP 
PKKKKI FFFFFFFFFFFFFFQTP 


4132 


1S033 


A 


.4156 


214 


283 ... 


ASPSEGQGLSSGP*GLIN*SDKPSVTHS | 
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nucleotide insertion 














PKPHCETA*GGTSFI J SSSGSSPQAPKSP 
HHGPDSPL,SDSIPMVISPPEPPPLRAKG 
CPQGPPGASWWGTT 


4133 


18034 


A 


4157 


319 


712 


QDFVSKQHSYCLKNliKSSSLIQVAMNRA 
QVCL ISSS KSGERHLYL I KVSRDKISDS 
NDQESANCDAKGNNFICIMLYKIWLN*K 
RLMK* HWTAI * LKI Y FSNKMKFSNLNLS 
SVINS FDVKKERLNANTLN 


4134 


18035 


A 


4158 


183 


369 


LKRKQSQRGEVNFLKVTQPECGKAGILF 
IYYFFEME SHS VGQTGVQWCNLG* LQ PL 
PPESSD 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIETGFLHVG*AG 
LELPTSGDLSTLASQSAGITG 


4136 


18037 


A - 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FWKGSPINPQGGSQAKKPR*REPSPSGL 
TPQRPGNGGPPPPPGQNFFFKKKGG*PG 
GPGRA* TPGPRGTPPLGPPKAGNKRGGP 
RGRAKNFLKNQNGPT 


4137 


18038 


A 


4161 


53 


376 


YLFAFFFFSKREAWQGGQPGMEGAKKKF 
MAP* PSKGGGGEFSGLTPPNQWNPRAQP 
PPRGELILILEKKRPAPNAQPGPENLGP 
REPSPLAPQKGGNKSENPWPPPP 


4138 


18039 


A 


4162 


1 


356 


GVFILVSFNFWAFFLSFFFLGKQILVLM 
PQPESQGQNLAPQNPPPPG*RGFPALTP 
PRTGNKGMAPQAPQF FG FLKKGG FSHGG 
QGGFKTPTLGNLP PLAPQGLGNNGLGP W 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPALGPPGGPFFSKKKSKNLPGLR 
GSTLQAVGPTFLGG*GFa?SLKPPGG*AS 
RGPGWGPSPPPGGPN*IPVSEKKEREGK 
KKKGRE KKKET FSSNLLIFLSF 


4140 


18041 


A 


4164 


18 


284 


TLQGTSGIFEGNFFFFLGTGISIiYCPGW 
RAWGGDHGSLQL*PPGSSSPASTPLGRS 
WDYRHV P PG WAS F F F FKTL PNG VTQAGti 
ELWG 


4141 


18042 


A 


4165 


377 


30 


FSGKKKFFQLGWGCGPPPLIPPLWGV* P 
GQPPRVGGFNPPLPPWKNPVFFKKPKQP 
GVGGASPYSPFLGKFRPRIPFTLGPEGS 
G*PNFPPFPPPWAPKKNFSPKKKKKFYN 
CL.W 


4142 


18043 


A 


4166 


305 


1 


ESRQVLTSRLTLLSIHEFNYLLNNSTYK 
HITSLPINIRQYGQDLKTLKKETKDLNK 
* GNI PCLGIGR INI VKMSLLS KL I YKFK 
AMPVKIPGELFLRNQQA 


4143 


.18044 


A 


4167 


394 


78 


EGKKQGRKKGRREGS*EGRREREKTR*K 
AEKEGMKEGGREGKKEGRKEGRRKGRKE 
GRKEEGQKGGRTDGGWWVLRSLDTSQLI 
S FASG P KWREAS S LVWASLCL 


4144 


18045 


A 


4168 


1 


431 


CNTCVR* CAYV3CV*MCGERLCWCVCSCN 
RVCCV*VERVCVLVCMSVCWCVCCCARQ 
CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YLPKLSAPPAAGLWDLPSEATASLSGSL 
CQPSCPRTRSFATLVSSMYHRPVTGELQ 
SIT 


4145 


18046 


A 


4169 


3 


423 


YNARE IEAAAGRDHATALQPGYRVRPCL 
KKKVLKLGQAWWLA*F*NFFFETGSHSV 
PRLECSGVISACCSLNLPGIV 


4146 


18047 


A 


4170 


1)8 


402 


Q I CT CTP KP KMKV KK I TMLSNNMR YL Y I 
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last amino 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=lVlcthioninc, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S— Serine 
T=Threonine, V=Valine, 

X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














HLTQDI *DLDIDL*NTADMNLKDLNK*R 
EIHCLRI *RLNIIKTSIFLKLPYRVHAI 
PFKISARSFVE 


4147 


18048 


A 


4171 


404 


285 


RGCGEPW*HHCTPAWVTE*DLVSKNKMK 
QNDNLSHLSLA 












431 


ESI/VGQRNWVAKYFYRITAPGRAQRLiMP 
AI PAP WEAEVGGS QGLE I ETLL * NNS S R 
PGAEAHACNP S TLGGRGGRI TRSRD * DP 
PGQH 


4149 


18050 


A 


4173 


24 


366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 
ADREEKQPCDSQAREEPRLCRRGFNRAP 
NINTRHEKLFFVCLF*TGSHSVTQAGVQ 
GCGHSSPQPQLPGPKRSSHLSLRSSWDT 
GM 














TEPVLANETWGPWDRGPLGLGGEjGPENT 
DNSHAVGADQLLKEPPFPLSTRSIMEPM 

QVM *SWGIRGILPQQ PWKMGASLS LTAS 
LSLS FAHTIjKHKHTHTHTLY 


4151 


18052 


A 


4175 


90 


387 


KGRWLFFPQGGERGHNG * TAPPQTGGKE 
NPPPPPPKDRGKKANAPPPGGIWYLKKN 
GFFPIGQGGPEPPPPRGPPPPAPPKGGE 
KRGGPPHRPGKKliLK 


4152 


18053 


A 


4176 


398 


280 


RRLKWEDPLNPRV*GCS KP*SHRCTPAW 
VTE*DPVSNFF 


4153 


18054 


A 


4177 


253 


395 


LFNFFFLVKMGSRYVAQAGLKPLA*GNP 
PASAS PRAGITGGSHHTQP 




18055 




4178 


366 


48 


PWASPGISLSFSFIjRTKPTVKVR*YRVG 
PQRQPCPSRWAPPACPFSMFPPAWVSPP 

APAWTAGLGEKQKLAARMRGT 


4155 


18056 


A 


4179 


44 


413 


GDGVNSAFFFFFFLEKKVWFIPPGGGPK 
PEFGFRAPPPPGVKKIFGFTPLRTGE*R 

DPKGAPPPGPPKGGE*RGEPPWVGWNFA 
LLKERKPLFKE 


4156 


18057 


A 


4180 


396 


235 


HEVSLCCPDWS*TPGLKYSSCLGLPNCW 
DHRCRNHGPGKTQLiEQGHRQVAGLiSR 


4157 


18058 


A 


4181 


85 


408 


YTCSLRITQDHFFFFFFFLGKKIFFFPP 
GWGAGGQI LTS PKTPPGVKGI PWLNPPR 
GGGPGGP PPPPTL.FLGF* KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNL*GRGCSE PRLHHCTPAWVTE * DS 
VLKKKS Y I HLGQNPTVRKVLL 


4159 


18060 


A 


4183 


310 


3 


ATAPSQFIRIN*RNHCHYIEVQSEAASA 
NX.ETAVNNPENLGK I MDEGG * MKKQ I WK 
EMTSRIFIVEKSMPGFKTSKAQALILGA 
NIAVNFRLKPMLIYHLEN 


4160 


18061 


A 


4184 


1 


359 


PTRPLVLDRDRPPFFFFLGSGKKKNPGV 

GGGVFFKPLPFLGGGPFLKNSGGVDKKK 
RAPPFFTKTPL*KKPPPPFPPKIFKARG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKE 
GV* KKP P P P PFLTP P P PNQGNF FFFWGG 
STPPQNI KKPLGFFSSQTPKKKKRAADR 
DLELADAW 


4161 


18062 


A . 


4185 


1 


362 


HFTKYLFISC*K**LL,CWMQDCCKKGIP 
MNTNVIQEKATSLYGSLKHKEGEGPKAG 
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correspond 

ing to first 

amino acid 
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peptide 

sequence 


Predict- 
ed end 

otide 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutatnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EFNASKGWLDNFRKSFGFKNVKITGEAA 
SANQKAADKFPDAIKKI IEEKGYLPEQI 
FNAGESAL 


4162 


18063 


A 


41S6 


366 


47 


AALFLVDPPGRGGVKGPPPPPGPFFVFF 
QRRGVPPLSPGGFFFRPPNSPPPGPPKG 
GGSGGGPRGPA* THGVFLKKNPPNCGAG 
KKKGFFLKKKKKKPVFWFSVK 


4163 


18064 


A 


4187 


394 


112 


KNTENLKISWAWWHVSMVSAPNEAETGG 
STEGTGCS KL * SCHCTAWVTVRETLS PR 
KFFKKRANYLTFRI *KVCSLYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPMFLPFNPAILLLGIYLKEYKSF*HKD 
TCMRMFITALFTITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFMRFKEGSHLPPI*VQGKAASAD 
VEAAASYPEGLGKTINKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQ P WLL.RR P KQENCLNPGGGGCSE PRS 
HHCIPA*AIERDSLSKKKKKGGAF 


4167 


18068 


A 


4191 


180 


415 


LG FLQLLL.FVGRSMFI FALELSHGWLG S 
SLSF'PVCVKQSSHLSLSSSWDHKHMSP 
YPANFCIFCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKGLL*PKNSFPPGAPP*IPF 
PPKKIKIFKFTPGGGGAPLFLLPRQVKA 
EGSLPPRVFGPPPFFFPPLPSPLGAKPN 
PFFFFKKKKKKEWLiHKSSLFYSAHFVLC 
LKYS hS FFYQS VKTK 


4169 


18070 


A 


4193 


105 


424 


ELKRLVI KLIRG I PE KGKAQCKE IQKLA 
QEVKGE I FME I GSLKKKQ * K I QETXDTL 
LKMQNALESLSNRIEQVEERNSELEDKV 
FKLTQSNKDPSQIKKKILYNV 


4170 


18071 


A 


4194 


3 


240 


LCLQSQLLGRLRQENPLNLRI*GCNEP* 
LHHCTPAQVTERDPVSKENKGSFIPMKI 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 


4195 


385 


1 


KMI ILTKKMEIKHKDEKELQKTEVDLKI 
ENNTTRAR LTNNI KKKP PES EKTNTKNH 
TKKSL*DQDTKKETRPQKKKNPFKSR*V 
RHWRNILNLESRNRGKKNPSGVQKRRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


LFYLIMALKHKSSDVGNSNI YAKEKLES 
AS FKER I * KS YVE VAKI YGKNE Y CI YE I 
VKKEK* IMHSI YRVQYYLQFQTSLEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGRFKGSNFTSPGLQGNSFFMGPPKLNS 
RAGV* QRREGKNPGVPQFNR 


4174 


18075 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFLIFSPP 
GSPPKSFPKAKWARRDRPF*NPGETLQE 
G*VFFSGRFFFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTILLVGRGCSEPR 
S HHCNP AWVTR VK 


4175 


18076 


A 


4199 


1 


270 


PSRTGPQIPRRPTRSNCPNQFliGF*GCG 
EPRSRLCTPAWATEQDSISKKERHGLLV 
FLKGFI H Y IRLL INLFAARGRKRVLFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


PPGIINLFDPGL* PPPPWVMGPPPS*KL 
ISPQKKKNKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSLKRQLIL* * KSGVSAASLV 
RHDGAPRLTSSAGKGHAEPTFQPRPTHH 
RNACSFVLIHIQSYKVF 


4177 


18078 


A 


420! 


298 


158 


FKI VINFQFLTP I FAPSEGGRGG *GRGE 
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Predict- 
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nucle- 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L-=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LRSCHCTPAWVTE* EPI * KRKI KGKKRT 
EQESRRQNLA 


4178 


18079 




4202 


128 


358 


KLMIPPSPGQPPRPGGRWGGHPPARPAA 
PSGR*GAPLPGRPY * EKRI PSARTPPRL 
GGWPSMS LRTGHDDNGGVWE 


4179 


18080 


A 


4203 


406 


1 


FFFFFLRQSFTIiVAQAGVQ* RNL.GSQQP 
UJPGF 


4180 


18081 




4204 


322 


2 


KPRPRKTPGGPFPI *GGGGAPPPRGGAP 
KGGSPPPPPFPPPGGKKGGKKKNNPPGG 
*KPWGGFFFFPPPF*TPPKKRGLFLKKK 
KTQNPKKKKKGGRSRYRTSPRV 


4181 


18082 




4205 


3 


127 


GFHRVSQNGLYLLTL*STRLHLPKC*DY 
RSEPIiCLACFLFL 


4182 


18083 


A 


4206 


1 


216 


GFRRVGRDGLDLLTS *SACLGLPKCWDC 
GR+PPRPAHLGGNSNAKEPGL.PACPLL.S 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


LiFFTTNFFFWARVFFbPPFF*NPPPGFF 
FLPHKKKKNF PPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
PWLKTWSC 


4184 


18085 


A 


4208 


2 


404 


PRVRLGIWCKERliFLCYFFFFliERGFCF 
LAQGGG PGGNFG * RAP PPPGEKKIPGPP 
PRGGGEKGPREPGRKIFGFLKKKGGGPF 
GPGGVGPPKKGPPPPKPPKGWGYGGKPP 
AQKRGGFFFGKKRAPFFNWVNG 


4185. 


18086 


A 


4209 


40 


412 


PLFCLSEKPRYFEYAIYLMLSLKFLESC 
IMSVNSSAVSASVYY*W*KIEREPLTSW 
TLEKLDPEVFHQKFAFT*RAKAILRKKN 
KFEGLTLSGFKTHYKTTIITTVWY*HKD 
TQIDQWNRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFtjGKG VS FCPQAEGQGRNLG 
* LHPPPPGLKQFFCLTLLRNWNHRLVPP 
PP 


4187 


18088 


A 


4211 


409 


2 


LERKTVFFSPPFIFFFAPVFFLSPFFYT 
PPPLYIFCPPKKKKIFPPPPGK.1FFFFK 
GPPPIFFFFFFFFFFFFFFWS*FFIMYQ 
IVFNISYNLIKVDSHLYLFKDKKIL.LCT 
ISSDAV) 


4188 


18089 


A 


4212 


24 


399 


ADAFSTTNLiHGLGPDLFTPTTQLYINLG 
MAILL*AGAVWIGYRSKIKNALAHFLPQ 
GTPTPLIPILVI IETINLLILATALWH 
LTAI ITAGHLLMHL>IGSASIiAGSTI SLP 
STLTVFTILILLT 


4189 


18090 


A 


4213 


411 


197 


SWLTAPSFKRFSCLSIjLSS*DYRLPVPH 
PDNFC I GS RDGARM VS I S * HHD P PAS AS 
QSVGFTAPKVIjGLQT 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAI DLKNVMRRVLI 
KSCTPGWAHTCNPSTLGGRGGQITRSG 
DRDHPG*HKSALGSCASCFIYLWD1jLLiH 


4191 


18092 


A 


4215 


1 


332 


MAPSL.CLSNKKRGFIGPGFAGAPKHPGR 
GQGHPREKLAFFKKGPPKPGLKSFLVLK 
TPPPKPFFFLS PKGGALKTEEEP * TKKK 
GGAEKKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 


1 


185 


KLYLSILISLQISLIITFTATELIIFYI 
FFETTLIPTIiAI ITR*GNQPANFLYLLV 


4193 


18094 


A 


4217 


1 


388 


LRFCWETLFPLPYNPFSFHFSFLFFSFS | 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFLGGKFCFCPQPGGGGPNLNLRDPP 
PRG*KEF*APPPPRGGENQGSPPPPPNG 
GGF* KKGGLNHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAPPLLKF 


4194 


18095 


A 


4218 


3 


195 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAF I PVLTKKINPRSTEAAI KYFLTQ 
ATASHPA 


4195 


18096 


A 


4219 


261 


1 


E P RRRERKE RKE KKKG ATP S R I CM KRE R 
EERFS *ERRERGEEGERGEKEERERGER 
ERERSEEERERERERPKN*MRLNGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYILN* TLKDTI I PKVNENL 
YN*DFLNSKVQGTPPPPRPSRPPSSPSP 
DPP PGP PPAGAGRRARRPGRRGS PPGG P 
PAPPPAPPPRAGRAAGGRGRPPARPGGR 
G PAAR PAGGGG 


4197 


18098 


A 


4221 


258 


267 


GGGALKKKI YCGGGGGNFFF *SPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
I YFIFILIF 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGPFKKKKFLiPR 
GGGKNFFF*GGQKKKIiGGGVKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHLRRENLQEKQTWMGIFFFFGVAGP 
PGPGGKSFSPPSFFF*GKNTFFTPPPGG 
GGQNLVGENSNSNP P PPRDPP PSGPSEK 
I * KKTPPPFLVGPNTKNKKHFGGGGSPK 
KKTLLPPPPLDPF 


4200 


18101 


A 


4224 


223 


431 


IRKTGPLGFGGQNNPLPKV*PPPGLrFF 
QKGPPPPPFPPFQKPRFLKFYPKREGPP 
PPKVGLTPPPGLL 


4201 


18102 


A 


4225 


173 


2 


RCKLHCLFPTLAI ITR*GNQPERLNAGT 
YFLFYTLVGSLPLLIAIIYTHNTLGPTR 
P 


4202 


18103 


A 


4226 


11 


377 


FLTWFICISLFGRLMGI*FFVUFFCEL 
FFHAFFSLFtiKTDI IFSIDLiLEI IIYAH 
VCHICCNLIFSLYFVFYFNGSLKYIKC* 
NLFAI ILDFADVSFLIFMFRTGKRQNDN 
QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KNSKVKNATDLLKNASESPHSRIDEAEE 
R I SELEDRLFENTKSEETK* KR I KK* SM 
PTDLENSLKRANLRVIGLKEKVGKEL*V 
GSFFKDII* QNV PN1>E KDNI Q AQE AYTT 
LSRFNPKTTSRHLAIKLSKVKDKER 


4204 


18105 


A 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG*GGGGLKLGGGKQGWPPFF 
FLRTWPTFTRAPGL 


4205 


18106 


A 


4229 


392 


3 


PPC*NRAPGFNFGGPFKKIYSSPPPRGK 
FGSLKGPPPFFFSAPPGGPKFSKGGGPL 
FFGAPDF+GGFGGKNYFWGQGFP*PHSP 
P P P FS WG PKRRAP FS KKKKKKRRRG WGE 
GEGQGSNHLLKHGYFQ I 


4206 


18107 


A 


4230 


211 


400 


YALIWLYCILNNYALHSVFFWCCFCFIjI 
FKFI YVFLNLLFLFAF* LVLCIiLKVQNH 
SGSLSIVSYFPLLEACRIFPLFPLF*NL 
* M I CFNMWLYFKQLCSALS VFLVLFLF 
FNF* IYLCI FKFIVFVCFLRRNPTLAQA 
GVQWHDLGA | 


4207 


18108 


A 1 


4231 


69 


386 1 


KRXFFFLGPRGGGKGKMWVNGNPPPKGK | 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
C=Glycine, H=Histidine, l=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GNFSA*PPQRGGPKGPGPKNKVIFGFKK 
KRGFYGGGQPPNLGKEIiAGPPKGGEKRG 
KTH PGG KFYAFLGGGLFS KKRN 


4208 


18109 


A 


4232 


394 


287 


FKEKKKPPPPPLFSPPFF+KRGGGDPLF 
SPRGEKK 


4209 


181 10 


A 


4233 


442 


1 12 


LLGRLRWEDGLSPQGRSCSEP*SCHCTP 
AWATQ*DPVSNTNNKNPNPQNACSSLNP 
SLPWTAAVWRTLGEHCNGREAXjPARAKN 
ATKAPVLTTHHICQTVRTISCALLPH 


4210 


18111 


A 


4234 


415 


58 


SLQKGHFPPFQEL.DPQNPGRSGWSPVLP 
TTGLFFSLDFIRFT* FMAGVLVGYKHCW 
EWTAALVLLL FLRDRVS LC FPGL I HTLG 
LKRSSCLSLLSSWSYRHTI PHPAST* FF 
G 


4211 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGNHLNWGGRS S VY I YTE KHTVHTQQ 
SVTTPFMSMSQTLAYSPLSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFSVYIYTELLPPQVK+ FPCLTLPSS 
WDYRSVPPLPAYFCIFSRDGVS 


4213 


18114 


A 


4237 


36 


405 


RLYCFIKGLNVKNKTFRIVFFFFLETNF 
PFGPQGGGEGANSGFPEPLALiGVKKGPR 
PPPPGGGE*GAGPPGPGNFGLLKKKGVP 
RGGGGGPKPPTQRFPPGRPPQKKKNFFG 
PALGKKKVFFF 














KKKKKLTRPGGGGPFSPLPKRVRQKKGG 
NPGGGFSKKKKSPPPPPPRGKKKKPFSK 
KKKRFLSGNNTGKNRLGKDPQS * PAGNA 
RKKPGWERAIARGRAPGMTDRKPQHSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 


405 


PTVKLVLLYLVLLKVAVSKNLR* *GGLT 
V*QQRQILWKRILSNTEFLYLRQSKMMH 
LVL * PTVNTLKSLKSGQARWLTPVI PAL 
WEA 


4216 


18117 










PLLQGTPASWPRWGRAGDRPRKEG* LHP 
RETRVPPALHHQPAPAGVSLTSPGPPFW 
LSLQRLRGHSSSDSLPAVCQYSGSWREE 
KAAAEAPALTPAICTCTICSVliTWL 


4217 • 


18118 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPLiNFLGG 
VFFFFFLKIFEIFFLKRFFFPIFLGAPG 
FKIFKKYFPFPRGFKTPPKKKIFLNPFF 
KKKGGGGGALFFFFFFETGSRSVTQAGV 
KWHDHGSLQP*PPRLK*SSDP 


4218 


18119 


A 


4242 


230 


393 


ADRPGS PLPS APETGSHS I AQTGVQ *HN 
YGSP* PLTPGLKLS S CLSL PNCWDYR 


4219 


18120 


A 


4243 






KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF*KKKKKSGNIGGPLFPPPPGGEEKK 
NLFPRKGEGSHKTDSSPPH*RGGKKKNL 
SPKKKKKKKAWCWDYRRKPLCPATYF 


4220 


18121 


A 


4244 


2 


425 


FVFKIILKNLSEIQENLQFSKIKKTIHD 
LNKKFNKETGNI TKNQTE I LELKNSMS E 
I KNTVKS FNNRLNQAEERI S ELEDRS FE 
MTQADKKRKKGEESLQDVLYTMK*TNIL 
WSFQKEKRKEKGLQNI LNE I VAENFPSL 


4221 


18122 


A 


4245 


12 


424 


IYYKTLNCT»IF*FGGALACFHFSYPFY 
IHYVGFVPSPVYLFSHVI ISVWTSGYLL 
YSL.GCNP ILSLFI FLLKLFHFGYLTI QV 
GIYVWILFVCLFLSNSFINI * FTYHAIP 
I PI Y I PLKVYNSMFYS I FTTK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4222 


18123 


A 


4246 


215 


425 


LSIQISQIRNLIYLFINIMGKILRAKRK 
♦GTSDAWNQTIFLRPRTVAHSCNPSTL 
GGRGGRITKSGDRE 


4223 


18124 


A 


4247 


413 


2 


WEVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPPWRVGFQDCFWPKDGRFr* 
PRTP PCPRARGAKKNSVS KKKKIKIWKR 
WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKITPPGGERQL*PQRAEGAGDTK 


4224 


18125 


A 


4248 


61 


281 


ITWKDFDSFSVLRVSKAFHLALPAFIGN 
CIKCHIKMGIYPEEKFKNIHKYLYLEID 
VCGH+TWKKIFFLFFFLRQRFTLFAQAG 
GQWLDLSLLQPPPPGFKRFSGLSFLSK* 
DFDS FS VLRVSKAFHLALPAFI GNC I KC 
HIKMGIYPEEKFKNIHKYLYLEIDVCGH 
IRLGKKFFFCFFF 


4225 


18126 


A 


4249 


23 


13 


R VR P FAS PGGRE VTVCGGLGREGQERRL 
RWHRKP FLPLAATS CAWPRS VTPS VTS 
GSADGDGQSLAVGAGTLATVGGLELLNS 
NDPPASASQGAG IAGVSHRAWP*T 


4226 


18127 


A 


4250 


182 


342 


KGFFFLPPGGGGGGEF* FNEPPPPRVXG 
IFPPPPPGKGEKPPPPPFPGYIFVF 


4227 


18128 


A 


4251 


2 


219 


PS LRKMQKLARHGGTLLW SQL FGRLRRE 
DCLSSGGRGSQTHGSEL*SYHCIPAWAT 
DGDPVSKNTKFFFFFF 


4228 


IS129 


A 


4252 


223 


324 


LGAVAHICNPSTLGGRGRRITR*GVRDQ 
PGQHGE 


4229 


18130 


A 


4253 


165 


336 


ESDAQGHQVAARVSHVLEKDALLVFRSL 
CKLAMKPLGEGPPDPK*ADSSSWPPSVQ 


4230 


18131 


A 


4254 


3 


233 


ETAFCHVGQAGLELLTSSDPPASGLQHA 
GI TGPATKS AP * W S * VSG PHLGAFDS LA 
MLLAQDWRALLESAKFFPLL 


4231 


18132 


A 


4255 


394 


44 


AKIFPPGVKKFFASTPPGGGKKRGPPPP- 
PVNFFPFKKGGGFPPWPGGV*NPAPINP 
RPWPPKRWGFKGEAPPPPPKLNLFFPQG 
GGGKNFFGKKKGAPTGGPPFFFFEMEFH 
CRPD 


4232 


18133 


A 


4256 


152 


2 


YWS PF YCRSCC I RQGTVAHTCNPSTLVG 
QGARFTRSGVRN* HGQRGES PR 


4233 


18134 


A 


4257 


404 


190 


AQLIRGWRQ*DHLSLiGGGGCNEPSSCHC 
TPAW I TE PNQSLS QKTQKQKML I PN * YG 
♦TVSPPKSHPELQFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACNPS* ENCRPRAVAHAC 
NPS AFGGRGEQ IPRS GEQEQPVQHG 


4235 


18136 


A 


4259 


406 


163 


GWGVQTHPGHHGETLFFLKKKK*VGGGG 
PTRYSPI FGGVGPQNGLKPQGGGSKNPK 
LNPLLPPWGQKGVFFSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTS I VPS LWEAEGAKLRGLGRPACRE S S 
LCHCTLPWVTG*NLVSKKKKKRGGALGF 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAAT PN L VFVGRGNCQ DWFTGPFLTRG 
GWGGVLFPPVKRVIKGLVAPFSGPIPIR 
KNHQKVGG FLE PNKTNFKE * NFVENKNT 
RRGI KKNVPEKKRPQKGIPLPPQI FNEN 
QTWGDFDFFFFAKKKKI INSFLGWA 


4238 


18139 


A 


4262 


337 


382 


RDARFLHCSLV*LIT*LQKEFFFVGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRGPPPSLGKFLVGWLVGWLVFETKFW 
FCWPGWRAITGFWVPATYTFRVQAIFL 
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SEQ ID 
NO: of 
nucleotide 


1 SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I==lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosino, 

nucleotide deletion, \=possible 
nucleotide insertion 














QLLGRLRQENCHLGGEGCS*LRSPPCIS 
AWATKLDPVSEKPKTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4240 


18141 


A 


4264 


233 


440 


1.PVHHGWGSLRKLTILGKGEANPSFFTW 
GQQGKAKQKGQKQPGAGAHACNSSPLGG 
QGR* ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL*SHLLVRLRKENHLNLKGKNCREL. 
KWPHCTPAWGTKKDSAAKNKNQPLPPPK 
G VKS P PQREHGL I VTCMFVQ 


4242 


18143 


A 


4266 


! 


349 


HKTNIHFKLVMVSNVFTFFFFGKGFSFC 
APGGRAGPLFGLVETPPGREKAFFGPNP 
PRRGE*RAMAPCPGKILVF* KKTGFPRG 
LNFWPGGPPPFGLQRGGNNGGNPWPGPE 
FFWF 


4243 


18144 


A 


4267 


2 


375 


RSEQPAGSASSGNEGLSTRASGCGGCTG 
S PNSAS PPALCS I SRRALAAFPRGRARD 
LQPAMPEPPTHSVGSCAAKPPR*APPPA 
PRRPVPSTTQGLRSAGAQRGTGRHLHLQ 
PQCRIHWVKPAG 




18145 




4268 


377 


3 


TPAWMTERDCIWRRRTSAPGGSWPSGPV 
PSPGAQ*RPPSQGLGLWWAAAAAPRC*T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HPRAGSAGPTGATPPGSTQGQRRRHSHQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


I 


294 


ZjEDWGGRGARAHYDGFSLEPESDHYRLR 
LGQYHGDAGDSLSWHNDKPFSTVDRDRD 
SYSGNWALYQRGGCWTHACAQ* ILDAVY 
ATGPKYQVHCEALH 


4246 


18147 


A 


4270 




218 


TGR IIILS QGLQTLL PLIAF* * LLAS LA 
NLiAIjPPTINLLGELSVLVTTFS*SNITIj 
LQKKKKKKKKKKKI FF 


4247 


18148 


A 


4271 


32 


443 


LHSDVDQLAGLVFPGCPWPLAS PARRAP 
' AGPWPRRAAAPPS *DAPAPRLAVSAGSP 
AWPPPST*GLPAPAAWASPASASPTSA 
RSRS * ASPTARRCRPGPGTAAPTSASRQ 
CQWRP* PCQRSPPSGTCSSAAAAPT 


4248 


1 8 149 


A 


4272 


434 


57 


HLSFPPASAAPKPLPQAASLWSSPPSIV 
LAQVPPMNTDPCPQPSA*PASEL,SPEMP 
PARPQAPPENWSHP *GCGAQTVAPLVFG 
ATPAHRPASPSPVTLPVLTAGGHCFYLY 
TYQIFLKFHYIKR 


4249 


18150 


A 


4273 


288 


424 


GLS LS VAQAWQWCDLVS L*PP PPRVKQ 
FSCLSPPRIWDY*HPPP ' 


. .. 


18151 








429 


NTRGAAGPPQMPHPPRASAFPENPCGRK 
N*GQVSGP * ASGS S PVKWAGPAGTWRKG 
GLiGWGPTRVRGRGPRRRPGASSGYARQQ 
GPGHPGFPPSPRRRLSVPACALCPGQTS 
GTRAGLAVWPQIGLCLKAQGARSCPRDH 
SSD 














I WLFCFVKTGVS VTQAGV* WHDHSS LQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 


4252 


18153 


A 


4276 


406 


3 


PMVSQGGVCAK*QARAI PR EG I FRRS AD 
TQWREVQQKYNMGLPVDFDQYNELHLP 
AVILKTFLRELREPLLTFDLYPHWGFL 
NIDESQRVPATLQVLQTLPEESYQVLRF 
QTAFLVQ I SAHSDQNKMTNTC I 


4253 


18154 


A 


4277 


373 


3 


DG VSLCPPGWS AVAQS WLTAVL *ALG F I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PIienylalanine, 
G=Glycine, H=Histidine, 1-Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FLFPVGGLTGIVLANSSLDIVLHDTYYV 
VAPFPYVL.S IGAVFAI IGGFIH* FPLFS 
GYTLGQTYAKIHFPIIFIGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


4278 


71 


176 


AGGRG CNESRSHH CTPAWATE * DSVKE K 
KKGLFFW 


4255 


18156 


A 


4279 


2 


379 


SPFVPLQSSLGNKSETPSHGRKKKRRKR 
KRTDENQP*KHPLRARPGPGAPRAQGGA 
QGSGGGRRAGPCRKLVHRGHTGNPKREP 
GLIPQGEGRSLGIHPSNGCHKPCPSRGR 
PPPRPKQGRGRMQN 


4256 


18157 


A 


4280 


2 


338 


CDMPRRKLDPLSGRNTLGFVCWWAQTS 
GLK* SSCLCLPKCWDYRHEPLRPVWVYF 
KLWHPSPWKPGFYFSVFFPFYLPLCTTA 
SLFCSCLPPCCSRVTLVGSESSCLWPAG 


4257 


18158 


A 


4281 


407 


1 


E ERG VATGHTAE RGRADN EE E RGE VAN K 
RGGREVQPEARRMTTTATGTTAATRGAR 
T * TAATTATAP VTRTAAPATTAS STRTL 
RTPRRARTASVLWARCRSRWSATWRASR 
TCDCRTASHSRPTATRSRTARMA 


4258 


18159 


A 


4282 


367 


225 


PCDSACLGLPKCWDYRREPPRPAGVLFL 
♦ASFS+QSKEVYVCKLTHA 


4259 


18160 


A 


4283 


452 


1 


NTCGLQSVCRGTQRGGPKRCPCHPRERW ' 
DLGDEQES SKEGRPGREGVQGQGLLGAT 
PGAQG S H * GL * AEAAMS SGHALG PGQ VP 
LTPLSPSPPNPHRRPRAGRASRQSREST 
EAQRAVPSQGAAPGWETDWGSSHQWQPC 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 . 


22 


CHVTGTQPIKVSWAKDSREIRSGGKYQI 
S YLENSAHLT VLKVDKGDSG Q Y TCYA VN 
EVGKDS CTAQLNI KERLI PPS FTKRLSE 
TVEETQGNSFKLEGRVPGSQPITVAWYK 
NNI E I QPTYN*E I TFKNNCI AAARRI 


4261 


18162 


A 


4285 


301 


403 


IiSIS*PCDL 1 PAT J VSQSAGITGMSHHARP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYSSREQERRQDLiPEQVIQAEGE*V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHHFAEVQGSPRCMSSRYLiVDGCPKT 
FAEAQNVCS * CCEANL.VS IHAFTFILRI 
QWCTSTVNQAQVCI 


4263 


18164 


A 


4287 


453 


3 


YIYEGS IMEEPRKPVPKGPLGLHCPGKF 
QG*RNSYNHHAVRVGTRCAPEGVKDLTS 
SLQSVITKPEQNIQELMKHFKKEKSEAE 
NHIRTLKAESLEEKNMAKIHRGQLEKLK 
SQCDRLTEELTQNENENKKLKLKYQCLK 
DQLEEREDV 


4264 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALSPRLQRSSAILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI*P 
Nli*NSMC*NCVFDTTHRiGENIWESYI* 
* E INI QNACRIPKTQQQKTHFQNKQGLH 
SNLiN 


4265 


18166 


A 


4289 


2 


263 


I HGAHLWS QL PGRLKWEDRRS PGSRGCS 
EL*SHHCTPVWVTEGDPVSKKRINICKQ 
NLNEKTNLTVDTPRHSFTHKIKSLNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YLGVHRIiS IDGLDLLT* *SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 | 18168 


A 


4291 


324 


19 


EMSKRHEQKFHRREI.QMANKHVKRTSLA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 
nude- 
location 
correspon 

last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyJalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, V=possible 
nucleotide insertion 














I REM K I E TTMK Y YHT P I QMA* VRNN DT S 
QVQWHTLQVAS PLVAE VGGWNEARNLRL 
QIVKMVPMNSHCTPAWVT 














IKFCRDRDSLCCPGWPRTAELiKQSALLG 
LPKCWDYRH*ATKPGLQSVFL,LGLKIFL 
ILLVRGVDRYL 


4269 


18170 


A 


4293 


427 


3 


RRKTVTVPRTGERGI ISNEDNGLCKAVG 
NIPGSPLCNVSKLNIMKHAPGGHAARFC 
IWTESAFRKIiDEIiYGTWRRAASLKSSYN 
LPMHKM INPDLS RILKS PE I QRALRAPR 
KKIHNRVLKKNPVLRPL* R I KLTSRHKR 
R 














FICPLQDYVICSDVTSCWKKILATTGR 
RLRDDSVQFSFFFFGKGGSLLAPRVKEQ 
GGDLG * WNPRP PRLRE F PGIALPRCWNN 
GLAPPPPLILVFLEKRGFPLAGKMGLNIi 
LHSR 












332 


IHSSYFFPQS*FFGTINTSDKPLRILIK 
EKKREGTN+HIMHEPWKFGIDPEDIIKV 
KKKYYKQLCTHKFDNLiEEMVHFLKKKTI 
HL I * NR * FE * L Y KY YRN 


4272 


18173 


A 


4296 


407 


30 


WSVIYGEN*ESL.GWGAHPSHVANIRVTG 

QMS S E I DVDNLGH I S LCNAVQ * IRNLNP 
GLKTSKI ELKFKELHKS KDKAGS EVTKK 
EFIEVYHELYAVG 


4273 


18174 


A 


4297 


350 


1 


YKTWMNKEKW * YVG YN IEQEQKLALKT 
TVLDEWYTLPDGR I I KVGGERFEAPEAL 
FQ PRJj INVEG VG VDE IiL YS T JQAAD VG T 
RSEFYKHIVLSGGSTMYPGLPSRMEREL 


4274 


18175 


A 


4298 


623 


1 


SRRGCAATCDGSITAWPQRQAAQGKPSV 
HSKLEAEAKPTPGDHAASESTGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
MARNPASS F* S F* RCCTAAS PPVPTPPS 
PVIiMRGPVPGGGGGQGKI RPLQEEAP PP 
SSPVVLSR*PQAGTPSSPAVSSL,YHGGL 
SPTGRQDRWGR 














PVRNCLiGARFRVSGRAAHHAXj*QSASTIj 
QGDPRTKRQAI SA* S SAFDVQDVSHVTL 
PFYPKRAQSKDLI KEAILDNDFMKNLEL 
SQIQEIVDCMYPVEYGKDSRIVKEVDVG 
SCVYVMEDGKVGVTKQGVKLCPVGPGKV 
YEELAILYICRRPCI 














QENCLNLGGEGCSGLPKFWDYRCEPPCP 
DCWPS 


4277 


18178 


A 


4301 


454 


1 


NGEKPTYSGKKYVFLF*NTRRPTLFTWG 
KTN*GERPRSSVYSCSSWRLSSSSL>GRS 
+NSVGSLNIILATLTAQELSNS*ANNLI 
* LAYTIAFI VKI PLYGLHL* LHKAHVEV 
PIAGS I VLAPVLLKLGGYG I I RLTLILN 
PLTKHIAYPLY 


4278 


18179 


A 


4302 


3 


463 


AVSLPLPLPRFGRLPPLRVGWCPLRCH ' 
I LRTQRLE G AP S R * QTGAKQ AGVWE ELR 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LASSLERG CYPAALLVGEGEGEMYLGPY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
oil 


SEQ ID 
NO: in 
USSN 
09/5 J 5,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

iocation 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cys<eine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, >=possib!e 
nucleotide deletion, Wpossible 
nucleotide insertion 














HPT*DGLKRLSLAKPPGGETRLTRSGHT 
FWGEGIPPSAVCL 


4279 


18180 


A 


4303 


469 


2 


FFFFFSETESCSVTQAGV*WHDL.GSL,Y 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETES CSVTQAGV* WHDLGSLY 


4281 


18182 


A 


4306 


517 


254 


WEETAFRLIGKTGIGLVTSGDLAASAS 
QR GGMTG VSH CNR KEFVLMN I I VMSSGL 
TFIiPFSPDEV*GS*GLSFLPSPLALNTG 
LRN 


4282 


18183 


A 


4307 


276 


3 


KIKMEGIPLHIPNPLVWLNLGLLFILAT' 
SSLAVYS IL* SG* ASNSNYALIGALRAV 
AQT I S YEVTLAI I LLSTL 1 1 S GSFNLS T 
LITTLV 


4283 


18184 


A 


4308 


234 


398 


KKFFFFPPLEGGGPITTIWSPPLPG*RE 
SPAPP P PRGG I KGLiAPPPNLFFFLDKR 


4284 


18185 


A 


4309 


403 


100 


SGGQKAVGPPWAAPPGYKQNTNKLGPLP 
QPSTEGGAFWPTQGPKPTGLPPLPG*LN 
PRNPTPP*WPPPPPPFNTRSKGPHTPSP 
TPLCGPPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKP I PLAPEWRGVIRFRALP PCRPLHC P 
TLKLTAPSVIHQRT*VHWGFL,VGVSLSV 
FLERESCSFTQVECSGTKLALRSLELLG 
VSDPS ISAFQRAGIAGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTEEDTRRCLVLFS FFLS FFLFCFVLRQ 
RPACC PG WSAVTQS QLTVAS TSLAQAKR 
SSHLS PPQRS * DYRHTPQCI 


4287 


18188 


A 


4312 


1 


420 


NTS WGVGELSL I VI VINMLL P YVWLPKG 
KFFFPQNSRFPSAPPHSSPPGLRSDFSH 
SGGLFFHLEVLWGLPLPPPPPPASTHVR 
RPLGTQGCP*LAWYIHLISASYQKANAA 
PQLSCILQDCIRSKGDILIFTFFTLCLS 


4288 


18189 


A 


4313 


403 


207 


ESSEG*IiNPIAHLAMRYKGCPFKDVREK 
SEFILKSIQVRKSILNTRMSLLQLDFTC 
RYSRCVCFY 


4289 


18190 


A 


4314 


469 


328 


TSEGGGFNELRLCHCTPVWLTK* DSVS I 
SQRIRKKNLKTQSLWRCP 


4290 


18191 


A 


4315 


1 


212 


NTLPGVEVXjKRLRWEDSLNP + GQGCREP 
*SCHCTTAWATEKDPVSKKKKKKKNPGG 
FRPHSLSQPFGPPK 


4291 


18192 


A 


4316 


63 


559 


SNLTFWQCAVPAVETTPSSLCGPVPSPT 
SSATPKPIPS *AACPPPDCAIoAAEVRAL 
PAAGRPRFSEACLTPQPNPCVLWPCLWD 
HSPTSCPTGSS+APHSCAFLPVPRLGGG 
MTTVHPTVHLPIHSSIjTFAWE* PSPPAC 
LTAPRPTGHCLLS VP PS AYS S S S P S 


4292 


IS 193 


A 


4317 


135 


405 


PAPPS PHPKPDS VSCVIPPS * PAPPSPK 
PKPDSVSCVIPPSHVAGPSGLPEMTLLE 
PRGPCPPEVPPTSPFVG*AWWPHPPARR 
ASGRMDGRTDGRGRHLDLRS THSFLTP 


4293 


18194 


A 


4318 


413 


1 


LYLVGRA*SFHRSFPGQERIHPGGKPYD 
CKECGETFISLVS IGRHMLTHRGGVPYK 
CKVCGKAFDYRSLFRIHERSHPGEKPYE 
CKQCGKAFSCSSYIRIHERTHTGDEPYE 
CKQCGKAFSCCKYIRINERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


G VGVDNDG IL, VLGATN I PWVIiDSAIRRR 
♦DNTVNILFLTADVTSR 


4295 


18196 


A 


4320 


400 


26 


HSEAVTTVRSHHSPVGHIHSLNAPPTVA 
IjTP*YNKPIjNIFV*ARAQKIQGQFLTHT 
I PHILLLVITRDTSRACVCVCVCV CVCV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CVYAYIWVCMCVYIYVCVYIYICNYTFI 
PTLVHLPINKMY 


4296 


18197 


A 


4321 


370 


1 


SDRGQGKAGQATRLGSRQTGQDEDKGTE 
KSIPSWDKGPVNNEFGKSVNVSSNLVTQ 
EPSPEETSTKRSIKQNSNPVKKEKSCKC 
NECGKAFSYCSAIjIRHQRTHTGEKPY*C 
NECEKAFSRNV 


4297 


18198 


A 


4322 


342 


82 


GLiGLTTNWLLTGKSAGRGCAPSSGGLWIi 
AVGDS VYYHRETEGGKPS PGRPCC* PS * 
AGLAPPS PPPPGCPEAPGS CCLHYRCLL 
TAH 


4298 


18199 


A 


4323 


284 


3 


ITKKFPRVIGGGCFSPFFMGLSKKNGLF 
PEGALS I SLI FGPAI PPGGAQVI SSAKE 
KKKVLSRPGWAHACNPSTLGG*GGRIK 
RSRVPDQPV 


4299 


18200 


A 


4324 


365 


3 


QVEVYWENHPFRLEES I YQ* CRLEGAT 
SVAGEQISEYNTSMRASDGGSPPLSTET 
HITLHVIDINDNPPTFPHLSYSAYIPEN 
NPRGAS I FS VTAQDPDSNNNAR ITYALT 
EDTLQGVY 


4300 


18201 


A 


4325 


253 


3 


PSFLSRD*SHIHKRLiECSGMISADYNLH 
LPGSSYPPTSPSQVAGTAGPSMSLQEGR 
ASHDITSSSRSIGAKELLRPADHPQCI 


4301 


18202 


A 


4326 


307 


95 


FLFLiIiKDKI LLS PRLECSGT I IAHCSLK 
LLDSSYPPAVAS* VAGTSGMCHYTWLRW 
KNHLS PG I QGCS E I 


4302 


18203 


A 


4327 


371 


2 


WAPNHI S PTPRSGSTTRI WAPT PPSNS P 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APASWSFPFQEKRLHFPALPCP*HLDSS 
LGPTTLFS FSFPPTS IRPSQTNHSKGTP 
PPLSYAQHDCI 


4303 


18204 


A 


4328 


409 


I 


RliLEARQPELEMAAliIFFTLKYKHVERE 
QKYHQLQDEYFTSAVVLTLILAALFGLV 
YLLI FPQSVWLLLLiVLCICFLVACVLY 
LHITRVQCFPGCLTIQIRTVLCIFIWli 
I YSVAQGCWG * LP WAWSS KPNLY 


4304 


18205 


A 


4329 


262 




LHGAYLVLDI IGAQNRKPRYS FKWRVGQ 
FFFLVFS PRDSLTLLPRLECSGTVMAHC 
HLDLLGSGDPPTSAS * I * GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF* FLKGMGLVNHVFTEDNLKKLYVSN 
LGIGHTRYATTGNV 


4306 


18207 


A 


4331 


408 


3 


SSVGIHVRTHTGEKPYECKHCGKAFSCH 
SSLREHVRTHSGEKPYECNQCGKAFSHA 
QYFQKHVRSHSGVKPYECTECGKAYSCS 
SSLRVHVRTHTGERPYECKQCGKTFRYL 
ASLQAHVRTHAGA* IYKYSGHV 


4307 


18208 


A 


4332 


110 


426 


GI>S PGTWS DMTGGPAVTAFPCTLTGTHA 
GHIII DG ID I AKLP LHTLRSRLS I I LQD 
PVLFSGTIR*APPPLRPTQPQAGSVPSD 
LEHKEEGVGGDQDPRGVSCSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSALAAPEVX.GPEKYDKSCDMWSL 
GVFMYIL*VPSPPPLY 


4309 


18210 


A 


4334 


165 


1 


1 1 LTF FKKNERKWKAVQ YW * KNR * VDQ 
WNRIASPE IDPHE * SQLISDKGAKECI 


4320 


18211 


A 


4335 


42 


440 


SARRAGDPARGAPSRNNASLPDPRELTG 
RPAGGLQ*GNCPAPPTPQLPSPSVSRPV 
SGRGP PPPPS FGDPRANRPQNPKGAKW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D~Aspartic Acid, 
E=Glutiimic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














LTVQMGEPGEIRRAACPQLSWAMTFPGS 
LPEPKRTGPQIQKSPPSRRGG 


4311 


18212 


A 


4336 


1 


160 


NTCGGGAQL*SQLLRRLMWEDLiNSGSQG 
CSELRSHHCTAA*ETKGDPLSINEY 


4312 


18213 


A 


4337 


236 


1 


QRSLL.CYIMEIRTVAVRIVAIKGVESEV 
YLAMSEEGTVYAKKECNEYGIF*ELVLE 
SHYYTYAAAVLRPVCCIESKVS 


4313 


18214 


A 


433S 


408 


3 


CQSSVSKKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHLEQKEQQRH 
I *QQMLLEGGVNQEDGRDQQQNRNEQFL 
NRS IQKL.GELNIGMDGLGNEVSALIQQC 
NGSKGNGSNGL*VNS FDTPPHV 


4314 


18215 


A 


4339 


363 


1 


RSQSSLPKSFKRKISWSATKGVPAGTS 
DT* GVQPGWQ* RWGASTATTQKKPS I S I 
ATESL.KSLIPDIKPLAGQEAWDLHADD 
SRISEDETERNGDDKTHDKGLKICRTW 
RARYSINEV 


4315 


18216 


A 


4340 


1 


353 


DVFLDTLARPLRHSCNFFLCYLL*DFFK 
FIPQFFYWMFYFDYIFISRRNFYFLLLT 
HFFTL.PLFLVIRIQCLHLFLFFSFFFFF 
FGKGNP FLPPGWRARAQFWVWGS PPLRV 
NALLP 


4316 


18217 


A 


4341 


376 


3 


LELREGGFLPKIADEVREERS PALLDDR 
AGRCQGQPRIQVCLTPKSMLiFliAFHM*T 
CEHCLEQCFSTSL*PIEIRTLH*D*GCV 
CMCAYVC ICVCVHMCMCAYICI C VARMA 
KRPLES IHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R+FSCLSLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


I FGEQ WFDYMNKFFSGDF * D FGASLTQ 
AVHTVPNV*SVIPCHPPRV 


4319 


18220 


A 


4344 


1 


434 


RSLIFRATAYEYRYIDQADLKLPASKRS 
S CLGL P KG WD YRHE P PCLARF I FWQI F I 
AYKLR*KHYKSRQRSLPSRGSC+CACDT 
FVSDPSHWGTMAQILGGWITLGQVQRSC 
LADOPLLPLLPPT1APCLP *NDVI,LFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


KHS CRS LiEHLDR P L P PALQE T C P VRAE P 
LtiLVRINASGGLIl.RMGAINRCL.KHPIiA 
RDTPVCIjLiAVIjGEQHSGKSFIjLNHLiLQG 
LPGLVRAGRGRREAGSKDGGSCLGEAGS 
GIPVCGDKEPTN*CSPFSPLQESGEGGR 
PRGGEASLQGCRWRANG 


4321 


18222 


A 


4346 


313 


442 


DSTLNFFFFF*TESYFVAQAGVQGHNLS 
SLKPPPPGFKRFSCL 


4322 


18223 


A 


4347 


437 


338 


AEVQEFYGDYIAVNPHLFSLNILGCCQG 
RNWDPAQLSRTTQGLTALLLSLKKCPMI 
RYQLSSEAAKRLAECVKQVITKEYELFE 
FRRTE VP PLLLI LDRCDDAITPLIjNQWT 
YQAMVHELLGINNNRIDLSRVPGCMRPL 
*RIHPGTLtERSIRliLFMPSSSWTMA 


4323 


18224 


A 


4348 


409 


118 


GLHDASFVAAIPSTQWYFFFEETASEF 
DFFERLHTSRVARVCKNDVGGEKLLQKK 
WTTFLKAQLLCTQPGQLPFNV I RHAVLL 
PADSPTAPHIYAVFTSQW*AAGPSGRWS 
TSSAAAFRRPRHSCRL 


4324 


18225 


A 


4349 




418 


YTLPDFPHHPHRLSHCRLLQSRTAETYQ 
SPGACTPISQGHPGQRSSPCSSSSNPAC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEO ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














*SSSSCGSTGFRCPGSAHPGCCSYVL*M 
GR*RSSQAPRASPIAQGPRSAPAGRAGH 
PALPS PL.P * GAP P P PSAAS S PG L* S PA 














VSKTNTHTNKKHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVLIKDGEVIGGICFRMFPSQGFTE 
IVFCAVT*NEQVKVSAAPRPQPQSTAHS 


4327 


18228 


A 


4352 


408 


3 


PKSSLFTSRPNGKSWSTKRPAISLAYSN 
ENAQGSWNGDQDAGPLLVENPRDYSTEL 
SVT I AVGASLL FLNVLAFAAL YSR KDTR 
RQ*PLCQLSPHRGIGASELGHAPQ*ELP 
ALLLG P I RHE CEAG PPHDTLP V 


4328 


18229 


A 


4353 


62 


280 


VRIGCLTSVPEASSCGLPTTPGCC* PLG 
TLPCRSRSRTRSFRSRSF.TQSRSRPKTY 
SPGRRRRSRSRSPTPP 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSLVPKS PVEEKGKS PVSQS PV 
EEKAQYRVPKSPVEEAKSKAEVGKGDQK 
EE * EKEVKQAPKEDKVEKKEEKPKHVPE 
KKKAES PVKEEAVAE WTI AKS VKVQLE 
KETNEECI 


4330 


18231 


A 


4355 


3 


558 


LGPTIjLWRRGKVPKANSPIPTSLRRTLP 
RAS PGPSRGPLFMRQAGSTPMTLPRS TP 
SMSPML*MVWPPTAVPVP*KPLMWAPPA 
PLVLLVTILTE IQEPATPAPLTQF*KPT 
SLMVSRPVCPWQGPRTTRSTLiCATTIA 
PSHATIjRPGLSTTTSPLWQTLiSIjIjIjEGQ 
ASLPKG*NTSITLPSVS 


4331 


18232 


A 


4357 


258 


_ 


LGFHDPGPAGWSQPHGGRSAQAQ*PPQP 
PTVPRTSLADVSDAPL>P P CGHS RPQHH P 
FRLPHTSFSTPGLSAVCLIVHPAARVSR 
CI 














DRDLGDEYGWKQVHGDVFRPSSHPLIFS 
SLIGSGCQIFAVSLIVIIVAMIEKLYTE 
* VPTLNCIMSLLFLNSFNHWNLLSQS 














KDYFGSSVENGGEEELKQEDQLGGYSVI 
♦ARDDEDEDDDDDDDDEDEEEDDEDEDD 
DDCI 


4334 


18235 


A 


4360 


88 


2 


RGSRTDTG*RGSDRGTG*RGSHTGTR*R 
GSRTDTG*RGSRTDTGGRGSRTGTGGRG 
SRTGTGGV 


4335 


18236 


A 


4361 


2 


250 


ELRDEGKASSAKQRLKCAGLQKFGERAF 
KAWAVARLS QR F P KAE FAE VS KLVTDL T 
KVHTECCHGDIiFGCDEESAGLAKYIM 


4336 


18237 


A 


4362 


169 


456 


EQLLCAGQGQLSCEPIjHLPSCPVGPQTP 
ahrpgrmapgpps flspgspcgliglai p 
CSCQ PWVAGVPDHHLFPS PMGRAGLS PS 
ERSAGEGVIPWP 


4337 


18238 


A 


4363 


1 


419 


PEFKLQKLKRSQNSAFLDIGDENEIQLS 
KSDWLSFTLEIVIMEVQGLKSVAPNRI 

DFTTTHPRPWKVKL1FTESTGVX1ALEDK 
ELGRVI LYPTSNSS KS AELHRMWPKN 


4338 


18239 


A 


4364 


375 


448 


PGAVAHACNPSTLGGRSGRIMRAG 


4339 


18240 


A 


4365 


1 


444 


DFLTNHYliHFLRIAGSQLTGLGTAVQLY 
SAYEEKNRTFLLAAVKRNHNQYVNPSGV 
ATFFESIKEILLRQSGVKVESVDHDSCV 
HGPCQNGRSCLRKIiAVSSVLKSRESLPV 
I IVANEPLQPFLCKCLPGYSDSWCEIDI . 
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SEQID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DEWLPFPW 


4340 


18241 


A 


4366 


52 


402 


CPQEWFHSFCHCTSAFMKDLYFL.CLVR 
FGDIiKEDKVTRHDGASSDGHLiAHIFRHA 
AKELFNEDVEEVTYRALRCGAVSTACLC 
LWLARVHGVPVWI CACGGSMDGVYTSQN 
QKDFF 


4341 


18242 


A 


4367 


119 


247 


WVLLSFQLHGVAVLGIALIAMGEEIGAE 
MALRTFGHLLR YGE P 


4342 


18243 


A 


4368 


2 


430 


GGT WDGQPG FHGDS bSKAPGMNSLEQG 
MVGIiNIGDVSSSAVKTVGSWSSVALTG 
VLSGNGGTNVNMPVSKPTSWAAIASKPA 
KPQPIMKTKSGLPPSPIKHNMDIGTWDN 
KGPVPKAPVPQQAPSTQACPQSQQVGQT 
LP 


4343 


18244 


A 


4369 


1 


406 


ETSTPEGEAGPIQRLDIPVENPVESKNI 
FLGAPLIICHVIDKRSPLYDISATDLAN 
QDLEVI VI LEGLGETTGISTRARTFY I A 
EEIQWGHRFVSIETEEQRPDSVDYSKFG 
KTDLVTTSRCNARELDEKPSILI 


4344 


18245 


A 


4370 


2 


284 


GTGTL CDLTALLS AR YDGVRTC I LPC WK 
TI ST I P VALLIH YHQNTQAS CGKRAI I LE 
TRQHRLFCADPKEQWVKDAKPHliDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


EDYNI LLS IMKGTTRP INKE I EALNDT V 
DQLELTDLCRTLTPTIAEYSFFSSAPKV 
LIQP 


4346 


18247 


A 


4372 


44 


304 


GLPRS IGCQGLPRLPEWPTWVGGCHRGL 
IMPSVPTQVSERPLMFLLDTPGVLAPRI 
ES VETGLKLALCG E PGDGLGLG PGPLP P 
HL 


4347 


18248 


A 


4373 


1 


422 


LHLFNPAASAPSRSLFSGPIIiDPLSRAL 
GVG PG I CGLAS SPGIS EGWDQ IRSWTHP 
PDPDHLS G FCR S QVYMH PS L.S P PS TM I L 
SGGTAIjKPPYSAFPGMQPLiEMVKPQSGS 
PYQPMSGNQALVYEGQIiSQAAGLSASQM 


4348 


18249 


A 


4374 


2 


423 


nsgthtpglledlskngrlpeiklpvng 
csdledsftilqskdlkqeplddptcid 
tsetslsnqnklfsdinlndqewqelid 
elantvpeddiqdlfnedfeekkepefs 
qpatetplsqesasvksdpshspfahvs 


4349 


18250 


A 


4375 


292 


429 


LKISVFPSSTEISLQQRIiEYIARAILTA 
KSSTGIASISADGESLRE 


4350 


18251 


A 


4376 


544 


32 


ALHACACLGLSPEACFL.HLLQLTGTSPG 
PVDGQGLDNQGFRLLKAILGVSPAPEPV 
HG AS EHNADTGV SVARACVL P TS P TLAR 
TAAAKDESPPLAGLTMGPSREPGGSCLP 
IiPSGIiLSSPTRLALPRRPPSVSVWQDAG 
SSGSMPTEPLVQCGHRFYGTLVSWAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKRAEEELLLHDTRCWLNGGAMPEARH 
PRTG AS ALHVAAAKG Y I EVMR 


4352 


18253 


A 


4378 


291 


1 


QMLSVDVASRYRAPSTYVLNSLKEGMDG 
LHGES CSS FLLGPSVAMNMQTAGIjEMGI 
CDGHFRQNVGCGYVL KPDFLRD I QS S LH 
PEKPISPFKGQSL 


4353 


18254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGNNFRREYIVTQG 
PLPGTKDDFWKMVWEQNVHNIVMVTQCV 
E KGRVSKQLS WHQ S FAFCHLDRI LPHHQ 



819 



NSDOCIO: <WO 0164S35A2J_= 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SETPQNNVGIM 


4354 


18255 


A 


4380 


214 


410 


LDHLPTHPSPTGEIEVKWFEAVQTGLPM 
C I LGAFFG P I RLGAQS LQ VLD S EL I PWA 
VQKGRIAPC 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSNLLSRLTRNEFNLGL 


4356 


18257 


A 


4382 


290 


119 


IATVSLNIVKMPRLPKAIYRFNAIPIKI 
PMXFFTEIDKTILKCMWNHKIIPGNSPA 
H 


4357 


18258 


A 


43S3 


2 


423 


LERVCWIKDI WAVTGENMEVMKSI IQK 
YQHKR I S LVEAGVTRHRS I FNGLKALAE 
DQINSKLSKPEWIIHDAVRPFVEEGVL 
LKWTAAKEHGAAGAI R P L VS T WS PSA 
DGCLDYSLERARHRASEMPQAFLFDVIY 


4358 


18259 


A 


4384 


356 




WYPCCLLASLICVFRGI,FYLSL.CLVLRV 
PVPLCLQFPGSLLGSLLLFSLLRQGLAL 
WPRLECS SA I RAHCSLELGSNDSPASAS 
QSAGITGMSQCAWPLSLLSVSPRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 






SGSKKTLRSSLEEPTILDIEKFHRESFF 
YTHLINFSGKRYC 


4360 


18261 


A 


4386 


53 


215 


WQLYWTTWCQKISRNQGLAHVPRMECSS 
MI IAHCNLKLLGSSDPPASASQIDPS 


4361 


18262 


A 


4387 


2 


390 


PSLAARVLARGYGNFSVFVLWLRKNDRM 
HGECAPNVSVAVSTSHTTI IGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAGSGHGSYCSGSSS 
GGYRRGSGGFGSCNSGG 


4362 


18263 


A 


4388 


2 


441 


IKTRISVIHKEEFGLTPIEGAIEDMKNK 
TLQLAFAINQEPSDAKMLQMVLKGSEGA 
TVNQGPLEVAQVFLAE I PADPKLYRHHN 
KLRLCFNEFIMRCGEAVEKNKRLITVDQ 
MEYQQELKKNYNKLKENLRPM I ERKIPE 
LVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGN FLGGC VKES P FVPQGGMQWPDFGS 
LQHPPSRFKQFSCPTLPGNWNFRNAPLR 
SGKFFFSPVFLVETGY 


4364 


18265 


A 


4391 


1 


244 


VDQQMRQNLFTTGAHHLQQANIQFRTDI 
ARTE YL S NADE RLR WQAS S L P ADDLCTE 
DAIMLKRFTRYELGCQGEAWEECGW 


4365 


18266 


A 


4392 


2 


146 


LDLLSQPCRAVYI FAKKNDI PFELRIVD 
L I KGRS S LGFG E PKS QEGDR 


4366 


18267 


A 


4393 


204 


489 


GAAS EHPIPKGESHGTS KAS S L CADAG P 
AGHFASVYNPLAWTVTTIVTLTVGFPGV 
RVTDEAGHPVPSQVSGIQRCLQGMKAPP 
VTVTVSGEKGS 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSIIFASWSAGDFGSVGATE 
WLEVFFIIPSHIGHELVGLGCCPEVMFL 


4368 


18269 


A 


4395 


2 




WLELLKFYTLDCALEE YVT CVR I QDI LT 
RENNNWPKRRIAI EGEMI CCI S FVEYFI 
CATHYLIQC 


4369 


18270 


A 


4396 


2 


285 


TVDFPFQPYKCQQEYMTKVLECLQQKVN 
G I LES PTGTGKTLCLLCTTLAWREHLRD 
G I S ARKI AERAQGELFPDRALSS WGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFS CSLVLEEEASDYLELDTI KNLVK 
KYSQ 


4371 


18272 


A 


4399 


157 


3 


DVLRNNFARSAVLYLFIDRVTLWNAPRL 
ECSGAIIAHCSLKLLGSNDPPTS | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamiue, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4372 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 
KQTKFVHECIKTILKX.YKGQEFAPKSVA 
I IGHSMGGLVARALLTLKNFKHDLINLL 
ITQATPHVAPVMPLDRFITGSTGWLKDH 
G 


4373 


18274 


A 


4401 


214 


430 


E LGN I LNYWRWEAEAELSNRW FNGQAV 
HGELSPVTDFRESCCRQYEMGECTRGGF 
CNFMHLRP I SQNLQRQ 


4374 


18275 


A 


4402 


3 


227 


XjTQVS PQMTGHAGLNTAQAGGMAKVSEL 
KHFQYFLPNRGLS LRVMRKMGPRGPLLF 
FLLGS PRATHAWKRQ I S F 


4375 


18276 


A 


4403 


158 


313 


NRDEFCHVAQAGLEXjLGSSSPPAATSQS 
ACITGVS YHVR PTLSNKFGCS S L 


4376 


18277 


A 


4404 


3 


628 


HCIREGGQDVPSNKDVTSLDWNSEGTLL 
ATGSYDGFARIWTKDGNLASTLGQHKGP 
I FALKWNKKGNFILSAGVDKTTI I WD AH 
TGEAKQQ F P FH S AP ALDVD WQ SNNT FAS 
CSTDMCIHVCKLGQDRPIKTFQGHTNEV 
NAIKWDPTGNLLASCSDDMTLKIWSMKQ 
DNCVHDLQAHNKEIYTIKWSPTGPGTNN 
PNANLMIiAS AS L 


4377 


18278 


A 


4405 


35 


180 


M CVD YHY F FCLLG PNNNNP QTS AVRT PT 
QTNGSNVPFKPRGREFSFGK 


4378 


18279 


A 


4406 


1 


438 


DFQRPRDDHGDVDWEKLVELLTDCSNLQ 
DQTY IIiYI LYVI KG P S WDTNLSGQNG VT 
VQNLLGELYGKTGLNQEWGLIPYISSLF 
RKKVDVLAEACTDLLSDQKQLTVGLSDE 
PREKI IYAPL.PPKELTKLIYEASGQDI I 
IGVLTQ 


4379 


18280 


A 


4407 


1 


428 


TESVNAYFKGADPTKCIVKITGDMTMSF 
PSGII KVFTS NPTPAVLCFRVKN ISRLE 
QILPNAQLVFSDPSQCDSNTKDFWMNMQ 
AVTVYLKKLSEQNPAASYYNVDVLKYQV 
SLNGIQSTPLNLATYWKCSASTTDLRVD 
YK 


4380 


18281 


A 


4408 


174 


383 


KNSW PGAVAHACNP S TLGGKGRQ ITRSL 
YRDHPVQHGEIPLYLKI/TALAVTFLGLL 
TGLHDLYLTTNLT 


4381 


18282 


A 


4409 


1 


288 


LMAEKDSLDPSFTHAMQLLTAGKGASRV 
PLGRPAI PGMSGPGFVPLASRPLGTEAV 
GSSGLILIFGSLDSTFPPQPGVRGRISR 
LQWRPEGDFAIP 


4382 


18283 


A 


4410 


292 


422 


liVHSSCFQVSSKILELARKQRMNTDIRR 
NT FCTIMTSEDFLDA 


4383 


18284 


A 


4411 


412 


238 


FFSRHRVSQWDQAGLELLTLSDLSAIiAS 
HSAEITGLNHHAQPLTTNFNKYCWISLQ 
VH 


4384 


18285 


A 


4412 


1 


405 


VTSSDKS LKVLLDAED KVFNE I RNEHFS 
NVFGFLS QKARNLQAQYDRRRGMD I KQM 
KNFVSQELKGLKQEHRLLSLHIGACESI 
MKKKTKQDFQELI KTEHALIjEGFNIRES 
TSYIEEHIDRQVSPIESLRLMCL 


4385 


18286 


A 


4414 


2 


149 


GFHHVGQAGLEVLTSDNLPVSASQSAGI 
TGVSHCAQLRSDGFKKRSSPA 


4386 


18287 


A 


4415 


1 


366 


D I CHR KLG VE CPS HAS I NRLWQWS S I 
TASLRFEGPLNVDLIEFQTNLVPYPRIH 
FPMTAFAPIVSADKAYHEQFSVSDITTA 
CFESSNQLVKCDPRLGKYMACCLLYRGD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyfalanine, 
G=Glycine, H=Histidine, I=lsoleucinc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamrne, R = Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














WPKEVNAAI 


4387 


18288 


A 


4416 


1 


164 


RRRCEMSTMFADTLLIVFISVCTALLAE 
GITWVLVYRTDPFGTLAGSEGPGGRY 


4388 


18289 


A 


4417 




141 


DIAI PCNNKGAHS VGLMWWMLAREVLRM 
RGTISREHFWDLTSSVMG 


4389 


18290 


A 


4418 


136 


1 


EQTKAFGQLHKVLGMDPLPSKMPKKPKN 
ENPVDYTVQI PPRDPMQ 


4390 


18291 


A 


4419 


1 


280 


GNYCKYYGYRGPSCEDGRLRVLKPEWFR 
GRDVLYLGCNVGHLTLS IACKRGPSRMV 
GLDIDSRLIHSARQNIRHYLSEELRLLP 
QTLEGDPGA 




18292 




4420 


3 


446 


WSTSQVRQNYHRDSQAAINRQISLELY 
ASYVYLSMPYYFDRDDVALKNFAKYFLH 
QSHEDRDHADKLMKLQNQRGGRSLLQDM 
RKPDCDDWESALNAMEWALHLDKNVNQA 
LLELHKL.TTDKNDPDLSDYIETHYLNEQ 
VTAINELG 












182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRNILiKNWAKNYITL>ESEKERKSKQRS 
RR 


4393 


18294 


A 


4422 


1 


456 


GPISYWVMPKKRQALVEFEDVLGACNA 
VNYAADNQIYIAGHPAFVNYSTSQKISR 
LGDSDDSRSVNSVLLFTILNPIYSITTD 
VL YT I CNPCGPVQR IV I FRKNG VQAMVE 
FDS VQSAORAKAS LNGAD I YSG CCTLKI 
EYAKPTRLNGSK 


4394 


18295 


A 


4423 


2 


257 


FHHVGQAGLKLL I S VDP P ALDS QSAE I T 
GVSHCAQP VLCI LNQLCAKAVLS PDSH P 
YPPKSGRHEGQVREQKMS PCSY I YFKRT 
V 


4395 


18296 


A 


4424 


106 


259 


CCCFKYFFSNVKICFYRDLKPENILLNE 
DMHIQITDFGTAKVLSPESKQGV 


4396 


18297 


A 


4425 


3 


364 


MDEIE KYQEVE EDQ DPSCPRLS RELLDE 
KEPEVLQDSLDRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDSEYLTVK 
VISLHLVFQIGVIFLFQVPLTDPRDVIA 
AGRTYGRI 












241 


HHVWLICFVFFVCVETGFCHVAQVGLER- 
LGSSDLPTS VS LS AG ITDVNHHAQSKPS 
PSKGTGMNRP IGLRVGKGVWSRL 


4398 


18299 


A 


4427 


2 


438 


TRLIGPNCPGVMNPGKCKMGIMPGHIHK 
KGR IGIVS RSGTLT YE AVHQTT QVGLG Q 
SLCVGIGGDPFNGTDFIDCLEIFLNDSA 
TEGI ILIAEIGGNAQENAAEFLKQHNSS 
PNSKPVESFIAALTSSTGRKMGHAGQLL 
AEEKV 


4399 


18300 




4428 


34 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TTHSEEEEKSGLGAKI PVTI ITGYLS AG 
KTTLLNYILTEQHS KRVAVILNEFGEGS 
ALiEKS LAVS PGGELYEERLEL I NACLCC 
SVKGNGLIAJENLMRQKGE 


4400 


18301 


A 


4429 


330 


8 


YSDRISHWIDMCPSKYIFRGPVHPPHIS 
TFLHTAHNLVARGNYENMSPHDEEKGAY 
QRKEKCCDDPMAGAAR I HFLCFFEMG FR 
SVSQAGVPWPHITSMQPQPPRIK 


4401 


18302 


A 


4430 


446 




LTSSDPPASAPQSAGITGVSKRPQPIWG 
VSIQLYFSGCPDTPGQGRLTREHCDGQD 
GEQGGPCS E S VAQ ECHDWS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
0(1 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
ncid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, IY1=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tain trie, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


243 


AREDSADFVSFFPDFVWTLRDFSLDLEA 
DGQPLTPGTSQKDNNFILPRLCIRKFTS 
LIRTSRSYQSPDTVSVASSRGAR 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


185 


ARWVACSEL.RLRCCTPAWATEPEPVFKK 
KKKVQQQGGWPGGLTPVSYRFGRLT 


4405 


18306 


A 


4434 


2 


329 


VRDIKEKLCYVAVDFEQEMATVEFSSSL 
EKSYELPDGQVITIGNERFRCPEALIQP 
S FLGMES CGMHETTLNS IMKCDVDIRKD 
LYANTVLTGGTTMY PGI ADRMHNE I 


4406 


18307 


A 


4435 


2 


374 


WVTFISLLFLFSSADSRGGFRRDAHTSE 
VAHRLKDLPEENFKAL.LLIAFAHYLHQC 
PFEDHVELLNEVTEFAKTCVADESAENC 
DQSLHTLFGDTLCTVATLRETYGDMADC 
CAQQEPERNECF 


4407 


18308 


A 


4436 


2 


352 


KVRRIDSRLAELDREIRNPQISCQLIYS 
PSLFLGAFFLFFFFGFFFPPPYLLAPKN 
QFNPPPLCTGGPVPRPKPPI PPWEPPRP 
APPLFSPQWDLGFQIAPFLWVINGGGPS 
GGSRA 


4408 


18309 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALS CL.SQVS V 
HLQGLESLFILPGMEVEQRDSQMALVES 
LE YVRGE I S KAMAD FTTWKTHLLTSDSQ 
GGNQMLDEGFREDFSEQMEIAIRAILCA 
I QNIiE ER KNE 


4409 


18310 


A 


4438 




231 


SDTSRPQLPCPAARERFPDGPLSLRPLP 
FFLSQVYNEQIHDLLEPKGPLAIREDPD 
KGVWQGLS FHQVWDWARVG 


4410 


18311 


A 


4439 


1 


365 


MAHAMEEVKKCLGPDMMDDI CHEQFLEL 
SYLNGVPEPSRGRGVPVRGRRAAPPPPP 
VPRGRGVG P PRWALVRGTPVRGAI TRGA 
TVTRGVPPPPTVRGAPAPRALTAVMQRI 
PLSPPPATK 


4411 


18312 


A 


4440 


92 


510 


LAAIiPDGGQELHIPRSRAGPAGELSWAV 
VGTGALSSLS ILFQMARCRPWPPLRPNS 
PLQGWI FNWPWLVAI PASLFSGFLSDH 
LXNQGYRAirVRKLMQGMGLGLSSVFAXi 
CLGHTSS FCESWFASAS XGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FPPHLPELGKWECPWHQCDECSSAAVSF 
CEFS PHS FCKDHEKGALVPSALEGRLCC 
S EHDPMAP VS PEYWSKI KCKWE S QDHSE 
EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAELVFLPSNTRSLMYPLiDQGWRTFK 
AHYAGYSMERIVSAMEENPGREKIMKKL 
LKLSS PRE I PAGEKLCPNDAHDFTELDD 
RVIHENHE 


4414 


18315 


A 


4443 


2 


371 


KWVTFISLLFLFSATDSRGVSRLDAHrS 
EVAHRFKDLGEENFKALEAIAFAQYLQP 
WPFEHHVELGNDVTESAKTCGADESADN 
CDKSLHTLFGDTLCTVATLRETYGEMAD 
CCSQPEPCEKG 


4415 


18316 


A 


4444 


2 


372 


WVTFISLLYLFSSGYSRGVFRRDAHKSE 
VAHRFKDLGQENFKAL VL I AFAQYLQQG 
S FEDHVKIjMNEVTEFAETCVADES sddc 
DKSLHTLFGDTLCTVATLRETYGEMADC 
CAQQEPERNEC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

correspon 
ding to 

acid 

residue of 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycinc, JH=Histidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

\V=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 

nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAEDILTMEGLRATMKHEIjEAAQKK 
HSLWE LL.RI PN I CKR I W I LS FVRSAG VC 
WRSTPDP VCLG ITSGGCRTAQ I PACS FL 
W 1CLHS I GAPA 


4417 


18318 


A 


4446 


173 


2 


KKKKCV1,YPEGHGVFHALVCVCVCVCVC 
VCVCIQVNFRNFCFISWIN1.SVQVSIjL.D 
Y 


4418 


18319 


A 


4447 


104 


221 


SCHLHYSLGDKSETLSKNHHYHHHHHNH 
HHHPAGYFKMK 


4419 


18320 


A 


4448 


241 


1 


WVSWDKAPSWVRGLGSGSRGGVWGGGSP 
PHRVTDVSCSFLPDGRHVWEMEAKTDRD 
LCKPVSAFGAAGATVGPQGLGGRC 


4420 


18321 


A 


4449 


229 


349 


WKRSHS FGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 


18322 


A 


4450 


76 


189 


TESKPAATTRSSGGGGGGGGKRGGKKDD 
SHWWSRFQKV 














RLLIGLKIWEKKVPQVSTPTLVEVSRNL 
RKVGSKCCKHPEAKRMPCAEHYLSWLN 
QLCVLHEKT PVSDRVTKCCTES 


4423 


18324 


A 


4452 


2 


371 


KFQNALLVRYTKKVPQGSTPTIVKVSRN 
LGKVG I KCCKHS EAKGMPCAEDYL.S WL 
NQVCVLHEKTPASDRVTKCCTE SLVNRR 
PCFSALEDYETYVPKECNAETFTFHADI 
CTLYENERIiFQ 


4424 


18325 


A 


4453 


118 


369 


GRYILLLKTKKQTANNNIKTPQYLSNMS 
KKFRHS E FF FFYLKWS LALQRCNI.GSLQ 
PTPPRFNQLSCLSliPNSWDYRCVPPNLV 


4425 


18326 


A 


.4454 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
ASLSCGGLHPVRASRLLCLtPNQACTMAG 
APPPASLLPCSLISDCCASNQRDSVGVG 
PSEPCAGYNLLVCRFI.SPSEKRSPSAGV 
MRFSRCRL 


4426 


18327 


A 


4455 


1 


379 


AAFTECCQAADRAACLLPKLDELRDEGK 
AS S ARQRL KCASLQ KFGERAFKA WAVAR 
IiSQRFPKAEFAEVAKLVTDLTKVHTECC 
HGDLLECADDRSDLAKYICENQDS ISSK 
LKECCEKPLLEKSH 


4427 


18328 










LRPARSLVFPWFAPGGSGLiRGIjKLLEAK 
CQGDGVSYEETTIPRPSAYHNLFGLjPLI 
SRRDAEWLTSRELDSLALNQSTGLPTL 
TLPRGTTCLPPALLPYLEQFRRIVFWLG 
DDLRSWEA 


4428 


18329 


A 


4457 


250 


330 


EWWTHLWLNEGFASW I e YVCVDHCFPE 


4429 


18330 


A 


4459 


96 


319 


YLTI P FS QAMEN CKDARLTTS I GAYNFN 
HRLLE M I LNKPGLT Y K P VCNQ VS ALS LL 
S FLFFMPliFLSYCQVS IH 














NSQTXjKLSNVQCIjKTRALFFFPKGI YPQ 
IKTLPPCPTSGPDPYVRVNLLPKGKGPC 
LKKISGNRKTLKPLFDKTLKFLVPMEKV 
PKSPLNVPVKNI 


4431 


18332 


A 


4461 


2 


357 


GYDGVKRWTRDGD I FNKELLLI PIHLEV 
HWSLISVDVRRRTITYFDSQRTLNRRCP 
KHIAKYLQAEAVKKDRLDFHQGWKGYFK 
MYCKHLALSQPFSFTQQDMPKLRRQIYK 
ELCHCKF 


4432 


18333 


A 


4462 


182 


325 


IRLSSWLCDCASCQVYEFRVKESSI IAP 
APAEDVDTPPRKKKRKHRL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
loca tion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianinc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4433 


18334. 


A 


4463 


1 


338 


SRHSPGPTPQPDCSLRTGQRSVQVSDTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPPIVFLPKLVYDMVESTD 
SSGLPKAASLLPSPSVMWASSFRPLLSK 


4434 


18335 


A 


4464 


2 


343 


IIDFFLGAYLKDEVLKIMPVQKQTRAGQ 
RTRFKAFVAI GD YNGHVGLG VKCS KEVA 
TAIRGAIILAKLSIVPVRRGYWGNKIGK 
PHTVP CKVTGRCGS VLVSL I PAPRGTG I 
VS 


4435 


18336 


A 


4465 




371 


INSTLKMS FVGENSGVKMGSEDWEKDEP 
QCCLESI FFALAS SLS S ASAEKGS P ILL 
GVSKGEFCLYCDKDKGQSHPSLQLKKEK 
LMKLAAQKESARRPFIFYRAQVGSWNML 
ESAAHPGWPIC 


4436 


18337 


A 


4466 


246 


12 


LCFTPLPSYILFFFETKSHSASPKLECS 
GVITATSEVILLGGGGCSELRSCYCTPT 
WVTKLDSITKKRRKRKRKDLY 


4437 


18338 


A 


4467 


2 


355 


WVT F I S LLPLFS S AY S RG VFRRDAHTS E 
VAHRFKDLGEENFKALALIAFAQYLQQC 
P FEDH VKLANE VT E FAKT C VADE S AENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 
CAKQEP 


4438 


18339 


A 


4468 


2 


361 


EDHVKLVNEVTEFGKTCVADESAENCDK 
SLHTLFGDKLCSVATIRETYGEMADCCA 
KHEPERNECFLQHKDDNPNLPRLVRPEV 
DEMCTAFHDTDETFLKKYLYEIARRHPY 
FYAPELLL 


4439 


18340 


A 


4469 


1 


373 


SSQPSSPSESVSGTVSVSPSSLPSSPCL 
SLSVCLLVCVSLSMFFSLPVCLSVPLCV 
SPRPHPSVSHAHPHRASALVNCPWGPAP 
TSAPVS FYKPQS PALKTGQAS PHPHPHP 
FPTPSPMQSQKK 


4440 


18341 


A 


4470 


236 


371 


KVLGAVLKDKGGLFCFVFGDRVSLCHPG 
WSAWQSQLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHLFMAWFTEYFKSTAQKKKFVFK 
TLLL I DNFDPLWDS KEGKL 


4442 


18343 


A 


4472 


1 


144 


AGFHHVGEAGLGLLTS RDP PAS PCKRAG 
ITGLNHHTW P FS VI LTYTYY 


4443 


18344 


A 


4473 


3 


333 


KNHLEASIGEHPFFPDHSPSCPHLLSSS 
NYHSLSCFLGGFFSETGSAPSPRLECSG 
VIAAHCSLKLLGSTSPLTSASWVAGITG 
MCWLYSPAVSPRPSPDLTCTSQLENI 


4444 


18345 


A 


4474 


288 


1 


TSAVNRISIKSQADQEATEQKLRDYSHQ 
GIQILAWQKFEYKPGKVTMDPEDCILQL 
AKKKEEEEEEEEEEEEEEEEEEEEEEED 
QEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRLECSGMIMAHCSLDLLDSSDPPTLAS 
QVAGTTKKISLH 


4446 


18347 


A 

- 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHDHGS LQPLP PELKR PSGTSHLLS S QG 
YRRAQPRR 


4447 


18348 


A 


4477 


3 


162 


LLERNKELEGSLQQMYSTNEEQ VQE I E V 
RALHVLAHTPRGPLGLCVPLCNLE 


4448 


18349 


A 


4478 


167 


2 


ACGISPFSLSSSCFGHVRHASLPLAFHH 
DCGFPEASPAMLPVQPSQLNLFIKYPV 


4449 


18350 


A 


4479 


195 


26 


WVS M LD I HDNGPH E VA V I C F VC FLSQS L 
ALLPRLECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 


1 


NRSATKCDWGEKRTSQELNLFFWFFVCL 
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Predict- 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N = Asparagine, P = Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














FEMESHSVAQAGVQWRNPGSLKPLPPRF 
KR 


4451 


18352 


A 


4481 


122 


357 


YHAFLYQWYNLKREHKRFRCSLLI I FLG 
GERKLQSIQLWNISHYIQIRRYVLEIPS 
GLGAVAHACNPSTLGGQGEW IT 


4452 


18353 


A 


4482 


137 


2 


FYSYLKILEEMGPHYVPQAGLELLGSRD 
PPDSAYQSTEIIGMSHH 


4453 


18354 


A 


4483 


347 


208 


GQAGLiEIjLTSSIjPASASQS AG I TGAGHH 
TRPRIYFLKKYLAFVLGS 


4454 


18355 


A" 








RLNRRGGGCSEPRSCRCTPAWATEQNS I 
SKKNQQKTTKLPliCCRLILL 


4455 












PCLLKMQQLAGHGGMHIjCSQIjXjRRIjRCF 
LHSYP 


4456 


18357 


A 


4486 


1 




HSAHSTHAGHAGHTSLPKCWDYRSEPPR 
LAKFLNTRY 


4457 


18358 


A 








QVAATLNNXiAVLYGKRGKYKEAEPLCKR " 
ALE I RE KVQKKGAWFFE I F VLC VCGF F 
FFFKRDP 


4458 


18359 


A 








RGCSEPRSHDCSAAWLTERDS VSKKNKN 
KSPKKQ 


4459 


18360 


A 


4489 


337 




YSINFALILI FLFYLGVIiVWAAIINYHR 
LHGLNNKHLL IM I WMVGS PRLSCG 


4460 


18361 


A 


4490 


229 


345 


LENETLGRARWLTPVIPALWEAEASRSR 
GQE I ET I T ,ANM 


4461 


18362 


A 


4491 


180 


365 


ENNLKTLA/ALDMSFYFILFLFYFILFYF 
ILFYFLRWSFALVAQAGVQWRYLGSLQP 
LPPRFK 


4462 . 


18363 


A 


4492 


269 




GNLQNLE ECVFYKI QHPFM I KTLNKLVI D 
RTYLNIIKAIY 


4463 


18364 


A 








Q L3j V S L I NXj F LQ A F KLQ I D CG H T CM H F W 
RQNLCLLPPTLPTPRRNLTLSPSLECSG 
T I WAHRKLRL PGS HHS PAS ATR 


4464 


18365 


A 


4494 


305 


379 


YIKIENFCLGWAHTCNPSTLGGRG 














ETATKTQRRWETAXjVPQHPHRPIiAPVTW 
VKTTGNSTIANFLL.I LVYLS FLFLVYSC 


4466 . 


18367 


A 


4496 


3 


129 


ITLNPGGRGCSELRS CHCTPAWATERDS 
VSKKKKKFPPPPF ' ' 


4467 


18368 


A 


4497 


396 


112 


LPELE FEAAVSHDCASELQLGHQSKTLF 
QKKINRIINNNNNSHIAISLLGIYPKEF 
KAGTQRD ICTPMI TASLFT I VKRQKQPK 
CLATNELVYRM 


4468 


18369 


A 


4498 


1 


152 


LSLPGSQTWWRALIVPATWEAEVGGSPE 
PRRSRLQVAMITPLHSSCGRRG 


4469 


18370 


A 


4499 


92 


2 


KRPG WTHACNPS TLGGRGGP I TKSWR 


■ 4470 


18371 


A 


4500 


116 


1 


AGIAGMCLHAQLIFLYFLVETRFCHLGQ 
AGLELLTLWS 


4471 


18372 


A 


4501 


99 


1 


QGTPVHRWWKCKLVQPLWRTVWS FLRKV 
KIDNS 


4472 


18373 


A 


4502 


140 


288 


RQFHYVAQAGLELLGFSNLPAL.TSQSAG 
ITGASHHNQPTSRFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFFXXFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFF 


4474 


18375 


A 


4504 


171 


2 


VETVSKESSLRICIiQDIRYFFFETKPHS 
IAQAAVQWHYQGSLQPQPLRLKQSSRLS 


4475 


18376 


A 


4505 


345 


74 


SVERTCHSPKPLMLFLPLVFQARDDILN 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GSHPVSFEKACEFGGFQAQIQFGPHVEH 
KHKPGFLEACKPYQTAEPASALTLDFPA 
SRTVQK 


4476 


18377 


A 


4506 


268 


102 


KNIRELWPGAVAHACNPSTPGGRGRWIT 
RSGDWLNLRTCEHVILQQKGKFRFLIS 


4477 


18378 


A 


4507 


1 


67 


GLPKCWDYRREPPHPARNCKYF 


4478 


18379 


A 


4508 


339 


180 


ANFFVFFEEMGFHRVAQTGIiEVTjGSSDL 
PILASKSAGITGISYSVRPQTNTE 


4479 


18380 


A 


4509 


174 


3 


nepkwiyiipgdplffllllffetesh 
yvtqagvqwhdlgsl.qplpprlqrfscv 


4480 


18381 


A 


4510 


84 


3 


AHHI FTVGIDVDTRAYFTSATI I I AI 


4481 


18382 


A 


4511 


127 


2 


LLIKIHCWPGWAHACNPSTLGVRGRWI 
TRSGVQDQPEQYG 


4482 


18383 


A 


4512 


109 


345 


E I FL I QGHEGS CLCFLFGS FMCFVATGF 
PHVTWLI LTLSNLPTS SYSRAG I TGVSH 
HARANNLFCFGSMPSHGITGLNF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKNHTAPNPLQRAQTHVK 
KGRGPTPPTPTPSKDPGPHPPLRKPKPW 
GR VWGEKKKKNKNG P KDKKG KKKKKKKE 
RKIQHSRF 


4484 


18385 


A 


4514 


234 


362 


DGDVXiSKFVNFFSFKWEGTPLIDGRRL 
KYRLNGDFLFLRIiKLi 


4485 


18386 


A 


4515 


190 


330 


CLALQS E CFRRWS FAL I AQAG VQWRDLS 
SPQPLPPVFKRFSSLHPFE 


4486 


18387 


A 


4516 


2 


199 


.CVCVCVCVCVCVCI FFFFLSKKWGPPPL 
CPPGGPKTLGPKESPPLAPKNRGNSGLT 
PPPPPTFFF 


4487 


18388 


A 


4517 


94 


339 


SAYLGLKFFLFFFFFFFFFFFGVFPPFF 
FPNKKKLKPPQIFFPPQKRPPIFFFPFP 
PGGKNFFFFGKFFFNPREKGPPKNLG 


4488 


18389 


A 


4518 


95 


3 


IjYLGPGAVAHACNPSTLGGRGGQITRSG 
VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRISLCCPVWSQTPGLKPSSWL 
SLPKCSD 


4490 


18391 


A 


4520 


207 


2 


ISREKLCTVRIPDMHVIEDSILFYYLFY 
S LAL PPRLE CNGT I MAHCSLHLPGSSNP 
PAS AS QVAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 
VRPGTVAHACNPRTLGGPDGWITRSGDR 


4492 


18393 


A 


4522 




82 


G F I LC VCVCVCVCVCVCVCVL YF IFF 


4493 


18394 


A 


4523 


300 


3 


LFLPPS PLPSQ I PGGQKPLPL I FPPRVP 
KKFIPPNFWAPPLISPRPFWVRFLGPPT 
QGFNYPLGPGPPPPPPFSLGFKKSPNPF 
FIKKKFSPPPKKQK 


4494 


18395 


A 


4524 


348 


94 


KPGEGGKGPQNPPPGGLKPGNPLNPGGG 
GSPEPNFPPWPPPRGAKKKTRPQKKKKK 
KKGRKESRLNYGMEKRMKEKEKVFIiSAL 
N 


4495 


18396 


A 


4525 


313 


161 


RGHPKNLKSKEVNIRREAPGHPRRNKKK 
TPNLKKKKKKKKKENKKQKQKP 


4496 


18397 


A 


4526 


118 


3 


FFFFLRGS FALVAQAWQ WRYLS S LQAL 
PPGFKRFSCP 


4497 


18398 


A 


4527 


153 


2 


KDAINKMKKWPGVVAHACNPSTLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKPPFRKIFLKWLGEGP 
GKNSLSKFFFFFFFETESHCLTQAEVQW 
LNLSSQQNKPPRFKQFLCLNHPSSWDYR 
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Inst amino 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gluramic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K.=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 
Q=Glutamiiie, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















4499 


18400 


A 


4529 


296 


39 


SIRI INIQILLGFFETGCHSVAQAHLKL 
LGSND P PAS AS E STE I TEMQH CTRLS T Q 
IFVAFPYSNIKLSTKKEPILIKLLLYNC 
K 


4500 


18401 


A 


4530 


292 


37 


SIRI INIQILiLGFCETGCHSVAQAHLKL 
LGSNDP PASASESTE I TEMQQ CTRLS I L 

K 


4501 


18402 


A 


4531 


3 


138 


GLWNGSCLCVCVCIFMGMGACLVCICTC 
LiYCCVPVNTCLCMDGR 


4502 


18403 


A 


4532 


380 


2 


GGPPKKRESAKAPGGRNPQGGGREKTPP 
PKKKRPPQKGGGKKTPPGGGGGPPRGGG 
KKKPPPKREKECHRGGRGGGGPPWTPK 

FILFFPNIVINGFY 














LSKITHAS IPVPVSKKKKKKKKEKKKKK 
KKKKKKKKKKKKKKKGGGGFKKTLGGHN 
FSRGIiE 




18405 




4534 


370 


48 


GPAPEVSSNGGKGGGCPKGLGKPLGSQF 
NGVKGYPSPKSSPLFQAPLGGQGFPKPR 
GNGGTPPFFPKKKKKKCALGDGTPTSGC 
PRRSWNHPETQDGCGGWGSRVPR 


4505 


18406 


A 


4535 


147 


1 


KGPPIiFFFFFFFFFFFFFFFFFFFFFFF 


4506 


18407 


A 


4536 


198 


1 


NPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFLiSFIFI FIiYFNFI 
SSNIVFIGFY 


4507 


18408 


A 


4537 


2 


356 


QNRTTGNS KTQS AS P P PKECS AS PATEQ 
S WMENDFDELREEG FTRSNYS ELREDIQ 
TKGKE VENFE KHLHE C I TT I TNTE KCLiQ 
ELTELKTTARELRDECRSLTSRRDQLiAE 
RVSALE 














KCSGTI IAQCNIKLLiGSSNPPVSASQSA 
GITGMSHHVQPKYC 


4509 


18410 


A 


4539 


308 


2 


FSISCPHDT P AAAS QS AG I TG VS HCARP 
I FSFI CFFF I CIMCI KYLLRTGDSVMYF 
HGLGC FGLCCYLE S Y I RKLSHMKKLS IL 
ITSREKCKCILQINIGDR 














IKQVVSTALiSPRLECSGTITAHCSLiKLiP 
GSASASWARTTGTCHHAQLIYFSETGS 
HYVDVDAA 


4511 


18412 


A 


4541 


101 


2 


MEENLGNT I QD IGRGKDFTS KTP KAMAT 


4512 


18413 


A 


4542 


1 


84 


AEEAEAEAEAEEEEEEEEEEEEEEEIKY 














KI Q I LFFVS S KTRLRPSAWHPRFVFC FL 
RWSLAIAAQAGVQWHHLS SLQPLLPGFK 


4514 


18415 


A 


4544 


1 


335 


RKKKKKEKKKEKKKKKQGRKKRKRERKE 
EEAKKKTKNKDK 


4515 


18416 


A 


4545 


104 


2 


QKKKKKKKKKKKKKKKKKKIYDFFLQIN 
CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPANFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVGLQDATSPIIEELITFHDHAL 
IIIFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMS CLS RS PS ALLKFE CRV I S S YW 
AW WRAP WPATRGAE AGES LE P 
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nucleotide 

locution 

correspond 
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Predict- 
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nucle- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4519 


18420 


A 


4549 


2 


340 


LNPGSAGCRELRSHHCTPAWTTERGSVS 
KKKKKKKKKKKKKKKKKKKTFFGGGPLW 
EREFLLGNFWEKKGGGANFFPLENPNGP 
QLKTWGKKAFFPRKGGGPLNPPQGFYQG 
A 


4520 


18421 


A 


4550 


165 


335 


PHYLPRRSIIHRPFKSHSPEPGAVAHAC 
NPSTLGGRGGWITSGQEFETSLINMEKS 
L 


4521 


18422 


A 


4551 


123 


3 


GRVDRLSCLSLLNSWDYGHLPLHLANFC 
IFSRDGVLSSW 


4522 


18423 


A 


4552 


3 


131 


HPEVDIPILPGLGIISDIGAYYSGISEP 
FGYVGMDVSYDPVRL 


4523 


18424 


A 


4553 


87 


364 


IliASQSWFRYAASAASGGLLETESQPS 
AAGTAS AAVPS LEAQT PAAFR PTPPGLA 
PSSPPAPGPPAPTGKVCGRHFLRSESSA 
SGPLRALL 


4524 


18425 


A 


4554 


114 


0 


LTWAPHSLIETIRNQIIQALLITILLGL 
YFTLLQRSE 


4525 


18426 


A 


4555 


91 


3 


TTTTYIAIATACVCVCVCACVCPCVCVC 
M 


4526 


18427 


A 


4556 


239 


364 


CIKITFSRPGAVVHACNPSTLGGQGGRI 
AMS RDRDHPGQQGK 


4527 


18428 


A 


4557 


250 


1 


STEHVKQLRNEFMKISVILIFLEILIiYI 
ECVHLKLCTLTTDHLLLGIKSLKILFRPG 
AVAHACNPSTLGGRGGRI PRSGDRDHS 


4528 


18429 


A 


4558 


2 


90 


QDGLDLLTWGSACLSLPKCWDYRSEPSL 

F 


4529 


18430 


A 


4559 


65 


394 


DPVSKKKKKKKKIKNEKEKKFFGKKKPL 
KKKLRADLILKPKKKNFLKKREWKKREK 
GGDPFWKNKPQVGKTGNNLNLGQNGIiGK 
GPS DLKKNTPE TKLNL VQKKFQI FF 


4530 


18431 


A 


4560 


93 


2 


GIISAHCNLCLPGSEDSPASASQVAGIT 
GQ 


4531 


18432 


A 


4561 


125 


4 


GAYILYLNKKSKS IRPGMVAHACNLSTL 
GGRGGRITRSGD 


4532 


18433 


A 


4562 


3 


84 


RSCHCTPAWATERETLSPKKKKKKNF 


4533 


18434 


A 


4563 


145 


3 


GFYHWPGGVEQSPCLSLPKYl'JNYHCEP 
P PLAS I GF FFFFFETQSRS 


4534 


18435 


A 


4564 


125 


388 


IRKPQILYSPQSSENMQHLHISVLMGVT 
GKRGKFMDILDVTERCRFSLKKCTGIQE 
DLNKWEGIPCLWI IRLNTVKMAVFSKLI 
CRFS 


4535 


18436 


A 


4565 


59 


325 


VARKGNLVSFLV I F Y PLFD I EE V I E PLW 
VLISSFDEGEKVELEFFFFFFFKKKSPF 
VTQAE FQWPNLSSLKP QPPRLKQ F FCLG 
LLIT 


4536 


18437 


A 


4566 


326 


3 


PEKE FRRLVI KL I REAPEKGEAQCKE I Q 
KS I QEVKGE I FKE I DRIKKKQbKHQETL 
D I LiLVMQNALESL S NR I EQ VE ERNS EL E 
DKVFDLTQSNKDKAKRIRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFD P YGTLSQG I FS PKFHE P FHKLMA 
RPAVALPFQVNLK 


4538 


18439 


A 


4568 


125 


2 


HNQSMSRS IDCSDPILGHCNLCLPGSSD 
S PASASQ VAGTTG 


4539 


18440 


A 


4569 


212 


2 


P I CLSGKKC YGQNVRVIKCHMWKHNHRL 
GWVHACNPNSLVGQGGRFMRSGVLEQH 
EQHAETPSLLKTLK 


4540 


18441 


A 


4570 


142 


14 


GRVDRSNPGRFLSTSNSSLYERPRSIRP 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
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K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














TSQAPSPVNHIIST 


4541 


18442 


A 


4571 


2 


419 


ARDS FLHDSQTS FCFSDS I PTPSNMEET 
QQKSNLELLRISLL.LIESRLEPVRYLRS 
MFANNLVHDTSDSDDYHLLKDLEEGIQT 
LMGRL.EDGSRRTGQILKQTYSKCDTNSH 
NHDALLTNYGLLYCFRKDMDKVETFIjR 


4542 


18443 


A 


4572 


3 


194 


HEGRLQSQLLRRLRQKNCLNTGGGGCSE 
PRSHHSFPAWVTEHGFISQISLKPLFIS 
HPEPFLLi 


4543 


18444 


A 


4573 


2 


286 


CRDGLTYND FL IL PG Y I DFTADHVDLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
MALTGGIGFIHHNCTPEFHANEVRKVNK 
YEQGFITDPW 














RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLELLHISLLL 
IESRLEPVRFLRSTFTNNLVYDTSDSDD 
YHLLQDLEEG 


4545 


18446 


A 


4575 


374 


33 


LRLARRSGASLCGSRSGGPPTCANICPR 
YAPLI PPCRTSLLEAQGLVMWGPDFRV 
RLVTSSERRFRYQPPPPGFNRFPCLSLP 
SNWDYRHAPPRLANFNSLFFFFFLVETG 


4546 


18447 


A 


4576 


400 


225 


PIFFWFFLFVLFLVETRLCHWQAGLE 
LLGSSNLPISASQSAKITGMSHPAWPRA 


4547 


18448 


A 


4577 


2 


372 


ARETIFCFSNS I PAPWNMDETRQKSNLQ 
LLRISLLILLYRTRLTLPCITQLiHILQT 
YKVNARCSHFHSS S TCGPLYSCTLYVS L 
TGLDMKLCHSTLIiKQNSALMSLLKIQVN 
LNQVS PVFNAH 


4548 


18449 


A 


4578 


349 


174 


GCGEPRSHHCTPAWATRAKRCLKKKLKL 
QLKNWKESKHQPLQNFSHLTTSTINTQ 
EN 


4549 


18450 


A 


4579 




124 


HELBNPEGEGCSEPRSCHCTLAWVTRAR 
LRLKKKKKKKKN 


4550 


18451 


A 


4580 


181 


1 


MSSYYFSPCSRLLMATVYNLHLPSSSNS 
PAS AS Q VAGNAGARH YAWL I F VLLVE TG 
LCSC 


4551 


18452 


A 


4581 


227 


3 


HRKRNRCVSRTALLFSLGDGVS PCLKI I 
III KVLFWL.GE VAHACSPS PLGSRGGR I 
TRSGVRDQPGQHGETPSC 


4552 


18453 


A 


4582 


257 


2 


RVRQLGAFSLSPGSGS FSKI LGALKIMG 
P PRAG PVGS PRYS RPLGGPGRQ I P RSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 


4553 


18454 


A 


4583 


3 


94 


HE I AMAFL ILTERKI LGYIQLRKGPNW ™ 


4554 


18455 


A 


4584 


345 


50 


IYIYICIFLNRNRVSLHCPGWSKTPGLK ' 
QSSHLGL P KCWD YS HE PLCPAKTSHFKA 
QDR I P KND P S CTPS PAQMSTNL I GWAS S 
YLQSYSPAPYNNKN 


4555 


18456 


A 


4585 


338 


19 


FQLSRRFRP KNGGN PG S G AC S E PRLG P C 
PSAWVTKRDSVSKKKKEYHTLSGLSTTG 
IYFSQFWMLGVQGASRFGVWGGAVSWSI 
DGAVLLRPHMAEGARQIiSGLGF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLELQTSGDPPASASRGAGIT " 
GVSHR VQLLGDRAKFSLKKKKRGI I LPG 
EFYSPLKKRFTLNTPDNP 



830 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thre6nine, V=Vatine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




18458 




4587 


344 


168 


PPGTWHTYNPSTLGGQAGRIGSI/GVQD 
KPGQHRQGLSLGVEDKPGQHSETPSVQS 
NP 


4558 


18459 


A 


4588 


212 


328 


GLAVAQAGLELLSSSNPPASASQSAGIT 
GVSKRTQPYF 


4559 


18460 


A 


45S9 


110 


323 


LSNQKNQLYNPNLLIYCVVRKLKSSYDK 
QAEVQGSKLGSLQALPHGFTPLSGLNLP 
SSWEYTRPPPRPTNF 


4560 


18461 


A 


4590 


2 


127 


ARGCSEPRSRHCTPAWATEQVSKKKKKG 
REKTFGQFFFWVI 


4561 


18462 


A 


4591 


211 


2 


ARGSLEAGSSTPAWATINQSTNKKTDKT 
ALYWKKMPSRTYKARQKSMPGFKSSKDR 
LTLFLGAKVSCLV 


4562 


18463 


A 


4592 


3 


120 


HEKTKTNNP I KKWAKDMNRQLS KEDNDN 
KINGWPGAVF 


4563 


18464 


A 


4593 


388 


227 


GHVGQDGLELLTSDAPPSLASQSAGITG 
VSHRAQPKWLLFQHRSPCLPEAAGK 


4564 


18465 


A 


4594 


1 


348 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PLPPPPPPPPPSSPSSSPPNPPATTSVA 
C 


4565 


18466 


A 


4595 


2 


138 


ARAARGFRHVGQAGLELLTSGDPPASAS 
RSAGITGVSHHTQPTNF 


4566 


18467 


A 


4596 


315 


2 


ALKPNYSSTSMNI KARLPQYNINNLNLA 
PL.YSSTSNQINI I VX ILINKENNHTLFP 
CLVSNSRLKQSSHL.GLPKCWDYRRELLC 
IjACDKFFISYSAPAIQLLSC 


4567 


1 8468 


A 


4597 


1 


146 


GTSVDQGSLELLTSGDPPASASQSAEIT 
GVSHRTQPPLTI I QLYLGYC 


4568 


18469 


A 


4598 


293 


3 


GGTPWPRKKLGQRKFPHRPPGGSGNPDF 
SLLFFFFLRQSFALVAQAGVQWCDLGSL 
QPPPPVWPFLLRKQFGFSTSCFPILiPAS 
GSFFGREASASC 


4569 


18470 


A 


4599 


174 


2 


RSVAPFLKSARPQTYRKEPTPDTGSPQP 
PPPRFKRFSCLTRLSSWDYRHAPPHPAS 
C 


4570 


18471 


A 


4600 


59 


252 


IiAPFGHELGTSESAHQTDNRHRLEGYQK 
RLDASGLERASYPLAAEFKVGGRGCSSQ 
PLTLCGYP ■ 


4571 


18472 


A 


4601 


233 


366 


IHYLPDAVAHACNPSTLRGQGGRITRSR 
DRDYPGQHGETPFIiLK 


4572 


18473 


A 


4602 


182 


325 


IiFVFCLFLRQCXSVAQAGVQWHDHCSLY 
PQPPSLKQSSRLGLPSSWDY 


4573 


18474 


A 


4603 


202 


2 


GGSSSGLLRPCAVLGLKQSSYLSLVRRW 
DHNSWLKRSTHLGLPKGWDYRREPPHLA 
CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRLGPQVFFSPPGAGFFFCGPVWGPGPF 
SRSLQLLTFGAKFFFCLSLPESFGAPLR 
GPPSGFMAHCFFFLVT.LFFFFFFRDSVL 
LCCPGWSAVAIHRRDPTTARA 


4575 


18476 


A 


4605 


233 


71 


FF VLLVE I EDQLNE KKQEG KI RKKKRMK 
RNKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 




HPRPARVSSCCPNWSAMVWSRLAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGWHGARCPSFFGGLGRGIPF 
TPEVEVLVNLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKKEKERERKEGRKKGQEKKRKITAKSL 
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NO: of 
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sequence 
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NO: in 
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Predicted 
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nucleotide 
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ing to first 

residue of 

sequence 


Predict- 
ed end 
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residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I~Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutaminc, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 

X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 
















4578 


18479 


A 


4608 


174 


3 


TWMWQQCVFVCLLKQGLALSPRLECT 
GI I IAHCNLKLLGPSDP PTSVEKTKKLV 


4579 


18480 


A 


4609 


171 


334 


DPHSSLNIRFSRSVAQAGVQWCNLNSLQ 
PPPPGFQRFLCFSLPSSWDYRCAPP 














ARGSRGHIWSAFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLNTLYSMS 














AREPRGHIWSSFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLSTLDSMS 


4582 


18483 


A 


4612 


180 


334 


CCLMGSSYLLPRKPNTLRTAGFLQQRKS 
LIRPGMWHACNPSTLGGQGGWI 


4583 


18484 


A 


4613 


189 


1 


SGFSRWENGLKCKGQKGGQSLVLLPIR 
VFFFFFFEMKSHSFAQARVQWGNLGSLQ 
PLPLGFK 














KSTPRLGl'V/AHTCNPSTLGGQGKRITRS 
GDRDHSGLHKGTPRLSKNI 


4585 


18486 . 


A 


4615 


179 


393 


EKQEEEGSWGWAKTEDGVALMLSEAIGS 
CWI KAFUTDAETFTFHAD I CTLSE KERQ 
IKKQTALVELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRITPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLELLALS DLPASDS QS AG I TD I 
SHHTRPGSPFLEHHLYPRYRI SGK 


4588 


184S9 


A 


4618 


3 


197 


HEKRPEVQNSTVEFIASSDYMLRPPLPL 
PFCFFLFFFKIKMGVSPCCPGSCQTPSL 
KQSSCLNV 








4619 


1 


339 


GTRTFGSGNGPTKPDLLQELRVATERGL 
VIVNCTHCLQGAVTTDYAAGMAMAGAGV 
ISGFDMTSEAALAKLSYVLGQPGLSLDV 
RKELLTKDLiRGEMTPPSVEERRPSLQGN 
T 




18491 




4620 


3 


306 


LEPGDGGCSELKSCHCTPAWVTERDSIS 
KKKKKKKKGGPLKG TLGG P KFNGGWQRK 
I FSQKGGQKKPNLG I LGRNLNFGGEKNW 
NKFLTKIKVLREKKNF 


4591 


18492 


A 


4621 


397 


29 


NTKEKILSARKNNQVFPFHTQKKNFFPK 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
KKKPPKE KKKT I ARP PKKRAP PGAFKKK 
PPPFFFLSSLCISYSFIYFFSLINLFFF 
FFFFHDSTVY 


4592 


18493 


A 


4622 


2 


235 


VSLCHPGGTISTHYNFCLPGSRDPPTSA 
TNS I PAS AS QS AG I TG MNHCT 














SGVRDQPGQY 












287 


NPFSFLFFSFLSFFLFFFFLEKGPRRGA 
RAQKGKPHPKKKKKKKRGGGKKNKTQKP 
F S PLKG PRG P PRGKGQTT P KG P 


4595 


18496 


A 


4625 


102 


] 


THSFPVS I SLSLSHTHTHTHTHTHTE I Y 
RVREIR 


4596 


18497 


A 


4626 


125 


1 


DRHVPQHTANFFWLIEMGFLHVGQAGL 
KLLTSGNPLVPLV 


4597 


18498 


A 


4627 


3 


167 


EREiRQQNCTNPRSEGCSEPGSPHCTPAA 
WATEQDSVSKKEKEKRKKEKKKKIW 


4598 


18499 


A 


4628 


135 


3 


AASTLSRGTITAQGNLCLPVSSNSPASA 
SRVAG I TGARHDTWL 


4599 


18500 


A 


4629 


320 


210 


WFLGI I FRRVENAESVNIDLTYDIQSFT 
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sequence 
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SEQ ID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predicl- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon,/=possible 
nucleotide deletion, \=possib]e 
nucleotide insertion 














DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


SSPTLAFQFQYFFSCATKEAVKIjSFPKS 
LLGTGSRSVAQPGEQWHNHGSLQPQPAQ 
LRLSSCLTLVSGWDHRCAPP 


4601 


18502 


A 


4631 


275 


2 


RVKLLLFYMRQSCSVIQVESSSTVNGSL. 
QPQ P PGPRQS S HVS LLSRWDHKRMP PCL 
ANF FLF FWRAKGLAI L P VHSNLVNTESP 
LISIKSN 


4602 


18503 


A 


4632 


180 


1 


KGLTLYFYFFI FKMRRGLAMLADWSQTP 
E LNQ FS CLGL P KCW DFRHE P PH P VKE FK 
PFHR 


4603 


18504 


A 


4633 


234 


5 


KPLRDKPLRSFSRLPKKNLNFFWPFFPQ 
KIWGFFFFFFETESHCVTQAGVQWLDLS 
SLQPPSPGLRQFLCLSLSSS 


4604 


18505 


A 


4634 


171 


2 


GRVD I LTPWQHKETIiFSFFS FF WQGLS V ' 
TRLE CSGVI MAYCSLDLLGS GNP PTS AS 


4605 


18506 


A 


4635 


324 


211 


LRNI IFMPVTMNPGMYTGGCMCVCVCVC 
VCVCLCVCVC 


4606 


18507 


A 


4636 


95 


1 


MRQSINLSPRIiECSGTIPAHCNLCLiPGS 
SNS 


4607 


18508 


A 


4637 


234 


1 


KKGHDTLTEKAMPVTLSSRMLS INVCKC 
IYKVWKNSPNYSSLTHHINNIKPSFTRE 
NTLMFIHLSPILLLiSLiNPDIIT 


4608 


18509 


A 


4638 


2 


243 


WGKVNVDEVGGEALGRLLWYPWTHMYF 
EFFGAFYISFCLFFLIFPHLFFVSSIHI 
VSSFYLLFYNFPIIFFTLFCHFFL 


4609 


18510 


A 


4639 


118 


2 


GR VDLQCSGAI S AHCKLKLQRSRHS PAS 
ASQVGAHATT 


4610 


18511 


A 


4640 


50 


168 


NKINKMHLITFLKKKKKKKKKKKKKKKK 
KKKKKKKKKKN 


4611 


18512 


A 


4641 


315 


87 


GIFLRDRNIiALLiPRLECSGTIIAQCNIjE 
LLSPKDSPASPSPVARTTGVQPRPDTLT 
LFELIYIRYKGMIFSQGDC 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVLSPPGPPGETPFFLKNPKFPGHG 
GGPRKSPFFGGLKGENSLTREKGGPINP 
NLAPPAFPPKGQNPGPFPKKKKKKKNRS 
S PAWNWDW IGTNAGK 


4613 


18514 


A 


4643 


3 


205 


LKRFFCFSLGGQGCSEPRSCQCI PAWET 
QRDS ISKKKKKKKTRPGGLTLEPPPFGG 
VKPKGPLGAGI 


4614 


18515 


A 


4644 


149 


1 


FFP I KADLVFFFFFLRRNLADSQGAVQW 
HDLGSLQAPPPGFMPLSCLSL 


4615 


18516 


A 


4645 


474 


344 


AIHEHY1.PESSASGDCDEDLQAACTILV 
NALMDFHQRLAAHTV 


4616 


18517 


A 


4646 


165 


357 


LCNALSHLLPQCTPTPLISILCI IDTIS 
LLIQPITHAVRLSANITACHI.I.MHIjILIj 
STLSICTL 


4617 


18518 


A 


4647 


2 


339 


ARGENDLSPGGGGCSELRLHHCTPTWAT 
RVNCVSQKKKKNQPKPKPNQTKPNQTRK 
GSPGFPLLNLKVIS PEKSLNPNILKTAP 
KVTFRNWIiTPE I PPFWKMEEKGS FDPR I 


4618 


18519 


A 


4648 


313 


187 


LTSGDPPASASQSAGITGVSHHALPPLV 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVNWPGMMAHACNPSTLGGRSRWIVRS 
R 


4620 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACNPSTLGGRGRWIVRS 
R 
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26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H~Histidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 


4621 


18522 


A 


4651 


240 


76 


FCVKAHTHTHTPS FQASGSSEHMHTYTH 
PSGS PEHTHT YTHTTFLASRSAE PQLP 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLSPSQAI AS PVTCTYM PR 
WPEVTEE 


4623 


18524 


A 


4653 


369 


189 


GTVKQAG AQ CRNLG SLQPTPPPG FKQ F S 
CKSLYSIMICYSKDCTMSTTLGIRANNR 
LHL 


4624 


18525 


A 




168 




DRICRASSYTIHLMLYFVFLIFIjFYWYI 
YFLFFVLFHFTIFI ILFYSFIIIjLLLLY 
FYN 


4625 


18526 


A 


4655 


56 




WKRTFPFFFFLETGSGFASRPCTLLLFL 
YSLGPNPRVLIRVFLFLSS S RCYY YYLF 
LGLSLLFLFFYIMYLLFIRLFFFVLSGL 
LLLLS S YYRFI I FFF 


4626 


18527 


A 


4656 


50 


186 


SACIiGLPKCWDYRHEALRPAMYLLTGSS 
YVAQ AGLNLLGS AD P PE 


4627 


18528 


A 


4657 


131 


366 


DHFVASSESVKEEGLFFVFVFFFFIFI F 
FFILYICFFYFKFIFIILLYIIIIFIFI 
FYFFYLYYLILIFFLIFFIFIG 


4628 


18529 


A 




290 




LEYIVEKGFHQDDKASPELLTSRYLSDS 
ASQSAGITGHRAWPTKG 


4629 


18530 


A 


4659 


163 


372 


LRLFALFAFLFMFFFYFLFLLYVFCFVF 
FLFFFFTFFLYYFFLYFFLYFLCSLVFL 
FFICFIFFWFFVFF 


4630 












FIFFSNCTSTALVHPLFYLSRLViiSKNR 
NKFIGRARWLTPVFPALWEAEA 


4631 


18532 


A 


4661 


3 


141 


RSRHCTPAWQQSEILSQKERKKERKKKK " 
ERKKEKERKKERKKKKKK 


4632 


18533 


A 


4662 


346 


3 


FLPPFLGENFFFAPPVNFGPPGGFFKGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 
DSFDNLGIFVIPYKIQDDFSSSMSNATS 
ILIKIAIj 


4633 


18534 


A 


4663 


302 


3 


GDINRRSHSPFVWSKPFAPCFMSSLDVF 
YKSSDFFYFFFETEFHSCCPGGISAYCN 
LCLPG S S NS PAS AS QVAG I TGVRHHTQL 
I FASWETFKITYKN 


4634 


18535 


A 


4664 


2 


76 


RLAPTPQLGGHWPPTGITPLNPLEV 


4635 


18536 


A 


4665 


1 


210 


SPTRSPKPTPPYYQTTLAKPFTKKKKKK 
KKKKKKKKKKKKKRGGAPLKKS PGGPKI 
NRGKKKKFFFLKGG 


4636 




A 








GS R PGGGKRTARR W TR PWRLRGACLG TG 
MAFAFHHQVI YKI KS FLRKKKKKKKKKK 
KKKJCKKKKKKKKKKKKKKKKKKKKKPRG 
GGFLKKLWGGKFFMGGGKIFFFFIRGGK 
KKMWGG 


4637 


18538 


A 


4667 


1 


323 


RTRG I FCYDVCVESGCADIQLIjLLiCIiKK 
KKKKKKKKKKKKKKKGAPLKKNPGGAQN 
NPGKKKKNFSPKRGPKKTPPGNFEKKPF 
FGGGPNGAPPPQKNKTPKEKKKF 


4638 


18539 


A 


4668 


167 


446 


ELNKGLCKPKKKKKKGGGRSQFKPTRPK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFSPKKFFFSPKGL.NFWGGFAPIF 
SPPKKRFFSKNPQGGFKYPPLRGKILXW 
GPRENWGPPGGFLKGAPLFFFFFFFFFF 
FFFFWG 


4640 


18541 


A 


4670 


82 


359 


NAYGIILIKGNEDTKTVKWKSSCQLANT | . 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














WEFHIRALLGFNCFFLIFKRRGLTLLPR 
LDCNGVI I IHNNLESLGLSNPPMLASQV 
ARTTGLCY 


4641 


18542 


A 


4671 


249 


60 


LLIFIYYFYFFLFFFLFFFFFFFFFFFF 
L.FVQVHLPVHLPCYDLSPLYKCVGVLVK 
CPFKYT 


4642 


18543 


A 


4672 


2 


319 


AAAS TL, V F VFS F F WVLG F F FFF FFF F WG 
PPKKILKKKGGGGGGGGGGPRGGPRGGG 
KRNPGGQKKGIGGANGGQKKERNFQGGG 
PPRGEKRGGGGGPGNPKGGGLK 


4643 


18544 


A 


4673 


3 


451 


HESFNPVI ISQZjERGEDPWVLDRKGAKK ' 

SQGLWSDYSDNLKYDHTTACTQQDSLSC 

PWECETKGESQNTDLSPKPLISEQTVIL 

GKTPLGR I DQENNETKQSFCLS PNSVDH 

REVQVLSQSMPLTPHQAVPSGERPYMCV 

ECGKCFGRS 


4644 


18545 


A 


4674 


446 


189 


SISLSTLVNVCQLISLSDPASGFVDHVC 
CIFTFYSINFALILIFLFYLGVLVWAAI 
INYHRLHGLNTKHLLIMIWMVGSPRLSC 
G 


4645 


18546 


A 


4675 


26 


170 


HLWFLVIVAHWASSRQI ISHWKNCTRH 
DCPVCLPLKNASDKRNQQSK 


4646 


18547 


A 


4676 


3 


378 


HEDEFDKMGNQHQALLEAMEQQSISLAK 
AGWCI LPARTS 1 I AVAHS FAGLYLI FN 
TVSWFFFFSNFIFliILSLFNYFCISIY 
ILFLIIFIFIFISSLIVFFCLISFRFLL 
FIFFSALSRIFIY 


4647 


18548 


A 


4677 


18 


242 


PIGLALSPRLECSGVI IAHLLGSNNPPT 
SGSRVAGTTGIHHHAQPHCFNCIFPMSS 
EVEC-LFICLIjIjIHTIjSRA 


4648 


18549 


A 


4678 


288 


2 


P KYRWVPE KPTTAS ENAE IPS ERQKEG I 
KLTIRISSRKKKPDSPPKVLEPETKQEK 
PEKEEEKTNVGRTLRRS PRI SRP PAKVA 
QIRDPKAHKKK 


4649 


18550 


A 


4679 


340 


20S 


SVGQAGLEIXTSGDPPATASQSAGITGL 
SQCYPPDSVPETFLPI, 


4650 


18551 


A 


4680 


4 


379 


EENTG INLHDLGFGNGFLDMAPKAQATT 
AKKKKKKKKKIKIKSFLFLGGLFKKGKK 
TPPKGEKNFGNFKFWGGDLTPKI FGDPG 
ASTKKKKPFLKGGRGLKKTFFKKKKKRG 
PHPLESPSPFSVF 


4651 


18552 


A 


4681 


1 


350 


SRFSGAKE AKMEKKI TG YTTVD I SQWHR 
KEHFE AFQSVAQCT YNQTVQLD I TAFLK 
TVKKNKHKFYPAFIHILARLMNAHPEFR 
MAMKDGELVIWDSVHPCYTVFHEQTETF 
SSLW 


4652 


18553 


A 


4682 


365 


45 


SPKWFVASYDPYAGPGRPTRPDNSRDET 
GSHSVSESSSGSDSKRESTSSDREANEP 
SHRAS PE PE PP PTNKWQLDNWLNKVN 


4653 


18554 


A 


4683 


238 


372 


SATPLISSFSFLFFFFFFFREGVWLiCHP 
GWSAVSRDCTTAAVWPG 


4654 


18555 


A 


4684 


3 


462 

0 | 


KTKESREAVE KE FE PLLNWMKDRALKDK 
I EKAWSQRLTES P CAL.VASQYGWSGNM 
ER I MKAQAYQTGKDI S TNY YASQKKTFE 
INPRHPL I RDMLRR I JCEDEDDKTVLDLiA 
VGLFETATLRSGYLLPDTKAYGDRI ERM 
LRLSLNIDPDARG 


4655 


18556 


A 


4686 


410 




FFFFFFFFFFFLLGKGGFGIIKFL 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 

peptide 
sequence 


Predict- 
ed end 

location 
ding to 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M^Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V = Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 


4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKLGSLQPLPPRFQQFS 
CLSLPSSWEYRHAPPRVRPRV 


4657 


18558 


A 


4688 


342 


183 


FFLFLRWSFALVPQAGVQWCDLGSLQPL 
SPGFKRFSWAEVGGLHEPRSQRLQ 


4658 


18559 


A 


4689 


126 


404 


NCQVWPLRFTGHSVFSVHPQWFILFYCF 
ALYFIFLEMGSYSVTQAGVQWHSLGSLQ 
PPQWSYVLjTFIjWHPSSWGYFVQNHLTPC 
FDGKVPWV 


4659 


18560 


A 


4690 


2 


128 


GESLFYIVSPRDVWAKERDQDDHIDWIj 
LEKKKYEVPLIXjLL 


4660 


18561 


A 


4691 


407 


2 


yfffplprmkifffphqkgfffggcpii 
spppnevffspkeggfffsppkkrvffh 
nppegffppppffmsppppffffffslf 
fffffffffffffffffffffffffsfr 

IHjLNQKKNLSACVELiRSCCTFM 


466 


8 


A 


- 






SVS KKKKKRFFALKFLGE ANVKPT IGGQ 
KN 














EVKS ATPMNTRM I RKRNSL I ADMEKVLV 
VWIEDQTSRNI PIjSQSLiIQNKALjTLFNS 
MKAERGVEAAEEKFEASRGWFMRFKERS 
HFHNIKAQGEAASADVEAAASY 


4663 


18564 


A 


4694 


343 


425 


HCFLSXJQLNSFEQLCIKYTNEKL.QQL.F 














CNVT I LAHCNLHLPGSSNS PAS AS RVAG 
ITGSCHQG 


4665 


18566 


A 


4696 


429 


0 


VTKWCYITKKKKKKKKKKKKKKKKKKKK 


4666 


18567 


A 


4697 


371 


217 


IPVFKQSSCLSFPRSWDYRHMPPCVTQK 
KKIEYQHISIjYKKPSKRCALFLI 


"4667 


]8568 


—g- 


~4698 


~~4T5 


— — 


LQQQCECFXSELTKKKKKKKKKKKKKK.1C 
KRGGGLEKKKKKGFFAPYPPH 


4668 


18569 


A 


4699 


431 


0 


LKKIFPPGGRVLSP9PPPPPPPPPPFPP 


4669 


18570 


A 


4700 


371 


245 


TLGD P PAS ASQS AG ITGVSHHAQ PLFGI 
FIYKFTYICRKTRT 














KQCWFTQKKKKKKKKKKKKKKKKKKKKK 
DRG 


467] 


18572 


A 


4702 


340 


402 


PFAGGGYRLGAAPEEESAYVA 


4672 


18573 


A 


4703 


188 


2 


TSLPKCWEYRHHPPCPAHTLLIFLFFIF 
VDMGS CHVAQVALEIjIjGS S D P PTSAS QS 
AGIIGM 


4673 


18574 


A 


4704 


423 


34 


LKKKKKKKKKKKKKKKKKKKKKKKQKDL 
KKMVGGGNYKNVGGAQILGYKKLiCFS FF 
YRVF 














PRVREREKBMASMKEEFTRLKEAIjEKSE 
ARRKELEEKMVSLLiQEKNDIjQIjQVQAEQ 
DNUU3AEERCDQLI KNKIQLEAKVKEMN 
ERLEDEEEMNRSXjP 


4675 


18576 


A 


4706 


2 


179 


QENGMNPGGRACCEPRLRYCTSS WATER 
DSVSKKNEKKRSASLRRFDIIiFGALTGR 
KAS 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
IIYPPPATKWGGGGGKKPPPPKKKGPPK 
KNRGRKKKPPKKKKKKFPPPPKKRAPQK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

ussN 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKPPPPIFFYLYPFKNFIFFFFFPFY 
FFPFFFFKFIFNLKKF 


4678 


18579 


A 


4709 


147 


31 


NFFFFFFFFFXFXFFFFFXXXKFFLFFF 
* FFFLKLKI FF 


4679 


18580 


A 


4710 


3 


229 


HASAHASDQPERLNAGTYFLFYTLVGSL, 
PLL I AL I YTHNTLGSLN I LLLTLTAQELi 
SKKKKKGGAVLKNPWGAQS 


4680 


18581 


A 


4711 




158 


PTRPPTRPRSCSELiRSHHCTPAWVTERLi 
CLKKKKKKFPNLS VGFKI LG I L.KG 


4681 


18582 


A 


4712 


163 


404 


KKKKKKKKKKKKKKKKKKKKKKKKKKKP 
KKKKNKKKKKKKKKKKKKHPS RGGAQ KK 
KKGGGAQNRAGGKRRPSRGGGEKQN 


4682 


18583 


A 


4713 


3 


378 


FEEPVYIKSRQKRKESNFPKLVSSQPHG 
LHDFFKKKKKKKKKKKKKKKKDGGGPIjK 
KKPGGAKNKPGGEKKNFFLKGGGKKNPW 
GFFKKKTFFGGGKIGAKPPKKKKSLKKK 

kkflrgkggkkt 


4683 


18584 


A 


4714 


2 


208 


vsnpavsvphlpfsvyksspmasmtfsk 
kkkkktkkkkkkkkkkkkkkkkkkggpq 
kkkpwgaqnkpg 


4684 


18585 


A 


4715 


290 


56 


sapppifflffflflfiffifffffkqi 
l.ekkggpppfflffffiffffffyrmgc 
drwwfiliqgtfhrkqkpvlkv 


4685 


18586 


A 


4716 


50 


424 


GGFKIKFLFTQFLFLLFSPSMFLISKSP 
AYIiWQSSPKQVPMWFGKTNFLVSIECK 
KEGIYFFCIPFVPFGFPHPKILCSLLPF 
LAFLRRSLALLPRLECSGAILTHCNPCL 
PGSSNSHAWA 


4686 


18587 


A 


4717 


161 


2 


GRPGPADFRVRPQLLQRFLFI YLFTEME 
S CS VTQAG VQ W CNLG S LQP L PPG LQ 


4687 


18588 


A 


4718 


2 


115 


VYTTAMAIEEYPEAWGSGVEVLVSVLVG 
LAMEVGSTRP 


4688 


18589 


A 


4719 


1 


416 


GNQGGGYGGGYDNYGGGNYGSGNPSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSS 

sssssssssssssssssss 


4689 


18590 


A 


4720 


1 


454 


QPDSGISSIRSFKTILPAAAQDVYYRDE 
IGNVSTSHLLILDDSVEMEIRPRFPLFG 
G WKTHY I VG YNL PS YE Yli YNLGDQ YALK 
MRFVDHVFDEQVIDSLTVKIILPEGAKN 
IEIDSPYEISRAPDELHYTYLDTFGRPV 
IVAYKKNLVEQ 


4690 


18591 


A 


4721 


3 


173 


DAWAGFHYVGQAGLELLTSSHLPASASQ 
SAG I TD VS HRTQLD VP FSFSLSPL PHQC 


4691 


18592 


A 


4722 


2 


201 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLERIGWGGSTTNMGAKNQR 
GEKFGKKGGF 


4692 


18593 


A 


4723 


2 


183 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLEIICWGGSPTNKRATNSP 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATS P I IEELITFHDHALI 1 1 FLICFL 
DLHALSLTLPSALT 


4694 


18595 


A 


4725 


388 


3 


SFSPLEENLGNTIQDIGRGKDFTSKTPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WLFFFXPLFFFFFWGGIIFFLLKKLYIF 
FFFMSKI FFFFFFFFFFFFFFFFFFFFF 
FFFF 


4696 


18597 


A 


4727 


41 


374 


KKKKKG KKKKKGG GGGGNP KKKKG KME K 
PPGGNKRGEGGKKKNFLKKKGGGLFKGK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KPPKKKNGGRKPWKKKNFKKKKKKAPFF 
LGPKI FKKKKKRGliKKKLGFFSQSVGP 


4697 


18598 


A 


4728 


323 


69 


FFFLRQS LGSASQAGAQGHNLGS LQPLP 
AGFQLFLQPPPPGSKQFGRLNLLGNKDL 
WGLPVLRAHPQAPPLAFIL 






A 




3 


197 


CLiALLHLS RQRQYALP LLGLPGC PHI HT 
HSHTKAHDHTHYTS PHAHRHAHTHTHMH 
TLPYTTHCI 


4699 


18600 


A 


4730 


309 


443 


VLTLSPRLECSSMNIAHYSL.KLLGSSDH 
PTSASQSGGI IGMSYHA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLKRST 
CLSLPKCWDYRPEPLSPAPKTLFSYVLL 














QTVFLL I WLLALCS PGVLQTVKRFLEFR 
CIFSLGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 
FFSPFFFGPPPFFFFFFFFFFIRRSFAL 
VTQAGVQWRDLGSLQPLPPGFKR 


4703 


18604 


A 


4734 


121 


424 


L I QGHRW I YFSQLRKVRGRQACD S TLL 
GHLPKWRRMFPFLSFTVAGLEPTSHYR 
MFVDWLVDQHHWRYQTGKWVQCGKAEG 
TMPGAR 


4704 


18605 


A 


4735 


319 


8 


NPPQKKKKKIFFPPKKKTPPKKIKKNPP 
PFTIFFFFFFFFFFFFFFFFFFFFFFFF 
FFYFQHVWFCFEIIHVIFFLIEVLFLAF 
HVGDS IADAWSTKNSNRTK 














TQGFFGGGPRKIGFFFRGLKNGGGGNPK 
LRFLKPGQIA/GRGKTKKNQG 


4706 


18607 


A 


4737 


430 


0 


CVLpGSKDKKKKKKKKKKKKKKKKKKKKK 
KAKKKDS 


4707 


18608 


A 


4738 


137 


1 


INYYFFLKQGFPLL.PRLECSGAI IAHCS 
LEFPGSSDPRPSASQVA 


4708 


18609 


A 


4739 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPHKEKNFSLPPPVNLGPPKDP 
LKRPP 


4709 


18610 


A 


4740 


188 


2 


TSLPKCWDYRHEPPCPAHTLLIFLFFIF 
VDMGSCHVAQVALELLGSSDPPTSASQS 
AGIIGM 


4710 


18611 


A 


4741 


410 


1 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLNPPPGFFFPPPKKK 
KIFFFSPPFFSPPPIFFFPPPPPFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GKDRVSLCCSDWSRTPGPKPSSLL 


4711 


18612 


A 


4742 


225 


444 


ENLKIiKSFPCSHFHWIiRWFNLTIGALSR 
S FDEAALLFFQNDTYWPGTVAHACNPST 
LGGRGGR I TKSRDRNHS 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKKKTKKKKKKKKKKKKKKKKKSGGPQ 
KKKNR 


4713 


18614 


A 


4744 


3 


144 


IaAA V LL KLGG YG 1 1 RL»TL I LN P LT KH I A 
YPILLLSLNPDIITGFSS 


4714 


18615 


A 


4745 


206 


2 


IQNPQTERVNSTSKLKISSKDTINRAKK 
QPMDWEVICKKHIFGKELISRMYKELQL 
NNKKQIVPPTRP 


4715 


18616 


A 


4746 


523 


178 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KIKDKREQECYAAEQRILRMNFHEDPYS 
GEKLSEIliAQLQLQEIKGTREKQQREKE 
YLRYVEALRAQIQEKMQLYNITLPPLC 


4716 


18617 


A 


4747 


3 


518 


ENRLNLEGG.G CGE P RS RHCT P T W AT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
L'SSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4717 


18618 


A 


4748 


491 


145 


LCHVGQAGLELLTSSDLPTLASRSAGIA 
GVSHRTRPHSHFYFYFLYSS S FFFFFFL 
AQRGNRAPRGKKTPFGLATLNI KSNGGV 
FWFLFLGFGFLKREPTFALGLKDRGGNL 
V 


4718 


18619 


A 


4749 


6 


362 


. NKLPNGTE FLSSLVPNTFFFSFLFFPFF 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 


A 


4750 


380 


3 


SLLLAFALLCLPWLQEAGAVQTVPLSRL 
FDHAMLQAHRAHHIiAIDTYQEFEETYI P 
KDQKYS FVHDSQTS FCFSDSI PTPSNME 
EPQHKSNLELLRISLLLIESWLEPVRFL 
RSMFANNLVYDPS 


4720 


18621 


A 


4751 


1 


345 


LFYIFKKTWSLFLCFLWMMCVFIFFFFL 
L F VF FLCLGGII.iVSFIPPGG E KRE KKKR 
RGGEKKKKKKNWFKKKKKNPWGGGGGL 
KKQRARGGKKKKKKGGGGSGGGKKKQSP 
LRG 


4721 


18622 


A 


4752 


1 


407 


QEFEETYIPKDQKYSFLHDSQTSFCFSD 
SIPTPSNMEETQQKSNLELLRISLLLIE 
SWLEPVRFLRSMFANNLVYDTSDSDDYH 
LliKDLEEGIQTLMGVRVAPGVTNPGTPL 
ASRAGGEKYCCPLFSNKARDQEN 


4722 


18623 


A 


4753 


357 




PLFFFKRNLFFFFFFTKKKEKKAPVFFF 
FPPPSREWGGFFFRVGGVNTQFLHPFFF 
SQNTSVFFSTLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLiFYCFI YFSR 
DRGLTLL 


4723 


18624 


A 


4754 


1 


207 


LGLAGVEEEVASRLNAGGRGCS KPRSHD 
WTPAWAMEQDSMS KKTKNE YQRRQL I HL 
QSSTSGFLYVHEQ 


4724 


18625 


A 


4755 


2 


336 


HEERERERERERERERERERERERERES 
RAQFRARKNSQAPHPRVCETLWTEEGGS 
FYPLTEKDFAQCERAHFCVLGVSHPPGW 
GLCKKMN PECERATHS LHGDAD I SG VL 


4725 


18626 


A 


4756 


230 


446 


VGFSS LFSRYCT I LIS ASSLTKELCEKK 
RSFLFCFAFWLLGFLRQSLALSPRLEYN 
GAI SAHCNLHLPGS SD 


4726 


18627 


A 


4757 


3 


390 


QTSFCFSDSGPTPFNMEETQQKSNLELL 
R I S L.LL IE S WLEPVRFLRSM FANNLVYD 
TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQ I LKQTYSKFDTNSHNHDALLKNY 
GLLYCFRKDMDKVETFL 


4727 


18628 


A 


4758 


234 


387 


LAKIFLSFSSATGWIENRPQSPATGRTP 
VFVSPIPPPIiPPPFPFAFVT 


4728 


18629 


A 


4759 


396 


0 


I YTHTHTHTHTHTY I YMYVFMI I 


4729 


18630 


A 


4760 


176 


384 


MVIPLFNKDTQHIILFWGYVLIFFFLKR 
NFVFVAQAGGQGRYFGSLKPPPPRLKPF 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


IYPPPPPWEPKGGSFLKKKKKKKKPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPLFFQKNQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQLNPPPPSGSAFCFLFFLRRSLALVT 
QARVQWHDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


AGFELLTSSDPPASASQSVGITGMSHCT 
QP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleueine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y = Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V— possible 
nucleotide insertion 


4733 


18634 


A 


4764 


420 


1 


FFFFFLGEIFPPGGKGGVFPPPPLKKFF 
FSPRGLFFLGGGGPNFPPPKKRVFFKKP 
RAGFFPPPGVLKRGPGPGFLTPPQKEKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 
FFFLGKKKKKKALFFTKKKGGGGTVRKA 


4734 


18635 


A 


4765 


167 




I YFFFFLKKI FFYKKKINKLKKKKKKKK 
KKKKKKKKKKKKKRAAARDLELAHAS 


4735 


18636 


A 








FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
KKKKYFFPPPEKLGPPREFLKGPPPFFF 
FFFFFFFFFFF 


4736 


18637 


A 


4767 


254 


423 


VIKSWARRNKNKWDWPGAVALSCNPSTL 
GGPGQANPLRSGVQDHPGQHGETLSLLK 


4737 


18638 


A 


4768 


363 


o 


KTTTS QARG P TS V I PAAQE TEAGDS SN P 
SGYGSYSSSSSSSP 


4738 


18639 


A 


4769 


51 




YYTHTHPRTGFVKKKKKKKKKKKKKKKK 
KKKKKKRGGGF 


4739 


18640 


A 


4770 


188 


3 


ISP KAIiNPRREVGPI FPPPKKKVPPQNP 
PGGFFTPPQKEKNFTLPPPVNLGPPRDP 
LKRPP 


4740 


1 8641 


A 








KLDKEMALDRAEQAEADKKAAARGKQLE 
DELGSLQKKKKGGPF 


4741 


18642 


A 


4772 


363 


452 


RLECSGMIIXHCSLNLPGSSDSPTSASQ 


4742 


18643 


A 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPLPPGFKQ 
FFHLSLLSSWDY 


4743 


18644 


A 


4774 


31 


226 


MISSQLS I I PTPTQEQGLKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKGGGIjIjKKIIjG 
GAKFLGGEKKNF FFF 


4744 


18645 


A 


4775 


161 


1 


KPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFLFFFFLKKSWRP 
WGLKSAPAL 


4745 


18646 


A 


4776 


394 


67 


SPPRGEGGVAPPPPPKNFFFPRAVFFLG 
GGGGPMPPPQKRGFFPKTPGGVFKPPPK 
GGKFFFPPPGEGGPPPEFFKGPPPPFFF 
FFFFFFFFFFFFFFFFCFFFFFFFF 


4746 


18647 


A 


4777 


2 


409 


GVCIFSGHKHNKAPLPFSVLYSIQYINS 
LEKKKKKKKKKKKKKKKKKKKKKKKKKK 


4747 


18648 


A 


4778 


456 


0 


FFYKKKKKKKKKKKKKKKKKKKKSS S 


4748 


18649 


A 


4779 


438 


0 


LLKIDKKKKKKKKKKKKKKKKKKRTSKR 
GG 


4749 


18650 


A 


4780 


58 


330 


SPFYQLKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKPQQKGGE KKKNPGE L 
GKKNCGGGGVLVGKPHKEKKRGGGKKKN 
FSG 














FFPPFPLKIFFFPSTPFFFWGGFPPFSP 
PPKKVFFPKFPPGFFFPPPLKKKFFFSP 
PPFFLPPPGFFFSPPPPFFFFFFFFFFF 
FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


LFKKFFSPTEREGVFPPQPPKNFFFSPN 
PIiFFGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFSPPGKILAPPGYFFKGP 
PPFFFFFFFFFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 


LGLWGALYSPFLGGLGGRKFLLPWRPR 
VPLTP IS PPPSRLGGP PKKKKSQAWWHA 
PWPTTRSAEAGGSLSPRRL - • • 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=IVlethionirte, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4753 


18654 


A 


4784 


271 


440 


VLYSKRDNNKSFKNNTYTGIGPGAVAHA 
CGPSTLGGRGGQIMRSGVRDKPYRHGET 


4754 


18655 


A 


4785 


2 


266 


KKYG 1 1 1 FCS FFLS LFLKDQGLAPL PR L 
ECSWFAAHRGLGLLGWSDPPKSAFLVA 
G ITGMHHHAWLI FKVQFSNliMGPWSYTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HRSGDSSLKKKKKKKKKKKKKKKKKKKK 
KKKKKIWEFWKGPFLWGGKKRKSGGG 


4756 


18657 


A 


4787 


249 


437 


MNNEWPLKTETSRPDWAHACNPSTLG 
GRGGQITRLGVQDQPCQHGEAPSliPENR 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFLPGGPKKSHQWFPKPKDGKPP 
PFSKKPLKKKTPGKKGGGHPTPGARSKK 
GGYPRGGDPGGPFFFKKPGQKKKTPFLL 
RRD 


4758 


18659 


A 


4789 


1 


225 


PTRPFITNNIFPASPFQTTIPLYLKLTA 
IAVTFLGIiLTTLDIiNYTjTNKIiKI KS PLC 
AKKKKKKKI KKKKKKKNEG - 


4759 


18660 


A 


4790 


2 


216 


SG RNS R VDGRQE AACRD YQ S S L E DLT FN 
SKPHINMLTILAEENLPFAKEIVSLIEA 
QTAKVFIHPAASYYF 


4760 


18661 


A 


4791 


389 


30 


EKKSPPQKKKKKFFPPPPQKKFFSPPNQ 
KTCGGGGPKKPPPKKKGSPQKNPKGVK.K 
PPPKKKKKIFPPPQKKGPPPKILKKPPP 
PFFFFFFFFFFFFFPFFFFYFSIiFFSGG 
FSYKIFli 


4761 


18662 


A 


4792 


164 


3 


LFPGFFNPGVKGEKGISFPPFFFFETES 
HSVIQARGQWRRLGSLHPLPPGFKR 


4762 


18663 


A 


4793 


550 


413 


AEMG FHHVGQAGLE LLiTSSDP P AIjAS Q S 
AGITGISHRAQPHRPIC 


4763 


18664 


A 


4794 


456 


352 


RSQDGLYLLTSSFACLGLPKCWDYRREP 
PCLAVFF 


4764 


18665 


A 


4795 


242 


3 


KPFSGGGGAFQTAFFITFGKKFWSRLG 
LLKKKVFLKFLPQAFFFFFFETESHSVA 
QAGVQWRSLGSMQPSPPGFKRLS 


4765 


18666 


A 


4796 


218 


346 


GFS CFVFETVS IiCR PG WS AWRSRLTAT 
SASRVQAI ILPQPPE 


4766 


18667 


A 


4797 


383 


80 


KFFFNQGIGGFFPPFPPKNFFFSIiKPFF 
FLGGFSPFFPPPKKIFFSKIPPGFFFSP 
PLKKKIFFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


18668 


A 


4798 


1 


126 


ILIKLSSTWEGIQAGKELEEQHGIHCNM 
TLLFS FAQAVACAE 


4768 


18669 


A 


4799 


81 


356 


ILFFFFFFPGGGTPKKSQVFFKNPQNPI 
GKPTRMGKGEKPPPPRGKKKKKGGPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 
PPLKKKK 


4769 


18670 


A 


4800 


1 


99 


GCSEPRSRHCTLSWATRAKLHLIKKKKK 
KKKGA 


4770 


18671 


A ■ 


4801 


238 


5 


NHQNSVQNYFKNLRIFQLIYSLSMNLPR 
SSFKLKSMTHSISFSFLFSFFEIQSHSV 
AQDRIKWHDLGSLQPQPPGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFS FFLFLRDRVLLCYVGWS VMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


AASTTES P S VAHAGVQWCDLGS LKPGSR 
RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFFDPGWGKGPEKKTPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
C— Glycine, H=Histidine, I=Isolcucinc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \ = possible 
nucleotide insertion 














PRGTTPNFFGPPLFFSPPKTQGPGNFLD 
FSFKLKN 


4774 


18675 


A 


4805 


59 


175 


S FYFFLMMKKKKKKKKKKKKKKKKRGGA 


4775 


18676 


A 


4806 


406 


300 


SSGVLYSRAPTYFCGQTLTFRQVDIKTS 
HLPQEAE 














KSIAWLGTMAHSCNPSTLGGQDGQTTRS 
GVRDQPGKHGETP 


4777 


18678 


A 


4808 


319 


406 


LYKSYSSWPNWAHTCNPNTLGGQGGQI 


4778 


18679 


A 


4809 


236 


351 


GFWPAPGVHACNPS ILGGQGGRVTGPKL 
GNLLGQHGEI 


4779 


18680 


A 


4810 


173 


2 


EVLFLGFRSILISLLQMNYLLYLIAKFA 
FVAQAG VQ WHNLS SLLPPPPRF KR FS CL 


4780 


18681 


A 


481 1 


2 


94 


TRILKVWQKRKCSVKNGFLTISHGTVS I 
LF 


4781 


18682 


A 


4812 


358 


2 


FFFFFSETESCSVAQAGVQWRDLGSL 


4782 


18683 


A 


4813 


362 


3 


YSEVCIISLQQAHDSL.SVINFHLAECVI 
QLPVNLRELPGAVAHSPLGSELCFSDSG 
SVPSHTRSFPKAAIRLiTANHRRGCTESG 
WPFSLVFLFERCMWPGTVAHTCNPSTLG 
AQGGQIT 


4783 


18684 




4814 


341 


1 


WNQKRAHITKSILSQKNKAGGIMLPDFK 
LYYKATVTKTAWYWYQNRD IDQWNRTE P 
SEIMPHI YNYLI FDKPDKNKKV7GKDSLF 
NKWCWENWLATCRKLELDPFLTPYTKIN 


4784 


18685 


A 


4815 


2 


283 


VYTGEEILQKKEITIRDQEAPYLLRNLS 
DHTVAI S S S TTLD CHANGV P E P Q I TW F K 
NNHKIQQEPELYTSTSPSSSSSSPLSSS 
SSSSSSSSS 


4785 


18686 


A 


4816 


266 


3 


AASTLQAHRAHQLAI DT YQEFE ETYI PK 
DQKYSFLHDSQSLRQPLiHTHTGSVGTRT 
LAASLS SGRTAHTRTHMHTQHTR KRTAH 
TNA 


4786 


186S7 




4817 


365 


1 


SFFFFFSETESRSIGRLEFSGTISAHCN 
LRLPDSSNS PASA 


4787 


18688 




4818 


87 


372 


ASYKKDKYCMIPLIRVIKFPQTERXKWL 
SGLGGEGNKEbLFNGCNMAVFLVLGGGF 
ETEPPSVTQAGEQWCDIjSSLKPIjPHRIjE 
RVSCLSLPSGW ■ 












175 


FAFLTIPLSRLFDNAMLRARRLYQLAYD 
TYQEFEEAYILKEQQDSFLQNPQTSLCF 
S 


4789 


18690 




4820 


235 


351 


FFXFFFXFFXFFFFFFFFFXFFFXFXFF 
XFFXFFFFFXF 


4790 


18691 


A 


4821 


2 


155 


IFLLLIESWLEPVKFLKSMFANNLVYDT 
SDSDDYHLLKDLEEGIQTLMGRL 


4791 


18692 


A 


4822 


90 


2 


LFFFFFETESPSVTQAGVQWHDLGSLQP 
P 


4792 


18693 


A 


4823 


247 


3 


KVKGSPSHRRAMVAKGNLSLHERMKSPG 
ISNYMGNSKILFSYFKNCWPGTVAQAYN 
PSTLGRGGWITRSGVRDQPGQYSET 


4793 


18694 


A 


4824 


93 


407 


QHPKKKKKWGGRGPLKNKEKVFKHKTFN 
FFSRQKKKKKKKKKKKKKKKKKKKKGGG 
GLLKKFWGGPKFPGGKKKFFFFFGGGKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P— Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleuctne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=l ryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NPPGVFLKKNFFWGGEKKKKK 


4794 


18695 


A 


4826 




129 


TTAHCSLNLPGSSDPPTSRSLVAGTTHV 
CHNAWVNFVFCRRG 


4795 


18696 


A 


4827 


342 


140 


GGFFFFPPPPFFFKFPLFLPPPPPPGPS 
CVGGVQKFTFPPPPQLSRSLFFLFFFAP 
PPETDNP FFFS 


4796 


18697 


A 


4828 




259 


GGGCSEPRSCHCTPAWVTEYDKKKKKKG 
RGKGKKKIGKGGFQRKFKGPNPHGRFLR 
ESHNSPKKGFPKKFFGNPPPGTSKKVPL 
PH 


4797 


18698 


A 


4830 


125 


1 


KGTRTFETESRSVAQAGVQWHDLGSLQP 
LPPGFRRLTCLSL 


4798 


18699 


A 


4831 


1 


323 


ARGERERE'REREREREREREREREPPPL 
SRGGSESYLKSAWREPJLLLERDPLFFFG 
GKMGNLFPPPPPPVGRGYTGGGLITGGP 
FWWGECVSHTHHTHTHTKTYMSV 


4799 


18700 


A 


4832 


3 


297 


TRRERERERERERERERTL 


4800 


18701 


A 


4833 


1 


150 


AREREREREREREREREREREREKKKKN 
HSRGGGPGPQFPLGGGVKKFGGGV 


4801 


18702 


A 


4834 


1 


98 


ARGERERERERERERERERERERERERE 
RERGAGGGPT 


4802 


18703 


A 


4835 


2 


66 


HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 


PEERERERERERERERERERETAR 


4804 


18705 


A 


4837 


146 


2 


FGLVYS I S Q FCLFFVRRS LALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


18706 


A 


4838 


354 


0 


PPPPPPSPPPPPPPPPPSSPPPPSPPSP 
SSSPP 


4806 


18707 


A 


4839 


62 


358 


GLVHDKHSPHGGCGSHHVLPSLiNPSFPT 
FSTKSPSVPPARAL.WPPDSRWAL.QVGHL 
GLTPTTRQESAQAPRRKLALLEGSFQGQ 
AQWLTPVI PALWEAE 


4807 


18708 


A 


4840 


121 


13 


RPGRPPPEFTPFSCLSLPGSWDCRRPPP 
RPANLLYF 


4S0S 


18709 


A 


4841 




221 


LVMGFVGFSSKPSPIYGGLALIVSGWG " 
CVI I LNFGGG YMGLI VFL I YLGGMM VG F 
GCILDFSLNFKVGLNF 


4809 


18710 


A 


4842 


3 


84 


CHCTPAWTANLDS I KKKKKKRKKFKN 


4810 


18711 


A 


4843 


33 


230 


LiTVT I S KAAE VLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
RFLPQKKRP 


4811 


18712 


A 


4844 


3 


368 


HELNPRGRSCGE PRPCHCTPAWATRAKV 
HITKKKKKKKGFNLFKPGGSLSNFFPKK 
KETPWSPQMFEKKGPRRFSPRGKNPPLA 
LRGKKKGGKSLSGKIFPPFPRGKTPPWA 
FMGTQKGEYS 


4812 


18713 


A 


4845 


163 


2 


HHGHSISKFLLTVSSPKSFNQLRPDAVA " 
HACNAS TLGGRGGWI TRSGVQDQLV 


4813 


18714 


A 


4846 


387 


234 


KGCGEPRSCHCPPAMATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNL.NRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQ KALLLKD I LRLGTAAHTCKP 
STLGGQGRWIMRCQVFKTSLAKMVK 


4815 


18716 


A 


4848 


3 


95 


HENFYCPGSALGRNEGNIFPNPEATFVK 
EM 


4816 


18717 


A 


4849 


2 


325 


DSHTI I MGD FNT PIiAI LDR S TRQ KVNE D 
IQDLNSALHQADLIDIYKILHPKSTEYT 
FFSAPHRTYSKIDHKIYSQGNHETDCRY 
GKKGGGWKDERFPDVDCEKFKSQ 



843 
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SEQID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 

sequence 


Predict- 

nucle- 

location 

ding to 
last amino 

peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 


4817 


18718 


A 


4850 


241 


348 


SFFSQTQASEQLiTKSKQGLiALiRTLPPVQ 
FPPPPPRP 


4818 


18719 


A 


4851 


2 


176 


L.VETG FCHVGQAGLELLTSGECWDYRCE 
PPSPAYCFLHTIVTIYPLLCLAFFSYLiC 


4819 


18720 


A 


4852 


2 


191 


VGRVGLELLASSDLPTSGSPSAGITGMS 
HCPWPNFLLIFIGYFFTVLTPCEYYSGG 
MTVPLWL 














ALKKKPWGGQKKTGEKKKKFFSKRGGKK 
KRG P GR VLKKG E RE K I F F WN FLKKNF F W 
GGGNLGQPPPKKLRALGKK 


4821 


18722 


A 


4854 


278 


363 


IWPDWAHTCNPSTLGGQGRWTMRSGVR 














RGCSEPRLRYCTPAWATEQDSASKKKKK 
KKTQI FWEGPGGGVSPFKPHFFKGPGGK 

SPGGGGPP ' 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNLGVKGFNEPKSP 


4824 


18725 


A 


4858 


392 


102 


FFFFF FLRQGLS WQARVQWCFCSNLGL 
LGSSNPPTSASLS CFLNKKEYRWENVTN 
KV 


4825 


18726 


A 


4859 


3 


111 


FHRISQEGLILIiTbGSAHLGLPECWDYR 
REP PR PA 


4826 


18727 


A 


4860 


282 


83 


AQ I YKVQNWGS SGNKCICVCVC VCVCVC 
VCI YI YMLCMSRFKYKCIVGCMNIDSYV 
VYIPNIQCYI 














nhsnlggrgcsdprsrhctpawvterds 
ISKKKKKKKKKIFFGLiREKKIjTIjFPFLG 
KKNENSGFPLNPNFFFGGKGKKATLGPL 
GLVLRGVPFFRGGETQNGENQKPGTTLE 
KFEKHFWVGPPPLEKISPHKGKQTQK 














K ■ ■ ' ' 














VSLCCAGWRAAVRSQITAALI SRAQAI L 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKKIPFKILLLIDNAPSHPKAL 

STFKSYYLRNTFHKALAAMDSDVSDGSG 
QSKLKTFWKGFTILDAIKNIRDSWEEVK 
LSTLTGVWKKLIPTLIDDYEGFKT 


4831 


18732 


A 


4865 


341 


210 


VYMCHVGLLYHQPVIYIRNRIEASEITP 
HIYNYLIFDKPSHQF 


4832 


18733 


. A 


4866 


206 


412 


QLCLSQACTLARGNAKGFLKYIHRNNVS- 
MPSVAGHTWGPDQIiVKGQGETCPATIKF 
SDSPSLKYTMYPM 


4833 


18734 


A 


4867 


248 


78 


SHFEAGLAAQLTVS WF PQGTP VF VHAG P 
FANI AHGNS S I I ADRI ALKLVGPEGFVG 


4834 


18735 


A 


4868 


637 


2 


EFDGRWPRQIVSS I GLiCR YGGR I DCCWG 
WARQSWGQCQPVCQPRCKHGECIGPNKC 
KCHPGYAGKTCNQDEHIPAPLDQGSEQP 
LFQPLDHQATSLPSRDLNECGLiKPRPCK 
HRCMNTYGSYKCYCLNGYMLMPDGSCSS 
AIiTCSIYLANCQYGCDVVKGQIRCQCPSPG 
LQIAPDGRTCVDVDECATGRASCPRFRQ 
CVNTFGSYICKCHKG 


4835 


18736 


A 


4869 


276 


426 


VS FFL F FF F WGANP P F VPQ AG GQGGFLG 
SLNPLPPGI.KHFFCLTPPSSGN 
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M 
cth 
od 


SEQ ID 

NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4836 


!8737 


A 


4870 


100 


485 


EERNREMTERCSVTQAGVQWHDLK3SLQP 
SPPGFKRFSCLSLPSSWDYRLECDGMNL 
AHGKLRL PGS S ES PASAS RVAGI TGMCH 
HAPIiARLVSNTLPQVILPPLGLQQAEEL 
PRRQEVEHNTPSLVPI 


4837 


18738 


A 


4871 


718 


408 


EVLTLQLAHFPGFLGAHWWNQQDAALGR 
ATDS KE PPEELCPDVLYRTARTLHGQET 
YT PRL I LMDLH YDVS LGAL VLTAAGYRA 
SS CSQS VAVCTGSVLEARA 


4838 


1S739 


A 


4872 


392 


246 


RGGCSEliRSCHCTPAWATRVKL-HLKKKE 
KRKCKIRPEGNEIIiVNNAKG 


4839 


18740 


A 


4873 


1 


156 


GGGGYSEPRSCHSTPAWTTRAKLRLLKK 
KKKKKGGVCFGVGFL.GPRPKGGFF 


4840 


18741 


A 


4874 


246 


2 


AHG I S I YASMLAMS I FSRRNVS FRRAGQ 
VKAVFPVSWFGPLHRHHLGEHFLFFLR 
QS F TIi VAQAG VQ WCH LGS PQ P L PPA 


4841 


18742 


A 


4875 


72 


2 


CFPAAPDEDSTTN I TKKQKWTVE 


4842 


18743 


A 


4876 


375 


221 


GRLRQENRLNPAGGGCGEPRSHYCTPAW 
VTEQDSISNICIYINIFSWHGQ 


4843 


18744 


A 


4877 


151 


2 


RAP F F F F FF F F VAQ AGVQWR VLG S LQ AP 
PPGFTPFSCLSLPSSWDYRRP 


4844 


18745 


A 


4878 


401 


70 


PHRREGGCVPPPPPKNFFFSPGGYFFWG 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 
KKIFFFPPPERGPPPGFFLRPPPPFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 


4 


152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKRGGNFK 
KR 


4846 


18747 


A 


4880 


267 


1 


TPPKKKKKKI I PPPKNGPPPHI FKKTPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFRTRGRTRGRTRGRT 
RGR 


4847 


18748 


A 


4881 


2 


408 


LQDATSPI IEELITFHDHALII IFLICF 
LVLYALFLTLTTKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKGGGGPKKKNRGG 
PKKLGGEKKIFFFFKGGKKKTRGGVGKK 
KIFLGGGNFGPPPPKKKKPRGKI 


4848 


18749 


A 


4882 


413 


69 


L.PPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVSPFGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMPCPPARG 
TKREPPFPKKKKKEKEEKKEPERKEKKK 
EI 


4849 


18750 


A 


48S3 


46 


394 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGALKKKPGGGQKKPGGKKKNFF 
LKRGGKKKPPGIFEKKTLFGGGENWGTP 
PQKKKPPGKKKKIL.KGEGGKKTLYFGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RLFFFWGGLiAQFSPPPKKVF FSKFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
P PL FFFFFFFFFF FFFFFSVS I RNAIAI 
RMGTMRSRRLAMGMLLRRG 


4851 


18752 


A 


4885 


391 


47 


PHFFWGGVFPPPPPKKKKFFFLPQKGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFSPP 
GGPPRFFFFIFFFFFFFFFFFFFFFFFF 
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eth 
od 
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NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutam1ne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 

nucleotide insertion 














FFFFFFFFFFFLSKMFTFIIiKYTLKQNE 
FY 


4852 


18753 


A 


4886 


32 


379 


bLiPVKKKKEKKGEKICKKKRKKKKKKKKK 
KKQKKKGGGPSIKI IGGGHLSSGGRKNN 
SFSHRGGLKKPLFFLGRTYFLGGGGLWP 
HPPPKKI AGGAKNX I L»RGWGEQKPL*LFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


IFKRKWFFFRFFFFFFLYYFFFKFFFF 
FFFFFFFFFFFNICHHPKLY 


4854 


18755 


A 


4888 


238 


3 


KPFNPFKKKKIGPQKKKKKFPPFLKINP 
PKYKTQNINIYKYIKIFFFFFFVRGSDS 
VAEAEVQWHHLGSLKPASPGFK 


4855 


18756 


A 


48S9 


93 


1 


DGILLLSPRLKCSGTISAHCNLCPPGSS 


4856 


18757 


A 


4890 


6 


219 


I E KAHKALGTVPGS T FVR YHHYHHHNHY 
HHHCHHHHPHQDNSEWEKAQSLEPDKLF 
LNPGFATSQLRGL.R 














TSSRSRAGRPMDFGFFFKKKGHREPPGG 
VEKKKKNWGSRKTPPSGVKNFPGFDPPK 
MWNKGGP PTPPVNFFG FKKKRG FHKGLK 
PPPPEIAPLNPQKGGEKKGGPPPPPTPF 
LGGKFQGKR 


4858 


18759 


A 


4892 


387 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFSPPPPL.LGGG 
GGPKSAPPPKGVFKEFPENFFWGAFIKK 
SFFLKKGGFFGPPRGFF 














KTPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 

SL 


4860 


18761 


A 


4894 


384 


10 


PLPPFIPPPPPPPPliFGGGVCPYFSPPP 
HLGSPSPPPGGRPSPPPPPPRSTPPPPP 

FFFPPPFFFFFFFFFFFFLPFFFLKKSD 
FIFSLQMKYFMLI 


4861 


18762 


A 


4895 


240 • 


355 


ul ITPALFFKI FVLWLGAVAHACNPSTL 
GGQGGQ I MRS 


4862 


18763 


A 


4896 


339 


3 


GVGWHLFFPPQKKFFFFFQVTGASCAAK 
SS KESHKGLHXiNIYLPKIFFYRPPPPFF 

FFFL WREG YNQHFRGMGP I GR VGGRV 


4863 


18764 


A 


4897 


56 


335 


TCFLYKLKKKKKKKKKKKKKKKKKKKKK 
KKRGGPIjKKNXjGGPKKNRGGKKKX FFFK 
GGEKKNLWGFFEKKPFFGGGKICPTPPQ 
KKKPPGEKK 


4864 


18765 


A 


4898 


7 


206 


LCLQ PDNR WNDDQS KS FL I EKKKKKKKK 
KKKKKKKKKKKKKGGGPLKKNPGGAPNN 
PGGEKKNSPP 


4865 






4899 


74 


250 


I N I CS EKL P KKKKKKKKKKKKKKKKKKK 
KKKGKKIKRGLNSQTSLLKKREKKKKRG 
KI 


4866 


18767 


A 


4900 


151 


312 


KYHFHKNYFFSLLYFSQNVSQLS PDGPL 
PQLPLPYINS SATRVFFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


IiNLGGRGCSEPRLRYCTPAWATEQDSAS 
QKKKKKKNPNF 


4868 


18769 


A 


4902 


1 


96 


GGGGCGELKSCHCTPAWVAEQDSVSKKK 
KKRG 


4869 


18770 | A 


4903 


3 


225 


GFHRVGQDGFDLPTSWTARLGLPRGWDW 
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NO: of 
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sequence 


SEQ ID 
NO: of 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,7=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RREPPRPAVWAGLQLLTSSDPPASDSQR 
AGITHLSHLTLFCHLRF 


4870 


18771 


A 


4904 


384 


194 


HWVGRPFFNLWGQKNPPPPPPKRLEYQG 
WGPPPPPPFFFFKKTPIFSQVFNPPPFF 
FFFFFFG 


4871 


18772 


A 


4905 


3 


387 


TI I TP I LiLTLFL I TQL.KI LNTNYHLPLP 
AQYPLPQKKKKKKKKKKKKK 


4872 


18773 


A 


4906 


1 


327 


PTRPIEPIPENPKFSVPPITPHPKKKKK 
KKKKKKKXKKKKKKKKKKKKKKGGALKK 
KTWGG PKKRGGG KK I FFFLRGG KKKPRG 
EFLKKTS FWGGENLiATT P P KKKKARKKK 
KNF 


4873 


18774 


A 


4907 


134 


2 


ADYSLKVKWPGAVAHACNPNTIiGGQGEW 
IRRSGVRDQPGQHSET 


4874 


18775 


A 


4908 


59 


449 


ATRHYTI MLAYTSVG I P I T AWLGKQ A I 
KGARFQVRRTTENYDKQQLIHKGGGRPP 
GCHSTHTVLLPPVTWVKTTGNSTVANFV 
lilLVCLSFLLLVYRCIQQLQRDSNQREG 
AMMMVWLS KRKGG YAG K 


4875 


18776 


A 


4909 


337 


3 


LKT AW ATRGN P P L Y KKNTQ I S G ARG EG P 
PIPLIGGGLSQKNFFTPGGENSINPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 
KDWTLELWDSDSQRAGKLAEVPGESSP 


4876 


18777 


A 


4910 


14 


162 


AE SGAKRETAF P FAPQQGATDKRLN 1 1 Y 
AAKKHMKTCIiPS LAIREMQ I K 


4S77 


18778 


A 


4911 


331 


2 


LPLLAPKEGQGKILGYKKPPPPGL.PPFF 
GSTPPRNGEKGGPPPPRGNFWVFKKNGG 
FPRLGGRGPFPGPQKPPPRPPQKGGKQG 
GNPAAGPFFFFFETEFHSCCPGWSA 


4878 


18779 


A 


4912 


116 


316 


ACAHAHTLF FFF FLKKKPP F VP LLGGKG 
GNLGYLKLWLPGLKKVCLTPPRTGNYGG 
AP PHR 1 1 FCFL 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVAHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 
PKFTRGGRGGPIYNPFPGRVRPKNFLYP 
RGSRFHWPNSIPCPFAWGTKPNFFSKKK 
KKKRKKKEKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TSCNPSTLGGRGGRITRVGVRGQPDQHG 
ETPSLLKNYIYIYICVDVYLYVCMYVGA 
YIYICICGYIYIWRYLSLTSEHTHTRAP 
GQYRVIYFFCG 


4882 


18783 


A 


4916 


1 


222 


ARGERERERERERERERESRHQGGGAGP 
PTHLWGEEGWLS S PQTGVRE KNAP I WG 
CAAP VLAPR VGRTP FG VD 


4883 


18784 


A 


4918 


1 


281 


ARGERERERERERERERERERERERERE 
RERERERERGAP P P PPRGGVRERGTP CP 
SARGARALSPTISCGDPHRGPKIYRGVG 
VHTQRYFSVGSSLCINTHT 


4884 


18785 


A 


4919 


1 


259 


ARGERERERERERERERERERVFVEKGD 
MVTLPAGIYHRFGGGEKKYTKAVRVCVR 
KPGWRAHNRPADH FGARG PHVKFLAQT V 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGVVTHTCNPSTLGGQGGRITGSG 
VQDQPGQHGE 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 


1 


698 


TLLLAELGTICDPYRSCSISEDSGLSTA 
FTIAHELGHVFNMPHDDNNKXKKRGGPF 
KGPNFSPPVRASKNFFFGPPNLNSWARF 
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SEQ ID 
NO: of 
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sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
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correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
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correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JL.ysine, L=Leucitie, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W = Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 














LTPGEGKTPGVPPFNPFATKAEL 


48S8 


18789 


A 


4923 


323 


2 


LVCRGTIHKFRCVPHLTGRRFEHGVTDC 

QNLYLDNLEATGLYQVPLSAAQPGDVLL 
CCFGSSVPNHAAIYCGDGELLHH 


4889 


18790 . 


A 


4924 


3 


225 


HEEALHLFQTLMECMKRKELITVFHIGS 
DEHQD IDVAI LTALLKGKPLRTFLF VRP 
F I LCM YMCVYVC I CVCX 


4890 


18791 


A 


4925 


371 


169 


HTQSISVVLVERRFHHSGQENPLNPGGR 
GCGELRLCHCTPAWATRAKSRLKKTKQT 
KKDAI KCSLCN 


4891 


18792 


A 


4926 


2 


352 


ARAARAGRIIKELFFFFFFGNLKKKKGL 
FWWPGGVKNPDLGNPPPWPPKGGGITGL 
TPRPGPMGVIFKKFGKKGSPSPTGPKSL 
GPREFFGLAIjQRGGDLiGLNHRGPPFFFG 
VLETM 




18793 




4927 


2 


379 


ARANMEETQQKSNLELVRISLLLIKTGL 
EPVRILRSMFTNNLVYPTSDSDDYHLLK 
DLQEG I QTLMGRLENGRRRTGQ I LTQTY 
SKFDTNSDNHDALLKNYRLLYCFTKDMD 


4893 


18794 


A 


4928 


175 


31 


ILGDLFPPAWLHPFLFLLLPFSRPSLAV 


4894 


18795 


A 


4929 


368 


247 


VDRLFPCCPGWSPSLELNQSACLSFLKC 
WDYRCEPLCSVS 


4895 


18796 


A 


4930 


2 


110 


ARGEPRSHHCTPAWAMSETVSKKKKSGG 
LFFLPRLiV 


4896 


18797 


A 


4931 


150 


I 


KYVAPCRPLFLVLSPCRRSCFPFPFCHD 
CKFPES FSEATMFFLQPAE PRA . 


4897 


18798 


A 


4932 


2 


155 


ARDDLNPGGGCSELRSRHCTPAWATERD 
SIS KKKKNPQN I WGNLKKKWNLK 














CLCVCV 


4899 


18800 


A 


4934 


2 


153 


ARESFVEKGFRHVGQVGLKLLASGNPPN 
LATQSAGI ISISHCTQPWRQGL 


4900 


18801 


A 


4935 


1 


355 


GTSQEFEGRKDRLASPLETGNAGAGRER 
LAEVSTCPSGASKPLQTPRPGGQEGTPG 
LGGRRMRHSAGPSPNPAWESIRDPSLTS 
TALLVACSIFHIHEDPILHDVLISRCMS 


4901 


18802 


A 


4936 


112 


358 


NIIKFSKAFFFSFYIFILINSVGLASTF 
MWGGEIiKFSFTSGILIIFQPFKIKGHPQ 
RGILRTGTFKEDPDDPEGNKVSSLLQ 














G PS P Y'V ruQr FSGWGRQGJjTLS PKLEHS 
ELRLCHCTPS WVTERDS IS KKKKKKLFY 

KK 














FFFFFSETEPYSVAQAGVQWC 


4904 


18805 


A 


4940 


255 


I 


IjGNNIiAGYKIRGLjYSISLRISfNRHPSPVI 
S LCGNLFYVIR F FDMES CS VTQAGVLWC 
NIGSSOALTPGIAPFSFISLPSTRQNSR 
P 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRINFVKMSL 
PPKAIYTFSVFPIKIPSVYIiPVHLSI 


4906 


18807 


A 


4942 


144 


3 


DKC I RRPGTVTHT CNS STLGDQGGRVMR 
SGVRDQQGQHSETPSVRV 


4907 


18808 


A 


4944 


484 


274 


PSSQRSPRTARMIALSTRPTTRSMKFCH 
VAQAGLKLIX3S SDLPASAS QNVG I SG VS 
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nucleotide 

location 

correspond 

ing to first 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaninc, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HHTWPRFYYLS 


4908 


18809 


A 


4945 


103 


412 


KKKKKKKKKKKKKKKKKKKKGGG P LKKN 
PGGAKKKRGRKKKIFFFKRGEKKNPGGI 
FEKKPFFGGGKNWAPPPKKNKPFGEKKK 
FLRGKGGKNPPI PPGKKKL 


4909 


18810 ' 


A 


4946 


309 


81 


KFFFFLKGLIFLGGFCPIFPPPKKIFFF 
KIPPGVFFSPPFKKKNFFFPPPLIFGPP 
RVFFKGPPPFFFFFFFFFF 


4910 


18811 


A 


4947 


18 


264 


NYTQRFKVEYL.NNI INQLDLTDI YRILY 
PTVEYTFFKNPHGTLCRTDHMLGHNTGL 
NKFKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSEPRSPPCP 
PTWVTKQVFFSKP 


4912 


18813 


A 


4949 


3 


252 


DSAKHLGLKHVVYCGLENVKRLTDGKLE 
VPHFDS KGEVEEYFWS IGIPMTS VRVAA 
YFENFLAAWRPVKASDGDYYTLGKTK 


4913 


18814 


A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKIFPPPP 
KIGPPQKILKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 


399 


1 


VGFSLFPPPQKKGFSPKPPRGFLFPPPK 
GKKKIFPPPGKIGPPQGFFKRPPPLFFF 
FFFFFFFFFFFFFFGPPSVTLLFLVTIL 
LQIQTTALGRKSWGKGFRIPKPVCPVPA 
QQTKQGAKHLLGQWE INSCA 


4915 


18816 


A 


4952 


235 


56 


FTWPPPKKKKKNPPGGGGPVPPPWGGK 
NKKI FLP PKFKG P FS FGAPLP F P PGQKK 
KPL 


4916 


18817 


A 


4953 


13 


194 


SILISLQISL.il TFTATELI I F Y I FFET 
TL I PTLAI I TKKKKKKKKKKKKKKKKKG 
GGAF 


4917 


18818 


A 


4954 


177 


1 


PPQKKKKNIFSPPGKIGPPQGFFKRGPP 
LFFFFFFFFFFFFFFFFRFEGECSDLDG 
RV 


4918 


18819 


A 


4955 


450 


0 


LPNKADKKKKKKKKKKKKKKKKKKAR 


4919 


18820 


A 


4956 


29 


158 


AMIVPLYSSLGDFFKKKEKKKKKKKKKK 
KKKKK KKKKKGGGG F 


4920 


18821 


A 


4957 


397 


0 


PSSPPPPSPPPSPPPPPPPPPPPPPPPS 
SPPPAPPPPPPPSPPP 


4921 


18822 


A 


4958 


257 


397 


FFFLSFTFVAQAGXQGXXYGSLDPPPPG 
FNLFSCPSLPRIWNYTAPP 


4922 


18823 


A 


4959 


419 


266 


RFHHDGQDGLELLTSSDLPASASQSAGI 
IGVSHRAQS I KSLAPQYWYFPS F 


4923 


18824 


A 


4960 


300 


190 


FFFFFFFFFIVAFFFFFLFCFLFFFFFF 
FFXIFNNIF 


4924 


18825 


A 


4961 


3 


179 


DAWVLPFI IAALAALHLLFLHETGSNNP 
LGITSHSDKITFHPYYTIKDALGLLLFL 
LS 


4925 


18826 


A 


4962 


2 


182 


RVNAKDS KWLTPLHRAVAS CS EEAVQVL 
LKHSADVNARDKMKQTPLHIAAANKAVK 
CAKL 


4926 


18827 


A 


4963 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEEYKSVYLKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCCNLTPKKKLFFKFCRKNGFFFFK 
MESHSVTQAGVQWCNLNSLQPCPHGLKQ 
FS 


4928 


18829 


A 


4965 


268 


364 


TVCVIELLTSGDRPASASQSARITGLSH 
HAQP 


4929 


18830 


A 


4966 


374 


2 


QN FFLKKKKKKRVGPG VP P FNP PPFWGQ 
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Predict- 
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nucle- 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGGFPRSILGNPIKPLFFLNSPKISPPL 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKKKFRFKKKKKKKGQAR 
WLTPAI PTPREA 


4930 


18831 


A 


4967 


79 


346 


DKI FSNI SCGI VI DL.IAI YWWDAYATRL 
GIYKHWDFIIIDKDTSSCRLSFSSYPRF 
LESLDDFYILSSGLILLQTTNSVFNKTL 


4931 


18832 


A 


4968 


3 


93 


FLHVGQAGLGLPTSGDPPASASQSVGLQ 


4932 


18833 


A 


4969 


158 


1 


SKSNFAYIFPIKDNGLTGKKTINRVVTK 


4933 


18834 


A 


4970 


2 


266 


VGQAGVELLSSSDPPASAYEGAGIPGVS 
HHTWPKHFFPALFVACISSLVNCLFQLF 
ACISIGLSFFFFFFREGDLGNLLKIAGL 


4934 


18835 


A 


4971 


3 


96 


GVRHHAHLIFVFXiVETGFHHVGQDGLNF 
L 


4935 


18836 


A 


4972 


253 


348 


NFI FL.L.F I YFEMESHS VAEAGL.QWHCLS 


4936 


18837 


A 


4973 


326 


2 


PMEIPQHKLSLWPGFAISVSYFERKLLF 
SADVSYKVLRNETVLEFMTALCQRTGIjS 
CFTQTCEKQL IGL I VLTRYNNRTYS I DD 
IDWS VKPTHTFLKRDGTE I TYVDY 














DAS LVFKVAETANEEEVKKMCMYKYPGN 
KCGREGGREKKREEERREGRKEGREGLR 
EGETDEEREEGEEGLSYSPFKNSI 


4938 


18839 


A 


4975 


6 


79 


THFSLTITSLQPEDIGTYYCQQYD 


4939 


18840 


A 


4976 


15 


339 


PGEAGNCLNPGGGSCSEPRSRHCTPAWA 
TERNSVSKKKKKKKKKKKIPRGRGLPPV 
SHPFW KAGGADWFDLGTLE P PWPTGET P 
VFKKKKKLTGGGGAPLVAPTWGGG 










204 


275 


I IEVDPDTKEMLKLL.DFGSLSNLQ 


4941 


18842 


A 


4978 


3 


352 


RRPS PHGLVGAVSVGGAGVMAVETLSPD 
WEFDRVDDGSQKIHAEVQLKWYGKFIjEE 

WRRL PYEQSSVLEL I KTENKVL.NRWTG 


4942 


18843 


A 


4979 


219 


3 


AAS TMAMS FEWPWQYRFPP F FTL QPNVD 
KSRLTSLSITVMLDC 


4943 


18844 


A 


4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPELKRSS 
CLGLPKCWDYRRELPRLA 


4944 


18845 


A 


4981 


351 


2 


FFFFFSETBSRSVAQAGVQWRDLD 


4945 


18846 




4982 


133 


3 


SFFFFLFETECCSVPQAGVQWCDLHPLQ 
PLPPGFRRFSCLSLP 


4946 


18847 


A 


4983 


3 


161 


LGSLHDTANTLWPRLERSRT I MAHYSLD 
LPGSSDPPrSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 


GPSNKKRGRSKKAHVLAASVEQATENFL 
EKGDKIAKESQFLKEELVAAVEDVRKQG 
DLMKAAAGE FADDP CS S VKRGNMVRAAR 
ALLSAVTRLLILADMADVYKLLVQLiKW 
EDGILKL 


4948 


18849 


A 


4985 


101 


1 


LQVCYGRRWMNSREYGAWKQQVESKNM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


PPPMSPPDVLKTSPVADAAGVfVDVDKET 
L.QHQRYPNVFG I GDCTNLHTS KTAAAVA 
AQSGILDRPISVIMKNQTPSKKYDGYTS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPLVTGYNRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGTI SAHCNVRLLGSSNSGPSASRAI 


4951 


18852 


A 


4988 


124 


3 


LPWLCSKFETKTFQFQIYYKAKGVKTVW 
YSHKDRNIDQWN 


4952 


18853 


A 


4989 


305 


150 


WL.LNYRYSHSAEVQVCFQFLTWVFSTLG 
SPDHFS KFFNOYFFKI CFDYWKN 


4953 


18854 


A 


4990 


64 


340 


KKKKKKNLL.KFPGQREKPLS PWFLGGLG 
GGKNGPPGGKPSRDPDWPPGIKPGEKKK 
NPLKKKKKKKKKPLSKGGGGKKTPLKKK 
KKPPPKGF 


4954 


18855 


A 


4991 


3 


158 


PSLVQTRLRHAGQAGLKLLTSTDSPASA 
SQSAGTTGVSHHAMPKDCSLNSN 


4955 


18856 


A 


4992 


116 


66 


FFFFFFFFFFFFFFFXE 


4956 


18857 


A 


4993 


3 


107 


GQAQWLTPVIAALWEAKVGGS FEVRSLR 
PAWPTQ 


4957 


18858 


A 


4994 


1 


105 


KL.DRLARHGL.YEKKKTSRKQRKERKNRM 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RRIGRYKTVFLCTQLEILMARYPLPPSP 
KPIKIKN 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLECNGTILAHCNRRLPGSS 
DSPALAP 


4960 


18861 


A 


4997 


1 


148 


ACCPFCTIYLLPMFMIIKAPLMGTSKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


4961 


18862 


A 


4998 


3 


110 


V I LQG S ND VEL VAEGN S R FT YT VL VDGC 
SKKTSTRP 


4962 


18863 


A 


4999 


3 


330 


PIADRGAEYVSAREWMTICFEL.LQL.LKA 
HKKAIRRATVNTFGYIAQAIGPHDVLAT 
LLNNL.KGHERQNRVCTTVAIAIGAETCS 
PFTALPALMNEYRVPELNVQNGVLK 


4963 


18864 


A 


5000 


3 


110 


VI LQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4964 


18865 


A 


5001 




329 


NS RRRRNDQGS PNLCQTFTFMTLPYLPE 
HRSLLLKIRSCAERETKKKDDIPEEDKG 
NIKQCEINYVKKFQSFQDHKLKISKEES 
KILKKAQKDGFLHETLLN 


4965 


18S66 


A 


5002 


278 


345 


EDEEGYNDGEVDDEEDEEELGG 


4966 


18867 


A 


5003 


101 


3 


VRII I SG TGKKKKKKGKI.PKN YDPKVTP 
DPER 


4967 


18868 


A 


5004 


18 


350 


VSHECLI FHIQNVLTGLVISLSCPSVPS 
HTHKHTHIHTHTHTHTHTHTFS 


4968 


18869 


A 


5005 


26 


147 


KEEVRMALFANDMIVYLENPFVSAPYVL 
KLISCFCKGRLG 


4969 


18870 


A 


5006 


85 


1 


SASRVAGI IGKHHHTRLIFVFLVETGQS 


4970 


18871 


A 


5007 


370 


3 


RAR KPLLW VLKKI WS P W I YKKKTR INFQ 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFLKKI PP FFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQLLCRGLPSS 
WDYRCLSTRP 


4971 


18872 


A 


5008 


1 


85 


I R I LS KI KNAL.TH FLPQGTPT PL I P I LF 


4972 


18873 


A 


5009 


218 


93 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 


18874 


A 


5010 


3 


250 


RPRRRRIiRQENHLNLGGEGCSESRSRHC 
TLTWATEQDSVSKKKKKRVFPPPHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 


18875 


A 


5011 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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nucleotide 
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amino acid 
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C=Cysteine, D=Aspartic Acid, 
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G=Glycine, H=Histidine, I=Isoleucine, 
K.=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro(ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 














SNQNGTGGESTYGEKFEDENFHYKVK 


4976 


18877 


A 


5013 


53 


324 


FLVFFVPENSEKISLQLHLALTSNSSWV 
QSPSHLELiMNLCRHINIRVDPTGLREGM 
HYTE VC G YD I AS P NARS L F RG P I T A V I A 
AQVKES 


4977 


18878 


A 


5014 


329 


1 


FFFFFSETESRSVAQAGVQWRYLGSLQA 
PPPGFM 


4978 


18879 


A 


5015 


193 


32 


PLLILCSERLCHCTPAWAIERDSVSKTN 
KTKQQQQQQQNQGTKCMFLALLGQ I 


4979 


18880 


A 


5016 


354 


235 


CGEIGMLLHCWWECKLVRPLWTNLFLRD 
GSTLTVGSKSSF 


4980 


188S1 


A 


5017 


348 


2 


RLLVGKLMELHGEGSSSGKATGDE 


4981 


18882 




5018 


76 


) 


I F FLSQVS PLS KEDAGE YECHASNS 


4982 


18883 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVFFVG 
VQ A IFFFFFFF LGG FF FL VR D YFFVA V F 
WWGLLFFFVFFFWFCFVFFFFFGWKRK 
NIFYGWVFLFCW 


4983 


1S884 


A 


5020 


1 


288 


FFLNLEKNIKMLSSYTDNGIWATAEDF 
MQNFKNLVGYHNS I TEENLPQLGANENL 
ESQSGNFSWFI FFNADRKRGMVLLLPN 
NEMTILKPRTSV 


4984 


18885 


A 


5021 


133 


1 


WATETMWSTKPKMWPGTGAHTCNPSTL 
GGQGGQITSPQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRSDPNFKNRIjRERRKKQ KLAKERAGLS 
KLPDLKDAEAAQKFFLEKIQLGEKLLAQ 
GE Y E KG VDHLTNA I A VCGQ P QQ I LHVLQ 
QTLP P P VFQMLLTKLPTI S QR IVSAHSL 


4986 


18887 


A 


5023 


1 


325 


VDGCPANLLSSHRSDVLRAETISLGEHP 
CDRGEQVTL.FLFNDCLEIARKRRKVIGT 
FRSPHGQTRPPASLKHIHLMPLSQIKKA 
LDI IETEDCHNAFALLVRPPTEQA 




18888 




5024 


135 




ATMFLNSKVSKYSGYLLGFHECREKGWM 
TWDGERDPSPGILQLQ 












94 


KTAT Kb I GGXH YDSXN I KAI RDGLLARR 
HAL 




18890 




5026 


2 


362 


QELERSMAQRCVCVLALVAMLLiLVFPTV 
SRSMGPRSGEHQRASRIPSQFSKEERVA 
MKEALKVFPTWSTSFIQHEWEEYSHL 
FTIQGSDPSLQPY1.LMAHFDWPAPEEG 
WEVPPFSG 


4990 


18891 


A 


5027 


222 


84 


AASTFCHVSQAGLELLTSDDPPASASQS 
AGITGMSHRTQPMVHLY 










366 


0 


SGRSKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGR 




18893 




5029 


231 


68 


AGMGSRALPKPLLMSHSSLKAVELPDSF 
S PELRSLLEGLLQRDVNRRLGCLGRG 


4993 


18894 




5030 


2 


366 


LNLGGRGCSELRSCHCTPAWVTETPFKK 
KKKVPHGGYFVAQLLPPRKMRTHRRPQA 

PSAFKGFFHPTNKSSSEAKKKKKKPRVG 
RKKNVGYIL. 


4994 


18895 


A 


5031 


241 


36S 


QVERNFKS QSGAEAH I CDLS TLGGCGGQ 
I TRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMWPGTVAHACNPGTLGGQGRWITRS 
GVRDQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSL.RVNPFFFFFFLKTNFSFCPPGG 
RQGPNFSLLDPPPPKVKEIFCLTPPKRW 
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26 
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nucleotide 
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correspond 

ing to first 

amino acid 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possiblc 
nucleotide insertion 














EYRPAPPPPRNFGFLIKTG 


4997 


18898 


A 


5034 


_ 


143 


SNPPALTSQSMGI TS MTRHALP IACFLV 
FVFLFFEMESHSLAQAEVQ. 


4998 


18899 


A 


5035 


350 


2 


QEISSVQTSTQLFNGMTVKARATTREVM 
ATYTIEDIVIELI IQLPSNYPLGSIIVE 
SGKR VG VAVRLF I F I TQKS FIFLFSFLT 
LCLCLQHFHNDFLLLI VP I LI AMAFLML 
TERK 


4999 


18900 


A 


5036 


1 


100 


HECGS S SQRTLSVQEAAAY LKVSNE IRI 
LIAIF 


5000 


18901 


A 


5037 


247 


334 


LQIDISAWAYTIAVKEDGELNLMKKAA 
S 


5001 


18902 


A 


5038 


2 


296 


DKAPMLKVIVNSLKNMINTFVPSGKVMQ 
WDEKLPGLLGNFPGPFEEEMKGIAAVT 
DI PLGKVHLEALKKKVI kff YKFPLRCD 
IHTAKCTGLRYTAQ 


5002 


18903 


A 


5039 


152 


335 


RPIWSYYLLFFFLFWGLYLDVNKRIFTF 
I LSEKYFDMKKNQCKEGLD I YKKFLTRM 
TRISE 


5003 


18904 


A 


5040 


336 


1 


GGLT ISSLLiKEKEGS EVAKFTL.EQLCL I 
CNIMSTAEYCVAATQQLEEKLKEKVDVS 
L FDR I NLS GDMGT F SPVISSSIH LLVQD 
VDAACDPAMVAMS KMQCQNVQHVG VKS S 


5004 


18905 


A 


5041 


3 


204 


LNLGGGGCSEPRSCHCTPSWVTARLHLK 
KKKKKKRGKNPLKNGGKENFKFLQILVN 
PKNSLENLAV 


5005 


18906 


A 


5042 


382 


148 


WCNHRGPRSRKKRSEGSTKSRRLGATIR 
MVTPHTTRTCARDPELTSKEKCVYIEEH 
THTHTHI YIHTHTHTCIYVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLIIKKMKIKITMKYH 
LSHLI PVKMAITKKTKD 


5007 


18908 


A 


5044 


198 


34 


KPASRFCHVGQADLEIjLTSSDLPASASQ 
SAG I TGVS HCSQ PNF I TLCLWTDHFF 


5008 


18909 


A 


5045 


11 


357 


LLTYFIIMFKILEIYEKEEQLIIKQISE 
GQEKVKELRQFKEHRKAKDSALQS iesk 
MLELTNRLRESQEE I Q I MRKKKRKNAAR 
GPYWKFKYSQRWKSPPGPFQWGWGPRKN 
FFV 


5009 


18910 


A 


5046 


2 


193 


GLQLLTSSDPPASASQSAGIAGMSHCTQ 
PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT 


5010 


18911 


A 


5047 


1 


196 


KLCLVMNSPMKAAPRNFSCAPSIjSLLPF 
SFRKTTTTLTSNTADDFAYFDLCKMEPC 
SMVLFFVFC 


5011 


18912 


A 


5048 


386 


276 


AQAVLELLDSSDLPPSASQSAGISGVSH 
CTQPDSIF 


5012 


18913 


A 


5049 


1 


324 


VDAAAEKLEASTGWLMRFKERSCLHNIK 
MHGEATVADTEAAAGYPEDLAKITDKGV 
YTQQQ I LNGDEIAFCWQKIPCR I FLARE 
QAVPG CNAS KARI/T VLLWANAAGD 


5013 


18914 


A 


5050 


1 


129 


PRS CHCFPAWVTEQDS VS QKKKKKKKKE 
KEKKKNPVEKSAKVI 


5014 


18915 


A 


5051 


193 


356 


RS F I P S AS AS AS TL CLKCSQNGQ P GAVA 
HACNP S TLGGQS RQLTRS GVQDQSGQ Y 


5015 


18916 


A 


5052 


192 


341 


AKRVKRNNFFFFFLKQFCSVPQAGGRGP 
DPGSLKPLPPGLKGFSCPTPLN 


5016 


18917 


A 


5053 


340 


1 


KRIPDKPQKELRRLATKLIREAPEKGQA 
QGKE3HKS I QEAKGE I FKAIDRI KKSQF 
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NO: of 
nucleotide 
sequence 


SEQ ID 
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M 
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NO: in 
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nucleotide 
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ing to first 
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peptide 

sequence 


Predict- 
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location 
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residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gtycine, H=Histicline, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














DRQEALDILLVMQNALESLSNRIEQVEE 
RNS E LED KVFD LTRS S KAKG KR I RNYDH 
S 


5017 


18918 


A 


5054 


2 


383 


WKVATQPADNPLDVLSRKLHLGPNVGR " 
DVPRLSLPGKLVFPSSTGSHFFMLGIGD 
IVMPGLLLCFVLRYDNYKKQASGDSCGA 
PGFANISGRMQKVSYFHCTLIGYFVGLL 
TATVAS R I HRAAQPC 


5018 


18919 


A 


5055 


2 


383 


GSVLSKKGDYLKYHYNASLLDGTLLDST 
WNLG KTYN I VLGS GQVVLGMDMGL KEMC 
VGEKRTVIIPPHLGYGEAGVDGEVPGSA 
VLMFD I ELLELVAGLPEG YM F I WNGEVS 
PNLFEEIDKDGNGEV 


5019 


18920 


A 


5056 


2 


383 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKENIDLVLDRIRKLADQ 

LSVDYGKKSKLEFSIYPAPQVFTAWEP 
YNFILTTHTTLEHSD 


5020 


18921 


A 


5057 


26 


452 


KLMSLRLRQAWHEAAIDEFRTGTYRQLF 
HPEQL I TGQEDAANNYARGH YT I GKENI 

GTGSGFTSLLMERLSVDYGKKSKLEFFI 
YPAPHVFTAWEP YNFILTTHTTLEHSD 
CA 


5021 


18922 


A 


5058 


2 


385 


AVI DE VRTGT YRQL FHPEQL I TG KEDAA 
NNYARGHYTIGKEI IDLVLDRIPKLADQ 
CTGLQGFLVFHS FGGGTGSGFTSLLMER 
LSVDYGKKSKLEFYIYPEPHVCTAWEP 
YNFILTTHTNLEHSDC 


5022 


18923 


A 


5059 






GDAANMYARGHYTIGKEI IDLVLDRIRK 
LADQCTGLQGFLVFHSFGGGTGSGFTSL 
LMERLS VD YG KKS KL E FS I Y P APQ VS TA 
WEPYNS ILTTHTTLEHSDCAFMVDNEA 
IYDICRRNLDIERPTYT 


5023 


18924 


A 


5060 


103 


3 


KI F F FLRWSFAL I TQAG VQWRGLGSLQP 
LPRAT 


5024 


18925 


A 


5061 






KASPFRTGTAFGNGKTSDYLLLGNFGYT 
FGGITGCLKAGLETSYWTWFTH 


5025 


18926 


A 


5062 


293 


406 


VI I GS I FE VI WAVI KPGTS FG I SVLRAL 
RLLRIFKVTK 


5026 


18927 


A 


5063 


419 


15 


WEEEGPLPKKKKGGFSNKRGNI IMGPFL 
KRPPEKKPPPPPRKINPFFFFFKRKGPP 
PPRVKGKGAIKGSRNPPLFGSRDFFSPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQAGPELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


216 


GLTNLFIFCRDRILLCCPGWSQTPSLKQ 
SSHLSLPKPWDYMYEPLCLYFLYPWFCL 
SLSLSFPLSHTYFFG 


5028 


18929 


A . 


5065 


259 


36 


SQPHWDYRASRAEE IATFLEVWLQAIFV 
CCLFLRQSPFIAQAGVQPVHYLKSLQPPP 
PGLKQFSCFSLLLVPRLA 


5029 


18930 


A 


5066. 


2 


404 


GKGAPTTSLISVAVTKIIAKVLEDNKLP 
GAI CS LTCGGAN I GTAMAKDER VNLLSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI IAFEDADLS LI VPSALFAAVGTAGQ 
RCTTARRLFIHES IHDEWNRL 


5030 


18931 


A 


5067 :J 


1 


400 


GENMITGTSQADCAVLNVAAGVGEFEAG " 
I S KNGR TR EHALLA YTLG VKQL I DGVNK 
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ISDOCID: <WO 016483SA2J_> 



WO .01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K- Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MDSTEPPYSQKRYEEIVKEVSTYTKKIG 
YD PDTIAFVPI SG WNGDNMLE PRAN I P L 
VTGWKATPIDCDASGTTLLDA 


5031 


18932 


A 


5068 


41 


392 


GSPHHPCAHIERKKKPYNSNIGFYTKRN 
ALRVAEVWMDDYKSHVYIAWNLPLENPG 
I D I GD VS ERRALRKRLKCKN FQWYLDHV 
YPEMRRYNNTVAYGELRNNKAKDVCLDQ 
GPLEN 


5032 


18933 


A 


5069 


3 


395 


GTPTRPHILLQADFIRAMLPCPDYDTDT 
KTGLDQALKICQAMLDEAANQGWIiVTVD 
NNTNLIQMAIQGRWVKDSSLLTLPYIEN 
HHLHLFKKRKPIMKGPHAKSRTSIECLP 
ELIHACGGKDHVFRSLVQQ 


5033 


18934 


A 


5070 


3 


405 


PRASEVCGFSCHTTCVNKAPTTCPVPPE 
QTKG PLG I DPQ KG I GTAYEGHVRI PKPA 
GVKKGWQRALAIVCDFKLFLYDIAEGKA 
SQPS WI SQVI DMRDEEFS VSS VLAS DV 
IHASRKDIPCIFRVTASQDSAS 


5034 


18935 


A 


5071 


3 


393 


ITRQEFIDGILASNFPTTTIEMTVGADI 
FDREGDGYIDYYEFGAALHPNKDAYRPT 
SDAPKTFHQGTRQVAQCICAKRFLVEHI 
GENKYRFFLCNHFGDSHQMRLVRILLST 
VMVLDGGGWMALDQFLTT 


5035 


18936 


A 


5072 


3 


394 


ITRHEFIDGILASKFPTTKIKMTVGADI 
FDRDGDGYIDYYEFVAALHPNKDAYRPT 
SDAYKI EDEVTRQVAQCKCAKRLLAEH I 
GENKYRFFLGNQFGDSHQLRLVRILRST 
VMVPVGGGWMALDEFLVND 


5036 


18937 


A 


5073 


1 


393 


GEDAANNYSRGHYTIGKEIIDLiGlaDRIC 
KLADQCrGLQGFLVFHRFGGGTGSGVTS 
LI..MEHLSPDYGPKSKLEFSIYPAPQVFT 
AVHEPYNSMLTTHTTLEHSDCAFMGDNE 
AI YD I CRTMLD I ERTTYTN 


5037 


18938 


A 


5074 


39 


482 


LGLHSAWRDDKIGYNPDTVSFVPISGWN 
GDNMLEPSANMPWFKGWKVTRNDGNASG 
TTLLEALDCILPPTRATDKPLRLPLQDV 
YKIGGIGTVPVGRVETGVLKPGMVATFA 
PVNDTTEVKSVEMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSLVIIYQYWRNMPDPHNLPIV 
AGW KKYPLFFGTAVFAFEG I RWLPLEK 
QMKESKRFPQALNIGMGIGTTLYVTLAT 
LGYMCFHDEIKGSITLNLPQDVWVYQSV 
KILYS FGI FVTYS IQFYV 


5039 


18940 


A 


5076 


330 


462 


VNFFFREGGTESPSIAQARVPWCSLNSL 
QPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGWGVEAEDFEYAPDVEPLEPTLSNI IE 
QRSLKWIFLGGKGGVGKTTCSCSLAVQL 
SKGRESVLIIFTDPAHNISKAFDQKVSK 
VPTKVKGYDNLFAMEIDPSLGVAELPDK 
FFEEDNMLSMGKKMMQEAMSAF 


5041 


18942 


A 


5078 


348 




RPQGNNI ERWQGGEVLiI IQLLR IAKS VK 
NVQDLWKTQFLIKLHILLFYDPRIPFVS 
NYPKEKKTYVHKKTYAQIFISLPRTFLF 
LRQSLKMLPRQDLSS 


5042 


18943 


A 


5079 


248 


3 


PGQVLEREHTCRRPWAWNVC I CLAGPA 
KPNREDVSSSQEESLQLNSIPPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
bd 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspoh 

last amino 

residue of 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutflmic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop coclon, /=possible 

nucleotide insertion 


5043 


18944 


A 


5080 


3 


192 


GDGSEIjRSCHCTPAWLTERDTSSQKKKK 
KEKKTWNPAPELFFFLLLEKKGQNQGKP 
NMKKVL 


5044 


18945 


A 


5081 


10 


171 


CFCLGPVGVCRSLLKPQLCFLGFPWLQD 
ARQKFRSVLVEATVKLDELVTRSTRP 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRGGWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


AAS TGSHS VAQAGVQWHDLS SLQ P LP PG 
FKRFSFL.SLPSSWGRKIA 


5047 


18948 


A 


5084 


257 


177 


I S LG E VAHACN P S TLGG RGGQ I TR PG 


5048 


18949 


A 


5085 


143 




GGESHSDTQAGVQWCYLGS TQP PLTLTF 
KQFSCLNLPSSWNYRCVPP 


5049 


18950 


A 


5086 


3 


87 


HVGQAGL1VLL1AS CDPPSLASQSDG3TV 


5050 




A 








LHSLSQHI ATRS KKPAFNLFAFGTLS PV 
STNWVGFNP I S FEGLML I I FFFFF 


5051 


18952 


A 


5088 


40 


203 


I FLP LLiLVS FFFYI LTWG FTF FWGLTL I 
NDLLNF FFGNSG I S FS WFGS RAGELG 














MDMRVPAQLLGLLLiLWLSGARCDIQMTQ 
SPSS LS AS VGDRE DGD VDAA 


5053 


18954 


A 


5090 


146 


328 


FMPFPLLESLEIFSSKLFNPPNLFFFFE 
TESCSVAQDGVQWYDLGSLQPPSPGFKR 


5054 


18955 


A 


5091 


338 


I 


PNL P S VQLP PTTS CTPL IGDIiLVARAHF 
AGWKHQGINSPSLPMAPREGMEAPRNPF 
HPHIFLCLTHGFHLAVSKLYSLFYYYYY 
YFSRWSFTLVAQAGVPWRDLGSLQPLPP 


5055 












PPLTPPIFPSSPKNKNPPPPHYNPPPPP 
FRTPPPPPYPPLSPPKSPPPPPRVDPPL 
P YS I FPRPNIi I S PP P YS PFYLLPQLQAF 
P 


5056 


18957 


A 


5093 


133 


3 


AQTCTPSTQINSKWITDLNVKGKTIKLL. 
KDNIREKLDDLGCGE 


5057 


18958 


A 


5094 


170 


350 


AGGQGGNFYSLQPNPPWIRESSHSTLPK 
FWNYRHTPTGPPNFGFFFIjKMGFGIjVAQ 
TIFN 


5058 


18959 


A 


5095 


133 


274 


RDIIIFLESGIKGYFLFFSDKITSELVS 
KIGDKNWKIRKEGLDEVAG 


5059 


18960 


A 


5096 


2 


188 


REMQIKTTVRYHLTTARMVLI KXSEXNR 
CWHGCSEKGTLLHCWWECKLliQPLWKIG 
WRVDAA 


5060 


I896I 


A 


5097 


235 


81 


FPKKKKAQIKKPQAAQSLNLSLSLYIYA 
FTYTHTHRHTHTQTHTHTY IH I S 


5061 


18962 


A 


5098 


355 


169 


KILLLITDSCAHISCRFSECMKYFNFFF 
HPHS V YVTDDK I R LLEEQL PHVFSNKME 
PFKVCN 


5062 


18963 


A 


5099 


160 


1 


FSASYFNSKKKLIFLLLKLRWNFFVCFF 
GME S H S VAQ AG VQ W CHtiGS L Q P Ij P S 


5063 


18964 


A 


5100 


364 


151 


QMGFCHIGQADLELLTSSDPSASASQSA 
GITGVSHYASQEFLKKEFHSAHLIPLQG 
TCVQGKTAQPYSEAL 


5064 


18965 


A 


5101 


205 


3 


AASPGVQWPDSRL.LQPRPFGVKQFFPLS 
FPSSWGYRGAAPPPGHFFFFFPSRDEIS 
VCLPGWSPTPE 


5065 


18966 


A 


5102 


349 


207 


DGASLCCPDWGLELLSSSGLPALALQNP 
EITGVGHHTQPWTLSLLIY 


5066 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFLSKKKKKKKKKKKKKKK 
KKKQS S S QTSYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopl»an, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibte 
nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


LFLFFFPPPKKVFSQKTPRSVYSPPPKK 
KKKWSP P PLKLGP PQE I FKTP P PP I FFF 
FFFMFFFSFFFFFKGHRPRFISLWSPPL 


5068 


18969 


A 


5105 


401 


10 


MFSPPFGGPRGGVPLRPGFKNPPGPKGK 
TPFFLKNQNYLGPPLYSPFLERLNKKKA 
FTLEAKGPNNPNSSRSPPPGAQKKNPPF 
KKKKKKKRKI I FMPQAHPAVYWCSGNQ 
LPITFPYSIISGPLSSFT 


5069 


18970 


A 


5106 


413 




PNKKPSPPPFSPPHKTPSSPSPPPPPNK 
PTPTPPPPNKKNFPPPQPPPKKKNPPQT 
PPPQKKTPPPPKKKKHPPPPHKKTPPPP 
HTKNPPPPPPLPPSLSIPPLFPLYINSP 
FFIPFFFFFFFKIFFYLGRVGGRV 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFSFFFFFFFFFFFFFFFFFFF 
FFFFFFIKKCHFIIFKYIGSTSN 


5071 


18972 


A 


5108 


238 


4 


QWLPLVDRIWVILLFFDSLKLKMFPQYQ 
IGLQKI KVKNNTTFYFLFSFFLRWSFTL 
VTQAGVQWCDLGPLQPPPPRFK 


5072 


18973 


A 


5109 


496 


0 


STPSRASPKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKSSK 


5073 


18974 


A 


5110 


1 


243 


DHLiSLGGRGCSEQRSCHCTPAWVTERDS 
VSKKKKSFRALDIFFFNGKKILGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAHACNPSTLGGRGGRITRSGV 


5075 


18976 


A 


5112 


2 


96 


AGVQWRDLGSLQPPPPGFEQFSCLiSTMP 
NFL 


5076 


18977 


A 


5113 


128 


6 


IYKDVLEPGVL.RLLDVDNRWLPIEAPI 
RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDLSSLQPLPPRFKRFCCLSLLDSWDYR 
RP PLRPAI QEAEAAE 


5078 


18979 


A 


5115 


1 


146 


AVETGFCHVAKAALELLHSNDPLASASQ 
SAGITGMSHRAWPSLFSILS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFLLYLYALLGPFPPLPLLQ 
LKF I CWPGAVAHVCNPSTLGGRGGRITR 
SGVRDQPDQHDET 


5080 


18981 


A 


5117 


407 


0 


LGWAWWFTPV I PTLREVKAGGS PE VRS 


5081 


18982 


A 


5118 


215 


414 


KIRPKE IRNNGGSQQRKRAQLGKLEQTN 
PVLNFKAP FFGEQGDYYKS FFKTCLDNL 
PRRGKGVFFF 


5082 


18983 


A 


5119 


40 


326 


KKKKKKKKKKKKFKKTGGAKWAPFLNLK 
KKGIKPQKKKGFFQGKKKRGKKKKIFKI 
LIQGFFQKKKNFSKKKISQKKKKNGFWP 
LGKGFCPKKKT 


5083 


18984 


A 


5120 


99 


2 


GRSTRHS PAHTHTHTLRHTGTLLHTHSH 
RHSP 


5084 


18985 


A 


5121 


3 


219 


ELRLCHCTPTWATRVRLSQKKKREKKKK 
NFVGKKKGKKGKKPPNPKKKGSGLGFKG 
FLFSKFFKTRGTTII 


5085 


18986 


A 


5122 


262 


351 


TRPGTLAHACNPSTLGGRGGWIMRSGVQ 
DQ 


5086 


18987 


A 


5123 


340 


0 


KKKKI SQAWWWVP WPATWEAEVG 


5087 


18988 


A 


5124 


94 


2 


LMP 1 1 PALWE AE VGGAPGQE FQTS LANM 
VK 


5088 


18989 


A 


5125 


310 


68 

131 J 


LSPFYKLGFWGRVIYPPPPPKVSPPVFG 
APPPPEKKKKPPPPPPKSSIKKPTEKKI 
CLAASGKGLEVCFCKKPLIQSYFI. 


5089 


18990 




5126 


343 




SRPRRPGliELIjTSDDLPAPATQSAGITG 
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M 
eth 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioriine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNHTQIWPISVGDLMNTGVVIP3NRIV 
SELLREIHFIiRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGSSVGRLEGTGTISTHCTLPASRD 
S PASNYRI/TPPCPADFLYF 


5091 


18992 


A 


5128 


1 


130 


SRSVTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPPCPADFLYF 


5092 


18993 


A 


5129 


1 


338 


WRKEGIRFEKEKSKDFKNHVIKYLETL 
LYSQQOVCKLWVTFI 


5093 


18994 


A 


5130 


346 


3 


DFFFFHP I LFFFFFKKTLFFS P PEGNGG 
HTLFLKNTPPRGRIFFFLPPLLL.WGGWA 
PPPPPKKNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFMPPTQSLFKDQEGPVQGGEEA 
PV 


5094 


18995 


A 


5131 


3 


114 


FLHIGQAGLELLTSDDPPASASQSAGIT 
GMSHHAQP 


5095 


18996 


A 


5132 


292 


180 


AASTDSIEGHGASLPSKKTPSEEDFETI 
KLISNGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRLPPGGRGFSGPKSHFCPPAWATE 
RDSLS KKKKNPV 


5097 


18998 


A 


5134 


337 


3 


IWLGNGKGIIRNNQIFSNKEAGIYILY 
HGNP WSGNHI FKGRAAG I AVNENGKGL 
ITENVIRENQWGGVDI RRGG I PVLiRSTL 
I CFGYSDGWVGDEGKGLI EGNP I SAN 


5098 


18999 


A 


5135 


227 


343 


NYVKEKLIPTWNWMVS IMDSTEAQLPYG 
SALTSWDPGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLGVFCSAVI LLYFKNMMKLDLSL 
TPHTT INSKWIKRLHTRPEAI KFLEVN I 
GKKFFDIGHEIIL 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPS ILGGRGGRITRS 
G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVAHACN 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 


2 


FFFFFFLSVMESCSATQAGVQWCDLSS 


5103 


19004 


A 


5140 


41 


317 


TMSRDRPSDKTWTYNRSNWMPDDGAPF 
RYSFSALKDRHNAVEVNWIDPNNGWETA 
TELVEDTQAIARYGRNVTKMDAFGCTSR 
GQAHRAGL 


5104 


19005 


A 


5141 


29 


523 


VKAEAAYKKADDIWNLRKDDYFVNDEAR 
ARYWDDREKARLALEAARKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TPLEKYTARQEELNKALKDGKI LQADYN 
TLMAAAXKDYEATLKKPKQS S VKVSAGD 
RQEDSAHAALLTLQAELRTLEKHAG 


5105 


19006 


A 


5142 


432 


144 


GGFLPKAFLPPKKKKGAFFPPPGKGAIP 
PIiFWGGWTVFL.IPEKNPPRGFPPP.KRGE 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKQIGPRGGGPPQKSPPLGGGGRG 
GSPRPGVLTPLGPQGKNLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PKILPFFPPRGKKKKIFFPKKKKKKSHR 
KL I QTLI LQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 
GGKKKFFFNGAKKKKPRAPVKKTGEKKK 
GGKKKKKCFEKNPFFGGGKKKKKKCSSS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCSEPRSCHCTPAWVTEGDSISKKK 


5109 


19010 


A 


5146 


317 


444 


IHQPVCVAHAYNPSTLGGRGGHITRSGD 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, II=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RDHPGQHGETPYPL 


5110 


19011 


A 


5147 


3 


413 


S FYRGF I P VLQTVTVLI IGDPS LHGDAW 
S WME FFLTVI F I AL WMLPL I GLT I WNA 
IWFQDIADLAFEVSWTKPHPFPSVSQI I 
ADML FPLLLQALFL I QG IAVS LFPIHLV 
GQLDSI1L1HMTLLYSLYCSEYRLVIN 


5111 


19012 


A 


5148 


409 


247 


QTKFRHVAQAGPHFLGSSDVPTSASQSV 
G I TGM S LHAQHQHFFTASMGRAFGEN 


5112 


19013 


A 


5149 


284 


I 


AEASRAHMEAHARTQGTPGSKAEGVGPV 
EEYRIWRRPGPDRAHASDSEESGEPGGP 
PPHPANFVFLVETGFFRVGQAGLEFPTS 
GDPAASASPV 


5113 


19014 


A 


5150 


439 


120 


FGSPRLECSGVISAHCNLRLLGSSDSPA 
SASHVAGI TARPNARI IKAFGAPSTCTV 
WERKEQQTCIQRSDLPAGRQKKTGPKN 
RPGLVAFPFNASPLGGPGGRIA 


5114 


19015 


A 


5151 


415 


0 


FFFFFFFFFFFFLPFVQSFYIYIYMYI 


5115 


19016 


A 


5152 


67 


179 


ICLSLFLTVHVCVCVCVCVCVCVCVCEM 
VFGVSIPCN 


5116 


19017 


A 


5153 


280 


2 


KTPFFLNLKEKTRAPFFLKKTFFKKKPF 
KIFFS KKRNFFFFP PPGKRP PTLFKKKF 
FFFFQKKKNFFFSPKKKKKKKKKKGRRS 
RSRTSPRV 


5117 


19018 


A 


5154 


296 


406 


DR VLLCH S S CS EVAES RLTAAST S WAQA 
ILLPQPPEW 


5118 


I90I9 


A 


5155 


114 


402 


VWRI KAS VEKKKKKKKKKKQKKKKKKKR 


5119 


19020 


A 


5156 


110 


I 


QRFLL TLKS L I LRLG AVAHACNPS TLGG 
QGGRIMRS 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQLRLFFPFFPLKIFFFP 
KAFNF CGGVFP I FS PPKKKF F FQNSQVG 
FKNP PQKE KNFS FP P P VKFGP P KG FFKR 
PPPFFFFFFFFF 


5121 


19022 


A 


5158 


2 


350 


TLQPGRQSETPFQKKKKKKKKKKKKKKK 
KKKKKKRGGGFLKNLGGGPKIWGVEKFF 
FFFFGGGYKKPLGFFL.EKIFFLGGGIWA 
PPPPKKISFWGKKKFFWGGGGKTPLFFC 
GGKK 


5122 


19023 


A 


5159 


1 


208 


PTRTLYL VKLGLMG PAEIFLS S FQEIRL 
I D YD KMVDHRGVR WAFGQW AG VAG KYA 
RAYITHFLNWLGE 


5123 


19024 


A 


5160 


1 


208 


PTRPLYLLKL.SLMGPVEIFLSSFQEIRL 
I DYE KMVDHRGVR WAFGQWAGVAGKYA 
RAYITHFLNWLGE 


5124 


19025 


A 


5161 


419 


25 


AQKKKKKKKKKKKKKGGGPLKKNFGGGK 
NKRGKKKKNFFLKRGGKKKTGGILSKKN 
FFGGGKKGEKPPQKKKALKEKKKFLRGK 
GGKKPLNWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGDPPPRCLDDKRLPSRP 
VIFVFLAETECDRVGQDGLQLLTSGDHP 
HL.TFPNCYNYRREPPRPTSSGFQHDIET 
LHYRRGKADRFKAHLPK 


5126 


19027 


A 


5163 


237 


409 


FHNQKTYLDSALCTLVYFWFVLRDGI F 
LLPQLECSGTVMAHCNIjICLIjGSRSPPAL 
A 


5127 


19028 


A 


5164 


373 


476 


LGFILLS fletrshs vaqae VQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


I FYSPFLYYHMLGKYHKIKNLKI KHMWL 
DAVAHACNPSTLGGRGGRITRLGDRDHP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

lo cation 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidirie, J- Isoicucinc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ' =Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GQHDETPSLLFLWTFASSLSGFPL . 


5129 


19030 


A 


5166 


332 


2 


RPQGLANCFPGALPPTKSEGFPGQSPGF 
QIWGGKAFFLRGFPFFPQVESKGAVSPP 
CKVPPPGLRPFSAPTPPKNGDKRGPPPG 
RATGFFFFFVFLVETGFHHVSNDGLY 


5130 


19031 


A 


5167 


238 


1 


GGFTGTGGKYEPAGIVHRGEFVFTKEAT 
SRIGVGNtiYRLMRGYATGGYVGTPGSMA 
DSRSQASGTFEQNNHGSAGESRG . 


5131 


19032 


A 


5168 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5132 


19033 


A 


5169 


3 


127 


DEDFSDFDEKADDEDFVPSDASPPKTKT 
SPKQVSYLIWVLS 


5133 


19034 


A 


5170 


20 


254 


IAPPARIRHEERERERERERERERERER 
ERERERERDTRAEWRARLFFFFFFFLPA 
DQCFFIKPSGQRFLPPGGGGVP 


5134 


19035 


A 


5171 


36 


479 


TXGXDNKKDLLISVGDLVDRGAENAECL 
ELITFPWFRAVRGNHEQMMIDGLSERGN 
VNH WLLNGGG WVFNLDYDKE I LAKALAH 
TADELPLIIELVSGDKKYVICHADYPFD 
EYEFGKPVDHHQVIWNRERISNSQDGIV 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPKLHPFFSSETKSLTLLPRLGCSG 
VIIAHCSLKLLGSSNPPTSASRVAGMTG 
S 


5136 


19037 


A 


5173 


205 


1 


HARLI FVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLKILKSAGCGGANST 
AtASGLLFSLGP 


5137 


19038 


A 


5174 


409 


194 


FFFLRWSIALSARLVQWRDLGSRQAPPP 
GFTPFSCLSLLSISFQPSATYMRFLNTI 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


AKFTC I SSN I KLSNTRPGTVAHACGPST 
LGGRGG Q I TGSGDGDH 


5139 


19040 


A 


"5177 


426 


148 


GGRGPPPLSHPFGGGGGGGPLRAGGQKN 
PGPKGETPFFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIKFDFPPAPPT 
WGEKKNFFF 


5140 


19041 


A 


5178 


328 


474 


NNTEYNKNRLGAVAHACNPSTLGGRGRW 
ITRSRDRISP 


5141 


19042 


A 


5179 


448 


330 


ETGFCHVGQAGLELLGSSDLPASASQSA 
GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


LRYKAILCSWIRTTNIVKMAELPKVIYI 
FNAI P I FKIPDMFCRI ARF I LKCMWNVK 
VSTIDKTIFKKENRVGIPPRVLMVV 


5143 


19044 


A 


5181 


2 


423 


KFYATLVRYVGDRKNLVCREMSMALLSN 
LAQGDALAARAIAVQKGSIGNLISFLED 
GVTMAQYQQSQHNLMHMQPPPLEPPSVD 
MMCRAAKALLAMARVDENRSEFLLHEGR 
LLD I S I S AVLNSL VASVI CDVLFQ I GQL 


5144 


19045 


A 


5182 


182 


312 


FFIQVGFHHVAQAGL.KLPSSSSPPHLAS " 
QSAGITGVSHCAQPG 


5145 


19046 


A 


5183 


301 


472 


GIPFFFFFLRRSFAIiVAQDGVQWRDLGS 
PQPPPPGLKQFFRNSTAWTKGLQPFPRL 


5146 


19047 


A 


5184 


216 


413 


KPXXKKKKKKKKKKKKKKKKKKXGGAVK 
KKGPQKTP VKPPGGFFFAPPLGVS PPPG 
VFFAGGGAPP 


5147 


19048 


A 


5185 


357 


465 


I KPTK I LAPG PGAVAHACN PS TLGG QGG 
WITRSGD 


5148 


19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK _ . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X"Unkno\Yn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5149 


19050 


A 


5187 


346 


3 


RFLPLGQGGVEILTPRSTCLGPPKWWDY 
KGE PLRPAQKCFF FKKKRRR KKVQNS VA 
VYM VYHSMCGENKLRWRWWG I FTHTHTH 
THTHTQKKQNI LSRTHKKLV I TTGS EEG 
TL 


5150 


19051 


A 


5188 


II 


109 


GETPSLLKVQKIXWAWWHMPVIPAMWEA 
EARES 


5151 


19052 


A 


5189 


3 


151 


QLLRRPRQENHLNPGSRGCSEPRLHHCT 
PAWATVQDS I SNTNNNNN KC P 


5152 


19053 


A 


5190 


2 


373 


AVAADKGVPLYRHIANLAGNPDLILPVP 
AFSVINGGSHAGNKLAMQEFMILPVGAS 
SFKEAMRIGAEVYHHLRGVTKAKYGKDA 
TNVGDEGGFGP I I LDNNDALELVKTAI Q 
AAGYPDKGAIGR 


5153 


19054 


A 


5191 


55 


421 


ARVADVCESMKEHLLVLVERAKYI PGFC 
ELPLDDQGALVTAHAGEHLLLGATKRSM 
VFKDVLiLiLiGNDYIVPSHCPEIjAEMSRVS 
IRILDELVLPFQELHIDDNEYAYLKAII 
FFDPDKETEA 




19055 




5192 


163 


466 


TCFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHS PCCPQYRLAP I 
PRARHDFACAS L I FAC I LLVHVLLMPRV 
SAG RG VG VR PAG I Q AG R 


5155 


19056 


A 


5193 


470 


0 


ECLWPGTVAHASNPTTLGGRGGRIMK 


5156 


19057 


A 


5194 


491 


281 


RGLALI.PMGQCSGMMLAHGSLDLLDSSD 
PPASASQSVGIMGVSHHAWPSL.VNISFV 
CLIHRSPKTKPEGR 








5195 


3 


222 


LSFHSLiHRCLYMLGTTSENVSPFSLLEL 
LSRLATLLGDYCGSLS EGT I SRNVALVY 
ELLDEVXjRXiESRCVAQA 


5158 


19059 


A 


5196 


20 


191 


STWWNSSRGGGCSEPRSHHCTPAWVTER 
DSVSKTKKKINNKPALALPAVHPTSHPG 
S 








5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 


A 


5198 


174 


41 


SLGELLTECLiFETKSHSVTQAGVQWRHP 
GSLQPPPQGFKRIPPH 


5161 


19062 


A 


5199 


131 


454 


FEYFKNRKPFFFFFFFGKGVSFCAPGGK 
AGDPFKLREPPPPRVKGFFGPPPPSGGN 

TPGGGP PPPAPQRGGINGLDP PAR 














RSGARAQPNR 


5163 


19064 


A 


5201 


414 


497 


LLFXXXXDNGSIYNPEVLDITEETLHSR 


5164 


19065 


A 


5202 


381 


614 


VQPGAPPEWVALSTCPSAAPEGSQQPYI 
PPTFCFFPV KWLLS VTWL YL FIYFLSWI 
SVAQAGVQWCNHGSLiQPRPSRIj 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 


A 


5204 


54 


434 


MRTPENLELTNPQEFGSSWAAVECPDTL 

EICH P WDVT WFYSNCLTDTCGCS QGGD 
CECFCASVSAYAHQCCQHGVAVDWRTPR 
LCRECPRQSPEGTE 


5167 


19068 


A 


5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFSSLTT 
SPLGLKRVTGGFNSKNRCDARTYCYLLP 
TFAFAHKDRDVQDETY 


5168 


19069 


A 


5206 


27 


177 


NS FHHAGQSGLE LLTSSDP PAS ASQSAG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nmino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

tocation 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GUitamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














ITGVSHRAQPKAGTFLSPFHR 


5169 


19070 


A 


5207 


6 


1175 


DL.GPHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYECHVGIYDRATREKVVLASG 
N I FLNVMAP PTS I E WAADTPAP FSRYQ 
AQNFTLVCI VSGGKP APMVYFKRDGE P I 
DAVPLSEPPAASSGPLQDSRPFRSLLHR 
D LDDTKMQKS LS LLDAENRGGR P YTER P 
SRGLTPDPNILLQPTTENIPETVVSREF 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
AliLTWTLNPQIDNEALFSCEVKHPALSM 
PMQAEVTLVAPKGPKIVMTPSRARVGDT 
VRILVHGFQNEVFPEPMFTWTRVGSRLL 
DGSAEFDGKELVLERVPAELNGSMYRCT 
AQNPLGS TDTHTRL I VFENPN I PRGTED 
SNGSIGPTGARLTLVTiALTVILELT 


5170 


19071 


A 


5208 . 


401 


3 


FFFFFSETESHSVAQAGVQWHHLGSLQG 
PPPGF 


5171 


19072 


A 


5209 


402 


229 


GFFKKGPPFFFFFFFFFFFFFFFKMVWL 
RLS GS KVEW VWGS Q KL YMNFGC CRGRS W 
FT 


5172 


19073 


A 


5210 


198 




PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFG PPKDPL 
KRPPP 


5173 


19074 


A 


5211 


516 


222 


GSTLRLTQRSMP AS S S TMAPS SKRLLLP 
RPERSVPAAAGTAGHH3ASRNCGRGGAG 
ADEGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEKNFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQLNPGKRI KSPGIDP YI YG 
QLIFDKAVRAIWWRKGSSFLQKALE 


5176 


19077 


A 


5214 


2' 


170 


ERKILGYIQLRKGPNWGPYGLLQPFAD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


19078 


A 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCLHW 
NAVAIHGHRH S VPQ PQT PG FKQS CLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 




PDKSYI ICVCVCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPFSPPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 
PGFFLTPPPPFFFFFFFFFFFFFFFFFF 
FRMGCDRWHGEFWILRDGFD 


5180 


19081 


A 


5218 


165 


1 


QVKYHNKKTHNPIKIGKRSEQTFHEKDT 
QMANKYMKRS PTS LR I KG I Q I KT I MR Y 


5181 


19082 


A 


5219 


408 


88 


PSSSPKFFSPPPLGGVFPPFPFKIFFFP 
PGLFFFWGVFPLFSPPPKRVFFPNSPPG 
FFFPPLFGKKFFFSPPPFFWPPPGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NLLLLI VP IL IAMAFLMLTERKI LGYIQ 
LKKKKKKKKKKKKKKKKKKKRGGAFKKK 
PRGGQNFPGEKKKKFFFLREVKKKPRGN 
F 


5183 


19084 


A 


5221 


10 


479 


KLMPAESDGRHRERERERERERERERER 
ESS SARAR CVS LSLDRYKNRVYMKVAE A 
LCECRLLAYISQAPTQMFFLLRLINIIH 
AHTLTQENDMCLHTTLEI LTLARQRERV 
PLYLRVLQRMEHKKKHPGFVLNNSHNL.V 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


1 Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino ncid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, MMMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














RLWQQLYLHRDRDSIC 


5184 


19085 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGG WGG QIKRSGV 


5185 


19086 


A 


5223 


14 


471 


LEMRSEIDSQNYCNCDAGRNEWTSDTIV 
LSQKEHLPVTQIVMTDAGRPHSEAAYTli 
GPLLCRGDQSFWNSASFNTETSYLHFPA 
FHGELTADVCFFFKTTVSSGVFMENLGI 
TDF I R I ELRGKLATLDKSQGTH YLAI KA 
LTQKKKKKKTRG 


5186 


19087 


A 


5224 


480 


0 


FFFFFFFFFFFLIVAQFYFL 


5187 


19088 


A 


5225 


16 


422 


VRRTARIRHEGKPYECNACGKAFNRSAH 
LTEHQRTHTGEKPYVCKECGKTFSRSTH 
LTEHLKIHSCVKPYQCNECQKLFCYRTS 
L I RHQRTHTGEKP YQCNECG KSFSLSSA 
LTKHKRIHTRERPYQCKKKKKKK 


5188 


19089 


A 


5226 


99 


436 


GHPS FE I PATMTAAPAGFPPQVP WEDVR 
YLFDEIMYGGHI TDDGDCKLCRVYLEE F 
MNPSLLAALSGTHACGEKPRbPANSHVS 
EPSWKWILQPQSSLQMTAALANILTATS 


5189 


19090 


A 


5227 


473 


9 


FQITATPHLAVYDPTVQFEFWFSEKQIA 
DIRQVEASTRYLGTALYWI AAS INIKPG 
HDYYFYIRSVNTVGKSAFVEAVGRASDD 
AEGYLDFFKGKIAESHLGKELLEKVELT 
EDIASRLEEFSKEWKDASDKWNAMWAVK 
I EQTKVGERDVTSG 


5190 


19091 


A 


5228 


1 


473 


PPPIDRLPNCTACRNSARVMTDAGQPHS 
EADYTLGPLLCCGDKS FWNS AS FNTETS 
YLHFPAFHGELTADVCFFFKTTVSSGVF 
MSNLGI TDFIR I ELRGRLATLDKSQGTH 
YLAI KALTQKKKKKTRGGAGP PFPLIGS 
RITIHGPPFNNAAMREK 


5191 


19092 


A 


5229 


170 


25 


KTPLLGPGTVAHACNLSTLGDQSGCIMR 
SGVRDQLRQHDETPSLLKRI 


5192 


19093 


A 


5230 


465 


0 


WLYPPQKAQKKKKKKKKKKKKKKKKTK 
ARG 


5193 


19094 


A 


5231 


419 


56 


CVX.LRSTKKKKKKKKKKKKKKK 


5194 


19095 


A 


5232 


79 


451 


AGSNLQEHRGLRPESPFNPGGGGCREPR 
RRHCPPAWAKRAKLRLKKKKHGRNQEVA 
QGRHGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVPG P ELKGPGAVL FSRHRRGGADLR 
PPRKGPRGGGGG 


5195 


19096 


A 


5233 


39 


467 


VQQQQRAQEQQQQHPVLHLQPQQIMQLQ 
QQQQRQISQQPYPQQPPHPFSQQQQQQQ 
QAHPHQFSQQQLQFPQQQLHPPQQLHRP 
QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QLAQLQQQHSLLQQQQQQQIQQQQLQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 


MILiKFFFNFKIHFFFFIjRQSIiALVAQAG 
VKWCNVGSQQPPPPGFKRNST 


5197 


19098 


A 


5235 


204 


475 


AS ITVHLW YVEKCYSQSTEKVTPMPSSS 
WFGPQPDAQLKHGLRPGAVAHACNPSTM 
GGRGGRITSSGDRAHP 


5198 


19099 


A 


5236 


59 


357 


NHWRKI YCKVYNRKKI KHNTTNTTKNYQ 
NFQRNRTKSSFTIFRIQSHYNKHIKKHI 
QRKRQS I ETDLEMTQMLELVWI FKWIV 
I LTKI KTPKTLKING 


5199 


19100 


A 


5237 


447 


166 


ElitAFWQTQYNNMNAVNLFFFWIKAPGI 
PTPSFASRRTHSPFFSFLFFLFFFLGKK 
GFPLCGPGGPKSLNLKTPPLGPPKGGK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od" 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

peptide ° 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G~Olycine, H— Histidine, l=Isoleucine 
K=Lysine, L=Leucine, M=IWethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 

nude^trdTd^'letio^^^Msi'ble ' > ° SS '' , ' e 
nucleotide insertion 


5200 


19101 


A 


5238 


34 


296 


LSVAE FLAFWQTQ YNNMNAVNLFFAW I K 
VPG I PTPSFASRRTHS PFFS FL.FFLFFF 
LGKNG FSPCGPEGPKFLiNLITPPLGPPK 
GWE 


5201 


19102 


A 


5239 


340 


472 


SLNETHVRSPAHHRPGATAHACSTPITLG 
GRDGL I MRSGDQDH PG 


5202 


19103 


A 


5240 


460 


0 


PKKKIFTIYFFYFFFFFFFFFFFFFL 


5203 


19104 


A 


5241 


161 


46 


sfysthrkpchwpgsvahacnpstlggr 

GGRITRSRDR 


5204 


19105 


A 


5242 


.175 


36 


LKYYLIFIYLLSFFFLRQSRSVGQAGVQ 
WRDLS S LQP PNS TALQ Y R 


5205 


19106 


A 


5243 


11 


442 


DT I RWGLPTLGS KSTTNEKKRE KRRKKK 
EQQQSEANELRNLAFKKI PQKSSHAVCN 
AQHDliPLSNPVQKDSREENWQEWRQRDE 
QLTSEMFEADLEKALLLSKLEYEEHKKE 
YEDAENTSTQSKVMNKKDKRKNHQGKDR 


5206 


19107 


A 


5244 


459 


333 


■ FLRVTQAGLKLLSSSDPPTLASPKCWDY 
RHEPLCPAQWSVS 


5207 


19108 


A 


5245 . 


229 


436 


FNS TWRKRGLGG F I HLNL I QTS FFAHL 
WS LTYVLiC VS F FVCLFLRQSHFVTQAG 
VQWHNLGSLQPLP 


5208 












VFFARWVFLRQDLDLSSRLECSDVIJjVH 
CNVRLAGSRGTPSSVSKVAGTTGMLYHT 
WVFFYEFHR 


5209 


19110 


A 


5247 


173 


442 


FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVNFWFFKKN 
GVPPPW 


5210- 


1911 1 


A 


5248 


255 


358 


. TG PGTVAHTCDPSTSGGRGGQTTMSGDR 
DHPGQH ' 


5211 


19112 


A 


5249 


3 


130 


QPQLAAPS IS WAPTSASQVAGTTGVCHH 
AWLIFLFWVESRRG 


5212 


19113 


A 


5250 


90 


1 


STRLENNGTMSAQCNLRLTGSSNSPNSA 
SR 


5213 


19114 


A 


5251 


358 


447 


SFISYRRRSPSPYYSRYRSRSRSRSYTP 


5214 


19115 


A 


5252 


55 




CWNTSMRGP I WEAEAGESLEPGGRRS CG 
EARTCHC I PAWWRAKLVDAA 


5215 


19116 


A 


5253 


330 


3 


EEMGFSHVGQGGFKI LASGDTPAWAFQK 
GGISGVSPRAWAGFIIFIWNFSGGPQTG 
FPFYPFRGIFFPRAKINYIRVLNPKGIjC 
QKG VFF F F FSG F PH YRCRLCQ I S V 


5216 


19117 


A . 


5255 






mylrfplglslserpgmvahacnpctyw 
ESKAGGSIjSLGVQDQPREHGGTLCLQKS 


5217 


19118 


A 


5256 


263 


1 


RPRRRFNRDKTALLQEMPFRTFTALGKL 
ISDFKDSKDGLTLYFVETASRSWLAGL 
ELLTSSDPPTSASQSTGITGVSHFSQPG- 
LAF 


5218 


19119 


A 


5257 


1 


103 


CHVSQAGLELLASSDPPASASQSVGITG 
VSHRTW 


5219 


19120 - 


A 


5258 


128 


2 


IHKWRDCKMVQPLWKRAWQFLMKLNTHL 
SCDPTVLLLDILTQ 


5220 


19121 


A 


5259 


381 


2 


FFFFFSEMESPSVAQAGVQWCGLGSLQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYLNMGGGGCSE PSLHHCS PALAIERD 
SVSTTTTTTKQQQQNNQLGLFSYPFNPL 

I • — - 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5222 


19123 


A 


5261 


1 


175 


I KGNQ I GKKE VKLFLFTDDM I LYMENPK 
DSAKNYYKQS KTFCLP WLGRS CKRTC P 
LA 


5223 


19124 


A 


5262 


2 


145 


QVSLKLLGSSD P PALTSQGAG I MGVSHR 
TRPRNTDFKNNTQKSKNTQ 


5224 


19125 


A 


5263 


182 


3 


STTSFLNFFLTEMVSHYVSQAGLKVLGS 
SDPPSSASHSAGITGMSHHTRPQQPLLN 
LKV 


5225 


19126 


A 


5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMGKVQDAFI 
FYRRF I DKS KPRANTWGS IGGLYQQQ I H 
PMDALR 


5226 


19127 




5265 


329 


207 


RGCSELRSRHCTPVWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


19128 


A 


5266 


177 


13 


TLTILLNFFFLRQSYSVAQAGVQWHNLN 
SLKPPPPGFKRFSCLSLPEFQQGHNIR 


5228 


19129 


A 


5267 


474 


122 


FICSPHFGGFLTLGPKLNILGPSQPLFP 
IjRGGS KKQG I P FYS YKI KGPPGAG P PG S 
ARYSNPFGGYGGRVPLPPEFPDPPGEHK 
PRFFLKKKKKGHQVPTSKYLPEYLFAYY 
NASLE 


5229 


19130 


A 


5268 


55 


311 


GIQTFGKNVWAGRSKNVGMPIAMLLHT 
DGEHERPGGDATVT I AHRYTPKEQLKIH 
TQLADI I I VAAGSLS ISHAGVQWRNHGS 
L 


5230 


19131 


A 


5269 


238 


133 


NKNIWVQKKCVFYIYLPLICVCVCVCVC 
VCVCVCV 


5231 


19132 


A 


5270 


336 


517 


PVLY I LEHSPYNAS I YQLALKKYQSRPG 
AVAHACDPNTLGGRGGRITRSGDRDHPG 
QRGE 


5232 


19133 


A 


5271 


262 


467 


VLSMR PR IHGS AAR EEDEHP YELLLTAE 
TKKWLVDGKTKGTFPTTPGRTNS KG I F 
KVCDPEWKGKMS 


5233 


19134 


A 


5272 


213 


2 


MKI YINVY I LLFSLKKRQGPTLS PRLE C 
SGMIIVHRSLKLVGSSDPPALTSRIAWL 
IGARHTPDCSVCKF 


5234 


19135 


A 


5273 


284 


105 


YSCSYSYFDEPVELRNSSFFRWNHSSDS 
YWKKKS S KDTE P VLK P PG YS AR YE CKT V 
GSS 


5235 


19136 


A 


5274 


290 


449 


IFYFSGRVRAKLSAPLAGMGNAKADSRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKVNITLMTKLEKDTTKKENYRP ISLLN 
IDMKILSKILQNQIQQYIQS3 IHHDQVK 
LL 


5237 


19138 


A 


5276 


425 


30 


TRRGMPHRQGSPRKAPVSVRRQRVREEC 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSEPRSCHCIPAWATRANSIFCNFQAS 
SVEVRRSARKKLFSDILKRHNTITWRVS 
GLLLVDSYFGRLATPVRTQ 


5238 


19139 


A 


5277 


25 


185 


SLFS VAE FAFNAYKVQTE KEEKEEARS K 
YKEAKES FQRFLENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFTWFFQLFCVYKLIFS FLTECRSVAQ 
AAHAGAHWHNLGSLQPPLSGFKQFSCLS 
LPEFPRMYSVLSPSP 


5240 


19141 


A 


5279 


25 


252 


LETLGLMTLVQCGGIPGFCHVGQAGLKL 
LTSGDLPAS ASQGAG I AGMSHHTRP I LL 
HFIYFILLYHPPCLAYFIIF 


5241 


19142 


A 


5280 


326 


203 


IETESCYVAQDGLELLGSSNPPTSASQS 
AGITGVSHHTWTT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


1 SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nuclc- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, '—Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5242 


19143 


A 


5281 


210 


459 


NKNNKKFNEKKRKNYKKWARELYRHFSK 
DDIHVANRYMKRYPTSLIIRKIQIKTSM 
RYHLTPIRIATIPR 


5243 


19144 


A 


5282 


262 


454 


ULFTDYHEGMLHSVKIiLFCFLGWSFTLA 
IQAGGHWRDLGSLCPLLPRLKQSSCLNL 
LSSWDYKR 


5244 


19145 


A 


5283 


451 


303 


GTEFHHVGAGLELLGSSDLPVLASKSAG 
I IGMGHHAQPKQYS FVILTVQ 


5245 


19146 


A 


5284 


217 


489 


PVS I S 1 1 LRCLS L PTPG SR VLCMLEGMD 
GISLGWVPLPSLPVPLHPLDLSLPVCRQ 
VGGTKTG WRYVGETDFAKGEW CGVE LD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFFFFFFFLYGFFFCFFFFFFFFFFFF 
FF FLKNKTQP YF I YKRN I LSM 


5247 


19148 


A 


5286 


2 


1 17 


PRVRPRVRPRVRPRVRPRVRPRVRKKKK 
KKKKKKKKKKGGGF 


5248 


19149 


A 


5287 


121 


3 


KFWPGAVAHACNPS PLGGRGGW I TRSGD 
RDHPGQHSETP 


5249 


19150 


A 


5288 


4-16 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFPPPPIFESPLiGLFFF 
PPPPSLFSFFFFFFFFFFFFFFFFFFFF 
FFFFFFLGVR 


5250 


19151 


A 


5289 


503 


0 


FFFFFFFFSPFFFFFSSSFFFFFFFFFF 


5251 


19152 


A 


5290 


70 


236 


IFCNSQLTS PH KHQ KKKKKK K KKKKKK K 
KKKKKKKKKKKKKKGGGDFKKKIGGAK 


5252 


19153 


A 


5291 


443 


287 


QFTKKKKKKKKRKKKKNKKKKTKQKKTI 
IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


KNPNALFFFFFFVDRISLLPRQWHNLGS 
LQPLPPGFTQFSCLSLPNSWDYRCEPLR 
PALLFFSSKSKIHIKPL 


5254 


19155 


A 


5293 






SSSPPIFFPPQKKKIFSPPPPKKFFFPP 
KTFFFLRGFFQIFPPPKKNFFPKKPQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 
PPPFFFFFFFFFFFFFFFFFFFFFFLVS 
LEKGFPDAWADAWADAW 


5255 


19156 


A 


5294 


291 




IQPYIKRLMQHDRVGIiSSEKQSWFRIYI 
YINVPGFQNKGFI I IGSSQ KAQKMI FFF 
LDRVLLCHPGRSTWQSQPTAALKNWAQ 
KTLPPQPSE 


5256 


19157 


A 


5295 


2 


401 


NPRSTEAAI KYFLTQATAS I ILLIAILF 
NNILSGQKKKKKKKKKKKKKKKKGGGPK 
KKKIGGGKKKPGGKKKI FFFIRGEKKNP 
RGE FEKKTFFWGGE KRANP PPKKKTLGE 
KKK I LGGGGGKKFFFFVGKKK 


5257 


19158 


A 


5296 


49 


363 


RLQKKNKEKICKKKKKXKKKKKKKKNKRK 
KKKKKKKRRNGGGGKIiLHKGGGQILTKE 
KILILVAVSYTCNINNLGRQRKWIIGGG 
KSEPNYEFKRNIGAGRGGSLL 


5258 


19159 


A 


5297 


100 


323 


EYLNTCVRICHSHAHFLPPSYVSFALLP 
RFLFKWIiliYMQVFGEGTEAVKKSLEGI 
FDDIVPDGKVKINVCFLF 


5259 


19160 


A 


5298 


595 


0 


VCVCVCMCACICIC 


5260 


19161 


A 


5299 


412 


141 


TVDTSYSEKKILFQILLLIDNGPGHLRA 
LMEMYKE INWSM PANTTS I MQLMNQGV 
ISIFKSSYLRNAFCKAIAAIHSDSFDES 
GQSNLL 


5261 


19162 


A 


5300 | 216 


2 . . . 


S QDWLR KFC YHPRVFQRQP I Q KLMGLF F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proltne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCLPNMVKNGVFFFFEMEGRSWQAEVQ 
WHDNSPLQPKPPGLK 


5262 


19163 


A 


5301 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKGFFFK 
KKIFFFYPPPKKKKKIIRAAEKRGPPLF 
FLKKTRPNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLVREK 
VFNSTISTQ 


5263 


19164 


A 


5302 


389 


2 


AMAQ AS GS CLCSRLL VDG F I PWGVFRYF 
IVEAGSCYVDQGGLAILASSYPTTLVSQ 
SAGMTGLSHRARPPLNGPPPGLTPSIiSM 
VPKANVFSLLEKPTGLTQALRITLANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 


A 


5303 


84 


333 


FHDCCRPERSRILRNFLLSSAGIQSIAD 
AYRLALPQGLLYGPTNFAP I INHVARFA 
AHAAHQGTASVRWGGSGSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPLFRRFSPPPPPKFFFSPRAFFFGGGF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KIFFLPPLFLAPPKDSFLAPPPFFFFFF 
FFFFFFFFFFFFFFFFFFFFSRENIFFI 
IIIFIYLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFVWWVITLNVELEPSF 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 




133 


TQES IMDLENDKQQLDERLKKKKKRPAV 
LKNPWGGQSLPGMARE 


5268 


19169 


A 


5307 


239 


-3 


PPRNWGFFSPLSPLKSSSPPKAFNFWGG 
VGPNFPPPKKRFFSKNPPGGFFYPPLKE 
KNIFSLPPVNLGPPRVFFKGPPP 


5269 


19170 


A 


5308 


454 


116 


FRLPSSSNSPAPASRVAGISGFRCVGRA 
GLQLLTASDPPASASQGAGIADGVSFTQ 
CSMVPRLECSGM I S ACYNLHLP AACLGP 
PKCRDCSLCLAATPSGK 


5270 


19171 


A 


5309 


123 


1 


CSVSKWYPIAFLKNI IMLWEAEAGGSRG 
QE I KT I LANTVKP 


5271 


19172 


A 


5330 


153 


33 


WPG WAH TCN P P PLGGRS GL I TRS G VQP 
QPGQHSESPSI 


5272 


19173 


A 


5311 


350 


3 


NSRNLIL.TQEHSPEEKNEFLFSLPLQSV 
SMNTTHSPLLSNSLPHFIISFLPTRFLV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
ITAHRSLNLPGSSDPLISVSQVAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRG I CKS I T I I HH INRTNEKKNH 
MIISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


LKTSFHHVGQAVLEFLASSDLPALASQS 
AGITCMSHCAQPYISINFIPAAETQQSS 
PELVPLPASQKTPSFSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


LYFYIYFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMI S SNNSSDS FGNCLNPGGKGCCEPGS 
YHYPPAWATDTFSKKKKKKL 


5277 


19178 


A 


5316 


112 


224 


PWMLENELGLHASYIiAMSTPLSPVEIEC 
ASKKIFTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRIFYIjGPQKKKKIiFYPPPFKNCFF 
IjGPPPPPIiFMIFFFFFFFFFFFFFFFFF 
PQWVMFLFFIoLSASNQNSPSWSP 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
DYKHELPRPATS CNS IL.FFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RlDFGGPKKKILLPPPPAVKIVSliKGPP 
LFFFFLNS FFAPAGGQWGVFQL ISSGDP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
' sequence 


M 
eth 
od 


SEQ ID 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

(ast amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPAPESVRIPGVSFCAGVPQGFFFFFL 
NSSVAQAGVQWRDLGSLQPLPPRSKRVS 
CLSHPSSWDYSH 


5281 


19182 


A 


5320 


377 


287 


ELLTSSDLPALASQSAGITSMSHHTQPG 
SF 


5282 


19183 


A 


5321 


381 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLYPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSLKNPPPFFFFFFFFF 
FFFFF 


5283 


19184 


A 


5322 


132 




EGGFVLKIIREGIGPHFLGIiEEEGSKFH 
EHIFWEKHRESFPKQGPIPHFMELGTWG 
LSKNPYFRVKQRVEHIEGFKNFFNEKKD 
FLKE 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL 
GGKQPPPPPPLKKKNPKGGPPQQKKIFF 
FLTRGGVSLFGPGGGFPFFFQPPKKPPP 
GGFPFFFLKKPPPPPLFFFFFFFFETGF 
CSCR PGRTRG 


5285 


19186 


A 


5324 


131 




DGALSPRLEWCSGTIIiAHCSliRLiPGSSA 
SCVSASRVAGITGVC 


5286 


19187 


A 


5325 


1 12 




DGI LL1L1L1PRLESNGAISAHCNI1HLPSTS 
ESTASAFLS 


5287 


19188 


A - 


5326 


404 


18 


PVS PHGKNHE KHTVTCVGSTSGAFLHGE 
EHCHGNTHHPFEPSNPQTACQSQANR3R 
HGNSFKNLWRSWAWWRTPWSATQDAEM 
GRPLEPRGLS LAWATQQDPS LKQKQKQK 
KPTHLWSYILGHHQLP 


5288 


19189 


A 


5327 


109 


3 


KCVLTP VIRALWEAETGGSRGQDIET I L 
SDTVKPR 


5289 


19190 


A 


5328 


1 


117 


ETGFCCVAQSGLQLLDSSNLLALVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


DAWVAAEAEGQNDT I EEPNKVQKRKRG I 
NDNVPAGQAH 


5291 


19192 


A 


5330 


105 




RSGWAHICNPSTLGGQGRWITRSGVQD 
QSVQDG 


5292 


19193 


A 


5331 


204 


41 1 


S KKKQKKKKKKKKKKKKKKKKKQDKKG 


5293 


19194 


A 


5332 


185 


2 


FFFETESHSITQAGVQWRDLSSLQPLPP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 
VSLCW 


5294 


19195 


A 


5333 


561 


325 


WSTLHSFPIWLLNTGYYFKELTGFFFFL 
TVALSPRLECSDTI IAHYSIjRLLGSRDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFSQEKSRRVKEENPEIAVSASTI 
PE 


5296 


19197 


A 


5335 


375 


228 


LSPWLECSGTILAHCNLRPLGSSNSPGV 
FGFYFFLTLILGSGVHVQLC 


5297 


19198 


A 


5336 


403 


o 


NISVPASKHISPFPKKKKKKKKKKKKKK 
KKSQKKKSRG 


5298 


19199 


A 


5337 


397 


80 


HITS KKKKKKKKKKKKKKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDSKKKRKEKKKKKRKKKKKKKNR 
EKKKKPKKKKKKKKRGGAQKKKNNHPAG 
GKAYFFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGNNPPFVERGKKKKTPGI 


5300 


19201 


A 


5339 


410 


0 


KKQKKKKKKDAPGGGGGGRKKKGGPQKK 
KKGSHIiKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


I QYKI LPQKENDWDKIQGQLAI PVSSLN 
F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

ottde 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleurine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATELI I FYI FFETTLIPTLAI ITRG 
GNQPERLNAGTYFLFYTLVGEGSLLARS 
R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVITPLWEAEAGVQWRDLSSL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 


5304 


19205 


A 


5343 


3 


149 


SWFLHFGEAGLELLTSGDPPALAI.PKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 


A 


5344 


464 


147 


LRRLRWENRLRPGGGGCSEPRSHRCTLA 
RVTESLSRVQCQRDPRRIAQTSRRPGAS 
S FPFAGPAFSAVNSTYQRALPPAQGAPW 
. TPRLPLG PHGTTS S TS S YL P V 


5306 


19207 


A 


5345 


228 


1 


VIRLEETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQSSHLSLPSSWDHRYTPHLANF 
CI FLQKQSLAMLPRLVSNTK 


5307 


19208 


A 


5346 


2 


106 


FVLELLGSNNSPS SAS ES AG I TGVSHCG 
QPLiFNF 


5308 


19209 


A 


5347 


140 


436 


FLCDMRVLYVTFFFFGKGVSFFPPGWRA 
GGEFGL I ENP PPG FNP FFLLNL P KKGEL 
RAPPPAPIFFFFFFEKRGFTRLTQEGLN 
FWTWEPPPLTLPRGG 


5309 


19210 


A 


5348 


1 


465 


LESRRFEDGTRVPGSTISWDPPEEAVCP 
FSDLQLRAGRTTTL FKAVRQGHSS LQR I 
LLPFVWQCPAPRGGVYRGRQASLSCSGL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLrLRHLiLKPEEKRS 


5310 


19211 


A 


5349 


183 


58 


ERDYEEGPGAVACACIPSTLGGRGGWIT 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


I EXNGM I IoAHCN FQ1>PGSSNS PASAS PV 
AVITGMNHYAR 


5312 


19213. 


A 


5351 


413 


305 


AEAGLKPLS S TYLRAS ASQS AG I TGVSH 
HTQPLLFF 


5313 


19214 


A 


5352 


91 


356 


TVPKRNTFSPKMVAITTAGTVSTSVIiAL 
GNRESALPTRLGWALCCFIFWGGRGHRV 
PLCHPGWSTVARSRLTATSTSQVQAILL 
PQPP 


5314 


19215 


A 


5353 


2 


410 


FVQWQFLKTGSDRSHKAVESLGYEKVEK 
AMTVNYS FRS LNTKGKLELLG K I SVNLY 
AKWKHFREKGRRYTNTLGPTATAAFYST 
EKTEFLPWEQTCKPFIDSFNKYLIiPGA 
ADHACNPSTLGGRGGKITRSGDRD 


5315 


19216 


A 


5354 


3 


288 


SLDDDLLKLLLPLMLQYSDEFVQSAYLS 
RRLAYFCARRLSLLLSDS PNLLAAHS PH 
MM IGPKNS S IGAPS PGP PGPGMS PVQLA 
FSDFLSCAQH 


5316 


19217 


A 


5355 


187 


3 


KKI LWQTPG VNLFS FKGAPLFFFFLRR V 
L.LCCPGWSAVARSPLTATSASRVQAI LL 
PQPPK 


5317 


19218 


A 


5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DSPQRLIHIjSIK 


5318 


19219 


A 


5357 


367 


241 


F FFRHVAQAGLQLLGSNNP PAS PSQS VR 
VTGVSHCTQ PLCFV 


5319 


19220 


A 


5358 


217 


1 


SCDRENTGLKLLSILLL.IHCETDTRISL 
LCISIiSIKREIWLGAVARACNPSTLGGR 
GGRITRSEDRDPGETP 


5320 


19221 


A 


5359 


255 


412 


TQIiCRKRNAFFFLATNHSQKGQSRWGTV 
AHACNPS TJJGGRGGR I TRSGDGDH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


'M 
cth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


1 Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5321 


L9222 


A 


5360 


391 


271 


RS CH I AQ AGLKL LAS S NPP AS AS QNA E I 
TGVSHHLTPICC 


5322 


19223 


A 


5361 


232 


397 


EGLKEPAGLFTMEYYTARKRNKIMPFW 
TWMQLEAIILGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFIKVEGKFFLRS INI LNESLLFLSSNG 
. PGLLKKLGVWPGWAHACKP S TLRGQGG 
RTA 


5324 


19225 


A 


5363 


306 


418 


SDGITGTLPLASQESAVVEDLLYVLVGV 
DGRYVSAQP 


5325 


19226 


A 


5364 


347 


48 


KKRWLiKPGGGKNPPKKGGKKGGTPPFFQ 
KKGFFFPPFFFFLEKRDLSPQAGGKGEK 
KKPPQPWKAEGKKNPWGNPPQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLSFGAHYQAS LLGS FSLMP LLLYS 
KCSGQWDVARSAECTSKSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 


5366 


400 


296 


IjVEGGFHHVGRAGLEIjPTSSDPPTSASQ 
NLGLQA 


5328 


19229 


A 


5367 


162 


1 


apssldetptllkktlfgpglvagicnp 
stlrgrggritrsgvrdqpdqhgetp 


5329 


19230 


A 


5368 


152 


403 


TTVLCKLVQPLWKTVWRFLKELEVDLPF 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 
AQFQIARIWNQSKCPPINEQKKKKGAPV 


5330 


19231 


A 


5369 


170 


367 


IVFEILVISYFKSFPQKTTEWNTFFFHP 
FKVFNWPSTVAHACNPSTLGGRGRQITR 
SGDRDHPAR 


5331 


19232 


A 


5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A • 


5371 


139 


3 


PKLECGAS r I AHCSLELLiGS KD P P PS AF 
QVAETTGCAPPCLAFr F 


5333 


19234 


A 


5372 


1 


341 


PLINILSHIiILLCRNYFSFVKFCIRHTS 
LL YWI S L FHVLL CWS FLTDRRAHN AS 
VT YG FVL P VTFKNS PL PG AVAHACNPS T 
LGGRGGQIMRSGVQDQ PGQARS S RPAL P 
T 


5334 


19235 


A 


5373 


280 


449 


ICHNLQLCPSKDTIEKVKRQPIEWEKIF 
ANHISHKGIiSS 


5335 


19236 


A 


5374 


170 


1 


QKRGFSMGTPMVSIFRPQDLTPLGFPKA 
WDPRVNPWGPWPCPLFFKGNPQLFQAPG 


5336 


19237 


A 


5375 


462 


165 


PPTRFLHWQTGIiELPTSGDLPASASQG 
AGITGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLSFLGFSVFFEPLFC 


5337 


19238 


A 


5376 


3 


451 


P RAKLGTRRGLRNLD T I G VAVD L I L LFR 
ELRVEQESLLTAFPCGLQFLRNIASRHE 
DSQSIGWVHAFPPPWPCLSLPPPHTGS 
SSSITLPPLLPHRPHPPPHSHPPMATLS 
RPPYPPPPHPPPHPLPPSLjSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLCCPRLECRGAI MAHHSLELLGSSN 
PPTSASQSAGVTGVSHCRSPWEFL 


5339 


19240 


A 


5378 


452 


191 


DSRFHHWQADLiEHPTSSDLHASASQSA 
VITGVSHRTRPDIAFAPSISSYFSCVAE 
ISLLHL,SESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


LATSSVANLRNTFYLKLKSSVLGVEAHS 
CN P S TLGGQGVR I T KSGVRD QPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFLGWAQWLTPVI PALVJE AGAGGS RDQ 
EIDTILANTVR 



870 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5342 


J 9243 


A 


5381 


406 


300 


FHYVGQPGLELI/TLGSTCLGIiSKCWDYR 
REPPHQA 








5382 


364 


479 


INDGISKISLFLSHLKDQAGQVWWIjTPV 
IPAL.WEAEAG 


5344 


19245 


A 


5383 


182 


395 


HHLRGSAASFLKSAQEPTDFRHTGFKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RTWQSWLTTTSISQ 


5345 


19246 


A 


5384 


37 


478 


GGIHSYPVICLFILMGSSVSLPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFLPSLTGR 
AFIi 


5346 


19247 


A 


5385 


201 


458 


WNP P FLSPRKW AARADTE WGVGKTAQE P 
GLSSRLQAYEERQRHWQREREALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


5347 


19248 


A 


5386 


206 


407 


FFSPGGRAGEKFGFMEPPPPGFKKTLWL 
NPPGKWKQRAPPP I PVYFLEFKKKGGWP 
NMVQVGLNPRP 


5348 


19249 


A 


5387 


468 


0 


NFFPPRPAPPFFFFFFFVAQAGVQWRVL 
GSLQAPPPGFTPFSCLSLPSSWDYRRPA 
RVSPHC 


5349 


19250 




5388 


136 


2 


LFFFFLiFETGFCSVPRIiECSGTIIAHCS 
L.KLPGSCNSPTSVSQV 


5350- 


19251 




5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPLAPFPGSFILMYFFMLLLFGIPLLY 
MEV I MGQWLHVDNI RVWKQLVP WLCSMS 
YAHSLVNEGPS PRPSP I SLCGLQVS AHR 
VHPQVCASVSLYNS 


5352 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSENKI 


5353 


19254 


A 


5392 


2 


150 


PRVRFL I ETE FCHVGQAGLKLLTSVDP P 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWWLMPVISPHWEAEVGGSLEPT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPS SWDYRHAPQCWANFI FLVETGFS S 
VIL.CSFRLGGFGLSLNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LLPHRKS RNYRTRP TRS LRQKNGMNLGG 
GACSESRSHHCTPVWATERDSVSKKKLF 
TIELVYVFLKKNSKGEPLPKVliGIi 


5357 


19258 


A 


5396 


174 


118 


FFFFFFFFFFFFFFFFRSS 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP " 

PPKFFFGAPKKKIYLPPPRALNFVLLKG 

PPPFFFFFFFFFSFF 


5359 


19260 


A 


5398 


294 


64 


I FFFFFFFFFFFFFFFFWELHI L WGALF 
LLKSR I PPGKKKWTCKIFLCTPLTGYI P 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTPYHHINQDSSNKYHETKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPPFLLPFLFFLPSFFFKAPHTINH 
FFSPPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYLFYKMHRINYTSNI 
LKINI FVFSS PGSGGRR 


5362 


19263 


A 


5401 


416 


3 


ENTE RDRGRQRREDRSRAETQ E TNRAG Q 
QEST P PPQKKCNTYS IPSSSSSSPTPQK 
GKKGGPPGGPPPPAGEKGGPKKKKRGPQ 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Liysine, L< = l>eucine, M = IVIethionine, 
N=Asparagine. P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 


PPPQKFFLGGVFFFPPFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LR W S LALVAQAG VW WLNHG S LQS L 


5365 


19266 


A 


5404 


372 


181 


YKGLAHTRCSIRNFLNFILI IEMGSRFV 
AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


5366 


19267 


A 


5405 


110 


1 


PVPVSSTEVKKLVXiWPGTWHACNPKTS 
GGQGGRIT 


5367 


19268 


A 


5406 


2 


401 


GFHLTTYGNRKQMQRHVREASARPLQGT 
P P PGHALE WT LENVIjLAAPR PRKAQ VXi F 
AIVASETSSWDREKLWTLSLEAKCKGIT 
LFVLALGPGVGTHELAEIiAELVSAPSEQ 
HLLRLQGVSKPEVNYAQGFTR 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPKX.LSWGNVDILINFFLS 
HCSPERARRLQAKMMTNLVMAKDRLQLL 
GIN 


5369 


19270 


A 


5408 


371 


222 


SVEMGFHHVAQPSLEPLGSSDPPASASQ 
SAEITGVCHCARPNHDSYRRS 


.5370 


19271 


A 


5409 


204 


416 


NQVG E WVLRKG TLGDL IMS MLFHFLWR 
SKDEKNTKIWPGAVAHTCNPSTLGGRGG 
UITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


AGR PGTVTHTCN P S TLGGQGGR I T 


5372 


19273 


A 


5411 


375 


216 


GCSEPRWCHCTTPAWVTVQDSPPKKEKK 
RKEKKKKLLTTENS KKALKRNGYi, 


5373 


19274 


A 


5412 


1 


381 


RFHHVAQAGLNLLGSNDLSAS AS QSAG I 
TGMSPRTQP 


5374 


19275 


A 


5413 


144 


3 


RPPPI FFFIAKKLRALFFFFHS VIQAG V 
QWRNLGSLQPLPPGFKQF 


5375 


19276 


A 


5414 






LiFS KKKKKKKKJCKKKXKKKKKKK 


5376 


19277 


A 


5415 


414 


2 


NNNS FVFFSKPIFLKTVGGAPPPSPIYN 
PIPIFLYIYPPH KWGGVY I PTHM YGGGD 
IWGYMYTPPPPPLFIYFFPPPPFYIEKK 
KKTVFCFFAQGGVRLCLKKKKNKDRGWG 
DTMTMGHGGRLEMVANVCGLKRYW 


5377 


19278 


A 


5416 


1 


240 


GGEGCSEPRSCHCTPAWATERDSISKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


5378 


19279 


A 


5417 


2 


63 


LCLKNNNNNNNNNNNNNNND 


5379 


19280 


A 








WPLCFSRCSKKRGSPLiGFPFSPPLGQSK 
KFVRFACKTPSAFKPFPSLSFFQRWAGP 
PPFLQGLGELPKGLFFQQCFFFFFFLRW 
SLTLVAQAGVQWHDPGSLQPPPL 


5380 


19281 


A 


5419 


1 


392 


FRMAGSYPEGAPAI LADKRQQFGSRFLS 
DPARVFHHNAWDNVEWSEEQAAAAER KV 
QENS IQRVCQEKQVDYEINAHKYWNDFY 
KIHENGFFKDRHWLFTEFPEIiAPSQNQN 
HLKDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RWYIVLDNLIFILENICCETSLKIYLGP 
GWARACNLSTLGGQGGQITRSGVRDQQ 
GQHGE 


5382 


19283 


A 


5421 


1 


149 


GGGGCSEPRLRRCTPAWATERDSVSKKK 
NLKKFFCPGLGLYQKQNTWTIi 


5383 


19284 


A ■ 


5422 


381 


179 


SSCLGLPKSWDYRCEPLCPARTALNIjKT 
I KKSKDNRRWRGCGEKEMLMHCWWECDM 
VWLCVPTQISC 



872 



ISDOCID: ^WO 016483SA2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucrne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Srop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


5384 


19285 


A 


5423 


195 


369 


CSLSFIKECKLHISGLQDSSISLVNLGL 
AIRNIRNKPGWAHSCNPSTLGGQAGWI 
T 


5385 


19286 


A 


5424 


253 


414 


KKKKKKKKKKKKKKKKKXKKKKXKXKKK 
KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


19287 


A 


5425 


160 


2 


NLKGSAPFSKKTSPFFLVKENPLFFFFF 
EMESLSVAQAGVQWRDLGSLQPLP 


5387 


19288 


A 


5426 


53 


426 


ASSSFFFFFFLGKGGSIFPPKGGGEGKN 
FFLWKPGAPGGGGPPPPTPRGGGKKGPP 
P PGQKI FLGMFFE KKR FHHGGGGGFKLR 
GKKKPPPIAPKRVKNKGGNPGGGQPLKF 
KKTLPEKMGLKK 


5388 


19289 


A 


5427 


60 


1 


KWPGTVAHACNPSTLGGRGG 


5389 


19290 


A 


5428 


87 


247 


KKKNPQKKKKKKKKKKKKKKKKKNKKKA 
PAGAGDLS LY Y PRAGRNNLF FNYLP 


5390 


19291 


A 


5429 


209 


405 


KUNLVWFSSLSSLKAMISAILLKLSVPI 
FPMKIVILSTLNSGSWPGAVAKACDPSA 
LGGRGGRIT 


5391 


19292 


A 


5430 


324 


237 


VMNAFVC VCVC VCS C I CVCVCVCVYFLF 


5392 


19293 


A 


5431 


409 


3 


FFFFFSETESRSVTQAGVQWRDL.GS 


5393 


19294 


A 


5432 


270 


402 


MKI I FHLIRITGWAQWLTPVTPALWEAE 
TGGSRGQE FETTVKP C 


5394 


19295 


A 


5433 


107 


277 


YNFNNHRYREAIQKWDEALQL.TPNDATL, 
YEMKSQVIIMKNFTFLMKLCYVCSVSEQ 


5395 


19296 


A 


5434 


95 


1 


RNRKNWGQS GT VAHACNPS TLGGRDGW I 
TRS 


5396 


19297 


A 


5435 


412 


105 


GGEGCSELRSCHCPPAWATRARLHLKIK 
KKERWDLAML.LRVGYSGHRQVHCTDHGT 
LQPQPF 


5397 


19298 


A 


5436 


276 


488 


PLPQPNKDS RLL PLLVCLG FLFAPLLML 
GHGPKRARLPIFFPQEAYFITFRLVFAE 
KKGYRGPLTMCLAPR 


5398 


19299 


A 


5437 


179 


2 


GISPLKGGTPSLKKKKKAKDFURRVSKE 
DIQFTNKHMKRCITLLIIKQMQIKITMR 
YLP 


5399 


19300 


A 


5438 


3 


156 


TPLYSQIIjI SLRQEHYFKAEGRGCNELT 
SYHCTPAWAKDGDSVSKNKQNT 


5400 


19301 


A 


5439 


47 


173 


S LQLFLQTE ERKDS DD E KSDRNRP WWRK 
RFVSAMPKGNFIKY 


5401 


19302 


A 


5440 


3 


122 


TTLTLTSL.I PPILTTLVNPNKKNPPKLI 
RILVLFQTTEK 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPPPKFKHFSCVTLPKHWDY 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPCLLKIQKLAGRAHLN 
PGGRGCSEPRLCDCTAAWVTRAGLSQIK 
KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CF FVS PKAKS QG WLGS P P PP P PGLKH F 
SAPTPRGLFLFFFFPRGVPPLGKGGSKS 
PPPGVFPPHPPQKGGDSGGYPPPRAKIF 
LF 


5405 


19306 


A 


5444 


356 


3 


AFFWFTNPFP PGFKGFFCPHPLSG I S KG 
GPPPRGNFFFFGKAHFFSPFFKFLPPIF 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5406 


19307 


A 


5445 


198 


86 


PWPLGAQAILLPQSSCLNLPKCHDYRRE 
PPHTALLNS 
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SEQ ID 
NO: of 
nucleotide 
sequence' 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of ' 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N = Asparagine, P = Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \= possible 


5407 


19308 


A 


5446 


198 


86 


PWPLGAQAILLPQSSCLNLPKCWDYRRE 
PPHTALL.NS 


5408 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLAIiLPSLEGNGTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 


A 


5448 


324 




KKNSATSFFSVEPFSTGGVLANKGALTQ 
EKI LPCFKI KKI RKRVGAVAPARNHFLF 
FSPFFKEGLQG I VYFFFFFFELESRS VT 
QAGVQWRNLCSPQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


LFFFFFFFFFFFFPKENH 


5411 


19312 


A 


5450 


114 


409 


LNIWTLKYLISKHIjLIVCLPDTILSSLC 
LIKYIGLQGKVLIFFFKQFSAIDKLGQN 
I A WGK FGFAH YS LLT KKW KL FGN I TQ E 
QNMIVTGGLSWWND 


5412 


19313 


A 


5451 


192 


406 


RNFFLGHLKYVKRFEQIGWGRDFMFKTP 
KARATKNKMDKGDLIKLKSFCPAKETTF 
RGTGHPTKGEKI FAT 














NAFG WL CVH YAL Q F I D KS V KN S VC WLiG A 
VAHACNPSTLGGQGGWITRSGVRHQPGQ 
YGETLRTRASTR 














PLLFILIPSPFSFPHPFCGP I KF I LPRE 
KKKKKKKKKKKKKKKKKKGGGPFKKKKI 
FPRGGKKKIFFFWAQKKKIRGGVLKTGG 
RKKPGGKKNKSG 


5415 


19316 


A 


5454 


285 


415 


TSGCCFVFLVLKLS I FCKGKVLNAIEDN 
GLKNSTFTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGI IFLGPKKKKFFSPPPR 
VYFFFFKRPPPFFFFFFFFFFFFFFF 


5417 


19318 


A 


5456 


3 


159 


AHASAHAYDTIKDALGLLLFILSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP , 


5418 


19319 


A 


5457 


109 


392 


N 1 1 1 1 KRSTS LFLRP I FPPHLFP FILYL 
FISLRTLLFFFFFKREFCFLPQVGGQGP 
FLNSLKPLPRGLMQFSCLTLLRSGNYGP 
PPPPPINFFF 


5419 


19320 


A 


5458 


243 


402 


FLKIGFIHKYLKEFVWGILNIVFNRFNS 
HERAPHAI VFVNGTT I EGHVEKC YW 


5420 


19321 


A 


,5459 


394. 


105 


IPPPSPRPGGPGAAPPPGGGGSTTPPPK. 
KKPPFFPKNKNQPGGGGGAPPSPPPRGL 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


37? 


VCFSPSIYGAWLNKSPLRSHGSSSTSG 
WGGLPS PFPAQRLPLASQVQEHNGHVFA 
SYQVSIPQSCEQCLSYIWLMDKALLCSG 
EWIiPHQAPKPSMSPTTPHCSRGLQ 


5422 


19323 


A 


5461 


184 




FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN • 


5423 


19324 


A 


5462 


295 


384 


LGAVAQARNP ITLGGRGGW I TRS GVQDQ 
PG ' 


5424 


19325 


A 


5463 


413 


257 


FLNPGGRGCSEPKSHKSTLAWVTERDLI 
S K5CKKKE KC W IPQRMPERGQ WS PD 


5425 


19326 


A 


5464 


2 


181 


SLYCSGWSQTPGLKQSSCLSFPKCWDYR 
CELPRPVYVLLYKTS FPKRYVFG I HLCF 
MSQ 


5426 


19327 


A 


5465 


1 


229 


TG5 CYVAQANIiEL JCS S S D ASTS VS QKAC- 
TTGVSHHVQPYTSSFSQVPPLRAQSAAT 
ETTSILWVQKLRLQWRCSV 


5427 


19328 


A 


5466 


.1. 


.368 


KERETKKEREEERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNNLGQSAGIT 
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M 
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NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GMSHHTRPKSHNVLSKFTNLCWAAFKAI 
LGHMWGRLDKLNVEQCFSTELSALFS I I 
ATSHMRLWVI 


5428 


19329 


A 


5467 


183 


3 


VSLCYGEKFS PQMNESQLYFRPGAVAHA 
CNPSTLGGRGRQITRSGDRNHPGQHGET 
GRV 


5429 


19330 


A 


5468 


1 


128 


GGAGYSELRLCHCTRAWATRAKLHLKKK 
KKKGPLRALKKIGG 


5430 


19331 


A 


5469 


1 


155 


NKFPVLISNVINI IGMCHCTPAWATERD 
S VS KRKKKE KKRS KLARHS VL CL 


5431 


19332 


A 


5470 


113 


1 


FFFFFFQKTSSRLGTVAHACNPSTLGGR 
GGRITRSGD 


5432 


19333 


A 


5471 


407 


281 


NGMIPGGRACSELRSRHCTPAWATEQDS 
ISKKQNKSKNPGLK 


5433 


19334 


A 


5472 


1 


177 


GGGCSEPRSRHCPPAWATERDSYLHKMK 
QSKTKGERMEVACYCI I ILFMLTDALEV 
LMA 


5434 


19335 


A 


5473 


76 


267 


KERREEKEKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRM WLiFC F EAG S HSVTQ AGVQWCGLN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 


YGVGTKGFRAGVKEFQALFLLI TVHS QL 
CHIEGTRHHSRHI IVWDFINQIENGTF 
FPLVLSNGCI WLL I VYLFWWI FTRQEGQ 
FFYGKLSHYSVFVFCFLRRS FALAQAGV 
RWCDL.GSLQPLPPGFKRF 


5437 


19338 


A 


5476 


163 


2 


SEKPWWALTLMTFSHFYFFIYFYYIFI 
FQMES CS ITQAG VQWRDLNSLQPLP 


5438 


19339 


A 


5477 


111 


2 


FLFLKQDLPGAVAKACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A 


5479 


388 


225 


ALFSSKKKKKRKKERKRKKEKKKKKESM 
QRKGGHLPRVCCGASMVSPGHKLSL 


5441 


19342 


A 


5480 


150 


2 


HRVAVIYEVFFVCL.FFYFLFLRQSLTLV 
TQPG VQ WRNLS KLQ PPHPGFK 


5442 


19343 


A 


5481 


156 


395 


DSVCTCVLHTCHSSGIjSQGRSCLHCSLG 
LPNSNCPCVCVCVCVCVCVCVCVCVSFC 
PLHQGVCVCVCVCVCVCVFLSAHFTKGC 
VCVCVCVCVCVCFS PPI 


5443 


19344 


A 


5482 


391 


115 


RACPDCRVRFPHTSPPCLPCGPEAEPGP 
GPALRELVQPLPGQLQPPFGMPPSGLGA 
APSPAPPACAWTRPPHLHPSSFSSSVPQ 
ISSLFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRSCCPRLECNGAISVHCN 
LRLPGSNNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAIFLPPPPPYLGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVRWCDIjGSIjQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK 


5448 


19349 


A 


5487 




413 


VYLHPSLQPSLSSLVPDQAIjSHDIWIjKK 

kkkkkkkkkkkrggi s pps p vgg vgpre 
twgvwgpganlfqrgggvrelfsqkkkg 
ekgqrnpekkpphkrgpggpkekfiffe 

KKKKKGGKPFGKKTKPPPPRVNPGG 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5449 


19350 


A 


5488 


229 


411 


IHGPGFKNGGRGKXTiGLPNLMALKKIPF 
SPGGYKPKNP 


5450 


19351 


A 


5489 


190 


50 


YHHHHHCHHHHHHQCHHRHHHDHHYHGD 
NDYSSHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLSPRLECSGMISAHYNLHLPGSK 
QLS CLSLPS S WDYRCPPPGKTTFSRGI R 
DAIiKPRKPKR IKLI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFLTPGGKWF 
SPPTPPRSGGKRVPPEGGGKGFSLFFFV 
EGGSCHIAQAGLKLLGSSDLVTSASQSA 
GIMGKS PHAWP 


5453 


19354 


A 


5492 


266 


412 


NPNFFERKSCSVSQAGVHRRELNSLKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVS ALS PGLE CGGM I IAHCNLELLGS 
SDPSALASQSTWNY 


5455 


19356 


A 


5494 


1 


1350 


MGFMTGICIEDNWGPSPESGSVLTVGG 
VR I QMLiDRCHTAHC P V I PGLL.RGQLWTE 
NLAERNS HDRR P F P TWCS H FAQDL W P E 
QS I KDS YQKVIliRKFEKCGHGKTLHFKKG 
CESVDECKLHKRGYNGLNQCLTTTQSKI 
FQCGKYVKVFHQFSNSKRHKRRHTEKKP 
LKYIEGDKAFNQSSTHTTHKKIDTGEKP 
YKCEECGKAFNRSSHLTTHKITHTREKP 
YKCEECGKVFKYFSSFTTHKKIHSGEKP 
YICEECGKAFMYPYTLTTHKI IHTGEQP 
YKCKECDKAFNHPATLSSHKKIHTGEKP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
YKCEECGKAFTRSSHLTMHKI IHTGEKP 
YKCEECGKAFTWSAGLHKHRRTHTGEKP 
YKCEE CGKAYTTS SNLTEHKTTHTG EKP 
YKCKECGKAFNWSSDLNKHKRIHIGQKP 
RT 


5456 


19357 


A 


5495 


4 


146 


PPTRPrCYYLSFYVSYPTGYKEDTRMVN 
KHMKRCSTSLVYLLWINFY 


5457 


19358 


A 


5496 


5 


388 


CGCFFFSPPPKKKVFPPPFFFFAPGFFP 
PPLFFTPPPQFFFWAPKKKKNSPPPPGK 
KIFFLKGPPPP 


5458 


19359 


A 


5497 


231 


163 


FFFFFFFFFFFFFFFFXXTNKWF 


5459 


19360 


A 


5498 


163 


2 


SVKFTLGKVSQFLKKHTPAGWAHTVIPA 
LWEAEAGGSRGQEI ETI LTNTVKPH 


5460 


19361 


A 


5499 


229 


388 


RTHLQASSDLFCLYSLKTGSCCVAYAGV 
QRHDLS SLQPPPPRFKRSS CLRP PE 


5461 


19362 


A 


5500 


405 


289 


HVAQAGLKLLASSDPPTPASQSAGVTSV 
SHCTH PRCLFV 


5462 


19363 


A 


5501 


286 


375 


GDFVRFSFWPGGGAHSCNPSTLGGRGGQ 
VT 


5463 


19364 


A 


5502 


20 


374 


ILQFQNFWIVLFKNFYHFGKFLIYILNW 
FSGLFWYYFSELSCFSLSSKIWILNLLS 
GILNVFFSISGVRKKIFLIKRYCWLGTV 
THACEPSTLGGQGRQITRSGDRDHPGQR 
WKIQN 


5464 


19365 


A 


5503 


1 


185 


FHHIGQAGLILLTSSNLPASASQKAGIT 
GMSHCTQP III FLYSLFLQLS FSWQDF V 
MRKKF 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTPIPTGTQPL 
P LLT F CTRPDTVAHACNHS TLG GQGGQ I 
TRSGACNQPGQ 


5466 


19367 


A' 


5505 


118 


7 j 


I KKENNWPGAVAHACNPS ILGGRDGR I T 
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peptide 

sequence 


Predict- 
ed end 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
■^Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTS IRMAI INNNKQTENNKCWQGQG 
EVGTLVHCWWECKMVWMLRQIVWSFLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RSCHCTPAWATERDSVSQKKKKKKKKK 


5469 


19370 


A 


5508 


3 


86 


EKKLQVLL.NCMTE I YYQ FKKDKAERS K 


5470 


19371 


A 


5509 


395 


108 


GVLECNDT ITAHCSIjDLVGS S D P PTS AS 
QVTGTTGTCHHAQLGL.PKCWDDRHEPRL 
PAWNKTLRRKNWHSLLPWPAIQELEVG 
GSLRPRSLRLE 


5471 


19372 


A 


5510 


421 


59 


FFFFFFLRRSLAVSQAGVQWWDLGSLQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFLWIVFVFVFVFVli 
RWRMESLSVSQAGVQWCNLSSLQPPP 


5473 


19374 


A 


5512 


446 


149 


FFFFFFFCFFETESCSVARLESSGMISA 
HSSHNLCI 


5474 


19375 


A 


5513 


74 


398 


DRI IDAGPKGNYARFMNHCCQPNCETQK 
WSVNGDTRVGLFALSDI KAGKNHFR I LQ 
LTSEFQGFCFLQTASSDYNFKHFYMYRS 
LELSSFTFLCNHHHHPSPELFSSS 


5475 


19376 


A 


5514 


73 


335 


EGGS LFCPGWSAGAliFWLTAPSASRGWG 
GAPGVPWGEVGQKNCWNPGGGGCNELG 
LHHCIPPWGTGKTLSPKKKKKKGATDMG 
GAS 


5476 


19377 


A 


5515 


416 


0 


WLGPGAVAHTCNPTTLGGRGRWITRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPLIHQGGOAI ILTQNPTPGWGTMVF 
Q P VLRP VKGL YNANH VTSS P VAS QPIIF 
.TTAG 


5478 


19379 


A 


5517 


2 


151 


GRVGFVGMGSHCVAOAGLELLGSGGPPT 
SASQSAIIAGVSHRAWPYFPS 


5479 


19380 


A 


5518 


312 


405 


GQVRPGAVAHAC S PNTLGGRGR I TRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRSCHCTPAWATHQDSVSKKKRKK 
KREKPFLL3KKRGPTLTWGEPKKGIPLG 
ERNGPPSHFKKGPPLAPKPPGGNLHPEI 
AQGPGSPTDGNKKGDLNPTPVFYKFKTP 
EQAINWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPLFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 


399 


KKWLFLSSAQ I PTKLSRRRRNRETWPKE 
RSKINLHKPILKKQRHELPDKEFKIALI 
KMIiYELKVTMHEQNENINKETENTKKKK 
KKKENPPLFGTGKNQSPPEKFQMGGQPP 
PPPPQKKKPPPPNRFFVFSPP 


5483 


19384 


A 


5523 


316 


399 


ML L. Y W PGT VA Y ACN PS TLGGRGGR I TRL 


5484 


19385 


A 


5524 


400 


0 


FFFFFFFCFFFFFLFFFS 


5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTSIVGSDTSQAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSASE' 
SETVTTKEFSGTTAISRTSHTGTPAASG 
GQATGSLTATTGVAPGTTVAPGS SNTEA 
TTS VGEKKKDQKKKKAS S P 


5486 


19387 


A 


5526 


135 


3 


GGKICTLIKQSVIQESWPGTVAHACNTS 
TLGGRGGWIMKSGDR 


5487 


19388 


A 


5527 


105 


5 


KKKKKKKS GRWPGAVADACNPN I LGGQG 
GQITI 


5488 


19389 


A 


5528 


411 


31 


GQRVPLSRGVAKKKKFLWNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP 



877 



JSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1Y1 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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nucJe- 

location 
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last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny}alanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucinc, M=iVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, "=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KKPLILVS PPPRKGGAHNMHFRQKKSGN 
KGGEGSTKGGKEKIFLGKFVQKNFFFFF 
DPEFHS CCLGWSAMP 


5489 


19390 


A 


5529 


411 


142 


PRFSFLFSPPRKKEVFPPPKILGGPRFF 
PPPRFLKPPPHFFFWGPKKKKKFPPPPG 
EKFFFFKGAPPPFFFFFFFFFFFFFFF 


5490 


19391 


A 


5530 


174 


401 


QVQPQQKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKASPGGGG KKKKK 
ERGGGGERKNIIWVPLIjFPP 


5491 


19392 


A 


5531 


410 


0 


QPQSKKKKKKKKKKKKKKKKKKKKKKAS 
QKKKKKKKKKKKKKKSSGGGFFKEV 


5492 


19393 


A 


5532 


383 


145 


GGFFFNPPAKKGSFPQPHNLGGPPVFSP 
PPVFKTPPRNIFFGAPKKKKFFPPPGAK 
KYFFKKAPPPFFFFFFFFFFFFF 


5493 


19394 


A 


5533 


418 


328 


KTGFRHVGQAGLKLLASSDLPALASQSA 
GL 


5494 


19395 


A 


5534 


169 


3 


1 1 LI I HPCSHVPRRETLGLKACTFKNL F 
WPGMVAHACNPSTSGGRERRITRSGDR 


5495 


19396 


A 


5535 


1 


254 


PTRPCKPPQTSPNSGKLSPSSNGCMNTL 
HISSTNTVGEVIEALLKKFLVTESPAKF 
ALYKRCHREDQVYACKLSDREHPLYLHD 


5496 


19397 


A 


5536 


43 


404 


QGGLTVLPRLVLNSWPDKRSSRLGLPEC 
WDYRYEPPHQAN 


5497 


19398 


A 


5538 


2 


212 


KENHLNPGGGGCSELRSCRCTPAWATER 
NSVSKKKKKNFRGGGLKGKGPGFYPLEK 
KFPGVGFFPKTNLV 


5498 


19399 


A 


5539 


2 


387 


I FQEHKNCGEMSE I EAKVKYVKLARSLR 
TYGVSFFLVKEKNKGKNKLVPRLLGITK 
DSVMRVDEKTKEVLQEWPLTTVKRWAAS 
PKSFTLDFGEYQESYYSVQTTEGEQISQ 
LIAGYIDI ILKKKKKKD 


5499 


19400 


A 


5540 


317 


407 


GliSHHVFFEEGKLKKPRIKDKDKKVPEP 
DN 


5500 


19401 


A 


5541 


120 


3 


NFFFFFLRRSFALSPRLECSGTISAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSNGPEKQEKRGVIQNFKRTLSKKK 


5502 


19403 


A 


5543 


172 


2 


FIFNFLHIFSFWFITKFGCFIVLFLRQS 
LALLPRPGVQWRDLGSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDIQELNSALHQADLIDIYRTLHRKST 
FFSAPHRTYSKIDHIVGSKAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRPKEIRNQGGSPQRKRAQLGKLQQ 
KYPVLNFKAPFFGERGDYYKSFFKTCLD 
NLPRRGKG 


5505 


19406 


A 


5546 


2 


914 


RPAWHEEQKS KNEPEDTEDKEGCQMDKE 
PS AVKKKPKPTN P VE I KEELKSTS PASE 
KADPGAVKDKAS PEPE KDFSE KAKP S PH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVIDLGEDHSGREGRKNKKEPKEPSPKQ 
D WGKTP PS TTVGSHS P PETPVXiTRS SA 
QTSAAGATATTSTSSTVrVTAPAPAATG 
S P VKKQR P LLP KE TAP AVQRWWNS S S K 
FQTSSQKWYMQDSSSSSSSSSPQNQQQQ 
PQS SQGTRYHTRQAVKAVQQKE ITQSPS 
TSTITLVTSTQSSPLVTSSGSMST 


5506 


19407 


A 


5547 


2 


244 


TRLIFVFLVETGFRRVGQAGLKLLTSGY 
VGTSASQRMTAHLCLPRCWDYRCEQLHP 
AQRFFKRMLSVAVDAALLHTSGGM 


5507 


1940S 


A 


5548 


406 


1 


FFFFFSETESCSISQARVQWCNLSSLKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

olide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Iso)eucine, 
K=Lysine, L=Leucine, M=lY1ethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5508 


19409 


A 


5549 


417 


43 


I RPARKGCS SGL FGAPLAGGGAPP PP P P 
RQTSPPPPHTKGGGGGPKKPPPKKKRPP 
PKKKGGGKKNPPKKKKKNTPP P PKS AP P 
PAAKTDPPFPLTVPPFLPPYIFTKKIPF 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPRFPTLTNIWENNPTLLNPI KRIAAGS 
LFAGFLITNNISYHSP 


5510 


19411 


A 


5551 


75 


194 


NKELSRNKKKKKKKKKKKKKKKKKRGGG 
LKKKNGGGAKT 


5511 


19412 


A 


5552 


343 


493 


IIFSFFFFFFLKGSFVFAPQAGGHGGNF 
GPIiKPPPLGLKEFS CLTLLS SW 


5512 


19413 


A 


5553 


148 


3 


DAVSLFCPGSLEHSDTI IAHYSPDLLGS 
SSLSASTSRVAGITGICHHA 


5513 


19414 


A 


5554 


407 


252 


KKKFLKKLAGQGGMRLQS LLLRRLRQED 
HSSSGVQSCSELWGGKLIYSKNR 


5514 


19415 


A 


5555 


3 


231 


HEFVFLVETRFHHVGRAGLELLI SSYPL 
ASAFREAKARREDLLSPGVKDKPGOYRD 
TLTLKKKKKS FF I YLGECW 


5515 


19416 


A 


5556 


3 


131 


SWVSWEESKDAKELVGYYIEASVAGSG 
KWEPCNNNPVKGSR 


5516 


19417 


A 


5557 


3 


131 


SWVSWEES KDAKELVGY Y I E AS VAGS G 
KWEPCNNNPVKGSR 


5517 


19418 


A 


5558 


462 


255 


RSHHSAQAGLELQASSDPPALTSQSAGV 
TNVSLRAQPILACSSVEVSSRCQVCLHV 
RRGKCPRAESYR 


5518 


19419 


A 


5559 


445 


29 


KSSNILILQKVENGDLSNKIL.KITDFGL 
ARE WHRTTKM S AAGT YA WMA P E V I RAS M 
FS KGSDVWS YGVLL WELLTGE VP FRGID 
GLAVAYGVAMNKliAL-PIPSTCPEPFAKL 
MEDCWNPDPHSRPSFTNILASCQAWR 


5519 


19420 


A • 


5560 


225 


475 


GAVESSCTEEHMWIPEKAVEGSLTPTPE 
SRSVAHAGVQWHNVSSLQPPPPGFRRFS 
CLSLPSSWDYTSAGEGCAASYRSKVLK 


5520 


19421 


A 


5561 


248 


142 


NKNI WVQKKCVFYI YLPFI CVC VCVC VC 
VCVCVCV 


5521 


19422 


A 


5562 


262 


456 


NFPLKTTYFLKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKASS PPPLI KRGPKN 


5522 


19423 


A 


5563 


314 


444 


LKSMFCFVLFCFFEIESRSVAQAGVQWC 
HLSPL.QPPPPGFKRS 


5523 


19424 


A 


5564 


220 


486 


LNSSHPESPLPLFGLVLQFFYPSNPKSF 
NKKNLE I RMLLTVDRGPNHPDS L F FVHS 
NVDTVFLPQEGPS FLQ PLGQG I TEAFEH 
NQAYV 


5524 


19425 


A 


5565 


176 


492 


CTKTAVASKARYLQALLYALASVTIAAT 
ATRIASISGASPIFRSFPYL.QPTAPGLV 
LPS ILILVDVGLRKEKGPDTVAHACGPG 
T LGGRGGR I TR S G VRDRPDRH 


5525 


19426 


A 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKI>FNEINKIPLPPWALIAIAGVAGL 
LLLTCCFCICKKCCCKKKKNKKEKGKGM 
KNAMNMKDMKGGQLPQDDDDAKTGL.TEG 
EGEGEEEKEPENLGKLQFSLDYD FQANQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 


485 


GXDGRLLRGHNQYAYDGLDYIAIiNEDLS 
CWTAADTVAQITQRKWEAAREAEQLTAY 
LEGLCVEWIiRTHLENGKETLQRADPPKT 
HVTHHPVSDHEATLRCWALGFYPAEITL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, ' 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TWQRDGEDQTHDTELiVETRPAGDRTFQK 
WAAVEVP 


5527 


19428 


A 


5569 


221 


422 


FFFLTGRVRQSLADLEQKIEVHSMPPFE 
SLALS PTL Y CRG AI LGATSAS QVAG I TN 
ICHHAQPSYLF 


5528 


19429 


A 


5570 


401 


477 


IFSRDKDVAMLPRIVLNSWAQVIUj 


5529 


19430 


A 


5571 


102 


36 


EVENNDD I SHKHHHHHHHKS T 


5530 


19431 


A 


5572 


106 


407 


AGRQG WLEGCRPAAWPTLLT PPATS S R 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDLGSSQPPPPGIKQFSCLS 
CP S S WDYRGRRAVCHH 


5531 


19432 


A 


5574 


253 


363 


YHSNQCVGLRKWPGAWHACKPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPGFFFFF 


5533 


19434 


A 


5576 


18 


37S 


TPGGGGCIEPRWHHCTPAWATRAKLSYK 
KKKKKKK 


5534 


19435 


A 


5577 


21 


232 


ILRCTTIETLLKVQDNHLNPGGGSCSEP 
RSHHCTPTWARERDSVSKKKKKGGAALW 
DPGGGQPSAGSQTR 


5535 


39436 


A 


557S 


377 


2 


LSL I QTI TLCLGAITTLFAAVCAVTQND 
IKKIEAFSTSSQIGLIIVTIGINQPHLA 
FLHICTHAFFKAILFMCSGS I IHNLNNE 
QD IRK I GGLLKT I PLTSTS LTIGS LALA 
GIPFLTGFYRTRG 


5536 


19437 


A 


5579 




227 


IESWLEPVRFLRSMFANNLVYDTSDSDD 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQTYS KFDTNSHNHVDAA 


5537 


19438 


A 


5580 


113 


3 


NFPFFFFLRQSIiAVAQAGVQWCDLGSLQ 
PPPPWFKQ . 


5538 


19439 


A 


5581 


184 


3 


IMPPHSRLSPTARFCLQKKTNNNNTPIE 
MAS I QCPGWSQTPGLKPPS CFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFLGDRAPPGGEKNPLGGGLKTPGKP 
PGGWGVSPPPQKKKMPPPNFFPPGKKKS 
FGGGGPIFFPGKNFPPPTQKKKRGGGGF 
FFFFKKPGGFFFFFFPFFFFFSSFFLGK 
SFIKISFKKRRRG 


5540 


19441 


A 


5583 


314 


389 


FSCLSLLSSWDYRCVLPHLGNFLYF 


5541 


19442 


A 


5584 


133 


1 


FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGDYYWTFFPQPTFV 


5542 


19443 


A 


5585 


484 


3 


MTEILLKQAMVGI VGS I GSAI GGAVGGG 
ASASGGTAIQAAAAKFHFATGGFTGTGG 
KYEPAG I VHRGE FVFTKEATS R IGVGNL 
YRIiMRGYATGGYVGTPGSMADSRSQASG 
TFEQNNHWINIJDGTNGQIGPAALKAVY 
DMARKGARDSNSAGDQ I FHH 


5543 


19444 


A 


5586 


136 


48 


NTMQPGWTHTCNPSTLGGQGGRITGSG 
R 


5544 


19445 


A 


5587 


306 


479 


DS FFPGVGS AFLYQVAAWTTKYFKKRV 
ALSTD I ARSEMRLTFLLAPFT KFL I DVC 
DW 


5545 


19446 


A 


5588 


203 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
S AGI TGVNHHAW P S FL I QGT KHS FKNS K 
PVFHIKLSKN 


5546 


19447 


A 


5589 


30 


477 


TPSLARGVYPCWLYTGFLHDFLFSPKIE 
ERFMDVGVIjRATEPSNSLFAI iystpie 
IiAIKSWHI I GDGLS I elvavergnrtt r. 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 
sequence — 


Amino acid sequence (A=Alanirte 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine 7 V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














I SRLPE FE KSSLiSDQSLVTLTSG YFAVY 
RVKLTAKKLEGIHEGAFQITTDYEILTI 
PVKARNALC 


5547 


19448 


A 


5590 


208 


91 


KGHFWVTGKKKVPCPPPRFFFFFFLDGI 
SLCYPGWSAW 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKAPPPTSALRSRDL.ISFCF 


5549 


19450 


A 


5592 


332 


403 


LSLCVXHTHTHTHTHTHTREIFIY 


5550 


19451 




5593 


292 


26 


FLLDMEGAEAGAELGLYHCLLCAWETPS 
RbAVLQHLRTPAHRDAQAQRRLQLLQNG 
PTTEEGLAALQS I LS FSHGQLRTPGEEE 
GTGE 








5594 


387 


51 


LKKI FLGRGQGVSLFFSPPLGGPGGRFL 
GLEIGALFGLPGKTPPFPKIKKNYPGLW 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNPPPPPLGGKTKPPPPLKKKKKKKVL 












362 


AET S PG QGS YQDG LRRPG FAEG P P AGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRL 
CTRLQCSGAISAHCNLCLPGSSDCPASA 
SRVAGIT 








5596 


92 


366 


DQPWPGQ1.PGRSPEAGIRGGSTSRKKPQ 
EEENSDRS PGRQRGI PASGVRGRCAALY 
QAAVQWCHLGSLQPLiPPGFKRLiSCIjSLiP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELiLTSSDPPTIaASQS 
AG I 1GVS HRARPQS S S 


5555 


19456 


A 


5598 


150 


3 


S C VI YS WMKFFS flrwsftl vtqag vq w 
CDLSSLQPPPPRFKIiFSCPG 










148 


3 


TQHLNLAKKSQ I TPGMVAHSVAHAGRQW 
CDLiGSPKPLiPPGIjNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVYFKKFCGFFLFVRSRIACSVAQVGMQ 
WCNHSSLQPRTPGLKQSSHFS FPSRWNK 
RHMQPCPGA 


5558 


19459 




5601 


228 


415 


VNVS I PKDVFLL I I I I LFFFGPAPQAGG 
RGCNLNSLLPWPPGIiKQFSCiiTLPRSWN 
YGLPPP 








5602 


405 


2 


AATGVDRHAQRTQAGLPLKTPQTPCPNF 
TSKARGTPSAQHPSCTRQRKQASNPTQR 
RRRRPQAEHS IRTDAHGPGIiPASGHMAG 
ERPTRKAQSRDVSVLRQRIALLPRLECT 
S AI SAHCSQNLTGSGNP P VS CL 


5560 


19461 


A 


5603 


3 


236 


RQENGMNPGGGACSEPRSRHCTPAWATE 
RNSVS KKKG IKIKGHCS 9 AMHHQYLS P F 
GWRQVCSHCVCPKPWLLVRGH 


5561 


19462 


A 


5604 


184 


2 


RP FLR I QGKI RMFSGSRFI FKFL I LETW 
SHSVPQARVQWCHRSSLQPKPLVFKSSS 
YLLP 


5562 


19463 


A 


5605 


261 


2 


GWFLKGPPL.FLGPKKPGGLGFFQKKLVL 
QKKNPPFGIGPPKNLKKTPLFFFFFLRR 
S LALS PRLECSGT I S AHCNLRI PRTSNS 


5563 


19464 


A 


5606 


134 


2 


GS PFKKLVFFLGQS LTLS PRLE CSGTI S 
AHCKLRLPGPHQS PAS 


5564 


19465 


A 


5607 


3 


269 


RQESHSNPGGGGCGEPRSRHCTPAWLAT " 
ERDSVSKKKKRKRNFLNFIGKKKKPCLP 
KPKX.TKKKKWGGITALVLKLYKRAIAPK 
TGGY 


5565 


19466 


A 


5608 


36 


380 


P P FXXP PGGETFGGTPRGPPPKKKKSPR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KRGGGG KKKPPRG APP P KHLLGGKNRGG 
GTPPPPRKTPPLNPKKKRGGGKRGVFFP 
PKGGIFQKRGFFFSRGGGGGAPPPHPPP 
GGG 


5566 


19467 


A 


5609 


3 


41 1 


SSRSRAAALFYFFFFSRRGQKEKEPFLG 
GKTPPVFVFFPGGVGKKNRGFPPPPPAG 
KGPFSGGGGREQTPIWSPPPPPRGKWGK 
PPKPVTPNFNFLGRVGFLGKNPSPGGVC 
PPPKTPIFFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


I S WVLFFLFLKWS FVliVAQAGEQWRGLG 
SLQPLPPGLKGF 


5568 


19469 


A 


5611 


399 




GGCSEPKSRPCTPAWGTERDFVSKKKNK 
NNYLVKKSN 


5569 


19470 


A 


5612 


408 


3 


KEVFS FPPGLEHGG VFYSLP PL.PPRVKG 
FSPPPPPGGGGPRGPPPPPGYFL.LFPKK 
GGFPPLPGGFSSPQIWPPRPPQKVGVPG 
GAPRPGPNFFFFFFRESFPFLARGFPKK 
KKKTCWNVTISHSAS VQS SVLL 


5570 


19471 


A 


5613 


1 


112 


LAQHVFFLLNTS IAS PADS SELLMEVHG 
NGKRPSPER 


5571 


19472 


A 


5614 


2 


389 


FVREGERKKEKRKKEKKRKEGTKEKGKE 
EGRKERKKEKVRKKTVDDRRRPQVTFQA 
VNLESEGAPGNCGVTAPPGGHIWLKQVE 
VQTHLENRCHRLKATVTTTTRPGAEPRS 
PPFIITCGEIMNVFSDM 


5572 


19473 


A 


5615 


385 


3 


KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQSFKRARLGPC 
PPSRGGKKKAGFKTNPPTKKKNQTKKPM 
KAKTS KLLQENRSG PGTVAHACNPS TLG 
GQGGWIMRSGDRDTK 


5573 


19474 


A 


5616 


218 


402 


RSYSFS QLVGTVI KARKPES S CWLCAPN 
RKNAFWPGAVAHTCNPSTLGGRGGRIMR 
SGDGD 


5574 


19475 


A 


5617 


2 


235 


KS FG CAS RLQMHGRTHTGE KP Y KCKQ CG 
KAFGCPSNLRRHGRTHTGEKPYKCNQCG 
KVFRCSSQL.QVHGRAHCIDTP 


5575 


19476 


A 


5618 


347 


421 


IYLXXKKKKKKKKKKKKKKQKKKKK 


5576 


19477 


.A 


5619 


92 


1 


ENLALLPRLECNGT I LAHCNLRLPGSSD 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
LFFFKKGKKKKKPPLEKKKKKKKKKGGA 
PLKKKKFS PPGGGRKI FFKGAPKKISRA 
RVKKQRRGEKTRGNPKKKRGKKKRPFPN 
TVSR 


5578 


19479 


A 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
IFVFLECRGFPQVGQGGFKIRPPNKPPP 
PPPKNLGLQECKSPPPPPIFFFFFLWRWG 
LMTLPRLVSNSWLKQSSHLGLLKHWDYM 
SEAPHLAKNFLKEYEWSLSFHVEGS 


5579 


19480 


A 


5622 


246 


132 


EVAPHLLGCISPTMCVCLCVCVCVCVCV 
CVCVCLCSW 


5580 


19481 


A 


5623 


219 


424 


I LNVMVLG IGTFGRLLGLEGRAIiTNG I K 
ALI RRDMGPGAVAHTCNPKI LGGREGW I 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GLEKVHSMTXDRNSRLSVQLRDWDGGGE 
LLQFSVHLDGEDTAYSLQLTAPVAGQLG 
ATTVPPSGLSVPFSTWDQDHDLRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRSIPQ 
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NO: of 
nucleotide 
sequence 


1 SEQID 
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sequence 
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NO: in 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QRQKLKKGIFWKTWRGRYYPLQATTMLI 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMILPSQVWWLTPVI 
PTLWEAKAGGSQGQEFETSLANTVKP 


5583 


19484 


A 


5626 


120 


406 


GEAATQENLAELRPEPELLSPSTVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
LLSPSTVLSRKPDLiLiSPSTVLSRKPDLL 
SPSTVLSRKPDLLSPSTVLSRKPDLLSP 
STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLFKTAPRPQKKNPPPKKKKPAP 
PKKKNLFFFFFFFFLVFFFIWCSWRLAE 
WNETLGECDMRWGQAVGTGTSRELRPGG 
SRSQPRRLAGKELGPHCAP 


5585 


19486 


A 


5628 


378 


72 


GRGGERPPPPQKSGAKKKPGVLNFFPQR 
GKRSGPPGPPGFFKIFSQEAPGFFSSFF 
WGRSLALFPSWGARGGFLVPANLGPPAG 
APKGNPPSKKKKEKIED 


5586 


19487 


A 


5629 


3 


204 


QES CI.NMGGRGCSE PRS CHCTPA WATEK 
DSVS KKKKKWKFGG FYE VI GNFLHLF PK 
WSYPPSFRIF 


5587 


19488 


A 


5630 


177 


1 


SFHEMRFWKLIFI IRIYVHFERRSCPVP 
QSRVQWHDLGSLQPLPLGFKRFSCLGIiL. 
SNW 


5588 


19489 


A 


5631 


154 


3 


KWFQMGSRS VTQAGARQCDHSSLiRPPTP 
QLKRSSCLSFPGGWDYRHVPPC 


5589 


19490 


A 


5632 


239 


17 


EGVRNYLILQPRSLCRLNCSFFPNREKE 
LCHHSSFSSPIiAAAKESQGEKGRLLSQD 
EGLLLWEVFVEDVEINS 


5590 


19491 


A 


5633 


2 


no 


GCSE PRSHHC I PAWVTEGDS VS KKQPP P 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


FFFFFFFFFFLKKIPPPEGGGF 


5592 


19493 


A 


5635 


306 


433 


FS I S IERSSSGRAQLLTPVMPAL WEAEA 
GRSRSQEIKTILAN 


5593 


19494 


A 


5636 


2 


285 


QSRTSVI FNSE KAE RGEETTE EKFEAS R 
GWLVKFKERSCLHN I KVQS EAI S ADVE A 
AASDPEDIAKI IDEDGYTKQHI FGQALC 
LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


L YCGKI YWKFI RLNFFLS QGLTLS P KL 
ECTGTITAHCSLKL.ISPSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQLNPGSQGCSGLSLCHCSPAWATETL 
SQKKKKKNPPFGFKIKGEIEPLREFGTK 
KKKGG FLKKF P PH P LG FFL E P FT WGG F W 
N 


5596 


19497 


A 


5639 


2 


121 


SELRSCHCTPAWRQSKTTSHKKKKNIKK 
KNKFFLPFREN 


5597 


19498 


A 


5640 


370 


180 


VEMEFCHVDQSGLELLISGDTPTSASHN 
AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FYILRKVFVFLFPAVSYGKGTYFAVDAS- 
YS AKDT Y S KPDSNGR KHMYWR VLTG V F 
TKGRAGL VT P P P KNPHNPTD L FDS VTNN 


5599 


19500 


A 


5642 


292 


106 


NFVILFYLFKQGSHYVAQAGL.EPLSSSD 
PLATASQRAGITSVSHRAQTKQAETYW 
PFSLFT 


5600 


19501 


A 


5643 


3 


100 


DTFFEVEVEMEKEVCRDLMCTSPKDEEG 
FliLI 


5601 


19502 


A 


5644 




282 


EKVPEAKRLYGKRGDPFYEAQENHNLIG 
VANVFLECLFCDVKLQYAVPI ISQQGEV 
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nucleotide 
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ing to first 

residue of 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H— Histidine, I^Isoleucine, 
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nucleotide deletion, \=possiblc 
nucleotide insertion' 














STGQPWRTMWQVFCFGLFWFSFYLSYV 
LEVDQETKAN 


5602 


19503 


A 


5645 


142 


67 


NKNRPGTVAHACYPNTLGGQGGQIT 


5603 


19504 


A 


5646 . 


350 


. 213 


GLELLTSSDPPASASQSAGITSMSHRPW 
PPL.ASRLDLCCGTLYLI 


5604 


19505 


A 


5647 . 


3 


180 


EHNAERAFLAQAHLGPGAP CT PG F PS F I 
PTQFPPFRSSGLPNIPVQTISRAAAEKL 
FG 


5605 


19506 


A 


5648 


366 


1 


RS W FTTYKERS LLH I I KVQREAAS AD I E 
TAASYSEDLAKITHESGYPKQQIFNVDE 
TAFYWKMI PLSTC I AREEKS I SGFKAS K 
DRLTLWLGANAI GDI KLKPM IVYHFKTP 
RALKNYASSL 


5606 


. 19507 


A 


5649 


1 


355 


QIQSSQSGKGLL.LTSFIDEETKL.FFFGN 
KVWFCIPPGPGGKLCFKKKKIFGLYPGG 
GGFWGPQTPPLAPTFPPGFPPPPGPNFS 
PRGPFPKMGPALRGPGKKKKPPEKTKPA 
KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRSFQPFSLPPFFRNVFKKKPPPFFFEG 
GRGPPAPPFKNPFFFFSRIi 


5608 


19509 


A 


5651 


194 


43 


KKKFFFFFYEMESHPVTQAAVQWHHLGS 
L.QPPTPTFERFSCL1TLSLIAIP 


5609 


19510 


A 


5652 


27 


244 


KKQQSTLFLFYFLSFILFFFLVKTGSHY 
AAQAGLELRASSNPNTSVSLPKS WDYGH 
EHTVPS PTWFLAAKTP 














GLFLSCLFFFFLEKGVPLTPPGATGENP 
APKKKGTGQKLL KNRDF F LKKAM FG KQ K 
QKKFFFAPTITGDQRAPRVFKGPGAQKE 
PFAQKKKERGGGKKTRAAPNKKKKPGAF 
FSLKKTPLWAGGG PPPFF 


561 1 


19512 


A 


5654 


391 


68 


NLYPHGGVPGVSPGKGSLPKAPNPLGGI 
PPSGEPPPL1PPQGFFPCLKPKKRL1GKAP 
GPFFFFFLAILLGWFQISNTDSCVQVFI 
IQTIWI IFTTIIKGCHLHFTFIjF 






A 








GARFLFFS PVLS PL I EHCVCVCVCVC VC 
VCVCVCVCVLCACMCT 


5613 




A 


5656 


88 




SIFCFIYVFFLRQSFTLVAQAGVQWRDL 
GPLQSPPQKKKGKETFSFLGENRYLQPK 
NFPTFLTQVCFFEKIPPNGLINGKRPKW 
I PGCE FLKTL WGPWV 


5614 


19515 


A 


5657 


176 


371 


HFYFCFSDINLAAEPKVNRGKAGVKRSA 
AEMYGSVTEHPSPSPLLRSGTLLFITAL 
CPSVGIFSF 


5615 


19516 


A 


5658 


3 


113 


CSE PRS RHCTPAWATERDS VS KKQQTKN 
LSYKKIHW 


5616 


19517 


A 


5659 


409 


67 


SSSNKPPTRPVFILPFFGHGVPRLPPGK 
KPPGPPPGPPPPQVEQMYGRKGGFALNL 
PPRRQNKNMLYSPELGEK 






A 


5660 






GR I S CSRNTPLRHRSLMLI VCGLRMPLS 
GFL PWVGQVFLALI PVNPQPS S LGVRT I 
MDMLRCCRDFIYGQFWGQFMARLWGACS 
QQYTEFFRFKI I FI SSLLFNI VLEVLAR 
AIRQEKELNGMLQ 


5618 . 


19519 


A 


5661 


256 


381 


NFNTGQAWWLMPVILAIWEAEAGESRGQ 
EIETIIiANTVKPHLi . 


5619 


19520 


A 


5662 


19 


313 


LYSSLAILGFLPLLINFIIIIIFFFFER 
DPGQQHQNFVSLKKRETP P P P FS LGVPG .. 
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KRSLLP P PPG PKHAGPPVLPAGliWGGPE 
PPTWGRGAKQKRE I 


5620 


19521 


A 


5663 


78 


365 


ETGRKRKETRWNLKKVCLLKMSLQGEKE 
ERKREKYRRTEMKRQKNSEK.QRGREKGK 
KSKYQRRRKKEKKGENQTKPKPKVEESS 
SRQ 


5621 


19522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
PICLPNFFFFLWTGY 


5622 


19523 


A 


5665 


350 


151 


IRCLCPDKFLAFCRDGVLLCCSGWSRTP 
GLKRS P CPSLP KS WDHRLCLAD IYIYIF 
LKDTSTIMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLRL.RHCTPAWATRANLPL 
KTKKKRKRFLFTLAETSKVY 


5624 


19525 


A 


5667 


1 


194 


RLNPGGRGCSELRSRHCI PAWATRTKLH 
LKKKKKGDQG FRGDHTKNI WLFPRVFLP 
LHSLNTLK 


5625 


19526 


A 


5668 


2 


312 


RGQNSARG III IAI LKKVCVYI YROGLT 
LSPKIiKCRGTI IAPCSLELMGSCDPSTS 
ASQIARTADMHHHTRLI IASLTFCLLKK 
KKNVFCGTYNHCPPKGFYL 


5626 


19527 


A 


5669 


1 


245 


RLNPGGGGCGEPRSSHCTPVWATRVKLS 
LKKKKKKRPPFFFFGIRQPPQGGPFQTL 
TPLNLLKNIjTTLLGAKKFFFFTGGK 


5627 


19528 


A 


5670 


' 


135 


ARTPSPSLSESSEDEKPTKKHKKGKALR 
L KRRFWWLMS AL PC IH 


5628 


19529 


A 


5671 


97 


2 


KKFTHVWPGWAHACSPSTLGGRGGRIT 
RLN 


5629 


19530 


A 


5672 


362 


173 


VFSPPPPFKTPPPPPKKIPPPKKKKTPP 
PKKKKLFFFFFFFFFFFFFFGTRIVIDN 
IIjSGNRV 


5630 


19531 


A 


5673 


237 


13 


SPGGGGGFAPYWMFFLDFERQSCSVAQA 
GVQWHNLSSLQPSPPRFYFFSRDGVWNR 
PVPR YAPPS PKG I SLRPYK 


5631 


19532 


A 


5674 


245 


14 


ANFLKAHIVSTLGLQIIWSLLQLLNFAI 
VFYFLRQSFALVAQAGVQWRDIGSLQPL 
PPRFKRFACQRFSWEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPTHSHPQHLPLTPTWNWS STPVDFI F 
RKAPPVFPHWQHHRAGPGTSLIPTDAPQ 
EASPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGS ITLAQELRSCHCTPAWATG 
QDSISKKKKTLLGP 


5634 


19535 


A 


5677 


460 


2 


RLGVTGRWQKLKPQERMRAWRENGKGPK 
ALPASWEGPTVNLLWKHEHRAWVSCHCD 
SGFCAWCQGLGSEQSGLPSLIPSWDGLV 
LTSPPNPTGPPPPLKTRSVLRGDDVLLP 
CDQPSNliARALWLLNGSMGLSDGQGGYR 
VGVDGLLVTDAQ 


5635 


19536 


A 


5678 


2 


346 


ENCZiNSGGEGCS KLRLCHCTPAWVTERD 
S ITKEKKKKNFKPVLEKFGE I FFLKGWK 
FFQLNPLKS SKEME FFNTTLPLE PCFNG 
NKPFWAAKI WTLNFFQTQKGS I FKWGGR 
KK 


5636 


19537 


A 


5679 


181 


4 


PKEVRTLFGRFFFFFHMESHSWQAGVQ 
WCNFSSPQPPPPGFQKFSCLSLPISWNS 
RHM 


5637 


19538 


A 


5680 


549 


410 


FFVQTDFCHVAQAGLQLLGSSNPPASAS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


111 


403 


SRQINGFFWSRTKVAIGQFLVGPSVKI 
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FENNNEICMIKS IQSYIVLNHKHLKVYS 
SNI WQLKNILLRGGAVAHACNPS ILGGR 
GRR I TRSRDRDH P 


5639 


19540 


A 


5682 


189 


3 


NNNRKASWKKKDRVLLYCPGCNWCDLGS 
LQPLPPGFKRFSCLKLPKCWDYKHEPLH 
PAPHV 


5640 


19541 


A 


5683 


2 


227 


I KVC VCVR VCVC ILREDSAQ PARCVC VC 
VCVCVCVCVCLDPRRELSPTSQLESSCH 
PLYVPSGGSRINRPYPGGC 


5641 


19542 


A 


5684 


318 


421 


DVPCFPVAFCCDCKFPEASPAMLNCESM 
KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETESRSWQAGVQWHD 
HSSPQPGPPRLKSSSHLSLWRHDHSSPQ 
PGPPRL 














TAQPHDRPLTAS S S LAPGQRVQNLHAYQ 
SGRIiSYDQQQQQQQQQQQQQQALQSRHH 
AQETLHYQNLAKYQHYGQQGQGYCQPDA 


5644 


19545 


A 


5687 


2 


288 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAILLLLGGVCVAASRVYKSKRNIILGA 
GILFVAAGERQREGATGAAHL.GAHVCVC 
VCVCVRARARA 


5645 


19546 


A 


5688 


314 


2 


CRPGSQLQELGDGGGGESFVGGHGQQHP 
GAPQRGRYLQPLLMCGAS PHASPRQPLA 
ILAAGVKFRVLRHEAGRPLRLLMQINPL 
PYSRWHXYRLPSCGCLTCI 


5646 


19547 


A 


5689 


122 


3 


MGGGEFSKKMNINGQGWAHACNPSTLG 
GRGRQITRSGV 


5647 , 


19548 


A 


5690 


144 


1 


EAV KKYT C P T WAG AV AHACN P S TLGG RG 
GWITRSRGRDNNGKHGEIPR 


5648 




A 








LGVFP P FFFLRG EGLS P P P PGGLVKNQ F 
FFPPGQKGPPGTPERGFQGFLGPKFFPP 
LiP PQGPKG I FFLKKKKKKKI P I CALAVL 
TYQHFCYRTGQERVDGLREL, 


5649 


19550 


A 


5692 


12 


147 


YERRFCHIGQAGLEL.LTSGNPPASASRG 
AE I TGVS YHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GHRFGFL.HPPPPGFGGFFRPPPPKTEGK 
RAPPPPPRFFFFLGGGGFSGGGPRGVLF 
LEHENPPPGAPKGVG 


5651 


19552 






177 


283 


THFLNRTDGRRWSLASLPSSGYGTNTPS 
STVSVSA 


5652 


19553 


A 


5695 


266 


460 


LSQRKYAFFFFFFFFGKKIFFSSPGGEI 
RAQFFLTEPSPFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


5653 


19554 


A 


5696 


— j 


~4V> 


MQPLKVNLLEQKNLGLEFVAPFTWIHCP 
SYWS IAYNVYFSQYSFIWVFCTMFITVN 
IPSKKKKKKKKKKKKKKKKKKSSSPGSP 
GYAPKKAPKNKKKK 


5654 


19555 


A 


5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 




187 


PTRPPTRFRFENGVNLGGGGCGEPRLCH 
CTPAWVTERDS VS KKKKKKKKAVGGPWF 
FQKKKV 


5656 


19557 


A 


5699 


374 


0 


DFLLQKKKKKKKKKKKKKKKKKKKKRGG 


5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFLGPPPPGLKE I P P 
PPPRGGGKKKPPPPPRGGFFFFKKKGVS 
PGGPGGSPP - 


5658 


19559 


A 


5701 


381 


249 


FHRAGQDGL.DLLAPGSAPLGLPKCWDYR. ' 
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REPPRPAFFFFFFF 


5659 


19560 


A 


5702 


382 


197 


LANFCIFSRDGILPCCPGWSGTPGLKQS 
AHLCLPKCWDYTHKPPCPAPQVIKPSIN 
ILIVN 


5660 


19561 


A 


5703 


1 


381 


ERGSKHPPRGLKHIRGRTCFVLKGCMIQ 
LCWPPGQGKKKKEGGGEKKKDKKREEE 
GRE KKKNGNQGERVKKKKS KRKKKKRS S 
SS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLVKMGFCHVGQAGLEFLTP 
GDPPALASRLLGL 


5662 


19563 


A 


5705 


367 


23 


KNPCPPPPPEFFLAPKKKGPPPRKKTRF 
VPPPLFLIKGPPFKRGKGPPPGGFPPLF 
CPLGGKRGGVFLSPRVGKPRGKGGNPLF 
PKKKKKKKRRRRKKEGKKGRKERQGYIH 
TV 


5663 


19564 


A 


5706 


411 


116 


FLKRWGSHYAAQAGLELLDSSDLPVSAS 
QSAGTTGVTHCSRLSYSALACQLCQPVW 
RIPWRGRGVGRETSFRGWGRRYGRAWSS 
SGGGAE S LPGLS PG 


5664 


19565 


A 


5707 


162 


2 


SYCFPNSDDFIPFQPLPHTWFLWPRPFF 
IFKTAHGWARWLTPVIPALWEAEAG 


5665 


19566 


A 


5708 


1 


416 


LPLLYKIPPPPAILFLFIKDSPPPPPPP 
PFFFFFFPPP 


5666 


19567 


A 


5709 


403 


142 


LSKEKKWGWPRGFSPRIFEKPPPPKIFF 
APPKKKIFPPPPGEKKFFFKGAPPPFFF 
FFFS FFFFFFDCYNSSTWNSAWHIRCS I 
SIC 


5667 


19568 


A 


5710 


339 


461 


PST FILFILRRS FALVAQGGVRW RHLG S 
PQPPPPGFKRVPA 


5668 


19569 


A 


5711 


408 


42 


TGAR I I FSGP P KKT YPS P PRAF I I LS F K 
RAAPQ L I Y F YLNLFAYKRLVKLTTV I M R 
RRPTFFLLLFKTFLFVEPGFCHVAQAGL 
QFLGSSTCGSLPQCWDYRCEPQHLAKRS 
TFIKSESAV 


5669 


19570 


A 


5712 


277 


9 


gvcvcvggvwppmglqtpsapwglsla 
pplgtlcsvqwlaesihlcicqax.ve.pl 
rrqlyq vp vrklllas sivsgfgdsiwn 

GSYLR 


5670 


19571 


A 


5713 


411 


0 


FLQILLFLIDIAFFLVFFFFFWKK 


5671 


19572 


A 


5714 


205 


400 


HYLIQSSSSMASESVQLDWMKIPLTWP 
GAVAHTCSPSTLGGRGGRITRSGDGDHP 
GQHDEAPSL 


5672 


19573 


A 


5715 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5673 


19574 


A 


5716 


176 


392 


GVHLVLEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRNFGSLQP 
LPPGLKRISCLTRPSS 


5674 


19575 


A 


57)7 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5675 


19576 


A 


5718 


306 


42 


GSGLRRCALNSVKCLNTLECILLGIYLR 
QIKICVYTKTCTCKFIVTLFIIAKKYEY 
PSVHQLINKMWHIQI IFLVIKRNEGCGG 
SCL 


5676 


19577 


A ■ 


5719 


1 


263 


PTRPGRFFLILGDRGCSELKS CVCTPAW 
VTEGDSISKQNKKQVELRSPAAHLLGGL 
VPNRPQASTRIGDPCTKKLLELISKFSK 
AAA 


5677 


19578 


A ' 


5720 


3 


278 


NQE ILKNVESSRTVQPHFLEFLLSLGWS 
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VDVGRHPGWTGHVSTSWS INCCDDGEGS 
QQEEVISSEDIGAS I FNGQKKSMHI IKQ 
IKYFXiSH 


5678 


19579 


A 


5721 


127 


3 


K KKKKKE EE EQGEGRRRRR KKKKE KE KK 
EKEEDGEGKEKGE 












96 


V 1 hHisbuc ^TPGaXKASClASIjPACWDYR 
PQPPHLA 


5680 


19581 


A 


5723 


3 


261 


QIWQWKYNDDDDDQLFYTRVYLDPGLR 
VGKAEEWAHPAALWEWRPQAEWDLGSKG 
GCKEKNVCLGWAHACHPRLLGGLGRR I 
A 


5681 


19582 


A 


5724 


134 


358 


PPPPPPPXLLFFVSAVPPSPSPSLSFPP 
HRPCYSPPSPTLSSLPPPIiLFPPPPSSP 
PPPPRPSAPSPPPSSPFFS 


5682 


19583 


A 


5726 


248 


470 


tiLNVQNGIRNVCSCPHYTPFHKWKYLS 
VFLTEVDSSQLRRQLCGGSQAAIERMIH 
FGR EKVQ I LH LAL> I Y FLV 


5683 


19584 


A 


5727 


1 


281 


NQHNRGFFDSCDGFFTNYNWREEHLERM 
LiGQAGERRADVY LG VDV F ARWNWGG R F 
DTDKGGGGFLSKGQRPSASPGTSFPHGP 
SIPVCTSKE 


56S4 


19585 


A 


5728 


460 


3 


SQTRTLTLiDRE I TLiPSSGTALIS LVDGS 
GNPVSVEVQSVTDGVKVKVSRVPDGVAE 
YSVWELKLPTLRQRLFRCVS IRENDDGT 
YA I T AVQHVPE KE AI VDNG AH FDGEQ S G 
TVNG VTP PAVQHL TAE VTADSGE YQ VLA 
RWDTPKGGPVTV 


5685 


19586 


A 


5729 


70 


465 


SQCSPHHSVLCQKVQDLESGTL.QSNWKK 
ITPWDENVCVCVCVCVCPCVSMCVYVCV 
YVCLCVCVCVTVCMCVCLYIKGLLFLLA 
KKETAGSFFHSQLTCLGVPCAFTIHPVP 
FLPQCRSHCVSGGRFWPGVXj 


5686 


19587 


A 


5730 


281 


427 


SWLTAAPNFQLQMESRSVAQAGAQWHDL 
GSLQAPPPGPQSAETQKQMIV 


5687 


19588 


A 


5731 


418 


194 


TKFHHVGHAGLELVTSGDLLSSASQSAR 
ITDMHHCSWPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 


5732 


229 


2 


PGGAPPLVRSG FPGWGGAPPFVF VPNGE 
PPKNFFFFFDGVSLCRPGWSAVAGSQLT 
ASSTSRVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAPPHLSRGGGSFTPLTPQ 
GKPRFFLKIKINPGGGPPPYSPPFGGGG 
PRNFFSPGAKGPLVIjISPLFFPPGDQKK 
NSFFKKKKKERERE 


5690 


19591 


A 


5734 


113 


409 


KDVCGG VLTGLSGVLTS PEY PNNY PNSM 
ECHWVIRAAGPAHVKXiVFVDFQVEGNEE 
CTYDYVAVLGGSS PTPVHQYCGS I S APY 
LFFIiAKNIMSYSLPN 


5691 


19592 


A 


5735 


3 


122 


GGWQVQFVSEPCSLSINGVIFGLTSTDL 
L FHLGAEE I SR 


5692 


19593 


A 


5736 


14 


428 


FLALRWGHIRLPASGPRDTATLFSTLDT 
QLLKTLYVGKDETGFYVSKALVHTGVAL 
VPRGLTliAPADGPTTDEVTLQVSGEREG 
SPSTAVRYPSGSVALPSQWLLIGHHELP 
PVLHTTMLRVHPTLGSGTAETRPPEN 


5693 


19594 


A 


5737 


415 


1 


QHINTLGQFWTTKAFLPRMLELQNGHIV 
CLNSVLALSAIPGAIDYCTSKASAFAFM 
ESLTLGLLDCPGVSATTVLPFHTSTEMF 
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E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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X=Unknown, *=Stop codon, /=possiblo 
nucleotide deletion, \=possible 
nucleotide insertion 














QGMR VR F PNLFP PLKP ETVARRT VEA VQ 
LTQALLLLPWTMHALVILKSILPQAA 


5694 


19595 


A 


5738 


335 


152 


GVEMGFHHVVQAGLEFLSSSDPPASASQ 
SAG I TGVSHQASPFFLPLSG I TLPLS I Y 
SSGSD 


5695 


19596 


A 


5739 


35 


462 


IDASDFGECSOAAINDNVVIVYELLEEM 
VDNGFPUVTESNILKELIKPPTTLRSW 
NSITGSSNAGDTLPTGQLSNIPWRRAGV 
KYTNNEAYFDAAEEIDALIDKSRSrVFA 
EIQGVIDACLKLSGMPDLSLSFMNPTLL 
DD 


5696 


19597 


A 


5740 


429 


325 


NPGGRGCSELRSHPCTPAWVTEQDAILG 
GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


FWPGAVAHACNPSTLGGRGGRITRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 
PLHTGGAKGGSFFKKKKKKKVPKKGPGL 
FFPGPQGPRKGPVGAPPVLIPNRLGPPR 
IPPGWEPGRQIFFLAPVFFFPSKPK 


5699 


19600 


A 


5743 


31 


224 


EEMGFYHAEQADFELLSSNDPAALAFQG 
AGIASMSHCSWPGKNFNMFCCIWKMKRL 
AIHWECIA 


5700 


19601 


A 


5744 


'217 


400 


HSVMGGSPSFFFFFFFFFGEKGLFFFCP 
GGPNLGKGNPPPPGKXGFPPPPPPKGGN 
NGAPP 


5701 


19602 


A 


5745 


4 


455 


DGLEFPGRR FRGQRCS SPP RRGRGQVEA 
L.LTSQTGRRGRGAPHVS DDGRRGRDAPH 
FLDGLVAGKRRS S LP I WDGGRAE TLLTF 
QTGQ PGRGAPHVPDDGRPGRE APHFPDG 
VAAGQRLQSRHFGGQGRRLGGGGCIEPR 
SRHCTPAWAP 


5702 


19603 


A 


5746 


381 


212 


PRGGGGGRYSPPPGGGGRGNSFYPGGGG 
LiNKPHFGPCPPPGGPKMKPPPQKKKKSL 


5703 


19604 


A 


5747 


334 


432 


MFCENYKE IAWPGAVAHSCNPNTIjGGRG 
RHITC 


5704 


19605 


A 


5748 


190 


1 


IPPKKGGLNFFSPPTPKKGVKSKPPQGK 
VFFFLRRS FTLVTQHGVQWRDLGS PQP P 
APGSKHE 


5705 


19606 


A 


5749 


425 


297 


ESAFHHVAQAGLELLGSCDIiPVSASQSV 
GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERFCKERKKEKKRT 
KSDE 


5707 


19608 


A 


5751 


136 


3 


AQSSRPAWLIFLFLVETGFHNVGQAGLE 
LLASGDLPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCS EP RSRHCTPAWVTEGDS VS KTNKQK 
TQRGLKS IALFPAIHS LLPWI KLAMP I 
LAMDI FPHPI VEFSYHVWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCSELKSCHC I PAWVTEQDLPSKKK 
KKKKGGKKTRNGPQTCGGDLPPPPLGGG 
KNFPKGETFWFHSGKKKRFFSLGKGEKC 
FFIPRAPKGPGKTLGGPNRGGGAKKKLK 
RPPGPVY 


5710 


19611 


A 


5754 


2 


191 


QGCSELCSYHCTPAWETRARPVSKKKKP 
KTKNQTKKNRLG KKP PNLI F P PFQKKKK 
WVQIPGK 


5711 


19612 


A 


5755 


42 


408 


KHAAPPASLSIiSLLLHHGQKRACFPFAF 
CRDCQFLEGSPAMLPVQPAKLTPRSTPP 
HPCSLENDSAHFFN 


5712 . 


19613 


A 


5756 


179 


410 


LQAFLRPSPPVLFLLYADLSNQNLWPGM 
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VARACN P TTLGG RGGR I T 


5713 


19614 


A 


5757 


166 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLPD 


5714 


19615 


A 


5758 


334 




EEKVFGLPSPILLSKGQGGAASGPPCPI 
SPRAMGPQKKGWETMVFITKAPPSFGFV 
LVKQSPGTGPVPRNFFFFLRRSLiALSPR 
AGVQ WRDLGS MQAP PPG FT PFSCLNL 


5715 


19616 


A 


5759 


373 


470 


SSCWSLRNSS PGAVAHACNPNTLGGRGG 
RITR 


5716 


19617 


A 


5760 


210 


384 


EALDTTIFFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGIiTPFSCLTLRKTWDCGP 
PP 


5717 


19618 


A 


5761 


257 


400 


SLGS E YTWVCFCVFETES RS VGQVG VQ W 
CNLSSLQPPPPGFKRFSCLR 


5718 


19619 


A 


5762 


400 


2 


ARAVGS VGEDPGLGTGPS PARETWE WAV 
VCRACRGSQLLWGKLiFIIjRHWDLRKVIjA 
RGNGTEDCLKQIjWSQCGAGMGRGPFQGL 

kgkpvgclcssrevfylflylfrdrvsl 

CHPGWNAVKQTRIiTAASASQ 


5719 


19620 


A 


5763 


34 


440 


RVRAPLGWEEGNYFVLRAKPRSGKRSCL 
PFLRCGHQEPLLWGISTASPLRHLVGAS 
Q I CG VIj F LC FQ VS LKP AG S W KWIGNNG 
ERKAQRLCPLRATRVEG I FCFRFVFLRW 
SFTLVAQAGVQWRDLGSLQPPPIj 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQRNGWNSHKSVQAQWL ' 
MPVI PALWEAEAGGSRSQE I ETIIiANMV 
T 


5721 


19622 


A 


5765 


1 


205 


GFKLTDSFASGAKAILRSQLLGRL.RQEN 
RWNLGGGGCMGLKWHNCPPAWAIEGNIjL 
fkkkkkkgggtf 


5722 


19623 


A 


5766 


307 


1 


KERDPPPTPFGKTFFFFFFKRNFCFFPP 
KEGQGGILGPRHPFFSRLSFLGRGVPRG 
FPPGPRNFFFFWILVKMGVPHVGQADFK 
LLTSGDLNKTAS QNAGI K 


5723 


19624 


A 


5767 


2 


319 


LLGNGEELSRFPFQRSSQAWVFESSAGI 
FGGKAG ATGLE KLTGS FQQLTGHPD PRT 
PEELGDPEKSSASEKTRGLQEANGVPEI 
LKAVTYTQAWPKEAKVDPITP 


5724 


19625 


A 


5768 


787 


1118 


EAAWRELEAERAQLQSQLQREQEELIiAR 
RKAEKEQLSEE I AALQQEHDEGLLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
TWQ LGALG QATAL LG AS VS L PAGGED 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSVSLEADENPFAQGALKSEDCF 
ILDHGKDGKI FWKGKQANTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
FKQFFKNWRDPDQTDGLGLTYLSSHIAW 
VERVP FDGGNLDT FTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKIVDLVKELDRDTVFALVNYIFFK" 
GKWERPFEVXDWKS 


5727 


19628 


A 


5771 


422 


8 


HIARWGARAPQGGWAbLKAELCLNPAPP 
PIGARTAPLWPGCCFSLRPVFLIiLFPFF 
SLFIQSFRILYPQPWPLQGRRSTTLSRM 
GTTDHVIVXiASTNRADILDGAL>MRPGRL 
DRHVFIDNPTLQERRKCMRRLGASRN 


5728 


19629 


A 


5772 


432 


3 


YSLNLKKRWFVLTHWSVHYYKSSEKSAL 
KJjGTWINSLCSWPPDEKI FKETEYWN 
VAVYGRKHCYRLYTKLLSEASRWSSAIQ 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NVTDTKAPIDTPTOQLIQDLKENCMTSH 
WEQIYKRKKILRYTHHPCIAAAVEDHD 
ND 


5729 


19630 


A . 


5773 


20 


432 


SRAAALIaEAVTETLFYYEVAEKIWSNRA 
NRQCADCGSSRPDWAAVNLGWICKQCA 
GQHRALGSGISKVQSLKLDTSVWSNEIV 
QLFIVLGNDRANRFWAGTLPPGEGLHPD 
ATPGPRGEFISRKYRLGLFRKPHPQ 


5730 


19631 


A 


5774 


433 


2 


RGAFQTLQKKDLQAFVLSSVOLQVLTGS 
CFKLRTVHN I P VPSNKVDDE C PGLYG FL> 
HVIVHFAKGFKQSATLSCALEVDSFGYF 
VSKAKTRVFRDTAE P KWDEE FE I ELEGS 
PSLRILWYEEYNWKSKVNNLYSAAVGFS 
AWQ 


5731 


19632 


A 


5775 


133 


436 


MLISL.PHPSTHLTSLPFCADFGTRQTTG 
AASHVLTLSAHSSVCSCPPLMGEATTSR 
AQDLPADEHTAFAHEL.EAPALSKGQGLS 
ILRALTGHLPLGQEVNL 


5732 


19633 


A 


5776 


397 


58 


KGTIS IGKLEEKLRGAARQALADAIIEF 
QLL P AS L CTE DT P TG MQVKRS LGS R E CH 
RSKQDSVAGERAKVGFRGSTSENYYIRT 
TYSQLSVSLSGHHPGATLAAVNGCSCSP 
T 


5733 


19634 


A 


5777 


1 


324 


MRRCVSAGGVELEEYYPAFLDMVRSMLE 
GSIDPTQYEDTLREMFTIKAYVGFTMDK 
LVQNIARQVSRAGVGPALRTAEMALPS F 
HS PLGRG P WS TANLS CFFLIETK 


5734 


19635 


A 


5778 


1 


398 


RLGSRPSLHDQSPLELRSEIQELKGDVK 
KTVKLFQTEPLCAIQDAEGAIHEVKAAC 
REE IQSNAVRSARWLFETRSLDVINQDP 
SQVRLIRGISLEEGARPDVSATRWIFDT 
QSLDAIREILVDEKDFPPSP 


5735 


19636 


A 


5779 


281 


12 


VACNIiGWRVRTPTPASYPPCVPQLSVRG 
KNRKELLGGFLRNIVKSADEALITGMSG 
LKEVDDFFEHERTNLLEYHTLYCGRSED 
HATSR 


5736 


19637 


A 


'5780 


406 


23 


DSQATGPGISRGRRISSPSSLCTRFSTS 
IVKCSPLSHVLKYLDPPPPPTRCSRPAS 
LRSSPKHCVCX3QFPIACRPGAAAPPPPP 
AQCPNPGPAAASARHEPKWGSRTEGGPA 
RARRREGTGRRKKRR 


5737 


19638 


A 


5781 


415 


18 


TSQAGDDLLHGSLLRRGPEMGYLPGYDP 
D PTLATP PAGQTLAVP SLPRATEPGTGP 
LTTAVTPNGVRGAGPSAPELLTPPPGTT 
APPPPSPASPGPPLGPEGGEEEPPPTI I 
TTTTVWTTVTSPMYCGRSEG 


5738 


19639 


A 


5782 


168 


432 


GMRRQTGLDQQSRGVGKNWRVDFFFPSS 
LSFFFFFFGKESYFVLKFGIHGLKFGYL 
EPLSPGLKEFSRLSPPKGGHLKCRPPSP 
LFFF 


5739 


19640 


A 


5783 


1 


423 


TDDDLNWLDHSRTFREQGVDENETLLLR 
RKFFYS DQNVDSRD P VQLNLL YVQARDD 
ILNGSHPVSFEKACEFGGFQAQIQFGPH 
VEHKHKPGFLDLKEFLPKEYI KQRGAE K 
RI FQEHKNCEKI SERSAKV 


5740 


19641 


A 


5784 


273 


400 


DS WLLMLS PGVCPHVQVWP I SCTLGAT 
FGYVAGLVISPLWI 


5741 


19642 


A | 5785 


445 


28 


DTNRAPNHMEIKGSGGTEAKVNGADS ID 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 

nucle- 

location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H = Histidine, I = Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=V«line, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KEAAGDETMEARTMEAEATGAKPTGVEA 
TGAKVTETKPTGAEVREMESTEEEANME 
SKPTGAQATDTETTGVEAMGVEATKTKA 
EEAEMQAYGVGAGQAEPPVTGTVLRPL 


5742 


19643 


A 


5786 


380 


2 


QRQQES PEASS LH I LERQVQELQQLLVE 
SQEEKESLGREVESLQSRLSLLENERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LS PS AQEHLAS LQ E Q VAVLTRQNQELME 
KVQIMENFEKDECI 


5743 


19644 


A 


5787 


663 


2 


FGVERRATRTI RLTRTVLDLYS FLAGVS 
ENLRHATQDDASRTRAPGIiSSQHPKPDT 
TVSGDTETGQSPGVFNTKESGMKDIKSE 
IiAEVKDTLKNKSDKLEELDGKVKGYEGQ 
LRQLQEAAQGPTVTMTTNELYQAYVDSK 
IDALREELMEGMDRKLADLKNSCEYKLT 
GLQQQCDDYGSSYLGVIELIGEKETSLR 
KEINNLRARLQE PSAQANC CDS VY 














TRCYHSAXjRYGGSFAELGPRIGRLQQAQ 
liWNFHTGS CQHRAKVIjPPLEQVWNLLHL 
E YKRNYG AKRG G P P VKRAAE P P WQP V P 
PAALSGPSGEEGLSPGGKRRRGCNSEQT 
GLPPGLPLPPPPLPPPPPPV 


5745 


19646 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKRGKGEEPLDP 
RSWGFKEA2TPPRPFTRGKKGTPRLKKK 
P KERKTRERGG KKEG KKKGKKNEGKTNP 

L . 


5746 


19647 


A 


5790 


406 


284 


RRGFFHVDQAGLELPTSGDPPTLASQSK 
PGPLCPALLHTF 


5747 


19648 


A 


5791 


200 


3 


GDALIYMEPEKQVMSRSSDECWALiCDQ 
WLVSYSKKIPLVNLLTFDIiKKNISICPL 
PGTWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQS PAGQSRCGGFL.VREDFVT/TAAHC 
WGR 


5749 


19650 


A 


5793 


1 


167 


VAPPKLDPHSGRVRAQAGLELLTLCDPP 
AS SSQTAAI TGAS HHTRQVLGS YKALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGAEAIjERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGVYSLEHFRCLAENAGDDAFVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 


VKIRHCHTMVSMSLLKNIRLILEEEQVE 
GNYSFCVXiDNQNTjQQIjWDWDHRSIjTIKA 
GKMYFAFNPRLCVS E I YRMEE VTGTKGR 
QSKGDINTRSNGERASCE 


5753 


19654 


A 


5799 


377 


3 


KVAFPGEGRQGESWSAFCGHTRWSDLK 
WAVYE PDLMVTSS VG TYIYIWDI KDTRK 
LTVALFWAGASQVKWNKKNANCLiATSH 
DGDVR I WDKRKPSTAVEYLAAHLS KIHG 
LDWHPDSEHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTE FFGGVSGLTVEQFRKI 
NGFPNAFWGWGGEDDDLWNRYLCRS FGV 
F 


5755 


19656 


A 


5801 


390 


2 


LFFFFFFFLRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCIiCV 


5756 


19657 


A 


5802 


342 


465 


VIFFFKLKQGLTLSPRLECSGTI ITHCG 
LDLLGSSESPTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENIKEEYRGRGFPTICYFEKGR 
FLIQYDNYGSTAEDIVEWLKNPQPPQPQ 
. VPETPWADEGGSVYHLTDEDFDQFVKEH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSVLVMPHAPWCGHCKKMNPEFEKASEA 
LHGEADSSGVLAAV 


5758 


19659 


A 


5804 


152 


1 


KPEIFFFESRSVTQTGVQWCNFGSLQPP 
PSGFKLFSCLSLPSSWDYTAHV 


5759 


19660 


A 


5805 


207 


31 


LYLQLLQQTASSGNLNTLSSLHPMGGKC 
FTAAEEQQHPGQQTQLTTVLGVWHRFCE 
NIF 


5760 


19661 


A 


5806 


199 


2 


GSECSCliLSRGGEMHVSLSSFICVTFSCD 
AIIHFSFLFFWRQEFPSVQAGVQWRDLG 
SLQPPPPGV 


5761 


19662 


A 


5807 


385 


239 


QDQPGLQKMCVCVCVCVCVC VCVCVI I C 
KSCIRLSYMKMPINFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GR I I I KFFGEMRSHHAAQTGLELLGSSD 
PPTWASQSAAITGVSLRIRP 


5763 


19664 


A 


5809 


376 


2 


WAEPTSFHVCITSYTQFFRGLTAFTRVR 
WKCLVIDEMQRVKGMTERHWEAVFTLQS 
QQRLLLIDSPLHNTFLELWTMVHFLVPG 
ISRPYIiSSPLRAPSEESQDYYHKWIRL 
HRVTQPFILRSV 


5764 


19665 


A 


5810 


271 


120 


FLCFIGKTSERGSFSIjYSRDTGLPGCQV 
SLMIKKSNEMVFKHINKGSINR 


5765 


19666 


A 


5811 


367 


3 


DKSFKEWHKML.EDANAPTEERDERIAGL 
YEE I ERDLMLLGAS AVEDKXQEGV I ETV 
TSLSLANIKIMVLTGDKQETAINIGYAC 
NMLTDDMNDVFVIAGNNAVEVREELRKA 
KQNLFGRCI 


5766 


19667 


A 


5812 


294 


37 


FRGENPDDGVRGSPPEDYRLRQVASSLF 
RGEHHSRGGTGRLASLFSSLEPQIQPVY' 
VPVPKVSHWAFFPNSSLEPLTYKAIFLP 
TA 


5767 


19668 


A 


5813 


184 


3 


I ISYLKNYLLFSPLQRQNFVLLPRLECS 
GT I TAHCNLKLVGS GDP PAPAS QVARTT 
GMRH 


5768 


19669 


A 


5814 


468 


1 


DDMEYEGSVSVFTQVPRKSWKGLDYTIi 
MVAGE FGLG KSTL VNSL FVSDL YRDRKL 
LGAEERIMQTVEISKHAVDIEEKGVRLR 
LTIVDTPGFGDAVISTECGKRRAMY 


5769 


19670 


A 


5815 


24 


314 


QAPFGPKCFNVNLCFSLSIjSKGEPSFHY 
IAGAHGNEVLGRELLLliLVOFVCQEYIjA 
RNARIVHLVEETRIHVLPSLNPDGYEKA 
YEGVMAIiAHLTQ 


5770 


19671 


A 


5816 


474 


3 


TTLRKGYSRHKGLLHSIRQEVQATPLEG 
IIVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPLHPPPHLTREGEPPLLPVLQPR 
RGPPLPLHSQASRPIV 


5771 


19672 


A 


5817 


511 


389 


GMYCTEGREWPTFRNEIEIEEDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLSCEVPNPLFPVPGVRMIQHHIEACAQ 
VRGMQE I VL I GFYQPDE PLTQ F I EAAQQ 
E F I LP VR YLQE F AP LGTGGGL YHFR DQ I 
LAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHPFLLLGTTANRTQSLNYGCIVEN 
P QTH EVLH YVE KP S C I AAAI EDQ AS S RQ 
R 


5773 


19674 


A 


5819 


282 


152 


FSLFLFIiLQSERHFIMQWCEATQCPDT 
RVSVRLYEKSVLHLS 
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SEQ1D 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5774 


19675 


A 


5820 


347 


1 


EPHPVTPPAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTGMNPSAMQGPSPP 
PPPPSYMHIPRYSTNPITVTVSQNLPSG 
QTVPRALQILPQIPSNLYGSPGSIYIRQ 
TKV 


5775 


19676 


A 


5821 


482 


13 


I YEKL I TGC YNI IiANHADPNSGLDES I L 
EECLQYLEKQLESSQARKAMEEFFSDSG 
ELVQIMMATANENLSAKFCNRVLKFFTK 
LFQLTEKSPNPSLLHLCGSLAQLACVEP 
VRLQAWLTRMTTS 


5776 


19677 


A 


5822 


608 


7 


FRFAEKWEGMFI IVNSITIKIHSKAFH 
ASFELWQLQGYSVNPNWQQSDLRLTRIT 
DP CRGE VLT FKE I TWQTLR I E ADATDNG 
DQDPVTTPLRLI TNQGRIQ I ALKRRTKD 
CNVI S S KLM FLLDDL.LWVLTDSQLKAMM 
KYAESLSEAMEKSAHQRKSLAPEPVQIT 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


AEVASEDCSLPCFLAVWNRI IEPVAAMR 
KEADMLRLFPEYLKGEELFGLTVHAVLR 
IAESLPGVESCQNYLFRYGRHPLMELPL 
MINPSGCARSEHKIiyTHYKRPSLY 


5778 


19679 


A . 


5824 . 


69 


308 


TSVPSCVRCRYIILRTSSALTNI LGTHS 
NSSFFHASSSALHSCCFPFFSWLQTL.DI 
NVKAP ALM T KA W P EM E KRG YRE 


5779 


19680 


A 


5825 


424 


2 


LQRAFSESIRKLRGYQGADRKQIYHRRC 
SFANHSVRPSADEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRSSGLQSGH 
SSQRTSAGSSSGTNSSGQRHDRESYNNS 
GSSSRKKGQHGSEHSKSRSSSPGKPQTV 


5780 


19681 


A . 


5826 


107 


2 


SSLTAGVRMGVPAQSTQGTVNGSSPQMS 
GTAALTS 


5781 


19682 


A 


5827 


2 


437 


FPTEDSRTS KESMSEADRAQKMDGESEE 
EQESVDTGEEEEGGDESDLSSESSIKKK 
FLKRKGMTDSPWIKPARKRRRRSRKKPS 
GALGSESDKSSAGSAEHIGPCDSTGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP 
GFLWS 


5782 


19683 


A 


5828 


487 


22 


MPEPVFPLSHFRQF IAAI KLQFQARLSR 
CVRDLVRS LAAPNHDTLRML FQHLCRV I 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQNRWELILQQCADI FPPK 


5783 


19684 


A 


5829 


2 


871 


RGICSQRWRREGSQSRGPGLVITSPSGS 
LVTTASSAQTFPISAPMIVSALPPGSQA 
LQWPDLS KKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLESGLPEMNDPYVL 
SPEDDDDHQKDGKTYRCRMCSLTFYSKS 
EMQIHSKSHTETKPHKCPHCSKTFANSS 
YLAQHIRIHSGAKPYSCNFCE KSFRQLiS 
HLQQHTR I HS KMHTET I KPHKCPHCS KT 
FANTSYLAQHLRIHSGAKPYNCSYCQKA 
FRQLSHLQQHTRIHTGDRPYKCAHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 


LPESRJjPSPHKREEGSRARVIMTSYPIE 
PHERRKGS LADWATLKQKKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESLAEKERQLSTMITQLISLR 
EQLiiAAHDEQKKliAASQIEKQRQQMDLA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 
otide 

corrcspon 
ding to 
Inst amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Jso)eucine, 

N— Asparagine, P—Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptoph:m, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














RQQQEQ I ARHQQLY 


5785 


19686 


A 


5831 


462 


2 


MILFDSSIiYFLCGWYLSNLILGTYGVRK 
PWYFPFTASYWKSVGFLVEKRQYFLSSS 
L.FFFNENFDNKGSSL.QNREGELEGSAPG 
VTLVSVTKEYEGHKAWQDLSLTFYRDQ 
I T ALLGTN GAGKTT I ISMLTGLHPPTSG 
TIIINGKNLQTVY 














VPSTPASKRKGIVPRCDIFI YDRGGALP 
KD I HVAG I L FASAWMNVWPL LS A I E FH S 
AWAMGMGVNFFVASTHHVSLNMSGSGRY 
APNGRKVYHYDMKTEMGELLLSEVDSHP 
LSSLAYPTADNWNAYATLY 


5787 


19688 


A 


5833 


384 


137 


EEICL.NPGAGGCSEPRLWHCTPAWATKR 
DSSKKKEKNCLSKNKKEKEKEMFACVCV 
CVLFSIYAFPSKAQELSNPCKHYGNF 


5788 


19689 


A 


5834 


171 


2 


FVEGINISGNFYRNKJjKYIAFLRKRMNT 
npsrgpyhfrapsri fwrtvrgmlphm y 


5789 


19690 


A 


5835 


278 


404 


SVLI KFE KKTLKYKAI WPGAVAHACNPS 
TLGGRSRGITRSGE 














S KG JCEGR CE WQVX>LPS GTHA VFQFFLKQ 
Q G R V P G LTQ A VQ VAKM S L I DLAG S E RAS 
STHAKGERLREGANINRS LiLiALIKVLNA 
LADAKGRKTHVPYRDS KLTRLLKDSLGG 
NCRTVMIAAISPSSMTNEDTYSTHV 














AGRAyCYNGMCLTYQEQCQQIjWGPGKALt 
LGSTALPSPPLPSQPPLALRWWNLGFK 
WHL 






A. ' 








LRSVPCKDYLTQNHYITSPIjSSEEAAFP 
LAYVMVI HKDFDTFERL FRA I YMPQNVY 
CVHVDEKAPAEYKES VRQLLS CFQNAF I 
ASKTESWYAGISRLQADLNCLKDLVAS 
EVPWKYVINTCGQDFPLTTTRPV 


5793 


19694 


A 


5839 


385 


174 


GLAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
TKQVWGAL.SIVSL 














SSIDEFCRKFRLDCPLAMERIKEDRPIT 
I KDD KGNLNRC IAD WS FY I TVM D KXiRQ 
CI 


5795 


19696 


A 


5842 


299 


3 


FSGIKY2CI IFIjFVCFFIjYIjIiGIGSPYV 
AQAGLTLLGS S DPP PS ASQS PG 1 1 GVSH 
YVQPIHTVAQLSPSSIFRTFSSFQTETV 
PIKHTSHSPSPSLY 


5796 


19697 


A 


5843 


391 


2 


APHAPAFLRARGEPQDPLSHPRVPAVSA 
NCRMWKHLPVHSSPTPRLTPLWKLQARW 

AREFLYPAKMSVSEVCSSGLSSPLLEQH 
KTWL I FYASGD I CSANV 


5797 


19698 


A 


5844 


479 


59 


FVGMEPGVGHASLAMHGLHIRSLGKIKL 
RKRKCTLF FNTQEKSARRRGHLLGENI Y 
LLLFAIALRILNCLLVQTSFVPDEYWQS 
LEVSHHMVFNYG YLP WEWTERLRS YAYP 
LI CAS I YKVLHI.L.GKDS VQMLVSGNKS S 


5798 


19699 


A 


5845 


406 


548 


SVQTKPFFETESHPVTQAGVQWHNLCSL 
QPLPCRSKQFLCLRLPSSW 


5799 


19700 


A 


5846 


2 


394 


TLCHRAKDPT VHHE SGR I MYLNGLCFLM 
GPAQLTQRLSVSRQGEC 


5800 


19701 


A 


5847 


472 | 201 


LSILSDTSLCLGRFFTYENGCAYFHEEE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

ntide 

location 

correspon 

ding to 

last amino 

acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylal«nine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leu cine, M=Meth ion i n e, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Sfop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REGLARICRLS IHSRYEDFWDGFNVLY 
NKKPVIYRSAPARPGLGQSLCNQVMWYQ 
VAILNK 


5801 


19702 


A 


5848 


127 


3 


SSNVKAAWGKVGAHAGEYGAEALERMFL 
SFPTTKTYFPRCI 


5802 


19703 


A 


5849 


238 


1 


MAETHPNPTGLPIiAPALPCLAGSATHPC 
LFSPEI SLDVDADRDGWEKNNPKKVPS 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


5850 


423 


2 


LSPLGNWSSANQKWFDSVKPSRKDLFA 
VDTQVGTVPSLTAGEQGFGDGGKGRAGE 
QATQAAGDWSSNMWGSDG I LSHSAPLTV 
PQPLLTGPQGPCLCRPLPNQQTSFPALL 
PFSTQLCSMCPGPASARPPPLLLKPTMY 


5804 


19705 


A 


5851 


326 


682 


PSYSLLSLLQVKNEVEKLPRQQRKESMK 
QKMEEHTQKKQLLVSPWPQLGSAGAGAR 
PGMVTRPPLCLCP I PAPQI PRS EYI.NLA 
SSFMAPLSPPPCSLHRLSCLHLEVRNSI 
SGYPNCI 


5805 


19706 


A 


5852 


263 


30 


HEKTDDERGPGQSARSGAI TKPPGPPLP 
IEPHETTPEHPAPSGTIPEPPLPVEIiHE 
TTPQHPVPSGTIPEPPYIiLSQ 


5806 


19707 


A 


5853 


3S9 


2 


GLPTQREKFGASMRTRMTIQS IAWTSF 
VCLVGEGNNVQGFRAESRCWRYDPRHNR 
WFQIQSLQQEHADLSVCWGRYIYAVAG 
RG YHND LNAV E RYD P ATNS W AYVAP L KR 
RCMPTKAKRWRGRCISP 


5807 


19708 


A 


5854 


379 


2 


GRSLRYSGSCSGEENSTNNSAGQSRAVI 
AAAARRRGNSHNEYYYEEAEHERRVRKR 
RARLVVAVEEAFTHIKRLQEEEQKNPRE 
VMDPREAAQAI FASMARAMQKYLRAAKQ 
QNYNTMESILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAPVNIYEFRHRPQCLEDTKPAFVK 
ADHADEVRFVFGGAFLKGDIVMFEGATE 
EEKLIiSRKMMKYWATFARAGNPNGNDLS 
LWPAYNLTEQYLQLDLNMSLGQRLKE PR 
VEIWNSTIPPCI 


5809 


19710 


A 


5856 


516 


18 


PYECKECGKAFNCGSSLVQHERIHTGEK 
PYECKECGKAFSRGCHRTQHQKIHRGET 
PIKCKECGKAFSWGSSLVKHERVHTNEK 
SYECKDCGKAFGSGYQLSVHQRFHTGEK 
LYQRKEFGKTFTHGS KLVHERTHSNDKP 
YKNKECGEAFLWTTYSNEKCIASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGSPRALGRHWTSFLKLRLNCSV 
PGDSTFYFDVLQALTGPVNLHGRSALFG 
VFTTQTNS I PGSAVCAFYLDE IERGFEG 
KFKEQRSLDGAWTPVSEDRVPSPRPGSC 
AGVGGAAKFTS SRDLLY 


5811 


19712 


A 


5858 


391 


3 


ARRTTGMVPKAGGGKGRRGAVFRSYIRE 
IEELRSKLVESEAMNESLRRSLSRASDR 
SPYSLGASPAAPAFGGSPASSMEDASEV 
IRRAKQDLERLKKKEVRQRRKSPEKEAF 
KKRAKLQQENSEETNGE 


5812 


19713 


A 


5859 


409 


3 


SLPGEGDPWWAGQEELLFVQEGKLSLPV 
AERVLLRIACRYDPRSNSWAEIAPMKNC 
REHFVLGAMEEYLYAVGGRNELRQVLPT 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLW I SGGVTNTAQ YQNRQC I 


5813 


19714 


A 


5860 


405 


135 


NDSRLSLCRQPMTLVTHVLLFYDSEEKA 
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otide 
location 
correspon 
ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FHLLVALWKRMliPDYYNTRWGECPRVLi 
PRLGLQVPAGSMRVAALPRPSSRL.HVGS 
LGVRG 


5814 


19715 


A 


5861 


389 


2 


YYSELEEQLTDEFNAELNRVPLKRLDLI 
FVTFQDSRMAKRVRKDYKYVQCGVQPQQ 
SSVTTIVKSYYWRVTMAPHPKDI IWKHL 
SVRRFFWWARFIAINTFLFFIiFFFLTTP 
AI IMNTIDMYNVTRPCI 


5815 


19716 


A 


5862 


525 


0 


SRLPESERIHHTVGQKEQVLMDTSKTRP 
NNDVPE P PM P I ADQ VS NDDRP E GS VEDE 
EKKESSLPKSFKRKISWSTKGVPAGNS 
DTEGGQPGRKRRWGAS TATTQKKPS I S I 
TTESLKSL, I PD I KPLAGQEAWDI.HADD 
SRISEDETESKGDDGTHDKGIjTIC 


5816 


19717 


A 


5863 


399 


2 


ATRRNRNRVPSGMTRTNVREMIAAVGPG 
PSPYPLP PPPEGTS S I EYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAALEKP I SH PMQE TMPHAGSS 
DQPHPSMQQDKARNTPQPMY 


5817 


I97I8 


A 


5864 


3 


714 


RRPFIALCIiSNVAFMLPWQFAQFILFTQ 
IASLFPMYWGYIEPSKFQKI IYMNMIS 

VTLjSFILM fgnsmylssyysssllmtwa 
1 1 lkrne i qkxgvs klncwl i qgsawwc 

GTIILKFLTSKILGVSDHICLSDLIAAG 
ILRYTDFDTLKYTCSPEFDFMEKATLLI 
YTKTLLLPWMVITCFIFKKTVGDISRV 
LATNVYLRKQLLEHSELAFHTIjQLIjAFT 
ALAIL.ILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


APPVSTAVAQSNSSEEEAREVGSPAQEF 
KYQKSLPPRFQRQQQQQQQEQLYKMQHW 
QPVYPPPSHPQRTFYPHHPQMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSAIiHPS 
GLMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


J 9720 


A 


5866 


497 


2 


AVGAGQKGGRGGGGRE.LMPFQKIWGGGA 
FTNNAHVGPLKIMPRLIRTGMLEEIHLE 
PGWQGLFPCVDELSDIHTRFL.SQLLER 
RRQALCPGSTRNFVIHRLGDL.LISQFSG 
HSAEQMCKTYSEFCSRHSKALKLYKELY 
ARDKRFQQFIRKVYCGRCRGSQQGR 


5820 


19721 


A 


5867 


382 


74 


LALSPRLKCSGQI I AHCSFDLLGSSDPP 
PSASRVAGTTGARRHARLPYSIjRHYHFV 
LRLFNPSKHSKLKHFHPILIRNPGKVGS 
YS FQMSDLLQSRRARQQS 


5821 


19722 


A 


5869 


378 


3 


SFSRSANLISHQRIHTGEKPFQCAECGK 
SFSRSPNLIAHQRSHTGEKPYSCPECGK 
S FGNRSSLNTHQG I RSGBKPYECKECGE 
S FS YNSNL I RHQRI LTGEKPYKCTDCGQ 
RFSQSSALITPV 


5822 


19723 


A 


5870 


375 


3 


IHVIVFNQATGHVMAKRVFDTYSPHEDE 
AMVLFLNMVAPGRVLICTVKDEGSFHLiK 
D TAKALLRS LGSQAG PALGWRDT WAFVG 
RKGEAECHWADTELNRRRRRFCSKVEGY 
GSVCSCKDPTV 


5823 


19724 


A 


5871 


373 


1 


QPEEVSGALS PPSAS A YVKLVLLGL IMC 
VSLAGNAILSLMVLKERALHKAPYYFLL 
DLCLADGIRSAVCFPFVLASVRHGSSWT 
FSAIiSCKIVAFMAVLFCFHAAFMliFCIS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, l=lsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VTRNMAIAHHLY 


5824 


19725 


A 


5872 


382 


2 


KSFSQKSQLIIHLRTHTGERPFECPECG 
KAFRE KS TVI IHYRTHTGEKP YECNE CG 
KAFTQKSNLIVKQKTHTGEKTYECAKCG 
ESFI QKLDLI IHHSTHTGKKPHECNE CK 
KTFSDKSTLIIPHV 


5825 


19726 


A 


58 73 


370 


3 


MGRVRAQNISGVMSGPQKMliMQSQFPTQ 
GQQGFCEGKEPYQAMSQNMGNTQDMFSP 
DQSSMTMSNVG ATR LS HMPL P P ASNP PG 
TVHSAPNRGLGRRRSDLTI S INQMGS PG 
IGHLNSTTCI 


5S26 


19727 


A 


5874 


362 


2 


GGKFLVLGNLPSKLEESMVQYYRLVTAA 
SLVRGQISEYNISLRASDGGSPPLSTET 
HITLHVIGINDNPPTFPHLSYSACIPEN 
NPRGASI FSVTAQDPDSNNNARITYALT 
EDTLQGVY 


5827 


19728 


A 


5875 


369 


1 


R I RPR PTARLAS ARTLHEVS LQESIRYA 
PGDAVEKWLNDLLCLDCLN I AR I VS VC P 
L.PEACDLSYVNRDTLFWCHKASEVFLQR 
LMAL YVASRFKNS PNDLQMLS DAPAHRL 
LNKCLLCPPPV 


5828 


19729 


A 


5876 


119 


39 


VI QDYTTP PNEELS RDLVNKLKP YMR 


5829 


19730 


A 


5877 


461 


2 


RRGKTSRRKPKEDPSGAAVPEMPKKSSK 
IASFI PKGGKLNSAKKEPMAPSHSGI PK 
PGMKSMPGKSPSAPAPSKEGERSRSGKL 
SSGLPQQKPQLDGRHSSSSSSLiASSEGK 
GPGGTTLNHSISSQTVSGSVGTTQTTGS 
NANSVQLPQPLCI 


5830 


19731 


A 


5S78 


503 


3 


PSPQVPGECPSPKKXGARAAFTTPDPAP 
LSPQSRVASSGSEQTEEQGSSRNSFQED 
GSGMKDVPS WLKSLRLHKYAALFS QMS Y 
EEMMTLTEQHLESQNVTKGARHKIALS I 
QKXiRERQSVLKSLEKDVLEGGNLRNALQ 
ELQQI I ITPIKAYSVLQATVAAATLY 


5831 


19732 


A 


5879 


421 


34 


LVHKVLSASNVLVDAEGTVK I TDY S IS K 
RLADI CKEDVFEQTRVRFSVNAMHYRTG 
NKGDVWRLGLLMLSLSQGQECAEYPVAI 
PSGLPGGFQDFLKKCVCLDDKKSMESPA 
WETKQNKSPANVLRPL 


5832 


19733 


A 


5880 


379 


3 


AQGLLLAWPNLNRQHCPCSCSHNLSTF I 
QNWKS PGLSLKTRAPGQGS VLS SS DLPR 
AVLTGTFAVMSVMVGSVTESLAPQALNG 
SMINETARDAARVQVASTLSVLVGLFQV 
WNSQEYVPPTTCI 


5833 


19734 


A 


5881 


429 


1 


SLVSYMDTESTAEKLGKELGLQAKELSA 
VHSSHHEIGVNDSNLFSLEMREPLESSN 
TKSFHSAVEIKNRSQHELPCFQHHGIDT 
PTSLQKRSACCPPSLLRLEETASSSQDG 
DPQIWDDLPFSESLNKFLTVLESEIAVT 
QCI 


5834 


19735 


A 


5882 


436 


3 


WSLVSYMDKKSTAEKLGKELGLQAKELS 
AVHSSHHEIGVNDSNLFSIjEMREPLESS 
NAKS FHS AVE IKNRSQHE LP C FQHHG ID 
TPTSLQKRSACCPPSLLRL.EETASSSQD 
GDPQMWDDLPFSESLIKFKTVLESEIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DS YS YVRS TAPAVAYDS KQYYQQP TATA 
AAVAAAAQPQPSVAETYYQTGGFS 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5836 


19737 


A 


5884 


432 ■ 


1 


LSAKVPVASVSDQAAAMHLSQCPKNLAT 
SIAELRTASQKAHKLCGPMEIDSALNTV 
QTX.KNELQDAKMAAVESQLKPLPGETLE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EHYTGVAARETAQALKTLAQAARGVAAS 
TTDP 


5837 


19738 


A 


5885 


313 


1 


GKSPHLLILQTSPLSKAPQPLILLLLYW 
SWVGVVSVTAASRETEAGIQVSSEEPGP 
RNVSPHSRLQDETNLRLEAENNLAAYRQ 
VREVQGRDREGRKGLMATLY 


5838 


19739 


A 


5886 


432 


3 


GSGFCSPILCLWWVEEAVGI IRLHLWLR 
ELGCWHSFSLPGTYSLEFLSTCTQVLPY 
SWSGLLFASGNYLSLYLVPSRAETDSRG 
SQPLPVARHGVGACKCKPRQTRPGLS PQ 
VCVEKliMPLSSFCSAFQQNTYNKQPMSR 
PV 


5839 


19740 


A 


5887 


417 


2 


ASLMVAPDYAE I SPLAMPAPS SGWCTP 
IMSTSSSEAMSTPLMLAPDSGELSPILM 
QDMNPGVMSTQPVPAPSFEAMSPLQITD 
EDTEAMSKVLMTALASGEI SSLLMSGTD 
SEAISSLIMSAVASGGTSPQPTSTLY 


5840 


19741 


A 


5888 


420 


3 


KFDL I KKLDLKTMSSYDLDI EASDGGGL 
SGKCSVSVKVLDVNDNFPEliS ISSLTSP 
I PENS PETEVALFRIRDRDSGENGKM I C 
S IQDDVLFKLKPS VENFYRLVTEGALDR 
ETRAEYNITITITDMGTPRLKTEQSV 


5841 


19742 


A 


5889 


321 


414 


FNMR I WPGAGAHACNP S TLGGRGGR I TR 
SAD 


5842 


19743 


A 


5890 


415 


1 


PGL PGTSVKG I PAS KQS PHES PRTLHL.K 
TSPI IQQLGLYLSHTAIRYHPQETLKEF 
VQLVCPDAGQQAGQVGFLNPNGSSQDKV 
HNPFLPTPMLPPPPPPPMARPVPL.PVPD 
TKPPTTSTEGGADYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPS S KVWSRRD PFL.LSS VEQ P I KDAV I T 
VPVFFNQAERRAVIiQAARMAGLKEMQL I 
NDNTACI 


5844 


19745 


A 


5892 


412 


2 


KHQVSDNKDS FYVSLYPDFSSLSRAI LD 
WQ F F KW KTVT WYDDSAGL IRLQELIK 
APSRYNLiRLKIRQLiPVDTKDAKPLLKEM 
KRGKE FHVI FDCS HEMAAG I LKQALAMG 
MMTEYYHYIFTTLDLFALDVEPSV 


5845 


19746 


A 


5893 


413 


3 


ELLLCSNTSCCRCFCVECLEVLVGTGTA 
AEAKLQEPWSCYMCLPQRCHGVLRRRKD 
WNVRLQAFFTSDTGLEYEAPKLYPAI PA 
ARRRP IR VLSL FDG I ATG YLVLKELG I K 
VGKYVASEVCEES IAVGTVKHEWR 


5846 


19747 


A 


5894 ■ 


424 


3 


CSGRREPSVRGVQCKGDQGHHSARMAPS 
E APGTRS CTP SHGQNTAAE AT P AQ KTP A 
KWYVFSTEMANKAAKAVLKGQVETMVS 
FHIQNISSSKTERSTAPLNTQISALiRND 
PKPLPQQPPAPASQDQNYSQNTRLQPCI 


5847 


19748 


A 


5895 


400 


1 


AS VQN P ALRLVT REE FAIMQT PAGELYD 
KS I IQS AQQDS I KKANMKRENKAYSFKE 
QI IELELKEVSTRRHLRKVAAELGGLKL 
VAFPTAHMLFTWNILRPMAHSSDAPDQC 
LVHWHTFPS PLLHS F FHPQLY 


5848 


19749 


A 


5896 


421 


3 


VSSIQMGAVNIiLGGGLDSLLGSDLGGGI 
AGSPAVGQSFIPSSVQATCAPSPTPAW | 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
peptide 


1 Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparag»ne, P=Proline, 
Q = Glutamine, R = Arginine, S = Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Srop codon, /=possib!e 
nucleotide deletion, \=possible 














SSGLNDL.SELSTWIGMAHGGYVAPKAVW " 

LPAVKAKGLESSGTFTRRQGHIYMAMDF 

TNKALQHMTDSAIQFTKNSFGVIPSCI 


5849 


19750 


A 


5897 


384 


1 


PRASRFIHEAGIVPLPMRTHTPKKPFTC 
E VAG VQ FS RNVKVK I HMR KHTG ER FYS R 
QSCSVRFVHSYDLKNHMHLHTGORPYEC 
HLCHKAFAKEDHLQRHLKGQ I CLEVRTR 
RRRKDDAPPHYPPPCI 


5850 


19751 


A 


5898 


429 


2 


QDIAAWQSLTQVLTPESWRKANIMTEPQ 
KSQERYKGIYVKEKLYRRARHDESLNWT 
SCDHHESQECKGEDPGRHPNCGKNLGMK 
STVEQHHWH VL PQ P FTCNNCG VAFADD 
TDPRAHPSTHLGEKSYKCDQYGKILSQS 
LY 


5851 


19752 


A 


5899 


424 


1 


G TMLQ ALAAHDAG SRAHVL LS L S QQDG I 
EQHMDFDSRYTLLELFAETTSSEEHCMA 
FEGIHL.PQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSSPCATREKSRGOR 
E LEFS MAVGNL I S ELVRSMG WARNLS EH 
V 


5852 


19753 


A 








YFMLKAEHVISYVCSENSLMVWNMRFC 
LIGLIHIQEFYFFEMEPHRAIQAGVQWC 
DIiCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRSPKPGDRLWESVHLSFPGPADSLSGG 
SAPASSYEPSQRSSFSSNRSQRGSTSTR 
NSSQKGSS VLS I KQKGKRELYMEKLQEH 
LIKAKAFTIKKKFSQNLVAQLWYFVKCV 
Y FGLSAYQ I PRG YPTRVLGNFLT K 


5854 


19755 


A 


5902 


402 


— 


GYRHPELEIKSVDGFQGPEKQPVILSLV 
RSNRKGEVGFPVEDRR INVD VTRARRLV 
TVI CDYRTVSSHAFLiKPLVEYFTQHGEV 
RTAFEYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 




5855 


19756 


A 


5903 


346 


3 


GICPLSHIKMTDARCQIYEMKGGEKSPK 
DTGKE PGHS EAKTGPPQVLAGVPAQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AGAS E TAQATDL ShS PGGES KANCS PED 
PMY 




5856 


19757 


A 


5904 


139 


212 


EVENTJDD I SHHHHHHHHHHSNS KS 




5857 


19758 


A 


5905 


2 


345 


PCGCWGRCALLLISAAAKAKSKCGPTFL 
PCASGIHCIIGRFRSKGFEDCPDGSDEB 
NCTANPMLCSTARYHCKNGLCIDKSFIC 
DGQNNCQDNSDEESCESSQGVGYVCGFK 
KA 




5858 


19759 


A 


5906 


432 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC" 
WSPYFSFHLYVALSAQPISTGQVESWT 
WIGYFCFTSNPFFYGCLNRQIRGELSKQ 
FVCFFKPAPEEELRLPSREGS IEENFLQ 
FLQGTGCPYKSWVSRPLPCIAAALRV 




5859 


19760 


A 


5907 


407 


1 


PYACGECGKSYRQSSSLVSHRRIHSGVR "' 
PHHCDECGEFFSRKYDLLIHQRVHSGER 
P YKCS ECGKS FSHS CS L I AHQR IHTGMR 
PYECSECGISFIHSCSLITQQRVHLGTR 
PYMCSECGKSFSQSCRLIKHRSV 




5860 


19761 


A 


5908 


419 


3 


VliAKKTI ITKSARDCHEFGNILHIiSTNL 
VAS I QRPDKHESFGNNMVDNLDLFS RSS 
AENKYDNGCAKLFFHTEYEKTNPGMKPY 
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SEQ1D 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
ncid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalar»ine, 
G=Glycinc, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GYKECGKGLRRKKGLiSIjHQRIKNGEKPF 
ECTACRKTFS KKSHL I VHWRTHTGVY 


5861 


19762 


A 


5909 


441 


57 


KVMGIFKSSLFQALVDIQEFYEVTL.INS 
Q KS CE QK I EEANQ VLQK WE KTSL-LAP CH 
DRLQKSSELQSLIGQHSSLETTDGPQPW 
LFPRGSEGHRGKTYDHCHSIGLQIAVDQ 
RKMLHVLS KI KKIRV 


5862 


19763 


A 


5910 


3 


317 


CLLML WRRWG S RG S RQRRL Q AT P S WE S T 
KTADGQGDRRESGSQRHCSSPHPQSKST 
TTSSVPAAREGAQEAAQGGPRKGQTSRP 
QVSTRQCAMPPARHRKIiENDA 


5863 


19764 


A 


5911 


183 


1 


QNSDSKDSLNRVASRDHAKPNLTCHVSP 
AIQGTGSISESSIPSVSDTSTPRRSRRQ 
LPPCI 


5864 


19765 


A 


5912 


395 


1 


KTI Q FN I C VKVFHR FS NSNKDK I R Y AGD 
KTFKCKECGKSFHVL3RLTQHKRIHTGE 
NP YTCEE CGKAFNWS S I LTKHKR I HARE 
KFYKCEECGKGFTRSSHLTKHKRIHTGE 
KPYICEKCGKAFNQSSTPV 


5865 


19766 


A 


5913 


449 


2 


FGSHLEKEDEKKQELVDKAI KPS IEATL 
ESIQRKLQYKRAESSRPEDIKDMTKAQI 
ANEKVALQKALLYYES IHGRPVTKNERQ 
VMKPLYDRYRLVKQILiSRANTIPIIEEE 
EGSEADSNVKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 


19767 


A 


5914 


464 


23 


SAHMTETRSKSFDYGSLSLTGPSAPAPV 
AP PARVAP PERRKC FLVRQAS LSRPPET 
ELEVAPKGRQESEEPQPSSSKPSAKSSL 
SQISSAATSHGGPPGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAQTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


LVSFINFFTSVtATLWFAVLGFKANIM 
NEKCWENAEK I LG YIjNTNVLSRDLI P P 
HVNFSHLTTKDYMEMYNVIMTVKEDQFS 
ALGLDPCKLEDELDKSVQGTGLAFIAFN 
EAMTHS PAC I 


5868 


19769 


A 


5916 


246 


366 


TSVKEQQMPGAVAHACNPSTLEGQGGRI 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPEFYQSVYKVTVLENAFNGTLVIK 
LNATDPDDGTNGD I VYS FRRPVWPAWY 
AFTINPNNGEIRTKGKLDFEEKKLYEIS 
VEAVDKGNIPMAGHCTLLVEVLDVNDNA 
PEVTITSLSL-PIREDTQPCI 


5870 


19771 


A 


5918 


410 


200 


CTPPQPVKCRFLNRDRVSSCCPGWSQIP 
GIKRSSYliS LPE CWDLRPVI LLFC FLSH 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDRASLiLSPRLECI WTIAHCSLE 
LL-CSRDLPASASQSAGITGIRHHTWUCT 
HFYSSFKT 


5S72 


19773 


A 


5920 


476 


39 


VPYGLFVGSTGRLGLP YLRGTSH PL.RGC 
L.HAAALNGRRIiVQPLTRNKHEG CAEE FS 
ANDDVALGFCGSHSLAALPAWGTQDEGT 
LEFTLTTQSWQAPLAFQAAGWHGDFIHV 
DI FEGHLWSMVEKGQGTVLLLNSVPVTD 
AQPHM 


5873 


19774 


A 


5921 


430 


2 


S1.RPSSTGPS PSGGLSEEPAAKDLDNRM 
PGLVGQE VGS GEG PRTS S PL- FNKA VFLR 
PSSSTMILFETTKSGPAL.GKAVSEGAEE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQDTKPPVPQEEAGQDHPPSKAS 
RC 


5874 


19775 


A 


5923 


41 1 


2 


MSLKVDVEALENSAGATYIRKKGGKVTG " 
DSQPKEQGQGVLKKKKKKKKGMVPKNYD 
PKVTPDPERWLPMRERSYYRGRKKGKKK 
DQ I GKGTQGATAG ASS ELDAS KTVS S P P 
TSPRPGSAATVSASTSNIIPPRLY 


5875 


19776 


A 


5924 


402 


2 


LQSGAIMNKFYQPHEAHIPYLLQLFIDY 
MLYGMNLINLAAVKFRKARRKSNTLHAT 
GSCKNHLSGNSLADTLFRWEQDEIPSSL 
I LEG VE P QST CE LE VD AVAAD I LNRL D I 
EAQIGGNPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFFFFLERQFPFAPQLGGHGRNL 
NSLNLLPRGLREFSRLSLPGGWIYGGAP 
PSPTFFC 










320 


1 


PGRRDRWIWGQGECLEYLLASMTQWCER 
EI S S IAPGELCCLLLSFLPQEB CQNYVR 
VL I VAGRKVFMCGTNAFS PMCTSRQVGN 
LSRTTEKINGVARCPYDPRHRV 








5927 


400 


1 


WATDGGLPLLAS S ATVS VALQDVNDNE 
PQFQRTFYNASLPEGTQPGTCFLQVTAT 
DADSGPFGLLSYSLGAGLGSSGSPPFRI 
DAHSGDVCTTRTLDRDQGPS S FDFTVTA 
VDGGGLKSM VYVK VFLS DEMY ■ 


5879 










62 


HNTEVLVGESVTLECSATGHPPPRISWT 
RGDRTPLPVDPRVNITPSGGLYIQNWQ 
GDSGEYACSATNN I DSVHATAFI IVQGG 
YPSSVPE PGLS P I LTPPLARAAI PTDAP 


5880 


19781 


A 


5929 


403 




LASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQ PEYR IS I PENTLVGTR I LTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
TSGELTI IKDLDYEDATFHEIDIEAQDG 
PGLLTRAKVIVTVLDVNDNANV 


5881 


19782 




5930 


408 


3 


VASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQ PEYR I S I PENTLVGS R I LTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
SSGELTI IKDLDYEDATFHEIDIEAQDG 
PGLLTRAKVIVTVLDVNDNANV 


5882 


19783 


A 


5931 


126 


391 


PVLTCAHFHFSAGAGGSYFMISRSLGPE 
FGGAEGLCLYLATTFATAMYILGAIQIL 
LVSQAE PWSTLAM I KQFLWTPNRRLMVL 
I PTC 


5S83 


19784 


A 


5932 


436 


359 


NKEWLPGSKVPEKS INDVKNTSGLL 


5884 


19785 


A 


5933 


365 


3 


LI IGMLTAI IGDLASHFGCTIGLKDS VT 
AWFVAFGTSVPDTFASKAAALQDVYAD 
AS IGNVTGSNAVNVFLGIGLAWS VAAI Y 
WALQGQE FHVSAGTLAFS VTLFT I FAF V 
CISVLLYR 


5885 


19786 


A 


5934 


394 


3 


FQYFSRTDRVLKHERMCHENHDKKLNRC 
AIKGGLLTSEEDSVFSTSPKDNSLPKKK 
RQITEKRSSGMDKESALGKCDLKKVKND 
YS PLYS S STKVKDE YMVAE YAVEMPHS S 
VGGSHLEYASGEIHPPCI 


5886 


19787 


A . 


5935 


393 


1 


REDLIAGIDEFLDEVIVLPPGEWDPNIR 
IEPPKKVPSADKRKSVFSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTSSGDDGEMPAMHEIGEELIWTGRF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=lsoleucine, 
fv = Lysine, JL=jLeucine, M = iVlethronine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W^Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














FGGLCLD I KRKLPWFP SCI 


58S7 


19788 


A 


5936 


410 


126 


P I CYYKKAGRARWLTPVI PS S WDYRHQA 
QL I FVFL VE TG FHR VAQAGP E LMTS GD P 
LAYNFLCSYPNEVFRSQASASDLQFVLR 
NLEPPDAGNV 


5888 


19789 


A 


5937 


426 


1 


DATFHE IDI EAQDG PGLLTRAKVIVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTI I 
GLFNVHDRDSGQNAFTTCSLPEDLPFKL 
EKS VDNYYRLVTTRALDREQFS FYNI TL 
TAKDGGNPS IiS T DAH I LLQVAD I NDNAP 
CI 


5889 


19790 




5938 


438 


3 


ADPTKGLLRNGSVCVRAPGGVSQGNSVN 
LKNSKQACLPSGGIPSLNNGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
QSKHLPKTAKPASQEHVRCSAIGTGESP 
KESAQLSGASPKESPSRGPAPPQENKW 


5890 


19791 


A 


5939 


3 


191 


YSVCLW3QLLRRLRQEDHLS PGGGGCTE 
PWNTERLSKNKKTKKREREKQCGFHSM 
WYCNLK 


5891 


19792 


A 


5940 


399 


1 


QYS DDNDMS WKVTRAAAKCLDA WS TRH 
EMLPE F YKTVS PALI YRFKEREENVKAD 
VCHAHHS LFKQTS PVQS WLCGLDAMDQG 
GTPLAMLQSQVANIVKAVHKQMKEKSVK 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 


A 


5941 


411 


3 


SPCEGPRRFQCKSGKRVDGGKVCDVQRD 
CRDWSDELLKVWCGACLRPLAGLSLLPS 
PSWYLGSRPSSAPCPDTFCSGPLFGFMC 
RPMASHGAFRPQASGLHLYKVLRACPSQ 
VLKNYVFSHKLGLSSFLPRSDHV 


5893 


19794 


A 


5943 


431 


26 


KAWG I PGDMG PPGI TVR PG YNGLPGNP 
GVQGQKGEPGVGLPGLKGLPGLPGIPGS 
PEEKGSIGVPGVPGEHGAIGPPGHQGIR 
GEPGPPGLPGSVGSPGVPGIGPPGARGP 
P VGQG P PGLSG P LV I KGE VS R VR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFLDLFMQSLFKPGARIS 
QDLKHKYIHILAYAASVVETCKKNKRVS 
INKDELKSTSKAVETVHNLCCNENKGAS 
ELVAELSTLYQCIRFPWAMGVLKWVDW 
TVSEPRYFQLQTDHTPTV 


5895 


19796 


A 


5945 


465 


32 


ERVTLAD IT WCTLLWLYKQVLE PS FRQ 
AFPNTNRWFLTCINQPQFRAVLGEVKLC 
EKMAQFDAKKFAETQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
EQAMAAEPKAKDPFAHMNKSTFVLDDCI 
AAAL 


5896 


19797 


A 


5946 


414 


2 


ATPPVRCI RKE IRNW YVD I Q PVQEPKAQ 
ACGNGHG I I I I AETSTGCLFAGSSLGKR 
GVNADKVAIEAAEMLLANLRHGGTVYEY 
KQDQL I VFMAIiANGVS 1 1 KTGP VTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


EPCWRRIADSSVQTDDEDGESRYLLSR 
RRRARRSADCSVQTDDEDSAEWEQPVRR 
RRSRLPRHSDSGSDSKHDATASSSSAAA 
TVRAMS S VGIQTI SDCSVQTEPDQLPRV 
SPAIHITAATDPKVEIVRYISAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFISFFRYRVGLLSPKLKCS 
GVI I AHCSLELLGS SNPPTS ASRVI FLY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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X=Unknown, *=Stop codon, /=possible 
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LSLCLFVFEMESCSVAQAGEQWRDLGSL 
Q 


5899 


19800 


A 








RAAAMSSRNFRFLSSHRELYFGVDLPSG 
NLVVREPADREQL.CRAKAACVLTYDLVL 
EDPLELHKIRIHVLDTNDNSPLFPAGDV 
QLHIPEFLTPGARFTLPNAQDDDEGSNG 
ILSYSLSPSQHFRLDMGSRVDGSEYPEL 


5900 


19801 


.A 


5950 


268 


19 


QDNWAYYCTVPEYCKTKLNI LAWWLVIN 
YLSGDFYFFFIFFLDGVSLCHPGWSAVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 


19802 


A 


5951 


389 


3 


QQMDLARQQQEQIARQQQQLLQQQHKIN 
LLQQQIQVQGHMPPLMIPIFPHDQRTLA 
AAAAAQQGFLFPPGITYKPGDNYPVQFI 
PSTMAAAAASGLSPLQLQQLYAAQLASM 
QVSPGAKMPSTPQPMY 


5902 


19803 


A 


5952 


2308 


307 


RRR PAR KKT WVRDGG PHQGL/ YPQLPSA 
VLQPTQPGHGPRALGKGRELESWQGKRP 
GK/GQTHRRKTRGTAS PAVYFS VEWGD \ 
SGGCPMTGKGAKQQRAGLGAAPNPELSG 
EQGTGRGRAQDSQGHGGGKGRRGQHPPG 
ND/PRGVSGMGQESVHSPEARPTGGGTG 
/EMLGRTWRHQQGLVGGTGNLLETGSWS 
GSGSWGRG WLRPQEGQVTTGMGLAGR / 
YQPARQRAVELS PGAQGLRQRRGGWSGP 
PQETEQSGVPGG*GSWPLESHQGEQ\PA 
AGQKQSLQMLPGGFPCWS/SGMGGGPQQ 
L.LES EG AG P S PGG GRHHGKGR VAVT TT P 
REGD \ RGQS PGGHTLQLFPCPL WS * ETQ 
GQGSRTVQHQEWGGEGREGEGAGSDQSR 
ALGIA* ICPHTLRPTSMEPQSTAPAPGQ 
PPGPPSWGHRGHQGMGYGGRCPGQ/ICG 
RGRAQLGSTVG*RDG/ QRG*MQGRDQQG 
PRSNGAGVGSCSHTSQK*IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTEPHSVPGWCWDTLRRHGAGQGHPGM 
ARSGTGEGQ/ QRGRY*ERGRRGRRQQKQ 
KKQGLKEPG*RAAPT3_iKGATRPIjCRCIjR 
KVQKPKQDGDVGS *LLKVFRAPGALGTK 
PQRTCRGPADFFHALSGLSNVHRSYCST 
NSGFGACM 


5903 


19804 


A 


5953 . 


461 


117 


HPRDIIGPVHQGLPPSPQPPPPCKQAPT 
TPSS \M* P * TNRHDTGLSGPVQVPGADW 
KPLNAP*LPEIAFGEEPCRPGVPDPPYV 
WLPDSAPEHCVTWATSPGL.SDLLSYPRR 


5904 


19805 


A 


5954 


845 


610 


FFETGSY/ SVAQVGAQWCNFGSLQSQPP 
RLKRS CHLS PPSSWDYRGAPPRLARF FF 
FC I FYRNGVS PCC PGWS * TPELQ 


5905 


19806 


A 


5955 


1 


378 


KTPVSDR\ATKCCSESLGNRRPCFSALE 
AYETYVPKEFNAETFTLHAD I CTLSDKE 
RQINKHTALAELVKHKPKARQEQLKAVM 
DDFADFVEKRCMADDKETCFVEEG*TLV 
AASQAALGLLHRIK • 


5906 


19807 


A 


5956 


723 


343 


GCQEG I CPSPCPVI PATREAEAGESLEL 
RRRRLH*AEIVPLHS\KAQRSLCSSDFI 
RH.VIFSGMFLVFTLAGAL.FLHQRRKYR 
SNKGESPVEPAEPCRYSCPREEEGSTIP 
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E=Glutnmic Acid, F=Phenylalanine, 
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IQEDYRKPEPACSP 


5907 


19808 


A 


5957 


55 


491 


AQHFNPHNWP VT* LLPHFTNEGGQVTEA 
RCLKWQAR\CLVPMQKVPNAMPGT* * PS 
AEVTPPVCNQHHWPQVEG I PS S /APAFQ 
WPLTMGHPC\WESAPRQDASTSPRPGTA 
SHCPSAVAFQERGSSPCPPHHSDSGLCL 
LPRLLPAP 


5908 


19809 


A 


5958 




413 


KKPDQ \AE \REHLCTSL \ WSGRNTDKNG 
EELHGGKRVMERLKKAVKIANQCMDPSL 
QMQLY I E I LNRY I YF YEKE YDAGTIQVL 
NQLIQKIREDLPNLESSEEPEQIT*HFH 
NTLEHLRLRRESPESEGPIYEGLILYE 


5909 


19810 


A 


5959 


871 


564 


WMCHATFGSHLN*AWRLRPVNSG/LLM/ 
PSEGKAKKESTLMVGVTNPD\MKGKIRL 
LLH S DGKDE Y VWNAGHPD I LG \H P LTLQ 
CPIIKIKKKL* *PKSGRTADGPDHSRIK 
V*VTPLGKEPTTDEVLAEGKENME*VTE 
TC* IDMN* VLQGQTWNLQR 


5910 


19811 


A 


5960 


390 


271 


GINFFSRN/RSLTT*SRLV*NSWAQVIV 
LPWPSRVLGFQA 


5911 


19812 


A 


5961 


333 


922 


GLSCRVFGRTLSCPPCILRHLSEVETLQ 
TLQKKEI EDLYS RLGKQP P PG I GAPAAM 
LSSRQRRLSKGSFPTSRRNSLQRSEPPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPWSLCSQ 
NliMLSDQQN*ARKI PTLKG* KA\GGHGE 
CSSI I VK S QT YVNCLLCGG VS S AAYHLH 
L 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ*HDLGSLQPLPPGFKQSSHL 
SLPSS WDYRRVTPRIjPNFN * FFVETDFC 

hiaqavlqllNssnnlpasasqsagvtg 
vshrarptltld i qae i pdlkp s g 


5913 


19814 


A 


5963 


425 


182 


ASNQ * DS VG VG PS E PGAG YNLLVRFLKC 
\LEKHSSWVGVTQFSRCHLSPLSLTRKG 
NSLTPCTSRVRQCLALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQLCSIRSTQAFCFVFFSRDE/SL 
ALLPRLVLNSWVQAILLPWPPKVQG*QA 
QWLTL VI PAL WEAMAGGLLE PRSSRPAW 
ATTQDL ISTKENKTKSLSAS D AAE LCVH 
WRLLKI 


5915 


19816 


A 


5965 


1 


373 


DDQRVKSVINLLLAAYTGDVSALRRFAL 
SAMDMEQRDYDSRTALHVSAAQGHVEW 
KFL\LEACKVNPFPKDRWNNTPMDETI,H 
FGHHD VFKILQEYQVQYTP * GDSDNGKE 
HHTVHKNLDGLS 


5916 


19817 


A 


5966 


3 


329 


HE ETS RYEIYELI HVHP YAVKQS FDLEE 
YSLSQS TLEQVGHF *GFVSSMVYVYKTF 
PGSCRGSTALFSSIjYFLPPFQVFLELiSK 
EQELGDFE / EDFDPS VKWKLLPQEEP 


5917 


19818 


A 


5967 


2 


47 


MEDRRLVRGIPCPQHNARQCPAVPPGIQ 
AYGAAPFEDLQVDFTEMSKCRGNKYLLV 
LGRTYSG * VEAYPTRTEKAREVTRVLLR 
DLLPRFELPLRIGSDNRPAFVADLVQKA 
AKILR I TWKLHAAYWPQSSGKVERMNQT 
IKNSLGKVCQETGLKWIQALPMVLFKIR 
CTTSKRAGYSPYEILYHRPPPILRGLPG 
TP*ELGEIELQRQLQALGKITQTI/YSP 
SKWPVSLFSPVHPLSPGDRVWIKJJWNVA | 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLCPLWKGPQTVVL,SPPTAVKVE*IPSW 
IHHSHVKP*TA 


5918 


19819 


A 


5968 


2 


150 


KSQSI*QSSLLSYFEKLPQPPQPSA\TS 
TSKQDPPIATRliWliAEGSDDH 


5919 


19820 


A 


5969 


3 


371 


HEGKSGPMTGWLVAVGEVAMKI LLLCL 
CLILLRVRSCRRKAARAALGMEAADAVT 
D*SPDSRL3_,PD\PPHPVPPQSERPCFLC 
RLCMVCPLLDSPDPSFPCHPTSPQTLRA 
KDSAALDTLHV 


5920 


19821 


A 


5970 


396 


58 


YWVLSKGPVTCMLLNSVFSFQASSCLSF 
LINS*AP*PGIFLGMSVFILPFFVFVGQ 
AGLELLTSGDLPDSASQ\ARITGVSHCA . 
QPHTALSQ YLEQCLAPSRQL I S VS h I LR 


5921 


19822 


A 


5971 


442 


120 


ICMKQGLALSPRLDSSSAFSNTVQPMPP 
RFKSSYGLSLP/RSWDYRCTPPHMARSG 
NGV * P \ FMVETGS W * AAQGGliELLS S SN 
LPP/ SAFQSAGITS KGHCTRPAVFLVC 


5922 


19823 


A 


5972 


459 


434 


MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADI PGPGWQRLPPPHRGLSGP 
PESCLLTTILCSKHPNGCPLTQNPRSLP 
GPRPCLIYSTPSPPP*AQL/YAAPSP*L 
YTWREALIR 


5923 


19824 


A 


5973 


681 


1758 


VANKCQIPDI KAKTYCI CNTK5KRSYLK 
T*NLHRGF\lKKQIEEFNIGKRHLANMM 
GEDPETFTQEDIDRAIAYLFPSGLFEKR 
ARPVMKHPEOI FPRQRAI QWGEDGRPFH 
YLFYTGKQSYYSLMHDVYGMLLNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWL.I KEE 
LEEMLVEKLSDLDYMQFIRLLEKLLTSQ 
CGAAEEEFVQRFRRSVTLESKKQLIEPV 
QYDEQGMAFSKSEGKRKTAKAEAIVYKH 
GSGR I KVNG IDYQIiYFP I TQDREQIiMFP 
FHFVDRLGKHDVTCTVSGGGRSAQAGAI 
RIAMAKAIjCS F VTEDEVEWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLSLSPRLECSGVTAHCSLRLPASSNSC 
FSASQVAGTIGACHHIWLVFVFL.VETGF 
HHVGQTGLELLT\+VICPPWPPKVLGLQ 
A 


5925 


19826 


A 


5975 


1 


259 


LTS YDYRRALP CLVN \ FCVY FL KREE LA 
LLPKLFWNSWA*ATLLP* PPKI FGLQAQ 
VAMLRLKFPACLPACLPAYLPPFLFSFI 
PP 


5926 


19827 


A 


5976 


24 


223 


PYLTQ I STNESS VCSHARAEFLDHQNQL 
/WLGTVTYTCNPS/TLGG*GGRIT*GQE 
FKTSLGNTARPCL 


5927 


19828 


A 


5977 


395 


136 


GSWDYTHVPPRSANF/LVETGFRHVA*A 
GLELLGNPPTSASQSAKITAMSHCTWPS 
SYY*CEYNARFGSLHKRSGMTIYIKSYK 
RQN 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP 
PEGWNRTRHVI ILMTD/GSEGTSLLSQP 
PHLLR P ACGP * VHL * HYTHG WG P ECD S * 
LAVHliS CDPS * G I LPKPCD QLS LTLP QL 
AHPAMCIPAFSQFIFLISYPHGPVSSAG 
LHNMGGDPITVIDEIRDLLYIGKDRKNP 
REDYLGE* PA*DPAPHFLRAWTIiII J PFY 
PSDVYVFGVGPIiVNQVNINAIiASKKDNE 
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QHVFKVKDMENLEDVFYQMIGREIQGNK 
EPNSPQVPLK 


5929 


19830 


A 


5979 


2 


419 


KAIAPIDSDSSDGSGQNKLKT*KEFIIL 
NAI KS I YDSWEEVKI STLTGVWKKLIP I 
LIEDFEVFK\ADLVEIARERELEVEGEL 
/LQSQDKT 


5930 


19831 


A 


5981 


1 


154 


KKC* MML.T WWCM PVG/ PTPRAE AGGS PE 
TGNLRL* CTMIMSVNSHCTPAWAT 


5931 


19832 


A 


5982 


12 


311 


RKSLNFVEGCNIRL.NKCRHTNFVPQTI F 
FFLLFL*LGQC* * WVFSFFFFFFFCFV 
LFFVFVCWFVFFCFCFF/ GCLFWFLL 
/YFYELFCIVI IMCYYLR* YV* I VYMLF 
FVCWIFVLYLCFHY*LFLFFFVFVFLLF 
I CGVFWFYELFCI VI IMCYYLR 


5932 


19833 


A 


5983 




203 


IGQKRASEDTT *GS AD / PKKSSAGPKRD 
ARQIYNPPSGKYSSNLGNFNYERSLQGK 
YRW PRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/IL/REQKSQSM 
*QISLLSYFKKLPQPFQPSATTALISQQ 
PSAS RQNPP PAKRL * LAEGS YDH 


5934 


19835 


A 


5985 


2 


155 


LANF+ IYFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS 


5935 


19836 


A 


5986 


325 


403 


FIYYLFFLSSLYLI ITYALFLiSSF*R* V 
VYHM VMVLLKW I CHHFLTL FFS FRSL I 
LYLFFILFM/YLLIFVLFFIQFSFYLFF 
FYLFYYLFFNDYCFDLIFFLCLIYYYHF 
QSYYFIS * FIYYLFFLSSLYLI ITYALF 
LSSFSI 


5936 


19837 


A 


5987 


81 


651 


KLQVAGCRTLSQIDVKFKPSVHSSLAVC 
LRANYFTSQPNVIMQVTLLTRCVAHSKH 
LGTVS YSSWYI KLLFPKSHFI YLFILRR 
SLAPVAPGVQWLDLGSLQPPPPGPKRFS 
CLS/LPR*LDYRHPPPRLANF\ *FLVEK 
GFCHVGQAGLELLASS DLPTSASQNAG I 
TGVTPAPSPQKATLNLILGGQFHI 


5937 


19838 


A 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
IYLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAG I PGMSHHDGLFL 


5938 


19839 


A 


5989 


71 


1252 


TKEQRSLRSQQLGVRQQRSHRLKFGGGG 
APSARGVGGSWSLATRLGPVSTANMSRP 
VRNRKWDYSQFQESDDADEDYGRDSGP 
PTKKIRSSPREAKNKRRSGKNSQEDSED 
S EDKDVKTKKDDSHS AEDSEDEKEDHKN 
VRQQRQAASKAASKQREMLMEDVGSEEE 
QEEEDEAPFQEKDSGSDEDFLMEDDDDS 
DYGSSK/NEKQKDG * E VQT* KKRKENAQ 
TQTKG YS DAKS S ERQRESG S P \ QAS KAS 
KEKTPSPKEEDEEPESPPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K* * WSGERFY 
* KKKE KKRKKR EE KKE PT * DRTKF WL WL 
DSWGFQCFFPFVESNISLSLSFFFFFFK 
ANHCMCKCLSYLFVYWSLCQPSPFPMKA 
MSN 


5939 

5940 1 


19840 


A 


5990 


630 


316 


RWNVNSVA\QAG\ IQWCDLGSTGNPRPP 
GF\K*FSCLSLLSSWDYPTHATMSSKFL 
YF*WRRGFTMLARLVSNWLVLNS*PSDP 
PASAAQS AGI TGMSHRAQP I YLS 




19841 


A 


5991 


2 


182 


WQEQKAE/ CLRDVWTAMCTAALLTTAKG 
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nucleotide insertion 














R\NIPCPMMKEWVRKMGPIYTFEYYSAV 
IMQVILKF* *HATTGMNPDDVISEVCQS 
QKDK * R I LLS CNNAGNS 


5941 


19842 


A 


5992 


79 


278 


E AK * RVNGCS KP T WHM YTYVTNLHT YV / 
I KPAHCAHVP* KLKYKI I ITIIK* * RWG 
LTILPRLVSNS WAQV ILLPWPPKI LGLQ 
A 


5942 


19843 


A 


5993 


22 


274 


AGDEHRAWGYQPAM* ARLEMNTGPGDT/ 
MPAMASQRAGITGVSH*GCPFFSIiIjNKI 
WRSH*VAQAGEQRLFRE*TPGLSCYSCS 
LPSN * DYTS APLCPPYDT 


5943 


19844 


A 


5994 


347 


195 


YMHTKHTHTHTHTHTHTHTHP \ EKHLKA 
PNEV+KSKNNV*IKPKQSQSCFS 


5944 


19845 


A 


5995 


1 


122 


RYELEDELHLQMTTQLHALKENNYQLR* 
SSHVSIPST*GYRHVPSCPANF/SSLTM 
FPSLiVSSCWAQVIHIjPWPYKVLGL*ALiK 
ENNYQLRTI I IDCTLNNNIF 


5945 


19846 


A 


5996 


365 


229 


R P P PC P AN F \ * FL VETGFHHVG KAGL.EL 
LTSSDLPAPALHQVFYA 


5946 


19847 


A 


5997 


357 


17 


ILVDKINNVWNLFQNKLWC*RGWGYK* 
DRSFHLLKTWAE*WVYEGSLY*SLSSY 
YLF/ C* RQGLTMLPRLVLNS WHQAI IJjP 
WPFSLLLC I LEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


1 


207 


P I L YNLFQKI E AEG I LPNS F YEAS VTLL 
PKPDKDITR/ITTY*FILLT*NILNKVL 
SNQIQLYIKRITY 


5948 


19849 


A 


5999 


603 


269 


EDRVL VLLiPKAKS AVA WPRLTATS AS W \ 
VQRSFHPSLPSRWDHRHVPPHSANFCCV 
CL * RQGLS LLPGLVLNS WAQVI LPP WP \ 
P KVLiG I TSMSHHAR PLS FY YGHFKY I QK 


5949 


19S50 


A 


6000 


3 


150 


DYRHAPLHLATFKK/ FFCRD/RGLS VLP 
GLVSNSWPQTVLPP* PPKALGLQA 


5950 


. 19851 


A 


6001 


365 


62 


AITAHCSLNLLGSSNPTASATRIAGTIG 
EAAMSYF*ISSTLTHFSYT/C*L,HHVWL 
I FKFFFVDEVYVAQASLELLSSSDPPAL 
VSQNARFTGVSHCTEPV 


5951 


19852 


A 


6002 


537 


236 


DR VSLCCPGWS AVAREQLTATSAFR \ VQ 
*SSCLSLPSSCDYRSAPPCPANFLSFCR 
DEGGGL PGL P RLVSNS WAQAI HP P Q P PS 
VWITGVINGTWPKIPLS 


5952 


19853 


A 


6003 


22 


368 


NFFLQNKENIKRKIF*ERKHKLMQRTSL 
LSYFKQLPRPPTTSQQPSTWRQDPPPAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQS I FTFNNYPRPF / CCKFKFKN / TLHL 
WADKVAHTCNLSTLEGQGGRIA*GQEFE 
ATLANMVKP * LLCKVK 


5954 


19855 


A 


6005 


2 


367 


WQFLKKLNTKIjLiYGPVF / PYLGI YKQEK 
WKHVHAETCTQMFI TVLFI I VPK* KQPK 
CPSMEE * ID*M* FVHMLEYRAIKENKLI 
HATWMNLRNTMLSE I SQTQITTCMNLLH 
KMFRKVTETES 


5955 


19856 


A 


6006 


1 


176 


pgfkqffcijsfwtrgifiemgfhhvg*a 
glgllmsgdppasasqsagitgVvshca 

WTY 


5956 


19857 


A 


6007 




289 


EVLTKIPIR/FFVEIGKIilQDLKDTGPR 
IA*TMLTKKTEMGRIVHSDLVAYYIVAV 
IKMVWYYW*RDRHINQ/WERLEISEIDS 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QNMFNWVFTKMQKLF 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH/SIT*ARVQW/PHHSSLQPEP 
PSLKQSSHLSLPSSSDYRCAPPHPADLC 
RDWVSPCCPGWSQTIiGLK 


5958 


19859 


A 


6009 


1 


252 


MVSIS* PRDPPALASQSAGITGMSHHAQ 
LFFLN/CIS/ELJSSTVTPGLKESACLG 
LPKCWHYRREPQHPACTVLYLQFQSDFQ 
LK 


5959 


19860 


A 


6010 


131 


514 


PWPYSLSSSGLLSSFEN/VEKWVPEI IH 
\HSLFLGTTIDLRS\EPCTTEKLN\KDK 
YNPLTPQMADMMVCDLKWE* LECSAFM 
WKRLK\VFDEALWAALEED*RSSCRKET 
STLLQSTPL.LVSYAL>PCTP 


5960 


19861 


A 


6011 


625 


141 


E TG E R F VTHAGVK W HNHDS LHPQPPRLK 
* FS YFS LLLSS WD YRHVP PRPANLVYPL 
*RGGPPSMLPQASLEF\LDSK.*IPPCLQ 
PS/AKVLGLQVSHRPSPKVTFHQRAKEG 
DWSHWSRGKSIPDRGNSQCMHRSIPC 
VFEEQKGGQCYCSIESKEETSRI 


5961 


19862 


A 


6012 


302 


42 


GLAMLPRLPGTPGLKQSSCLSLPSSWDY 
KHILS * PAVY/C + FL.KNIK 


5962 


19863 


A 


6013 


363 


82 


VSVSLSPSLKTKVNYAVASASFTIAKIW 
KQLKCPSMDKQMK* I WHIHTMEYYSA/I 
KQKIMSYSTTWMNLEDIMLSEISISQAY 
KGKHHMIYGI 


5963 


19864 


A 


6014 


329 


99 


H I FGNT YF FQVAS S F I YLFRDS VSLCRP 
GWSAVA* S WFTATSG \ FRP SACLGL.PKC 
/WDYRREPPRPANVLFSMRVTH 


5964 


19865 


A 


6015 


324 


238 


LAMLSRLVSNSWPQAILLPLPPKVLGLQ 
A 


5965 


19866 


A 


6016 


2 


308 


F WKAI AG I EGDS S DKCRQ/.S KLKT I WKK 
FISLDAMRNIHDS * EKVKMS IFWKMIPT 
PRDDF/ EGLKTS VEKRTEDWKTARQ * E 
LKVDRT* TDEELLLLDQQRK 


5966 


19867 


A 


6017 


2 


457 


F FFLRWS L / DSVAQAGVQWRDLGS LQAP 
PRGFTPFSCLSIiPSSWDYRRPLPRPANF 
FYF* * RRGFTMLARMVS IS* PRDLPALA 
SQSAGITDVSHRRAERVISKQRIVSVMW 
KPLPEIHIPCILNLSLRFNHRSVAGLRN 
SLIVRMLS ILTHG 


5967 


19868 


A 


6018 


389 


142 


AHMLFAAQGDSS I PMI/VAPLFTWKMWE 
QS KLPIiADEWSGFW * HIHVTE Y/ YAIKR 
RE ILTHATTSLKALCYVI * ALTKGRTL 


5968 


19869 


A 


6019 


384 


232 


LNL,PSSWDYRRAPSHSA/NFSYF*RDGG 
LTML\ PGWPQVIQMPRPLKAL * LQA 


5969 


19870 


A 


6020 


333 


84 


GLIDQEFATDTTKSTSLSLT*T/IPSRC 
S1.CLRGSSDSPASC/SQPPASVAGITDT 
CHHAH I I FVFLVETRVHHVGRANTEKPR 
LH 


5970 


19871 


A 


6021 


106 


366 


DTISFFFFLKAPIEPVFIIFIFFYFFFI 
FFFI*FFFLLFLFLFIFYFVF/IFFFIL 
LCLFFFFLVLiIiFYIiLiFFF/ SLFFLFCFV 
FCFFF* FCYFYFYLFFILFFFFFYIALF 
I FFFFGS SFLFIVFLFL.FFLFCFVFCFF 
F 


5971 


19872 


A 


6022 


384 


18 


REKKPGPGTGKPPKKKGGPKFGPKGPKK 
RGE KKKKGG P /NS FF P RGGGT PGGN F * R 
GFPPENQKKRGRKKFFQGGKRGANPGIP 



909 



01 64635A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLGPKAEGGFIPGKKKKKKKNSNKKSK 
QTKKALSRTSHS 


5972 


19873 


A 


6023 




191 


LVKGKIRSWEETWVKTW/RFYPGVMAH/ 
TYNPNPLGGQGRWIA*AQEFKTSIjGNVV 
KPHLYKKKI 


5973 


19874 


A 


6024 


364 


142 


IINGLIIHLHNFKINGHTLSTWYTFHIL 
CVCVYI YICLCN CIYVSVCLCVCVYTHI 
QYTY* INKHIC*RLEAK 


5974 


19875 


A 


6026 


338 


102 


WGFTILARLVSNS*HLVICHLDLPKFWN 
YRHE P P C LiAL F LI F VGT * * V Y I FMG YRR 
Y / C * YRHAMCNNH I RVNMVS VGLNF 


5975 


19876 


A 


6027 


99 


77 


REICTPTFTEAFSIEAKGWK* PKQPLTN 
EWINTMWCIHMRGKF*P/YAKTSMKLED 
VFHE I SKSQKDKHYYI * FLSPTYVKIQA 


5976 


19877 


A 


6028 


2 


328 


TITYRGAKIRITSDSSSETMQAREE*SE 
IF*VLK/EKKKTH*PKILYSGKLSFKSK 
GEIK/YF*GKQN/L.KEFVSSRPVLQGML 
KWLQREGKLYRS ETQTYKKKEKAS E KE 


5977 


19878 


A 


6029 


8 


241 


GFAPLPRLECSNTNMAHHS LNLSGSS DS 
PASANFS I F\ IETRFGHVAQAVL* FLSS 
SNPPISPSPSVEITGVSHHARP ' 


5978 


19879 


A 


6030 


108 


362 


CCYLMVTE I SAHQNPAKQGLA/ HTAF P * 
KTFFIiKVSWTGTVAHV*KPSTL,GG*GGR 
TS*SQKFKTSLGIMRPPSLQIKKKVFKI 
YP 


5979 


19880 


A 


6031 


1520 


77 


ASSSSSSSSVILSS*NIPNYNYIVIFM/ 
CL+IIF*AASSSSIYYVPGICYIYH\IL 
CVCVCICMCIYVCICIYSIC*YIVYVSI 
YYIHY 


5980 


19881 


A 


6032 


405 


100 


EGFFLPPRGGGGKR/PPGPLTPGGPGNP 
RPKPPQKWGKPGPPPGGGPF*WVLLSPS 
LPGGQVES PGAVSG PRQVP PSGSRPFSP 
TAVGTTVTFLS KKKKKQV 


5981 


19882 


A 


6033 


42 


309 


CDKFFHKASNHIYVSTYIWVFQICVCVC 
VCVCVMCIbC*AY/CGCLGIMIWFLFVL 
*CI/CYCCY*SSLFSVIICIFLMCFVSY 
ERRFILFA 


5982 


19883 


A 


6034 


397 


2 


KGPPP*GGPDFF\*PRGGPPQKNPFFPP 
GGGPPPPKNGVGGPPPQKAPFFGPKRGA 
PQKGGPPGGGPPRGEKKSLCPPPNWENT 
PRGLKKGPPPFFFFFFFFFFFFFFFVIjR 
STGLRAPFQSVFIVELVEDMS 


5983 


19884 


A 


6035 


343 


2 


LWPPQGSFKTAAPFFFFFFLFISDFITF 
HFLYM\YVFFIPLFLL*SFFCIK*IFYY 
VTF*FLQSFSILFFSWFL\WLL*SLLY 
SF*LIRIYFRFILT*FQ*DTETLFFYYS 
IP 


5984 


19885 


A 


6036 


319 


68 


SLGEICAPK*FFPPAQKKGPFQKFPSCK 
FSPSPVFKTRPRAKF*KGPP*GKKFYVA 
NPGKI\GPPKGSFKRPPLFFFFFFFFFF 
F 


5985 


19886 


A 


6037 


419 


191 


RIHTI P I TALFAVAKR/ WQSKYPLVDE / 
MDKIWYIHTMED*AAYGKKEIMSHARTW 
INLKNIMLSEISQSQKDKYFPFI 


5986 


19887 


A 


6038 


434 


401 


YGKNFSFSLjKV*IFCEGLVLVFFPPKKK , 
/FFFKNFHRWFPLYVFF*TGRGWFFK 
P PFLEKI FFFLTRVNLGPPRGFF* GAP F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K— Lysine, L— Leucine, M—lVflethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=posstble 
nucleotide insertion 














*NFYFMN*IFTVI 


5987 


19888 




6039 


12 


346 


SFSFVRRSLTVTQAGVQWCNLGSLQPP\ 
LPPSFK*FSCLSLQSNWDYRCLPSCLSN 
FCI F/M* RRGFSMLARLVSNS * PQ/ CDP 
PASASQSAGITGVSH/SHLV*NA*SFKT 
ESHF 


5988 


19889 




6040 


275 


45 


KDVQHQWKIVQPLWKQYGSLKASTESPH 
DPSILLiLGTYPRKMKT* IHIK\TCIRMF 
AEALFLISKK* KLPKCLLADK 


5989 


19890 


A 


6041 


355 


1 


FGTIKNFFFYIFFFKNYF/FFLYIIFFK 
KKVLFFF*KKKIFFLFIPNKKIFWVFFF 
FFFFFFFFFCPRFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLVTRHLF 
EMRKGN 


5990 


19891 


A 


6042 


635 


209 


PLNI KKTFFFLRQSFTLVAQAGAQWHDF 
GSPQPPPPRFK*FSCLSLPRSWDHRRPP 
PRPANLVF \ D FLVS TG FHRVS QDGLDLL 
T/S / GDLPAliASQSAGITGVSHCARPTL 
EFLKVKEMIFCSTTVKAVLDHANFLILR 
FSIT 


5991 


19892 


A 


6043 


12 


348 


ESRYSSICLPFSSVYCSRIISLQLYSYV 
LLVYILFFF/CFLKREFGFCPPLGVQGP - 
GLGSLNPPLLG*KQFSGLIFPGIGNTGL 
APP S P PNFG FFRKKG VS P CG PNRPA I S K 


5992 


19893 


A 


6044 


192 


1 


I FHRFCT/H I TLP / TTALNNN * PWLGTM 
AYAYNPNTL*G*GRRIV*AQAFKSSLGN 
MTRPCLYKK 


5993 


19894 




6045 


338 


42 


WKTAW W / FLKKLD / MN YC * D PAI LLLG I 
YPKELKAGT*TDIRTSMFTAELFAIAKR 
WKQNVDLKNLRQNECRQF I WAKVEGNWL 
PSWFGEWSIQHRLQAGF 


5994 


19895 


A 


6046 


300 


3 


RLDCHYLHSHTHTHTHTHNPAGIYTHH/ 
HCWNLHTHNPGGILGVALDL*MQFKLNR 
GWSWGGEQMAWPCPQGHVMWLGDPGKMI 
LKPPLWVCQLQGSSYDG 


5995 


19896 


A 


6047 


332 


3 


SLESAGFT*YKMYFYYYYKIK\YIYKSI 
FIILFL*IFFFFDLFFIFFFFIFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLKSP 
KRVIYFGFGTQNPQMFSLLTLKYVLC 


5996 


19897 


A 


6048 


2 


152 


KNLKISWT*WLMPWSAT/WGGSLKPGR 
LRLQ * AV I APLHS S LGDRAR P YL 


5997 


19898 


A 


6049 


1 


28S 


PII I YVUL1jLFFETDSHSVT\RLi*CSGA 
FS TH CNL CLPG PGD S S AS DS / RVAG VAP 
ACLF* PYRGFSGPATLAFRALDPGLPLH 
PGFSLQRPSCSRGG 


5998 


19899 


A 


6050 


358 


1 


FSIFSILIFDFLGGFLKFFAFFYGWDFL 
RFPFFYR*PFGPKSSFIFPHVF*R\WIW 
FLLLSSWFLKWS FDPQFFLKRFFFFFFF 
FPKGFFYKRVFFFPGFFFFSNLLLSSQG 
KGVWLI 


5999 


19900 


A 


6051 


308 


45 


ADLSAEAL*TRREWDDIFKVLKEK/NKG 
QPKILYPSKL.SLINE/NEIKSFPDKQKL 
REFTTRLVL*EMLKGILHMEAQGQYLPS 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


I FCVLVETGFHWVAQAGL * LLSSGSP PA 
S/DLPNAGITCVSH*AGITCVSH 


6001 


19902 


A 


6053 


1 


330 


LC* PGWSR\FLTS * SACVGLPKCWDYRR 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp.irtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPLHLVFFFFFFFFT* IYLKGFYLNPRK 
IFKNFFPLLKNPFFGVPPKIQFSPIPNL 
PSLWPNMVFFKRVLGEKI FRGFFPP 


6002 


19903 


A 


6054 


34 


293 


SPILFLLCLRHQSGCHPSWTAVAQSELT 
AASNSWAQAILPHSWDYRHAPPH/LNLF 
GWSHAALPRPDSNF* PQAILLPQPQVLG 
LQA 


6003 


19904 


A 


6055 


319 


217 


YHFTPTGMAISKNRQKTGDNKCC*GCRE 
IGMLVCC*WQYKMVWLL/WKTVWHSLKG 
IK/L*EI.PYDPAIPIj*GIQRVSWKPIPH 
SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGSPES/REFETNLGNKARLHLYKK 
* ISQMWWCI PAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 


19906 


A 


6057 


2 


558 


FFFLRWS FTL I AQAGVQWCNLS SLQPLP 
PRFKRFSCLSLPSSWDYRHVPP\RLANF 
FVFLVEMGFLHVGQASLELLASGDPPSG 
DPDLR * LiAFQSLG I TGVT / HHAGQI FLF 
/CLETES PSVAQVGVQWCNLGSLQPLPP 
\GSE*FSCLGLPSSWD\YRKALTHPANF 
\SIFSRDGVSPSWPGWSQMPDLS 


6006 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTL.ASVYLT 
LLPISKTSKTSGLPASKEVNRYLGTCAI 
RYFVFYFIIMIiFV*RQGLTMLPRLVSNS 
WAQVI LPPWPP KVLGLQA 


6007 


19908 


A 


6059 


294 


50 


CYFSLSFGL.+QFLSLNQSFVTLSSHCDR 
DTFE + SWS VI / PGIGVYSS * E * TRGLAM 
MPRQVYNSWAQAILMPWLPKML.EVQT 


6008 


19909 


A 


6060 


616 


354 


ERVS PCHPGWSAVALSQPTAALTS W\ VK 
QSSHLSLLNSWDYKCMPPLIiANF\KFFC 
RNE/ SLLMLPRLTSNP WAQVI CPP* PPK 
VLGLHA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGSEEPLCPATTPSG 
RCTQQ/LH* ERAMMTMAVLWNRKGGKVG 
KRLRNRLVAMSV 


6010 


19911 


A 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
FWKGFTILDAIKNI CNSSEEVKIATLMG 
V/WDDFEGFRTSVKEVAADVC/ETARQL 
EVEPESVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQ I S KVTQ S TLRCLKMS VQQVKS 
CYKCTATYLCAKFNSHEFRS SG VCEN P V 
SYFYTHSIRSKIL/WPGMVAHTCNPSAL 
ES QGG CMT *GQHCETS M VNMVTPR VSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXIFFFFXFFFFFFFFFF 
FFFFFFGFFXFFFFFFFFFFFFFXFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXF 


6013 


19914 


A 


6065 


826 


571 


DGSHSIA\RLECSGVISAHCNLHLPGSS 
DSVASAS * VAGTTGTCHHARLI F/ WFL 
VETRFHNLGQAGLELLVIHPPWPPKILG 
LQA 


6014 


19915 


A 


6066 


1 


284 


GTSFFF*KRVLFCCPG*RAVPQ\SWLTA 
ALTSWAQGSS +NHRLKP/HMLS FFNFCK 
QG LTML P KG AIJSIS WAQ AI LQ PQ P PM * LG 
LQ ACT YAWS LG P 


6015 


19916 


A 


6067 


36 


384 


VSKNKPI HLFLFFFS KTKSWSVT \ RLEC 
SGVTSAHCYLRLPGSREGAASAL* LS AT 
TWLCJCVAL PLGE0LGALCRNP WGAG VKG 
S S RES P P LGEARGGEL P WGQ VR I RL.WGR 
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6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRFGQPGL*HLSSSN 
LPALASQSAGISGINPGGRARFPLFFFF 
FFSLLLiLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/LLESESHSVA*ARVHWRRIjG 
PG / LKRFACLKLPSNWNYRHTPTHPANF 
CIFIRDQGMTL*PSWS*T\PDIK 


6018 


19919 


A 


6070 


3 


377 


HEEPQGCATVPGEFFKFLWRHGLITYPR 
L VS NS WAQAAL P P Q * METH IIFSIGRLH 
MVK\RQYWSGAVAHAYNPRLLGS * GGKA 
A*AQEFETSLGYVMRPCLHKNLKNSPGT 
VAHPCGSSC 


6019 


19920 


A 


6071 


337 


3 


RESQAFFS KKKKKGFTKMWFEQRCERG 
KG ARQQS HLR KS I YV * E IAS AKALRRHM 
PGVFQKQPGG/WPGWAHT*NPRTLGGR 
CGWIT * AQELETSLGNPEKP CLYKKYKL 
V 


6020 


19921 


A 


6072 


1007 


628 


FFLLAHSPHSVAQAGVQWGDLGSLQALP 
PGFMPFSCLSLPSSWDHRRLPPRLANF\ 
FIFLVATGFHRVSQMVLIS * PCDPPASA 
SQSAGTT/GARPKVF* IFVYMYFGVRKH 
SILMSMPQHDLFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVSIFDLRSLWADISRATSTLFWL 
bFSWWCFIjHLIjSFNIiFLSLGQK*VCCR* 
HTQSCVT*QWRY\WSGI.VTHACNPSTFG 
S QGGWVASV* E FETS LCNMAKP CLYKK Y 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTASCFPLQLDYKCLQYS 
FVCFFFFETKSNFVT\RLKCNGPISVN* 
NFNLPGLTRSQA* ASREAGTTGTCYHA* 
*ILLPSSTRL 


6023 


19924 


A 


6075 


330 


49 


KRRFALWQAGVQWRDLRSLQPLPPVFS 
CLSLPSSRDYRCPAS/RPS*FFLELLTS 
GDLPTSASQSAGIIGMSHCAQPPLIYFL 
FQIKVILFFS 


6024 


19925 


A 


6076 


2 


267 


ARGVEVKVGKTQKFLNMIRRTIjVHCWWK 
CKLIQPIiF/WRILPKVKEELPYHLAILP 
LGIYPKQMKSISL\KYICSYI*IVDHFV 
*AILPLGIYPKQMKSISLNIHMFLYIDC 
RSFCIGTFLLL 


6025 


19926 


A 


6077 


379 


236 


PPPPQTRETPPPGHVHKDPPPPHHPYPP 
PQEPQPRTPVPNTHPGEEQNHELFNSSL 
LEPWW\YYSSLLPPEQA*EPHIH 


6026 


19927 


A 


6078 


2 


290 


RRSXjPSS PLEDGRLS IAP3LTWEPPSHP 
PPQ*RSPPPPKKKKKKKKSPPGPKF/HK 
FKTGKPEKPPRVL * RGHWPPFKFF * LKP 
GL>PFF*GTKPPKV 


6027 


19928 


A 


6079 


1241 


1487 


KMYCIFFVE*KPCLSCAIFIRLIiKWS*P 
TVCNSISGGPPHG* IMPSIIL* FLFLET 
VFHSV\TRVGVQWPDHGSLQPGPPGL.K* 
SSCLTASQSTQIIGVSHHAWPPSITFEI 
FKRLAYSMSADGW * KS PPMKLDLGALKK 
KLSFARPG 


6028 


19929 


A 


6080 


333 


1 


LCHHVQLSFKFFVEMEVSLCGPGWPFK* 
FSHLGLPKCWDYRHDPPHLAFFFSLS \ * 
FFCRDRVSMLPRLVLKSWSQAILLL*PP 
KVFFLFVCFLRRVSFHHAEPYAVSSC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN . 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucirie, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6029 


19930 


A 


6081 


2 


239 


ARGKYIYNRDKISFCCPGWNAII/RSLQ 
P*TSELKRSSHINRPNSWDYRCMPSHLA 
NF* IFRRDGGLASLPRLGTGWLKS 


6030 


19931 


A 


6082 


323 


185 


SACLGLPKC/WDYRREPLCLVSR*VLKS 
*AQAI CLPWPPKVLGLQA 


6031 


19932 


A 


6083 


296 


2 


GER I LEECI YARSHRAEI TSLHCSLS VS 
K/HSVAQWVKWPDFCSMQSMPPRFK* F 
SCVSWSSWDYGLPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


6032 


19933 


A 


6084 


443 


1 


QKPK* *KTQTWSSISLQSRKMKKKILSV 
ITFFKT*GEEEEEKEKKRK\KEE*EK*K 
EPSQKKEVEEVEVEDERRGRK* KVDMEV 
KEEEKDKGEEKSGEEQ*KESAEAAVRE* 
VDS /SELVTFAVIiPGDSSEEEQGMMVPR 
A 


6033 


19934 


A 


6085 


328 


1 


SNILIFKAQVTKHIYSLETCSPDYPGIS 
QAIKANFSPCTVG*LPSNWAYRH1jPPCL 
DNF* IFCRG\RSILCRLISNSWLQQVKL 
QEFPQFLGCHARFQGAGFLSHCLV 


6034 


19935 


A 


6086 


299 


46 


LDDLHLLSLCLSLFRDRVSLCYLGWSSV 
A*S*LTVA\*TPVILSLRLK*SSHLNLP 
SSWDYRWSNS*PQGLIiLPRSPKVHLLL 


6035 


19936 


A 


6087 


266 


48 


GIE I VRQMRSRTGADKVI YI VQS PLLKM 
QK1 / WLGTVAHTCNPSNCGGRSGR I T * G 
QEFKQSSHLSLPKCWDYRL 


6036 


19937 


A 


6088 


2 


185 


ARVTMIiPRLISNS \GFK+ SSHLGLRKCW 
DYRCE P P CPAE I S F I SSGYLTRSG I AGS 
YCNSV 


6037 


19938 


A 


6089 


520 


297 


LGYRDFLHLS FP I S / WNYRCLPSCPA/N 
FFILVETN* FHHVGQAGFEHLTSGDLPT 
S AS Q \ S AE I TGMT YRAQ P AAGS 


6038 


.19939 


A 


6090 


388 


187 


SQHLGKLSREDCL/RLIVPDQS \GQHSK 
TPSLKK*F*K*AWVW+HMPWPAIQEAE 
VGGS LE PRKS KLQ 


6039 


19940 


A 


6091 


188 


329 


TLGLKRFFCLSLR*FIYFFFRDRVIjLYY 
PGWKAVGIHRRDYSMLYSLELIASKDS / 
SCLSLRSGWDYCVWLIFFGCM* CKRQDL 
SIIPWQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


R P AWP T WQNP I S AKNTN I TWA * AARVAG 
IT\GTCQRAQVMFVFLAEMGFCHVGQAG 
LYPKI I ILYDKLSHYKCSHLQMRKALET 
YSG I YS LFTI YLV 


6041 


19942 


A 


6093 


169 


2 


SH*PFTSSTIRSIWT\GAVAHTCNPSSW 
GE*GGRMA*DQKIETSLVNMTRPHLYPR ' 
A 


6042 


19943 


A 


6094 


2 


324 


FFFLRWSL, / DS VAQAGVQWRDLGS LQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF 
FYF* * RRG FTMIiARMVS I S * PRDLPALA 
SQSAGITGMSHRARPAVYILTSTSN 


6043 


19944 


A 


6095 


76 


340 


LRKKFCFSVYPFIHRWTFELLPLNDFLM 
NFYFRIVFRFMKKLQR\WPGAVAHAYNP 
STLGG*GGRIS*AQEFKTSPGNIVRDFI 
STKNM 


6044 


19945 


A 


6096 


349 


84 


FIiIiSLF*DTISLCHPGQSAWRTMTSQIj 
KQSSHRSLLSSWDCRCVPPCPVNFFSFC 
RDE/ SLAMLARLVSNLWDQALVAEAGVS 
VWARQE 


6045 


19946 


A 


6097 


346 


184 


LTMLPRLVSNS\GLK*SSHFGLPKCWDY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utaniic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, I=Isoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *"=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RHEPPCLAKISLYITKFFHILSLPI 


6046 


19947 


A 


6098 


378 


250 


FKQFSCLSLPISWDYSREPPQMANS\*F 
LVETRFHHVYS YVP 


6047 


19948 


A 


6099 


3 


225 


GFKRFSCLSLLSSWDYR/HPAPYLANF\ 
* FL VE MG FHH I GQ AGLE L LTS GDP P TLA 
SQSAG I TGVSHHAQPIiI IF 


6048 


19949 


A 


6100 


1129 


457 


YCVSSKYRRIISFSLIA*LLLTKLGS/V 
QKVGVQWRDLG*LQPPPPG\FKLFSCLS 
LPSGW \DYGHVPPHPA*FCVF * WEMGVS 
PCWLRLV\LNS*PSGGSRPPSASQSAGS 
LQGLI \HRARPEH* F*TS * PLGEGKTLD 
E I Y I KTKTFYQ F C IVCLKLH * RLME FAL 

vaq agmqwhdlgs lqplp pqfk wfs cls 
lpss*dyrywllyhpalelkeapsfpes 

PCTH 


6049 


19950 


A 


6101 


3 


296 


eiiqallitivlglyrtllqas*yl*sa 
ftisdgiygstflaa\tglhglhviigs 
tflticfirqlifhftskhhfgfeada* 
ywhfvded* lflng 


6050 


19951 


A 


6102 


1 


749 


RHEGGLFKTILLI *TSLTIGSLALAGLL 
FFTGFYS KNH I I E \ TANI S YTNA* ALS I 
TLIATSLTSAYSTRIILLTLTGQ.PRFPT 
LTNFKENN\ PALLNP \ I I YVAAGSLFAG 
FLITNNISPASPFQTTIPLYLKLTALAV 
TFLGLLTALDLNYLTNKLKI KSPLCTFY 
FSNILGFYPS I THRTI P Y \ LGLLTSQNL 
PLLLLGLT*LEKLLP\NTISQHQISTSI 
ITSTKK\GILKFYFLSFFFPLILTLLLI 
T 


6051 


19952 


A 


6303 


377 


1 


SKAFGPPGF*APYGLKAHCFPPGFRRGV 
WAPSGFWARPPIGYPFGALIGAPVWPSP 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGSPGFLTTPGSKKKKSTKKGKPQKKI 
TPLDTSCSRTIVM 


6052 


19953 


A 


6104 


1001 


531 


FFLRRS FTLVAQAGMQWCDLGSLQPP P P 
GFKQFSCFGLLSSWDYRHVPPTLANFSV 
FLVEM\GFSMLARLVSTS*PQ/CDPPAP 
ASQSAGI TGMSHCAQLPKP I FNYL* AID 
SVTTSISLARDWPNSSENRLLGYMGKPS 
KFKRQEQKMECYRDLAPS 


6053 


19954 


A 


6105 


376 


117 


CSDWSAVSRSQLTAAPNS* \VKQSSCHL 
SLSSCWN/YRCQPRHLANFVLFCYVLF* 
RWALAMLPILVLNCWPQAVFLPWPPKEM 
GLQA 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAWQSWLIATSPSLV\KPSSH 
LSLLSS WDYRCTS PCPANFFVFFY* R*G 
STM* PRLLWNS WAQAI LPPRP P KVLGLQ 
A 


6055 


19956 


A 


6107 


2 


176 


AREYPANFFVCLFFLILCRDG/DFTMLL 
RLISNSWLKRSDRLSLPKW*DDRREPPH 
QAY 


6056 


19957 


A 


6108 


328 


69 


LYIFSFG*ERVLFCHPGWSAVAQSQLTA 
ASTFRTQVLSLIGRWDNRHVPPHLADL/ 
SFYRDGVSPCCSG*SQTQAILLPWPPKV 
LGFQE 


6057 


19958 


A 


6109 


777 


329 


FLEMGSHSVAQAGVQWRDLGSLQPPPPG 
FKRFSCLS FPSSWDYRRPRPRPA/NFFV 
FLVETGF\TILARLVSNS*PRDPPASAS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

Oil 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence. 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














QSAGITGVTYPRPALTQNFSNLTFGEEN 
RKPNPEDSCVRYLLLSIRQRSPTALLHV 
I LFQQM I QLVL 


6058 


19959 


A 


6110 


345 


309 


ASLIFV*LVEMRFLHSGISLFNSGKNGP 
TAFSQDPPG\SSVSKKAI2IIINKINK* 
IQHVTV I Q I E VQ WHNHVS LQP L.NPELNR 
FSRLSLLSSWDHRHMPPHS ANF * NFKN 


6059 


19960 


A 


6111 


332 


3 


RKVIiVCPPGLSAVAGFQVPARSAFQVHP 
LFLEAEVAVSPGCAPAFKPGPQGPF* KK 
KEEERKICKGRRKEEE/EEEEEEEEEEEE 
EEEEEGEEGEEEEEFYFLRAGKVKSPRA 


6060 


19961 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGF 
K*FLCLSLPKSWNYSHAPPCLANF\*FL 
VETGFCHVGKAGLELLASSDPPALASQS 
VEITGVSHHAWPMQEYFQI 


6061 


19962 


A 


6113 


1381 


172 


RNIDSKAILSKKNKAGGITLPSFKLYYK 
ATVTKTA*YWHK/NH/DVD*WNRIENLE 
MRPHTYNF\LIFDEPDKNKQWAMDSLFN 
KSCWDNWPAIWKRLKMDAFLMPYTKINS 
RWIK/DLNVKPKTVKILEDNPGSTIQDI 
GTGKDFMTKTPKTIATEAKIHKWDLIKL 
KSCTAKETIKRVNRQLTEWEKI VANYAS 
EKGLKS SIYT*FKQI*KK\NHPI KKWAK 
DMNRHFS KED I CGQQS Y / VKK / CPAS L I 
IREMQIETTTRYHLTPVQMAI I KMSENN 
RCW*GCGEKNMLIHCWWKCK/liKDVQPL 
WKTVW/RIKTEIPFNPAISLLGIYPKE* 
KS / C / CYKDTCTRMFVAA\ CPS VTDW I K 
KMWY IHTREY YTAI KRNKTDWAWWLMPV 
ILALCEAEVGGSLEVRSSRPAWPTW 


6062 


19963 


A 


6114 


3 


155 


HEKKISQVWWCMPWP/TILWWLR*EDS 
LSPRKLRLQ*SYDCTTVLPAWATE 


6063 


19964 


A 


6115 


47 


368 


STPTPSAFLCLAF/ YRQRWIjCWPDWGA 
V\ I QHT YS LPT * LS I LKQD * P LNLLGT / 
WSSRRVAACPENQIYRRNGVS/MLPRLV 
LNSWTQVILLPWPPKV/LGLQA 


6064 


19965 


A 


6116 


1 


592 


RQRIFLLECGGAISAHCNLHLPGSSNSP 
ASVS * VAG TTG VRHHAQL I FVI LVETEF 
HHVGQDGLDLL/NLMIHPPRPPKVL*LQ 
G 


6065 


19966 


A 


6117 


I 


306 


LWKTLWQFVLKLNI LLPYNPAI VPLG I Y 
PNKMKI YVHTKTCTQICIGALFI IAKT* 
KQPTCSSIGEW/IKKLWYIQTMEYYSTP 
ERNEPSSHENTWKNXiKCI 


6066 


19967 


A 


6118 


2 


284 


QTGVHWHDQ\NLL*PETPG\SRDSPGSA 
SQVAGTTGMHQHARLNFFSFSFFLSFFF 
FFSLGKTLGLGLE FFPRWLQMSG PKK I L 
RLYLQNPKNKG 


6067 


19968 


A 


61 19 


2 


315 


SR VAG I TGVGHHARL IFVF/ CF FFFFLE 
RDFLFVPRVGFQGPNLG'LKFPLPGLPL 
FSGLTL P KTGD * GAPLQPRVN FW I F * KK 
G\V*YCGPGWFEILDLRGSPPF 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKQFS CLRLLSS * 
DYR\HAPPRLG*FCIFSRRIVFLHGWVR 
AGSRNS* PQDGSAPPQASQLAGITG\VS 
HHHTWAPHFLFFKNSGCSTQNR*QVHKV 
GDKETS 


6069 


19970 


A 


6121 


2 


202 


VHLSDQNQPKCP * MI E WIKKMWHIDTME 
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SEQID 
NO: of 
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sequence 


M 
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od 


SEQID 
NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glut.imic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YYAAIKKDKFMSF\ETITLGEETQEWKT 
KHRLLSLIRGS 


6070 


19971 


A 


6122 


355 


161 


FIXYLFFFFFFFFFFFFFLFFLSFLFFF 
XXXXXXXXXLXVFTGM IS* LWVCLYVYM 
NVYM* IYMF 


6071 


19972 


A 


6123 


1 


133 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
EPLHLVFFFFFFFFTLi 


6072 


19973 


A 


6124 


3 


212 


HPGLPSSWDYRCT* PCLANF/ SCCCC*R 
LGLTMLPRLASNSWTQVILLLWLPKLAQ 
NTSLINNFYIDQMLK 


6073 


19974 


A 


6125 


156 


2 


ALLTHAKNKAL \ WLGAVAHDCNLSTLEG 
* GGW I T * G * EFN I S LGNI VKHHL Y 


6074 


19975 


A 


6126 


345 


34 


HPSTIHRVFCSL*S*SSPQARNNGSCLS 
SEHFGI TGVNHHTWLLFFKKN Y / CIL I K 
IETGS* YVAQAGLELLSSSNPPISGSES 
AGFTGMSHGIRPVSPSLKWS 


6075 


19976 


A 


6127 


2 


328 


ARAYLPLCFFI I FF I * VSVI YFI WLL * G 
LI * LNKRYLAHPK\ Y * LNVS YSVSFCKT 
HFFPRHKEKAI IQLGWAHACWPSTLGG 
QGGWIT*GPEFETILTSMEKPRLYY 


6076 


19977 


A 


6128 


1 


341 


GTRKFSDYRGKPPHPAKNRNLLFS IFFY 
FETLSCSVLLCEVTSSQL/ CNSFHFPDIi 
PSSNDP P S S AS RAAET TGACHHAQM I F * 
FFVETGPDWS *TPG /LLAVLP PRS PRML 
ELQA 


6077 


19978 


A 


6129 


264 


2 


TAISENKVASRSEHRRHRML*LQVI /LI 
ALQHRSQRDNLSH* KN * KKKM LGE VAHA 
CNPSTMGGQGGWIS*GQQFQTSLSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLP KCWNYRRE PP CPALCCPGWS * TPR 
LK*SSPLGLPKCWDYR/R*TTAPGLFFI 
S KSSLRS PAYS CVTVCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


MRLCSQPHHHVDFSVLETESCCVTQVGV 
QWHDLSSLQSPPPRFKRFSSLK*LGAIL 
LLPSSWDYRCTPPHPAN/LCRF*HGPVC 
LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCE LESHSVS QWRNLS S LQPP P 
LRFKRLSYLSLPSS *DHSHP*AHQANF\ 
CIFSRDRETGFHHVGQENLIMLIVQQSS 
GLGFS I FATLLAVMQLV 


6081 


19982 


A 


6133 


351 


66 


RWS FTMLHRLLLKS * LQ / CDP PAS AS QS 
AGMTGMSHHTRLHNLF*TANNLTQSHTK 
FYSFISLT PHFAINVTNYI FLY YTS INI 
VCIFLSFKFYARI 


6082 


19983 


A 


6134 


2118 


1392 


FFCKGK* I FIQCPDWGG/DLCV* QEKIQ 
KKTYHTGKDKYV*LF* KVKPITKI *RQI 
HTSGNTDAQLSYKENHDQGH * QAKNFKF 
FLFF WF YLRRS L/ NS VTQ AGVQWRDLG 
SLQPPPPGFKQF/SCLSLPSSWDYRRPP 
PRPANFC/ I FE*RWGFTMLARMVS I S* P 
RNPPTSASQSAGITGVSHCTRQEFKVKT 
GYHFHLLPSGCLFLFYVFYLYACLLYVS 
VYYPLRQQESRKISKIEMFLTTWH 


60S3 


19984 


A 


6135 


383 


47 


HLFLKFYYYLSHYWYLYYFLLHLIPSSP 
KI FFFFT * KI I FFL I KFNKVF * NLS F* K 
KIF/IFSYTF*FWPP*NFF*KPPPFFFF 
FFFFFFFFFFFF+QTLFLGGCGYNTKLR | 


6084 


19985 


A 


6136 


2 


196 


TRPSILVTALYYLYIFTTTQWG* \LTHH ) 
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nucleotide 

correspond 

amino acid 
residue of 
peptide 


Predict- 
nucle- 

ding to 
last amino 

peptide 
sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














XNNXAPSFTRENTLMFIHLSPILLLSIjN 
PDIITGVCS 


6085 


19986 


A 


6137 


700 


433 


FLRWSSASVTRLE*SGAISAHCILCLLG " 

VETGFLHVGQDGLDLL/NLMICPPRPPK 
VLVLQA 


6086 


199S7 


A 


6138 


377 


298 


QMKWKLLLLFRYRVLLC/HPWSAVA*LQ 
LTAASN\ FGLK* SSCLSL / MLTMLVLNS 
WPQVILLPWPPKVQEL*AIAAYTIKMWV 


6087 


19988 


A 


6139 


1 


199 


LYTTYFVTKTLLLTSLFIj* IRTG\YPRF 
RYEQLIHLL* KNFLPLTLALIil *HVS I P 
ITISSIPPQT 


6088 


19989 


A 


6140 


381 


28 


STPNRLIERQEDPPES*NTNPPGLPRKT 
PSFTKNPKINLARWGGPENPPNPEEKPR 
KVPESPKRKKP*TKIRP3jPSSPGDPTKP 
P\LKKKKKKKKSWPATQEAEKGGPLKA 
RSLRL 


6089 


19990 


A 


6141 


1 


209 


LKLLASSDLSTLGSQSAGNIVGMSHCTQ 
PKINMGILLKLVYRFNINPVKTPAGLiF/ 
TKKKT*KTEIDKRILKFIWKCTEHRIAK 
QSLKRTKLEDFLLNFKTYYEATEI KTS * 
YRFNINPVKTPAGLFKKKKHEKQKLTSG 
S 














KKGKNPPGPKIGKNNFGKTPFWAKKI*K 
FNLI\PKKTPPFFFQKLKKTGVKFFWAP 
KGPKGFLSKK 


6091 


19992 


A 


6143 


382 


158 


FFF*DSLAVSAPGVQ\WHDLCSL,\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 
ICIFS RDRVLP I LAKASRNVS F 


6092 


19993 


A 


6144 


185 


316 


PCLANF* IFFCK/NGGFSMLPTLGLNSW 
AQG I LLPKPLKVLGLSA 


6093 


19994 


A 


6145 


489 


231 


SVA*AGVQWHDHS*LQPQ/PLGLKQSSC 
LSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGIiKGSFCLNLPKF 


6094 


19995 


A 


6146 


179 


1011 


GTILNPQSLPFKPFFFSTLIFLKTVLLC 
CPGWVSNSSGSSCRSLPKCWDY*HEPPH 
SAL * TFPWPFP * LVKDTS IHSTQLLKAE 
IFSASL.DAAPSPTPYIRTHPSSGL.QRWL 
LDAARSPTPYIRMHPASRLQRWCSTCTL 
LPSPSYHTPA*AMTTASSLISPFFPLPP 
PAHFSQCRMTFCLFVL/CLFEMESRSVA 
QGWECSGAIIjGSIj\QPPPPGFK/RGFSC 
LSLLSRW\DY\GHAPPCPG\NFCIFSRD 
G\VSLCWPGWSRTPDLWHPPRPPKALG 


6095 


19996 


A 


6147 


348 


3 


GYHKT INWHCCMS VFANLTCGFNAI P I K 
/ IPAKYFVDPDKLVXiKFTWKGKRPRIAG 
TILKEKKVRRLILPNFKTHYKATVIKIG 
W*W*NNRHNHWNQIGSPEIAPHKYSQLI 
FDKEA 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*NWKQPKCPSTGEWLNTL 
CYIHTIEY* YYSAIKRNKL.S IHVKT\WI 
DLKGIVLNENNQI 


6097 


19998 


A 


6149 


406 


279 


RS IQGS SHSHASA/ S * VAGI TGARHHTQ 
VI IVFLVETGVMALTY 


6098 


19999 


A 


6150. 


406 


10 


MRYHFTPDERLKSKGLKLEYW*GSGETR 
SLIHCWLEYKMSHPLWK* /TVWLFPIKL 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
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NKNVPYGPAISLLDIL*EK*KHMSTKNT 
YK* MLQALFI ITKNWLLS I KRRMTKQ I M 
VYSNRNYYSTIKNELPIQTRI 


6099 


20000 


A 


6152 


367 


1 


PTPWEAEAGGSFKPRR/LQ/ CPN*PPMC 
SCTPAWAPGSPPVSKKKKMQFHRRMGHV 
S S AWW PCGADVRNLRAGGY VRHRER FPG 
LPWAAGGRDEVIRCLTVCTACGEDHPGS 
HLAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


ILNPGEAITSEKYVQ*FNEMYQKAKRL/ 
QVNRKGPVLLSYNA*PQVAQPVSQKLNK 
LGHEVLSHRPYS LSLSPTDYHFFKCINI 
FLKEKR F YNQQDAENAFQEFTES * SA/D 
FCTTR I NTF I SHWE KYVD YNRS YFD 


6101 


20002 


A 


6154 


308 


121 


GESFPAEKTFPKENDYRWPSFFQWSSVL 
/CKIHGLLMLPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 


A 


6155 


1 


420 


QNLQPHWKTLRQF/LYKVKHMMLSPAIP 
LLDIT*GRCKPCPYK/ESIC/RMFIEAL 
FIIAKKLKLSKCGSAGE* INKV*DIYTT 
EYYSAIKSNKQLTHTTTWIKLKSITPSE 
*SQT*ETACFTMLLIWHSRKGNIIVTEN 
RSMF 


6103 


20004 


A 


6156 


77 


415 


TLGLKPTSSSHLSLSKCWDYRQEPPCPT 
SEANFFFFFGGEFPFWSPS *LKQFFFVE 
MG \ FTMLPRLVLNS WPQAHL ILP/ASAS 
QSVGITGKSHRAPPQKLTFFFFLGGNFP 
FGPQAGCPGAESNYPEP*TPGLKAFFGL 
NLSKCliKYGDSPTAPGKLEIFFFKWGGV 
SPSCASRVLNPQPR 


6104 


20005 


A 


6157 


436 


206 


LWYIHTMEYYSALKKNAILIHTI\STWM 
SLENMIiNE I KQENH KRAN I * FHLYEI PR 
KESRMEVNQGLGRRGIWEFTVY 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP*K 
FWGGGGKASPPQKKFPPLPPPQNGVLCF 
PPHKKK* FFPPPEKTVEPGPLHVGQRPP 
QS\PFLCYPPLSFFLYFCYFIFFPFVFF 
HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FFFFSIRFLFWWLYQG*EYLCFFMT* 
* IGVLLFLFEICEFFFFLLVGILFGMFL 
/YFFATASPGLFFFCFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFFFCEKS 


6107 


20008 


A 


6160 


322 


2 


TQD KNKRHRARH I YKRARKS ARD * RSL V 
HR* K* KVQKRNEY* PEGTLHTCIALSQG 
NN I * DA/ HSQE * W I T PVI +TLWEAEAGA 
SRGQEFKTSLANIVXPDAWANAW 


6108 


20009 


A 


6161 


421 


57 


FCEMKFTPGNNAVNIV*PTTKDLEY/WI 
NI /DKTVAGCEKTDSNYERSSTMGKMIA 
SNSIAWYTEIFRGRKSQLMWQISLFSFK 
KIj*QPLQYSATIALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6109 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELSVFFVQMGSCHVGQ 
VGFFCI FCR/DRGIiAMFSRIiVSNS * TQG 
ICPPWPPKVLGLQA 


6110 


20011 


A 


6163 


2 


423 


KINNTKSQCFENI LKIEKTLSNLTKKE \ 
REDTNY * YWE *NMD I TVESTD IKRIIRN 
TT\ETLHKFDNL 


6111 


20012 


A 


6164 


434 


5 


ICHIFFSISKLLNILRPVSKLFFSINFI 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 














FGPELYYL*IHFNP**GKCI,SFLFLQNI 
CIGFFFNSFMQI*SYFLRISLFIPVSP1 
LL.G \ FVQ IFIVSINI L YLS I * LRLFFS F 
FIiSLFliFFFYYYTLiS FRVHVHIVQVS YI 
RIHF 


6112 


20013 


A 


6165 


1238 


1017 


ETGSHSVTQAGVQLiHNSSSLQPAPPRliK 
*SSYLSLLSSWDYRRTPPQPAN\FCIFG 
RDKVSPCCPGWSRTPEFR 


6113 


20014 


A 


6166 


504 


210 


FFFKTRSHOQGSLQP*PLGLKGSSCLSR 
TLPHSWD/YRV/PPPHPAKKYKNIFIFF 
VETEFHHVAQAGLKVLDSSNSPVSASQS 
VGITS I SHCAQP ILCQF 


6114 


20015. 


A 


6167 


258 


65 


YKTKNTTSIFSL.PSSF*TQILTF*NTYY 
Y YY / CRDE /SLAML PRLVSNS W PQ VI LL 
PWPPKWELQV 














LiARMVSIS*PHDPPAPASQSAGITGVSH 
CAQLFFFFFWRQPRFFLRAGGQGRDLGK 
VKPLLSGVRPFLS * TLPTEGRKSGPPTH 
FG*VLRE*FGGSH\PVSSLW*QSAGITG 
VSHCAQLFFFFFWRQPRFFLRAGGQGR 


6116 


20017 


A 


6169 


3 


285 


HASAHASGNILSYVGSFISLTAAI \LjI I 
IMI * EAFASKRNVLI VEEPS INLE * LYG 
CPPPYHTFEEPVYIKSRRIRRSLIDQDA 
ARKETQOGAAD 


6117 


20018 


A 


6170 


645 


15 


LPFIYCSPCRNFNSLFWQERDSNNPLG/ 
ISLPHSDKITFNP\YYT\IKDALGLTSL 
HSLLKDLHS FPHETS * GPQAY YSL/QPT 
PLNNPSPHARPEWIFPYCAYHNFSGSGP 
LTNLGGVL \ALLL \ S I LHPKQ * FPHP PS 
YPNKQRH*YFGPLRPITFIDFLGRRPSF 
ISNRESGGQPVSYPFTI IGQVAS VLYFT 
TILILIPTISLIENKILKWA 


61 18 


20019 


A 








YTTPGGKI FFKSN PGS KIiFP P P P F / PPF 
FSPFSP*KFFFSPKPLNFFGGFFPFFPP 
PKKRFFSKNPPLVFFSPPFKEKVFPFPP 
PLNFAPPRVFLKAPPPFFFFFFFFFKGG 
EVKWLS EALDCKS KDRG 


6119 


2002O 


A 


6172 


73 


52 


VQLLKCRIHIF*DKVLiLCHPGWSTVARS 
* LTATS RS S CLS PPS S WD YRRVP P CLAS 
F/SF* YIALNN*A*AI ILPQPSKVLGL* 
AVAHAYSPSTLEG 


6120 


20021 


A 


6173 


325 


470 


IFTFFFLIiAFI*FFCRNR/ SLALLPRLV 
LNSWLQAVLPPWPPKVLGLQT 


6121 


20022 


A 


6174 


2 


228 


TLEDNSTIPCKTIQSSSCTPWYVTQKAA 
CITLLSM*PKKLWYVYTKSCTQMFIVAI 
FI IAKNWKQPRCLSVSE* INKLWYI * TM 
E*YPVLKRNELSSHEKTW/MEM* IQLLI 
ERS QS * KAWCLHKKLHTDVYS S Y IHNC 
QKLEATKMSFSKRMNK 


6122 


20023 


A 


6175 


3 


408 


RYLTLAIRDMQIITPVSYHFVPSGDYGQ 
KDNNKCW*RG*KMGSLMHC*W*CKMVQP 
LWKAAWQ FLK/D VKRRAI LFLG I YPREM 
KTYVHAKTCMHIFTLALFLYVHLNALIS 
VMDWMLGHP I EE I S RDS YSCVCLL 


6123 


20024 


A 


6176 


2 


237 


WVWWLIPWLAAWEAEAKGLLELGSSRL 
* \WTMPLHYSLGNRARLCLKKKKKKKFK 
VFPPQKKKKKKKKKKKKKGGAL 


6124 


20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE*FSCLSL\W 
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nucleotide insertion 














DYRHKLLTSGDLPASASQSAGITGVSHC 
AQPLSLFVFFLTNQRFVAALS 


6125 


20026 


A 


6178 


1 


239 


FFFFEMECLTVSQAGVQWYNLHSLQPLP 
PGFKQFS C\LSLPS S WD *RVPT3RPAKF 
/CVIF*DGVSHCQPGWSAWQPPIiH 


6126 


20027 


A 


6179 


325 


50 


RKFPLTWRRVSFFSLLRSSTDWMPTPHR 
HIGGIICVTQSTDSNIGLIQKHPHRHTQ 
KNV* PNV\WHPMAKSS*HPKLTITVYPL 
STWHTYTSL 


6127 


20028 


A 


6180 


157 


338 


SQHFGRPRQEACSSSEVRDQPGQQREFP 
HLYKN*KEITQVWWHMPPEAQDY/LEPG 
RSRLQ 


6128 


20029 


A 


6181 


3 


273 


HEGQSKLETFWKGFTI*DVIKNIDDL/W 
RGVKIPILTGVWEKLIPTLINNFEVFKT 
SLEEITEYWEIGRELELEVEPKDVTEL 
LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


YYPAIP1jLGIKPK*KNSIYOMHTCTHWF 
TAVLFT I AKI WNQL * CSSMGEW I KKM / W 
MHTHTHTHIHTME YYVPIK/NE I I S FAA 
TWMELG VI ILTEI I QAQKKTKG I FSHLY 
VGAKIFDYM 


6130 


20031 


A 


6183 


3 


211 


NPGKLSVS INQDCFPWTPCP FQMGRC PT 
LLF\ S FYGGQAI PL I TPSVGTQI FRLKM 
/ PEVTHHFHSFP* EPLTAAALNQPSLPL 
SLLVDF/RDRGLFCHLGWSAVD*SHLTA 
ALNS \LVTQLSCPSFLSSWDYRRMLPCL 
GNF* IFRIjKMIjKSLTIFILFHESH 


6131 


20032 


A 


6184 


707 


384 


LQIFFYNRVKCVKNKSTQNQFYIFSRIY 
MHANRHGKYE I/WI IKKMPMREG * AVQL 
RMKWKI KKNTHI KLG WVHG YNLSTLG 
GQGGRIT*AQEFKTSLGNIARLHLYT 


6132 


20033 


A 


6185 


720 


363 


FFFWGRSLi*TILF* /QTGSHSVAQSGVQ 
WHHHSLLQP*TSGLKQSSHLSFQSSWDH 
RCAPPRVANFFLDFCRDRVCVAQAGLEL 
UVSSNPPTVASQSAGITGGSHRTWPFWD 
ILGIMYWGC 


6133 


20034 


A 


6186 


439 


66 


KPKTLGSKKVNSNPRVPPGK/PWGVGPLi 
GPGP*MGGSKP/S*NPSPPPPLFKTRFG 
FFPPG*RQGHYSGFLQPPPPGGKTFFPP 
S PPKKWGHKKKI PFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKI FRFRF/ CGFLGQSHS VTQGGVQ 
WHDHSSLQP*TPGLKRSSLPTCWDNRHK 
PPHLVLFCFYYRIi 


6135 


20036 


A 


6188 


290 


609 


RRMDDPPFFKYFFERRVLVCHQGCS*V\ 
K* SSHFS FPCSWDHRCV/ PM PH PAN \ FN 
F FHTDKS LTMLPRLVSNS WAQV I LWPLP 
P + VLGFAGMSPCPQPP SQFCI EAV 


6336 


20037 


A 


6189 


2 


371 


RYI FEKVFAKYSELTNDSLS I LSGKSGI 
LWKKRLSLFS*Q* SYKWFSLGAIYCGMK 
* KRFIjGTSIIj/FTE*VETGFHHVDQGGI» 
E LLTS CE TIiAS A Y * SAG I TG I S HCAESQ 
STLiKKYIPGSTI 


6137 


20038 


A 


6190 


531 


205 


RWRFAIiVAQAGVQW/ HDLGSLQPLPP * F 
K* F I CLRLQRS WD YRCAPLRPA\NFFVF 

lvetgfhhfgqtXnlqlltsgdppasas 
qsagitgvsh*arpwfiiftallyk 


6138 


20039 


A 


6191 


52 


375 


NRCETSFLFFFFEKKFWFCPPGGGEWRN 
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LK* ME P \ LPPGLKE PS CLTLRRGGN * G P 
FPPIiQENFGVFRKKGVPHCNPG*FQTPD 
PRELARLTSPRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FFFLLLNCGLKKETG/WLGWAHACNPS 
TLGGRGGWITR* ELKTSLANMAEPY 














GLKEFSCLSLLSSWDYRHTPPHPANLFV 
F/SFLVETGFRHVGQAGLELLTSGGPPA 
SASQSAGITGVNHRAWPEVFKK*RVSFC 
RPGWSAW 














FILPSSWDCMCAP /RPDWVLYFIjVETG 
FHR VGRAGLQLLiT AS DL PAS AS * GAG I A 
DGVSLTQCSMLPMLECRGVILARYNLHL 
PAACLGLPKCRDCSLCPAATPSRK 


6142 


20043 


A 


6195 


619 


243 


SPETGS CSGAQAG VQWHDLGSLQP * LGL 

VQAGSCHLAQAGLELWSSCHLPALVSQN 
AG I TG VS HHARIiAVAW KRT PL YGL P S R I 
SNIiCVXjLKFLFFYLiY 






—fr 








KRPPWPPG*FF/VFL+KRGFPPFGRWP 
PSLFP/SDLPPPAPPKGWVSGLRPCPRT 
FFFFFFFDKEC/RliSCPANFCIFSRDGV 


6144 


20045 


A 


6197 


498 


437 


ENLQS S I NLVDKAMAWFER I DSNFDRS F 
TTVKMPSNLTS I INACCGE I FHERKS * S 
IWQTSLLSYF*KLLQLAPPLLATTILIS 
KEP*TSR/QDRLLLAEG*D 


6145 


20046 


A 


6198 


79 


394 


SVKLlAIiFFFFFGENFCFAPQVELKGPN 
LG*LNPPPPGLKNFSGLT/LPKKWELGG 
HPPPPSKFFFLFF*KKGMAPFSQGWFKT 
LGLNQFACLTL P KGGNLRREG P 


6146 


20047 


A 


6199 


2 


313 


SASHSAGIIGESHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMEPS 
PSGLKGIFCLGPPKKWELRAPPPCPGYF 
WFF*KKGGLPIRPGRFWKSVP 














AFQQARKGTQ \GRL*S S PAAGALiP \PVIA 
PSGPPLPA* \GSSS \ PASVAPYPPPQAT 
G/S P PTP PGELRRGGWSKGGLDHPRNTF 
YCWESSEP/PPN*GQS*APGGVNSEGRG 
Q/LPPSS I ARCVP * TPSHVPPHPLAFPP 
SLPHPGPASRAP P / TGSSDLSASAVSAP 
GREGHP * GTRMVPQGAGRQKPGHRGAS * 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNHSRC 


6148 


20049 


A 


6201 


424 


7 


KNMKRS P I S CVTRD PQ I K I TMR * H YTG I 
KMANV/RLTGNTKC* *GYGAKGPLIQC* 
WECKIV*PLRKTAWQFSQS/LKIV*/PY 
DPAIALPGIQLC*KCTS\KKNCM*IFIT 
ALFIMAKKKQKPRYPSIGEWKYKLLHL 


6149 


20050 


A 


6202 


371 


144 


RWALPRRTNF\* FFVETRSLHVAQGGVK 
LLGSVSPPISPSPSAVITGVSYCAQPYG 
HPFELSFSLiIEYEYIYLiDFF 


6150 


20051 


A 


6203 


409 


110 


LRRSS*LCHLGWSAMVQSWPT*FKP*LL 
GLSDPLTSASRVDYRGMPPCPRNFIIFD 
M*FI*CVCVC/RTWGLIVLPRLVSNFWP 
EVLLHPGPSKVLELQV 


6151 


20052 


A 


6204 


15 


221 


KASRKMQIETTLRYYLSPMRIAKMQMLI . 
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nucleotide insertion 














NTSCW/RKSQETRTLIC*W+EGKMVQPL 
SKGIWPYLIQAEYP 


6152 


20053 


A 


6205 


23 


371 


KVACTCMFITVLFTIVKL*NQSRCPSTD 
DWIKKM/WLCVCIYSSVCVCLYVCIDR* 
I ERHD I YVYIME YYLALK* /DEI MFSAA 
TRVKLDAII*S*ITQNVKCCIVSTHSGN 
LIMVST 


6153 


20054 


A 


6206 


113 


417 


SAVTLLVI YLNDLKTYVHTKTCTQMFTA 
ALFIIAKI*KQLRCPS\LIWIN*C*KEM 
SYQAMKRSYARNLWSSKKLWRNLKCLLL 
RERSQSEKSAYCVIPXI 


6154 


20055 


A 


6207 


510 


167 


GLGQAISPSLPKSQGIGAYGFGFFFFFK 
R*DLVCCPG\SLNLVASSQSSCLSLPKH 
WDHECEPPCLAFFFFLL*RQGLAMLPRI 
LPLLKALASGGGMAKKKCIRYFSMKIHL 
QAQ 


6155 


20056 


A 


6208 


418 


40 


S YTHTHTHTHTHTHTH IP* *LTGS/RLQ 
GICRKGSRKHPRGIiSQLiLP 


6156 


20057 


A 


6209 


1 


258 


SGFLLS*VPSVRDPTGNRTV*LTWQPLP 
APJjE F \ CP RLS D 


6157 


20058 


A 


6210 


229 


310 


DFSEAQKSRQCGGGISNVNHPGWLKPV 
IQHF*DGVLLCCPGWSAW*SQLP*LPR 
LK* S S QLS LPR / S WD YRHVLQ \ HPANC F 
VFFVEMSLAMLPRLVSNSWALAILLPQP 
PKVLDDWLEHHTWM I YITYATTTLP 


6158 


20059 


A 


6211 


496 


1 


CCQES FE I LGS INS FGPGLKKGGVKNMR 
PGAGPTLNKRPFSDNPLQPGRRGGCFPF 
VSQRGSQKDRVPPKWFFFFFSF*DRVSL 
WPPGPSARKPSWLPVASTSR\V*QSSCP 
PLWFPGPPTMVSYFFFF\KRQDLPMLPR 
LV*NFWPQVIIiLSQPPKVLRH»QARA 


6159 


20060 


A 


6212 




340 


VTAGI I I IFMI*EAFASNRKVLIV*EPS 
I*SLQV/*WRGLLLLGSPPPYHTLEEPG 
YVTSKRKRKESNPPKLiASSQPHGLHDFF ' 
KKKKRGGPFKRILEGAPFGGGWGHREFF 
P 


6160 


20061 


A 


6213 


60 


493 


KSQQNFFCFETESRSVTQAGIQWHHFSS 
LQPPPPGFXVISYL/SLKSNFSGYP*DF 
FCNGPPCS+GPIRFTYYICVIPSSWDYR 
RLPPHPAKFCVF \ I KTVS PG WS QTPDLR 


6161 


20062 


A 


6214 


414 


1 


LCNGRPGLLiVSPIjPIjGEEIjIPCLiPR*KTj 
GPPRVL* \KGGPFL*SSFAPFF*EMESH 
SVTXRLECSRAILVHCNLCLPGPSDPPP 
SAS * VAGR I ARAQKVEAAVSHDHDAALQ 
PEQHS ETPSQQTNKHNQTKLTQKYP F 


6162 


20063 


A 


6215 


2 


403 


SLLLPLVQLPLRLCFFFFLKRGFNFGAP 
DKREWGEFG \ *LKP WLPG*NNFSGQTLR 
RSGD* RKTPHGGVNL VF * EKTGFSNLGR 
GGLKPPPSGNTPAWPPKGVGITGGTPPP 
PPVFCFLGFCENTNLRAKKKFI 


6163 


20064 


A 


6216 


2 


388 


KMEQTLWKTDWEFLTKIiNILYPYSPAFA 
DLGI YPNELKT/C/ CYTNTC I * KS I APL 
FIVARTWKQPGCPLRDGWLNKLGCIQAL 
EYYSP *KRNEL*NHNMAYKKLKLI FLLR 
QEDRFSLGVQEQPKHNGEM 


6164 


20065 


A 


6217 


363 


302 


NKEFKTTIANMGEPPFL* KRGFS \MLAR 
WLNSLFHDSPPPASQKAGIPGVSPPPR 
PLFFFFFLLETVSHYVAQAGQLISNYDL 
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Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of' 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp.irtic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutam1ne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLPSCLSLLDTWDYRH* PPYPVFKLLNS 
TYKAC . 


6165 


20066 


A 


6218 


576 


6 


RAVkRGLLVL.LLTKHTFFLFPPRQSLAL 
LPRVGVQW\HDL\GSL\QPIj\PPG\FKQ 
FS\CLSIPSSWDYRYPPPRPANF\*FSV 
ETGFHHVAQAVLELLTSSDLPALASQSA 
GITDLGQSETIHGGSGLDKPPDHKVDKG 
PTEERACLLSYSAGRKYKEYHILPEQGS 
EPPHHGDILSISFQAASHIAQTPPAHT 


6166 


20067 


A 


6219 


44 


496 


G FRVD PRVR P RVRLN S*PRDPPTS GS AG 
ITGV/ SSHHARP * N I FT*KIiG I S FLVFW 
GFLFLFFVFCFFFFWKGGFLLVAQVEGK 
/WPKFGLIEPSPSGVKEFFFFNPPEGWD 
\YGLAPPPGVIFWVLIKNGVSFFCRAWV 
QTSGLSWINRLNLP 


6167 . 


20068 


A 


6220 


1 


421 


QRGRLTAIS RETS FTLVAQAGVQWHDLG 
S LQHLP PG FQG F P CL S LLS T\TGNCSVT 
QAGVQWCHLGSL*PLPPGLKPSSHLSPS 
SSWDYRL* KPHQGSD* VCVT\GAASPEP 
NTMTAWNKPPDIRMKETLLLPPSSSEKM 
L 


6168 


20069 


A 


6221 


3 


36 


AYCI VKPS P* PLTGALS ALLMTSGLAM * 
FDFHS IT'LL I LGLLANTLT I YQ * WRDVT 
RESTYQGHHTPPVQKGLRYGIMLFITSE 
AFFLAGFF*AFYHSRLGPTPQLGGDWPP 
TGITPLNPL* VPLLN\ TNE* LA* GVS I T 
*AHDP 


6169 


2007O 


A 


6222 


895 


13 


ANSWANN* FELATQ I AFSGR I PLTDSHL 
GLLKAHVEAPI\AGLIVLAAVLLKTGGL 

EA*FITSSHLPSDKQDLKS\LMAYSS\I 
SPHSPS*LTPILIQTP\W\SFNGAGQFS 
I IAHGTYHAYITYSA*ANFKITERTSQC 
AC I NPAAQGLSNS * LQLIGLFE * LPRQA 
\SLTMGLTPPLINPTGENCSVASNQRFS 
/WTQISLLLPYRDSTILGSQALYSLYIF 
TTTQWG S LTHH I NW I KP S FT RENT LMF I 
HLSPILLLSLNPDI ITGFSS 














HS YALVYNPVGIRVKVHLSKC I LYT WH I 
SNSSQEVCEMPNLNLKRVPTTLAPRF*N 
IS*GISQLGAVAHTCNPSTLGG*GRW /I 
QQLKTSLANMAKPHLLLKKKKNFPNFPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQAGVQWCNCGSLQPPPPGFKRFSC 
LNLPCSWDYRRVPPHLANF\*FLVEMGF 
HHVN/ Q AGL E LLT S S D P P ALAS Q S AG I T 
GMSHCARPRVAF+GEKNHRSL 














VI LPLWPPKVLGLQA 


6173 


20074 


A 


6226 


417 


81 


TQTRGPPPPPFLGGPPSGPERTVPPPRG 
GVPQTGSRFFLFRQNPGPALFPKN/EGG 
F* KKFGPPIiPPF*KTPFFFTNRGNPCPF 
FVGENPQAPPVFGGFNPFPRKKKKRNDF 
LS 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AAA*YWHFV 
DW*LFLYVSIY**GSCVPHPA 


6175 


20076 


A 


6228 


503 


28 


AAHAPPRVRKVIGH*AFFTQRCTNSRH* 
I I KFPKDKDKARILKGAREK/RTTYNGA 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 
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NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PIWLAVNFSVETLQ/A/RREWHDMFQMP 
KEKTFYLRILYPMKISFKHEREIKTFPD 
KQKPRDFISSRPVLQEMLKSWPGARVHT 
GNPTLLGG 


6176 


20077 


A 


6229 


418 


175 


VSPLSPLKILFFPKGLKFWREGGPKCPP 
PKKRVLPKK/PPAMF* IPPPKRKKSKLP 
PPRKIWPPQGIFKSPPPQFYFFFFFF 


6177 


20078 


A 


6230 


45 


308 


ACVRLKH*TDN*QPNIYNQP/HKSLIiP* 
KKKKKKKKKKKKKKKKKKKKKKKKGGGG 
FKKKPGGGPISWGGEKIIFFFFGGGKKN 
TLGIF 


6178 


20079 


A 


6231 


440 


209 


IQHFRGPSRRGLL.GPEARASPGQHRESR 
SFLKMQP\NKKFS*VWWCTPAIPASPKV 
EARESLEPGRRRLHLTEALQPG 


6179 


20080 


A 


6232 


419 


2 


SSPPPPGGKFF*KKPGKKKFSPPKKKRV 
FSPP\PPKKFFFSPKA*NFWGGGGQKFP 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWG P PRVF * KGPPPFFFFFFFFFFF 
FLS FNVNI L.FKPVQAPCLT 


6180 


20081 


A 


6233 


448 


112 


QSARITGVSHRTRPMLRTSTPLSATDRN 
AGQKI SKDIADLNITNEQDHRDIYRNSTQ 
* QQNTHSFQAAIRNKIDP ILGHKKMLTH 
/ CKQSDQTWNKLE I SNKYAKPQKI KHT I 
K 


6181 


20082 


A 


6234 


2 


351 


MDQGI I/SDFESYYLRNTFCQAIVARDG 
DSSDGSGQRKLKTFWKGFSLL.DAIKHTE 
DSW/EEIKISTGVWKLIPILVDDFEGFK 
TSVEE/VADVMDLSRELEL*VEPEDGTE 
LLQSHSQT 


6182 


20083 


A 


6235 


3 


252 


FHTYNSLLVCHSNLCYQNSLLFFIIH* / 
CI /HFHS * AKVS IWKHTYTHTHTHTHTH 
MHTHFFQATDRKICVNVYMFSVAYIHKG 
LQ 


6183 


20084 


A 


6236 


3 


294 


I ITF* LPQLSGYIEKSTPYECGFDPI SP 
ARVPFSIKFFLVAITFLLLDLEIA\LLL 
PLP* ALQTTNLPLI VMASLL* I I ILALS 
LAYEGLQKGLD*AE 


6184 


200S5 


A 


6237 


3S1 


121 


PVPASF*FFFFYRDR\VSLCYPGWSQTP 
GLKQSSCLGLLSHWDYKYDPPHPAFISF 
I)LS S S LL W VL * R P WLLCAT VP AFLD I R 
EQ 


6185 


20086 


A 


6238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGEFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKRGVFFFLGGFSK 
FSGGPPFSLGPQKKKRFKIKGPKJCFCF 


6186 


20087 


A 


6239 


1066 


761 


S FFLRW S F I L VTQ AG VQW CDLS S LQL P P 
PGFK*FSCLSLPSSWDHRRPPPRLANFC 
IF/M*RWGFSMIiARPVSNS* P/CSDPPT 
PASQSAGITGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GELPTP I YLAPPAS PKGGVSGGS P PSRV 
KFFPLEVPPPPPPGC* PGPFFYI WPQGG 
ARVP * QQIRFFSLRW\ PPRPTRPKNPHL 
QVGPLRICLSPETGGGVIKVFRGSLLKK 
KKRAAASRSN 


6188 


20089 


A 


6241 


366 


1 


KIKNFDRPI/TSKGA*SW/RNLPTKKS 
PE PNGFPG VFCPTFKKKLTP VF /SKLFP 
KF * KEGI LPNS FF * S S LAL I PN PF YPSK 
KKKKTKKNYRAISLMWIDEKIHYKY*QT | 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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M 
eth 
od 
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■USSN 
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beginning 
nucleotide 
location 


Predict- 
otide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L= Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














E FRSTLKVLYT I TK 


6189 


20090 


A 


6242 


23 


439 


IASGRPFFFFFFFFFCLPPPPGGNPSFS 
FLKKKKKGGGGGENPPNPPGKAPNPQGV 
GVPPFPPPPVFQKNPPPPFFF*KNPPPS 
PKGPPGGPPFFFPPGGGGGPPPT/PYFL 
KKKGGWGGPPFF* KTNPPPKGEITPLVG 


6190 


20091 


A 


6243 


20 


261 


WAQEILKKFSCLSI.jLSSWNYRCAPPHVA 
N\FVFFVETGSYHVAQSGLELLDSRDPM 
TGESHHAQPHFC* ILYTGESQRIL 


6191 


20092 


A 


6244 


376 


76 


LLPRLECSGVIKPFSHLSLPSSRDYRCV 
/PTMPNF*KFFVELKFNSL*R*GLTMLP 
RLVSNSWCQVIPLPWPPKVLQGCATTPG 
LTDF I TTFKGYRHGDT Y 


6192 


20093 


A 


6245 


222 


233 


KSFYFALIFEKNFCYVKKSQVGLVFFSK ' 
VKVXiVRPFFPMVIKKNQP/FNPL/RFFF 
FFFQDRVLLCHPGWSAW* SKTLS 


6193 


20094 


A 


6246 


2 


358 


EI ILTFQSYY*ENTFNEAITAIDSDSSD 
GSGQSKWKSFWKRFTILGAIKNICDL*D 
*V/KISTLSGVWKKVIPSLMDDFEGF\K 
TS VEE VI ADWE I AR* LELE VEPEDVI E 
LLQSHDKT 


6194 


20095 


A 


6247 


3 


322 


PDHSLLQTPTPGFRRSSHLGFSSSWDYR 
LEPPHSVCVCVCV\FCVYIYFFFFLEKT 
GFGFVPQAGLNLGVXRI FKA* PPGGVE I 
PGLTHTPGPWGGP IKGKGANPAP 


6195 


20096 


A 


624S 


1425 


1280 


SLDFQGLALPGWSAW*SCLTEASTSR\ 
VKRTSHPSLPNTWVYREALPRLANSVYF 
L* RWGLTMLPRLGSNS * TQAILQPWPPK 
VLAL*ADEEMKAKGHISTKWGGQTLNPQ 
PDCGATSVYSLPIRKFQ 


6196 


20097 


A 


6249 


189 


336 


HAP FPVSVI SLS CQKRRRX WLAMVAHTS 
NPSTLiGG*WGWIA*GEE FETSQ 


6197 


20098 


A 


6250 


2 


300 


FLPSFMGFWPVL.GHGRASTVTPSEPLSL 
PFLLPAEPFLFFFFSLFRKRESIFGA/P 
GGLEGGQFNFLEPSPPGIKEIFWPNPPR 
GWELKIWPPPPINFCDF*EKTGFTMVAR 
AGLKL*TGGLEGGQFNFLEPSPPGIKEI 
FWPNP PRGWELKI WPPPP INFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/SKCEWSWRFDVQLKDLKRWQN " 
HLLLSQQLACI VLTSAGIMDHKEMR* KH 
TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


SbRLKSSSHLSLPSSWDLRCELPHLAN* 
KKKKKL/CKN/RSLTGLPKLVSNSWV*T 
I LP PR P PRALGLRA 


6200 


20101 


A 


6253 


402 


3 


KKGFVLSPQVEDKDAISGSFNPGPPGEK 
IPPPQPPEEVGPPGGPPQTRGNLGI/LG 
N*GGGGGF*PPGAGKSPAPGPQKG\PFG 
KDKREKPYGPPPKLFFFFLRQSLCFAQA 
GVQWCDLGPLQPPPPRFKQFCL 


6201 


20102 


A 


6254 


362 


78 


VGVN/MTFFPQHFLGLSGIPRRYSDYPD " 
AYTT*NTVSSIGSFISLTAVLIMIFII* 
EAFAS KREVI S VS YASTNLE * LHGCPPP 
YHTFEEPTYVKVK 


6202 


20103 


A 


6255 


357 


2 


DLSCHSHGEATQKHSEKCKASCTHMFIT 
AIjFLIAKHWNQPTC/PHTMEYNSVIK/N 
EQT I DTTTTWMDLKCI MRHE QRQS * KNS 
VLVCFHAADKDI PETGQFTKERGLMDLQ 
FHVAGEASH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryprophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6203 


20104 


A 


6256 


2 


305 


QWCNHS S LHSNPGERETVSS LKLPKAWN 
PRHMLLCPTTSFILRWGFIYVAQAGFDT 
RGHAS L / LA* AS PS VG ITSKS QRVRPVH 
DIACFHRESTMFSAENFN 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPPGVTPISCFTPPKKGGNRGPP 
PRRGKFWI FKKNGGSPG* KRRVLFPDPG 
VPPPRPL.KKGGLKGGAWAKI FFFFFLRQ 
SHS FTQAGMQWCNLGSLQPP P PGFK* F 


6205 


20106 


A 


6258 


219 


2 


PTLS S QKDT KAQRL YS FPLTTTKDE RLV 
TLVS/WPGWAYTFIPSTLGGQGRWVT* 
AQKFETSL.ENTVKPPSC 


6206 


20107 


A 


6259 


306 


117 


SFELliGSSDPLISAIiSSWDYRCAPSRLA 
NF*FFVCVEMGSSYIVQ/SWPQ*ILLSW 
PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDILSVAQARVGGLHLGSLPAPPPGFT 
PFSCLSCSS *DYRRLPPRPA/NFFVFSV 
EMG F\ TVLARI VS I S * PRDP PPSGLP * G 
ALGLPGVSPLRPALCIjLLK 


6208 


20109 


A 


6261 


357 


116 


FAQ AG LELMG S S S P LAS AS Q S VG I TGMS 
HCA\RLL,LFFCGGRW*GG/LK* FSCLSL 
LS S WNYRHEP PHPANL.C I FS FNTISY 


6209 


20110 


A 


6262 


3 


337 


HERHERHERHEQ S AE I TGVSHHAGGG I G 
LLNFFLFLFFEAGSCTVT*T/GV*WHDH 
GPQ * PQ I PRFK* S SHLSLHS S WDYRHVP 
P*LGI..QACATMPG*FFIFCRDPISEINK 


6210 


20111 


A 


6263 


515 


150 


FLRWSFTLVAQAGIQWCGVSSL.QPL.PPG 
FKQFFCL,SLPSSWDYRCPPPCPTNF\*L 
LVEMGFHHVGQAGLELLTSGDLPASAS Q 
SAG I TGI SHRARPGTL.FF* AANGGGKQ V 
FLLLRVLNGL 


6211 


20112 


A 


6264 


1 


296 


GTS FLPS FLP / LLS FLKFLNFFQI VSHC 
RPGWSAVA*SQLTAPSI\FGLKPSSQLS 
RLS S WDYRCP I NFF \ SLPMLPRL VSNS * 
PQAILPTQPPKFLGLQA 


6212 


20113 


A 


6266 


298 


182 


LHFFEVSKIIKLIRAEAKWCL*VLFTIA 
KIWKLH* * I CKYLS I DEW I KKMWYIHTV 
EYYSAIGNKEI/PVIVTIWI YMENIMLS 
EIN* *QKDKCMIALL*GIYIQ 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKIIFP/ 
SPPVNLAPPKEF*KGPPPFFFFFFFFFF 
FF 


6214 


20115 


A 


6268 


125 


17 


KGQAQWLTPVI PALREA/EEAGG* L.EPR 
SSRPAWAI 


6215 - 


20116 


A 


6269 


1109 


454 


GDSAGAEGTMENFTALFGAQADPPPPPT 
ALGFGPGKP P P P P P P PAGGG PGTAP PPT 
AATAP PGADKSGAGCG PFYLMRELPGST 
ELTGSTNLITHYNLEQAYNKFCGKKVKE 
KLSNFLPDLPGMIDLPGSHDNSSLRSLI 
EKPPILSSSFNPITGTML/VRLPPPHWP 
VAGAVSSDAYSASQEEE*AQAQTEPYPG 
SCPPRNTI *FRSQEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


WGITGVSHCAWSRNGTLTHCWWECEKV 
/ Q PLR KTV * Q FLML KI \ P YQM I * P L.LG I 
YLRKNKAY I HMKTLK* MFSLFLFE I AK/ 
TWKQPKYPSASEWIN*LDSFSLYWLGWS 
RTPDLK 


6217 • | 20118 


A 


6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS/GGFK 
IXjERM*PIYFLIPYVVTIFNQRYYFYIW 1 
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M 
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od 


SEQ ID 
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Predicted 
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nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

"ocation 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K~Ly stae'^Leuti'ne "iVl-MetlTionine' 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 

X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NLKKTRGIYTVLYKMYKSF 


6218 


20119 


A 


6272 


787 


470 


LPFFFSFFETGSHCWRWERESLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSLPSSW 
DYRCAPP * LDKFFFCI FCRDG FHHVAQA 
GLKLLSSG\NLPASVSPSAGTTL 


6219 


20120 


A 


6273 


2 


322 


FCKAIAAIDSSSSHGSRESKLKTLWKRF 

LIPTLTDDFQRFKTSVEEVTA\ELLVVE 
I TRE I ELEVELDDVTELLQPHDQT 














ALIFVGRLSSFVPPPKIRFFSKNSRNLF 
/ CPVGEFFKTRPPVNFFFSLP*RKSYFF 
SFPG* I WPP*GIFLKAPPFFFFFFFLRE 
RREVSMLPRLMSNSWVQVTPPPWPPKVR 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKSFVHCWWQWKLL* P 
LWKTVWRVLKTLK\lELPYYP/ATKLL* 
SYPLE 














PPPSKNFFFPPQKDFFLGGL PKKS PFPK 
K\FFFKKTPKKNFPPPSFFKKPPPPAFI 
KTPLKKKKKFFPPPQKFGPPPKFFKKPP 

TKTFYL 


6223 


20124 


A 


6277 


2 


264 


ETESHTVAQSGCSGAI I AHRSLNLQGST 
NPPTTAS * VAGTTDACDHAWLMF * FL \ V 
EMGSHYVAQAELELLSSSDPP\PQSPKI 
LGLQA 


6224 




A 








IRCYRKIFYERKHQWLQQISLLSYFKKL 
PQP \ PPL *ATITLISQQSLI SRQDPPPV 
KR**IjTEG*DDC 


6225 


20126 


A 








I NKANRD KL /GT I DFLE FLT I KVRKMQN 
TKTESKKRN* ES I LHVE* EDTGHVRAAA 
VCRDMTYLGEKLTDEEVDK\ I I REAKYW 


6226 


20127 


A 








SSSSPKACMFVYEWVCTQKCVWIYEYMC 
VFVYV * LVLMCCMNVCACM \ C * S ACV * M 
CVICECVCM*GLSVICEGVCM 


6227 


20128 


A 


6281 


459 


235 


FLRNGVSCCHPGWSAWQS*LTAASNS* 
AQVI LPS S WDYRP VTSHPAKTHNFYMQ / 
CGNKKMCVTHS I VI FS LLQ 


6228 


20129 


A 


6282 


505 


470 


GVP*GQEFEPSLSHMGRPCFFLKCKINP 
AWWQKLF * PRRQKFRLTQ I / LALAPQPG 
QQKVNFFS/NKKKKKNPNNC* IALR 


6229 


20130 


A 


62S3 


206 


374 


GGGGGKKKNWGGEKKRGGEKKIFFF*SP " 

PRFFFSPPYKKKKFFFPPPFFFPPPNFF 

FYPPPPPFFFFFFFFFFFFFFFFFFFFF 

FFFFFFPFFFFFFLNHFF\FF*LYVIIV 

LFMGHSDILMHVKNV 














YXKGGPPPEIETFLI FFYYIYLI I FIFF 
FFFFFFFFFFFFFFF* FVCLKYLI FNHI 
QRF*TYISIPHFMLLM\YTLHLFTFCIC 
SLFTIIVVFYIFV+L+YCCTVKALCNSL 
LVSELDKY I KNNYNCKQ 


6231 


20132 


A 


6285 


298 


44 


YLFFFFF*VELVFFLFCMFFFFFFFFFF " 
FFFFFFFFFFFFFFFFFFEFKNKVSFNS 
CFSLSLFKLYFLKYNFIQLQGNLHFNNI 
I 


6232 


20133 


A 


6286 386 


1 


ANLHIKVNFWNFTLLNFSFLPVWFGFH 
FLW*RVYLSHRGFYVFFQNFLRIFFFF 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SF*VG\FFFLFFLLFFFPFFFFFFFFFF 
FFFFFFFFFLCIAIVFY* I WYISYFTKY 
IEEPRTRGRTRGRTRG 


6233 


20134 


A 


6287 


190 


134 


LE FGRNR FQFSRM ILRGSGLQWRNYADM 
VKIAKT* YLRNTFHKVIAI IGGDSCDGS 
GQNKIj KT FWKG FS \LDAING I HNSQEE I 
KMATLTGVWKK* I PVLKDDFEGFRTS VE 
E / V/TADMVKI AKT \ LELEVE P \ EDVT* 
LLERILF 


6234 


20135 


A 


6288 


394 


2 


RRQAS LSCGGLHP VRS SRLLCL P KQAWA 
MAGAPPPASLPPCSIjISDCCASNQ*DSV 
GVG PS KPGVGYHLLVHRFLS PliE KRS I R 
VGVARFSRCRL,SPLSIiTRKGN*LP\TPC 
ASRVRQCLTLLQliAHGALKP 


6235 


20136 


A 


6289 


233 


58 


DRVSLCRPRWHWSRTRIiTVASTSR/LQ 
AILPPQPPE* LGPQAS ANRS PRS PKLTG 
LRG 


6236 


20137 


A 


6290 


385 


48 


TLQYLCTPTSTV*KKKKPENIKYW*QCG 
ANRTLFQ*WOVCKVAQTLWKPVWHYTVL 
NLNTRI VYDPGMPLLGTNNNAYMCCI C\ 
YQKTFTRMFIAALFEITPK+KMHKYLTV 
VG 


6237 


20138 


A 


6291 


273 


36 


RNGRWT?CGGR*IHPS*GTDPVYIjPVPG 
TIiSSSQASNSHPIiLKAPGSRP\PWPTDL 
GYKDPRGPPSNPACRRRPSLFHSD 


6238 


20139 


A 


6292 


394 


5 


AKIYFSINPLVFSKQ*FLKNVFPNILHL. 
QFFSNINSLMLIKV*ALLQDFPLVHNVY 
NKICDTS KDTTTL.FMYLMFVFKMKTFFS 
LKAYI /WSGTVTHTYNPSTLGGRGMWIT 
*GQE FKI S LANMVI AS DY 


6239 


20140 


A 


6293 


536 


190 


RRSFALAVQAGVQWRDLGSLQPPPPRFK 
R FSCLSLPSSWNYRHAPPRPANFVFLVE 
TGFLHVGQAGLELLTSGDPPVLASQK\A 
GMTGVSHRVQPD*GHGGSRLSSQPFWEA 
KTGGSPEVRSSRPA 


6240 


20141 


A 


6294 


2 


238 


HEADLGTTQVLTHPSTTAMYFVHYCQPP 
* ILYGTINT*PPWHKNPIHIKTP/SPC 
LQASTAINPQLSHINC\NPKDTPH 


6241 


20142 


A 


6295 


2 


447 


FFFLMGVSLFCPGL/WSAMGAILGSLAT 
SCLLGVQVILLCLSLPSSGKLQVCAYPH 
LG+FLYF*AEDGGFSPCWPG*VSNS*PQ 
VIHX.PWAPKALGITSMSHCCPACFHCTF 
FRHFRQNPHLNLFFFLEGLGLSPPPPSQ 
PLEGGKGPGN 


6242 


20143 


A 


6296 


2 


133 


AEERERERERERERE \RERERERERERE 
R* VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HDYSS FLAPPALGYMRMPLHPAYF I LFY 
FIS/FYLFFSQDEVLLC*LGWSQTPEIjN 
QASCLGLQKWWDYRCEPQCLAECDS 


6244 


20145 


A 


6298 


1 


361 


GTRRSAMLARLVIiNSGPQ/ cylpaxasq 
RAGITGVSH*GCPFFSLLKKRWRSHCVA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCP P * DTVF * VRAVSSE VPL.T I TY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 


348 


HERHELEELIKNLFFFFFFGKKIFFFFP 
GGKNGGP F * FIGAFS FRPQG IFLPYPP* 
KW\DWGPWPPCRGDFWFF*KKTPFSILA 
RWLNSPPWGILS PYPPKGVRFPGGTPR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, 
X=Unkno>vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGPV 


6246 


20147 


A" 


6300 


1 


436 


VDGEFSCLSLPSGWDCRRASPCPANF/S 
VFLVGIGFRYVGQAGLELLTSGDLLASA 
SQSPGITGMSHCTRPRKHH* IANSSSVXi 
GDFRIDYFQIS RKAQW I TLG YNPWAESH 
LLMGAGLRVXiPAPRLMISQVDLiLPATHP 
SSGRRG 


6247 


20148 


A 








P KG * S TLPAS AS RS AG I TAV ILi*SHCTS 
PPM*LRFKHFTHSSTSTPF/CFFFFGKK 
ILI FVPQPGGKGPNLS * LKPLLSWLRKL 
PCLTL*RGGDYPGFPPPP\SYFCFFIKK 
GV * P CGPGGNS * PAS AS RS AG I TAVI L 


6248 


20149 


A 


6302 


1 


285 


KHFFLFTYLFRDRVSVCYPSWSTWE/S 
L*SWLIAASN/CLGSK*FSHLSLLCHHA 
LL VD FN FL * RRR L> I RQ PRL VS NS WAQA I 
LSSQPPKALELKT 


6249 


20150 


A 


6303 


194 


351 


GTFICI KKPVQ / WLDTVAYTCTPSTLGG 
RDGRVT*AQEFETS LGNIVRPHVYKK 


6250 


20151 


A 


6304 


354 


211 


YRHMP/ S*LADFFIFCRDSLFML>PWLV1j 
NL*AQAILLPQPPKVLGLQE 


6251 


20152 


A 


6305 


1 


346 


GTSSLQLISLHSSPTASYPPKHLHTRLT 
ELLSVSLMAHVLS VSPATSS /PILQPHP 
IYPCTWIMFTHSSGSGKPSCQTHTHTYT 
HTHTRM CAHGSGHATL P P * H P LHPRH S A 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLNTl.WYIHTVDYFLAMKRSDLLIHSPW 
MNLQGIMLSEKRQ\SQR*HW*FYLYNN 
VEMKKSQKWRTVLWLPE 














ITGKSHRALP\KFAFLTSSQVMLTLLAH 
GTTAWFSKLS * F * EPPMTCAVFS WTG 
SCH INS FRIQTCL * YYSVFTSCFYCSM I 
EKLTTGI S * QKEHELPFSQSSDI FK1AY 
LIFFW/CWEMESHSVSQAGVQWHNLSS 
LQPLPPRFK*FSCLSLPSGGDYRCPPPH 
MANFC I FSRDGVLP C * PG W \ S * T PDLRM 
DPPGLRPFQSAGDLTGMEA 


6254 


20155 


A 


6308 


377 


67 


PPPPKIFFFSQTLFFFWGGCPQIAPPPK 
KFFF* KNPQGFFFPPPKKKKI FFFPPPK 
FLPPQKFF*KPP/LSFFFFFFFFFFFFF 
FFFFFFFPTI*IFYSMSFLP 


6255 


20156 


A 


6309 


1 


362 


QTYAKMHVTI I FFGVK\LTFLPQRFLGL 
S GMPRR Y SD YPVAYTT * NI LS CVGS F I S 
LTALILI IFMIREAFASIRQELIVEEPS 
INLE*LLGCPPPYHTF*EPGYLKPKRKR 
KESSPPKLV 


6256 


20157 


A 


6310 


2 


411 


VLPTLWEAGARGSLEARSSRLQ*AMIT\ 
HCTPPWAT 


6257 


20158 


A 


6311 


59 


334 


THACGNMLSYAGSFISLTAVLLITFMI * 
EA\ FAS KRKVL I VKEPS INLE * L YGCPP 
PYHTFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLHDL. 


6258 


20159 


A 


6312 


3 


22 


KPSP + PLTGALLDLLMTSGLSM* FNFHS 
I TLLI LCLLTNTLS I YQ * WRDVTRESTY 
QGHHTPPVQKGLRYGIILFITSEVF\FF 
AGYF\WPFTHSSLTPTPQI*GHWPPTGI 
TPLNPL*VPLLNTFVLIjALGVSIT*AHD 
P 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AI:uiine 
C=Cysteine, D=Aspnrtic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoieucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


6259 


20160 


A 


6313 


11 


182 


1NKMW* I*VRG* YSVIKK\DEILjSFMAT 
WMEVEDILLS KIREAQKVTHPVFSF I SG 
S 


6260 


20161 


A 


6314 


3 


421 


IQASTMAGAPPQASLiLPCSLiIIjDCCASN 
S/RGSVGVGPSEPCTGYNLLVCLLLRPS 
EKCS I RVGVT * FSRCRPS P/TRKGHS PT 
PCTSRMR 


6261 


20162 


A 


6315 


1 


111 


DKGMNVFMPANTASVLWPMDQGVILNFK 
SFYLRN\TFSKYISAIDSDSSVGSGQSK 
FKTF * KGFTNLDAIHDS WEEI KIST* TG 
VGKKli I \ PT FRDDFKG FSTS VAE VS ADV 
VE IGREL*LEVKPGDRTELL * SVLWPMD 
QG VI LNFKS FYLRNNI F 


6262 


20163 


A 


6316 


440 


312 


TSSFSLSFF/CQDG/ SLTMLFRLVSNS* 
PQA I L PS WP P KVLGLQL 


6263 


20164 


A 


6317 


5 


404 


PRYMIVKLLKI/ERNKILEAAREK*HFT 
SKGKAIWIIVDLSSETMEVRKNLHNFQV 
LKEK/NKTWEVRILYSAKISFQKKGKIK 
IFSDEGKLR * S FTS RPTLRE * * RKFS K * 
K*NDRRRNLKTFGRKKEYPKELKP 


6264 


20165 


A 


6318 


394 


182 


MFI VA/ VFSKAKEWRQPKCLSTDEWVy Q 
YM * Y I HTMKYYS AI KSNELV IHVAA* IL 
FENTVLS ERSH I WHGSH I 


6265 


20166 


A 


6319 


1 


378 


APSDFFLFQSLNKPLNCTHFSSVHNVKK 
TSLT*LNSQNPQFLRDEPNGWYHCL/QK 
YLELDGAYVKK 


6266 


20167 


A 


6320 


286 


314 


NRCCFSLPTQVKCFNLYLGICCVCDWGY 
YRWIJtfF *NP I F * FI LRQFHS VAQAGVQ W 
RDLGSLQPLTPRPANFYILVEMGF/TPL. 
LARLVLNS*PSLIiKYKN 


6267 


20168 


A 


6321 


23 


397 


QKTLNMEKEMNIYIHKM*ITPNRSNIKR 
SSLRHITVKFSKVKDKQRI /MKLAIEKQ 
PITYKETTIRLSEDFSAETL/NARR*WD 
DIVRYWLKKKKKKKKKKKKKRGGGLLKN 
SLGGPILTGGGKEIF 


6268 


20169 


A 


6322 


411 


2 


TE PPPPGGNF \ L * TKPGKKNFLP PKNLG 
FFSPFSPLKFFFSPK\P*ILGGGGAPFC 
PPQKKGFFPKIPRGVFFSPP/YKKKNLI 
FPPPVILGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MLARLVSNS*P\CDPPGLASQSAGITGM 
SHHVPQVNHF 


6270 


20171 


A 


6324 


3 


359 


QHYLVFRETQIKAAVRCHYI PTGMAKI I 
KTGSTKC*Q*CGAAGTLIHCWEECR\LK 
QPLWKTAW * FL I * MLCD*TVP FLQI YPT 
EMKMCVCRMTC\RVFLSVLFI IASSWKQ 
PKYKKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNFVNQAGGQGHDLG * LQPPPP 
\ GSR I FPA * TSRVAG I TGLNHHPH PLGS 
IF 


6272 


20173 


A 


6326 


265 


13 


TASPIRI I ITSQDVLHS * AVPS LGLKTD 
AIPVPJ J N\QTTFSDTRPGV*YGQCSEIC 
GANHSFMPIVLELIPLKIFEIGPVFTL 


6273 


20174 


A 


6327 


32 


259 


CYYEHLCTSL>CVNWYFHFF*VYT*EWQW 
PKYLSADEWINKM* YIWTVEYYLTIKIS 
EVL/SATTWLGLENIMLSKRS 


6274 


20175 


A 


6328 - 


1 


228 


PGLNHCSHLS WDHRHAL.QCPANFG \ YFV 
ETRFTVL PGLVSNDP PALAS QSAE I TG V 
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ding to 
Inst amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K-Lysin^L^L^uc?^ 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














SHCAQP*VCFHYCLTFFSLQ 














LP KKKKKKI S RGLFP FQL P KKKKDNE KS 
LTFSSETMPGRRQWSKIFKVLRGKKQPQ 
QPRIISLAKLFLKSEDFL/ETQKLREFV 
ASRPALQAMLKEVL*REGK*YRSLFKKG 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNKHNy RKKI ICLDVLRYNKVDDN 
SKMSGLH*QCPSDECDAGVFMASHKQ/F 
DRHYCGKYCLTYCFNKPEDKAQHCGSLE 
PRSLR PAWAT 


6277 


20178 


A 


6331 


319 


46 


RPRRPETK*ALNK*KDI/PMCSWKQRHN 
TVKMLILSKLINRFNAITIKISTE\FHR 

LLYLISRPP ' 


6278 


20179 


A 


6332 


496 


309 


DRATAF * RQS FTLS L / RAGMQWHDLS S L. 
QPPPPGFK*FSCPSXjQSSWDYRCPS PRT 
G * F\ CI F\ VEMGFHHGGQAGLELLTSGD 
LPASAS QS AG I TGVS HCTW PN I L.NW K 


6279 


20180 


A 


6333 


3 


173 


HHHSRL/ IFFCSVLGTSSHYVA*AGLEL 
VF 


6280 


20181 


A 


6334 


339 


153 


FSPPPPFFKGFFFPPPPPPF/SF*PLRG 
VXjASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLLNHSVIVFLLSFDHSYFLRNLSLFLA 
FKLIGVKNS*YIILSFRYLLHP*HLLNF 
F FF / CKRQGLPVLTRLVSNS WAQG I CQ P 
WPPKVRGLRAWSPGLSL 


6282 












DGVSLLLPRL\GGAISAHCNHDLPGSGD 
SPVSAS*VTQLQYYFCICLGLFTFTTRL 
SAYLGTGKLiTTALi PNTPGLLD S 


6283 


20184 


A 


6337 


302 


59 


KMLLNTI MCNRE IL / RKRKS PRVHQTSS 
VSYFKKLPQPPQHSAASTLISQQPSSSK 
QDPPAAK* L* FVEGRLI AFPSNNVF 


6284 


20185 


A 


6338 


197 


28 


AEKVRFSSAVTWHLGSHSL* *SKKNL/W 
LGMVDHNCNPSTLGGRGGRIA* AQAFET 


6285 


20186 


A 


6339 


46 


317 


EVPSSISSQISLFSGDFSSVPLEELSNL 
FFMFCPEHI IVTCERVTSNYS / YYKWKP 
EFS YP * I F/ CFKMSLTMLFGLSLNS WAQ 
VIIPPQPPK 


6286 


,20187 


A 


6340 


4 


276 


QTTLRGTTVQI TKNLSS ETMNAS RCEKK 

GVEI KTFTN/ KQKIKN/ FITS I TLV+ E I 
LKEVLQAEG 


6287 


20188 


A 


6341 


485 


2 


GVWEFLETESRSVAQAGVQWCDLGSLQA 
PP\PGFTPFSRIjSLLSSWDCGSPPPHPA 
NFFFVFFVETGFRHVSQDGLDLMVS IS* 
PREPPASASQSAGITGVSHRARPKHSKI 
LI AGSRRS IQMFPVKFFQKLAWENFQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 


20189 


A 


6342 


2 


339 


EDLRVKLKKEGYSNISYIWWHQDPMLN 
SNGSVTWALLQAS * YLCI LQAS * ASSQ 
SWTS/ VTLAA VS FQR I SNQEH I P VYQQE 
ENQTDVWTLLNGSKDDFLIYDRCGCLVY 
HL 


6289 


20190 A 


6343 1 181 


12 


RWGLPSGWDYRSMPLHPVNF* I I / CMRQ 
DFTVLPRLILNSQPPVI LLRWPPKVLGL 
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peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
Inst amino 
acid 
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peptide 

sequence 


Amino acid sequence <A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QA 


6290 


20191 


A 


6344 


3 


295 


YWHLSCF* PLAITNKAAMNI RLLWTCAS 
IFPR*RSRKEMAGS*GS/SLFNFVRNCL 
TIFSHDCTTL/HS/HQQWGGV/PVLPFP 
HQHIjLi * S I FLVFD I L> I G I 


6291 


20192 


A 


6345 


334 


8 


KPSNPGGEGP P* KKGPKVFS PWGKKENF 
SPKKKKKI *CPQKPKGPPWGAPAFKKKK 
NF*NPRGSPQL*KKRGIIGPPTLIPPSS 
QRPPPKKKKKK/GPPKGALLAFGGIKFF 
FFFLEKNFIiFFPKGKTLIAPFFKGGLPL 
PG 


6292 


20193 


A 


6346 


68 


256 


WVA\ PVTHTCNPNTLGGRRGRIT * AQEF 
KTS LGN I VGPHLYKNLKKKKKKKGEKKP 
GGGFSPL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI /PSLGTSSKEFKSG 
SSRDICTPMFIA/AI TKRWK * LKCPS MV 
NI/VKKMWFIHMMQY*SAKKKEIL.S*MH 
LKDVMLSERMLSQSQKKNFCMILLIWGI 


6294 


20195 


A 


6348 


234 


61 


CHRIEHIKMVNL/WLDWVHTCNPSTLG 
GQDTWIA*AQEFKMSLSNTAKPCLYKKY 
KN 


6295 


20196 


A 


6349 


2 


358 


FLSLGIHFNYCL.VTTESKLSKLRI/TYFF 
FFFLETEFPFCS PA* RPW\ PNLG*LKFP 
PPGSRETPALTLPRTWNNRRAPPHPANF 
GFX.RKNGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNN 


6296 


20197 


A 


6350 


346 


27 


KMV*SLWKAVWQFL,IKLNIH* PYDSAVP 
LIG/I YPT*MTT* I YTKP * TQMLiAAVS F 
IMAKNTRNESNIFNWPIDKLWYIHIMEY 
* *AFKRN*LjLINTKT*MNLKC*LF 


6297 


20198 


A 


6351 


36 


158 


KNSVPSQE*LYYL\RQENRLNSGGIGCR 
EPIWCHYTPAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSS WDYRRAPP /H/ LANCC I F 
CRDR / S FTML PS LVLNS * I QT I LL 


6299 


20200 


A 


6353 


74 


303 


EPSICGQIYLSFSIYKIQK*/WPGAVAQ 
ACKTSTLEG*GGWIT*AQFKTSL.GNMMK 
PCVFSKKKKKKKPHPGGGGRKK 


6300 


20201 


•A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA 
VHSRLTAAS\TI.GL.KQTSCLSLVSS*DH 
RCMPPCLAIiKRKKT 


6301 


20202 


A 


6355 


2 


627 


GRVGGFLVPMRDLAEASRGQQGQVKLGE 
MKPSVSCVCGGV*CISKFKLKN*Li\WPG 
TVAHTCS P STLGGQGGW I A* AQE FETS L 
DNMVK\ P CLjY 


6302 


20203 


A 


6356 


584 


166 


RQGHNVAQAGVQWRDLGSLQPPPPGFK* 
FFHLSLPSSWDYTRVPPHLANF\ * FLVE 
TGFHHVGQAGLELPTLSDLPAWASQSAG 
ITGVT/HQARLQLL*KELINPSTLGPTW 
ELLEAWGRCPSWMGDDLALNCFHHFSV 
S 


6303 


20204 


A 


6357 


448 


202 


FTTLVSECI*DIHDSWKDV* ISTLTGAW 
KTWLIPTIiVNGFEG\FKT*VEEVTADW 
EIARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


6358 


3 


237 


DRSSL*P*PPGLK*SYYLSLPSGWDHKR 
A* PHPANFCLFTF/ V* QRGLDLLA* HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 


20206 


A 


6359 1 49 
1 


242 


SSYLSL*P*PPGL.R*SSYLSLPSSWDHK 
HA* PHPANFCLFTF/ V*QRGLDLLASHD 
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NO: of 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G— Glycine, H=Histidine, ^Isoleucine 
K=Lysine, L=Leucine, M=IVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y = Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPASASQSTS ITGMSHRVWPPLTF 


6306 


20207 


A 


6360 


508 


223 


FVTQAGAR*LDLSSLQLPPPGFKQFSCL 
SIjPSGWYYRHVP/STPIVF/CIFSRDGF 
HHVGQAGLELLTT*STCLGLRSAGITGL 
SHRAQQERGFEP 


6307 


20208 


A 


6361 


2 


595 


AI WG*GTHFLIPWVQKPIIFDHRSQPH 
NVPVITGSKDLQNVNI IPCILFGPVTSQ 
LPRIFTRIGEDYDERVLPSITTEILKSV 

AT FGL I L DD VS LTHLT FRKE F TEAVE AK 
qvaqqdaera / rnslatagdglmelckxi 
EAAEDITYQIjSRSWNITNIiPAGQSVLLQ 
LPQ 


6308 


20209 


A 


6362 


1 


259 


LESFYFLLYYYK*SS*PLRLKRLILFIF 
ETECCPVA\QLECSGTISAHCHLCIiQGS 
GNSPAPAYRVVGI TDS FRI THCILTCI P 
SE 


6309 


20210 


A 


6363 


60 


221 


NTTKLGMVAHTCNPSILGARGGRIT* V* 
EFKTS LDNIVK \ P CTP P KKS LKI KLN 














RPRRPQTGVQWQNLGSLQPPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSGFS/HHV 
GLKLFDSSDLPWPPKFVGIARHESHRKC 
T*RSRRGRRIADPLER 














CINSKIMQSSRITTLiCKMVQPLj*RALWQ 
CLIKLNIPHPYDSAIPLLHFYQCK*RHM 
NPHN\TFTSMFITALFITAKN*KQSKCL 
SIGQWMKK+ *CIHTVQ+NSGI * *HE* IN 
SKIMQSSRITTL.CKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRLAKVQKFLLFFYLVF*DRVTLCHPG 
WT AW * S QLT VTLAHRNLRL PSGWDYRH 
VPQRIaAD F VFL / CR /DRGFSMLiVGIjVSN 
SRPQEV*ATSPSQSLLS 


6313 


20214 


A 


6367 


67 


636 


QDNKKGIRYLFVYFRQSFALVAQAGVQW 
HDLGSLQPLiSP* FKQFSCLSIjPSS*DYR 
HAPPRPANFL / QFLVETE FHHVGQAGLE 
LLTSGD* PASTSQGAGITGVSHCTGPRI 
RFLKTFSHLEAPGGFPEKRQPLRSERLQ 
FRAFPQTAPCAHAYEjLGCVSHLEMGCRN 
idi pels fflcplqcliyrgks p 


6314 


20215 


A 


6368 


3 


251 


RSRHCTPA W APLS TQDHF YVF YFFLHFL 
IGSLFIFACFVSSYFIEFSYCL\YFFVD 
SFGFYRYTMK*NLNSNYFKLSAEILPQ 


6315 


20216 


A 


6369 


333 


193 


FKKLVLMAKAKYCKANFFFQGVPVASKN 

WDYRHEPPPRLANFCI /FLVETGFQHIG 
QAGLALLT /S / GDP PASASGALE * LGGG 
SCL 


6316 


20217 


A 


6370 


36 


369 


RPTRPEMRFHPVGQAG/PRSPGLK*SAC 
LSLSKCWDYRHEPP /'*PG*FSFFTYIYI 
YFFFLFFS FS FLEEMGFHHWKAGLELL 
C*NDPPASASYSDRI IGVSHQARPFFFF 
FF 


6317 


20218 


A 


6371 


2 


375 


DIQMIDRNIKN/CSTSQSIRKLQIKRIM 
*YHLTSVRMAITKKTTHKC*RCGKK\LH 
C* WQCKLVQPL * KTIQRFVQKTENNINI 
*PSNSKN/WKSAC*RDICTAMCITT*FI 
RAEHGGS \ PCKPSTLG 


6318 


20219 


A 


6372 


75. 


372 


NGNSQKKKKKKKKXGGPPLKKPSGGPNL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P PGGAKKS FPL KGGQKKP PGG FLEKN P P 
LGGAPL \ G PPP P PKKRPRGKTKP * GG * R 
GKNPAKPWLGKKI LKP 


6319 


20220 


A 


6373 


605 


350 


ISTGI CLGMF IAAL FTIAKL / WKAT * PS 
FNR*KKKMCYIYASYSAIKK\DAILSFA 
ITWMNLENIMLSEISQTHKDKCCMISLI 
CAI 


6320 


20221 


A 


6374 


167 


378 


KKSQGGTNIKPPAITKYLSQMGG*KKLL 
TGFLKKKS LLGGGAGVTILLT\NRTFST 
KINDPDGGGDHILSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALSPRLECNGVISAHCNLRFPGSNN 
SPASAS* VAGAHHHTQLTFVFLVETGFH 
HVGQAGL E LL /NLVIRPPW P P KVLGLQA 


6322 


20223 


A 


.6376 


1956 


1592 


HEFEMESYSVTQAGVQ*HDLGLLQPLPP 
RFKQLSCLSLP/RYWDYCLAPPCLANFX 
* YLVKMGFRHVGQAGLKLLAS S DAP AS A 
FQSAGITGMSHRARPKSAFKIQAI IGEG 
GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQE IiQNK I SH CL I RRVDR KM / WLCT VA 
HTYYPS WLGGQDGR I A* GLEFE TS LSN I 
TRPRLYRKKKKKKNVFQAWPLWSNFLjRR 
LRREIRAEKSF 


6324 


20225 


A 


6378 


463 


146 


GKLGPPKGSIjKPPPPFFFFL*R/P/HPP 
P RGE T P F FF KKP KKS P RGGG AS P P FP P L 
GRVRPENFPPPRGQGSP*PKRGPTPAIiP 
PGGQKETLFPKKKKRKKKKKYMRLI 


6325 


20226 


A 


6379 


3 


207 


LEYQVAQTMCIISSRIQLVAEKASVPFV 
DKTVLS\TMGQL+EAKAEDGFFVPG*YG 
LAVSPLKFQSTRP 


6326 


20227 


A 


6380 


328 


70 


RPRRRLGLPRSWDYRHEPLHPDQLLVE I 
MSHYVAQAGLKLL \ I TGVSHHARPH/ YR 
F*bGHC*SLVRRPWTISYSFLCL.GFPfVlW 
EVSD 


6327 


20228 


A 


6381 


13 


77 


VI FLLQPPK*SSCFSLPNRQDYRRVPPY 
PANF* FS /CRHSLTMLPRLLF* TGLQAC 
ATI PS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHL.FFFFFFF*KDSPSLPRVEC 
CGEISAPCNL*LFGSNDFP/ASHAIVGG 
KLPKQRKGAFFS * GGGFFFFF* M * KHSV 
LNSSRN 


6329 


20230 


A 


6383 


1 


208 


RGRVDKI FYLFFETDSQSVT*AGV* WLN 
PGSLQPLPPGFK* FS + PQVI \TCLDLPK 
CWYYRREPLCPVY 


6330 


20231 


A 


6384 


277 


50 


LVLKLGRCKTSSFI *NFKGPQIAKTILK 
KNKAGSLT \ LPNFKTNYKAT VI KTV* Y W 
HKH* YRHG F I SGLS I LFH 


6331 


20232 


A 


6385 


2 


354 


AAASTNFRDR/ Hj/LCHPGWSAVAQS *L 
TAVSTIiN*AQGIG\SSHLSLLNSIiDYRY 
APPLLAVLLCCPGWPQNSWPQVILLSQP 
PKALGYRCEPPCPAPLLLNFFFFFKNRL 
WFCCPGWSA 


6332 


20233 


A 


6386 


2 


188 


FFKYICY/CINMYMCFYIFINIYSLIYL 
*IHIYKYIFAYIFINTYL*IYICLYIYK 
YI FIYYR* * IYICLYIYKYIFI YYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR * G* RRTGTLTGG* WDCKASGQL 
♦KTFQQFLISITY/PYGPAISLLEIDPS 
ELKTYVHTKTRT * LF I AASF I I TKN 


6334 


20235 . 


A 


6388 


142 


429 


YCVCIYIYIFLYLFIFFCTFFTYSSLHL 
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NO: of 
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NO: of 
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NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, 11= Ilistidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion . . 














LIFSFFFFLERGVCFSPRGGGGGKYLGL 
WRFFPPE*IKKNVYIYA\QQYYSAIKQKT 
EILSFAATWMELEAIMLSEIRQAQNDKH 
YMISLICGIKR 


6335 


20236 


A 


6389 


468 




VHRRTPPCPIIFFFF/CKQ*VLiPMLPKL 
VSNS W PQATLL P W PS LTVE FQALTPGWK 
NFKKMFLMVMSWS I KKR 


6336 


20237 


A 


6390 


616 


1084 


LCLCQYAGISVQCPDCVFQFFHNVDSTY 
EDYLRKLKTSLS F F WRRS FALVAQAG VQ 
WCDLGSLQAIiPPGFK*FSCLiGLLSSWDY 
RH P P WHP ANFVLIj VEMG F I HVG QAG IjEL 
LTSGDPPASASQSAWITGMSHRTQQE\Y 
FLNLKFSTSPGELLjCLK 


6337 


20238 


A 


6391 


415 


200 


SPPPRFFFFCPPKKNFFFPPPRSKNFFF 
*RGPPFFFFFKKTPFF/YFFFKPPFPPR 
/RFKNPPKIFFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PLFGKKKKKKKKKKRAAPR * NPRARG VF 
FFPPPPLGFWAPG 


6338 


20239 


A 


6392 


3 




KLiKPSSH\DY*VGETY'GKDITSRGKDKP 
IAVCKTFSPLS 


6339 


20240 


A 


6393 


3 


204 


DRVSLCRPGWSAVA*S*LTATSNS* \VK 
QSYKLSIP*GSWDYSCVP\PPLG*LVIL 
LRQPPKVLGLQA 


6340 


20241 


A 


6394 


501 


187 


AGVKVfYEHGLWQPPPPGLKRSSHLSLPS 
S*DHRHEYPCPANF*KIFF\VETRSHYV 
AQTSLE FLDSSNPPTS ASQNAG I \ * GMS 
HCAQPMQTFSLVKIGTNFLIF 


6341 


20242 


A 


6395 


3 


393 


HACGAI MNSKPHAVTQ YHTPliliM *SVLI 
TT\VLLLLFVTILNDGITILLTDRNRNT 
TLFYPSGGGDPII>YQHLF*FSGHAGG* I 
L/LL.PGFGTISHIVTYYSGKKEPFGYIG 
MV*AMISIGLLGIIV*AHHI 


6342 


20243 


A 


6396 


101 


135 


FDSPTPGIVPGTWQTIRSLNVYLTRLTT 
S FSLM P S * FLF I F FCRDG / SLT I LPRL.V 
SNSWPQAIL.SPQPHKMLGLQA 


6343 


20244 


A 


6397 


395 


187 


SSYLSLLGIWSYRHVPPHPANFLFFCRD 
GWYFIFFC\KMGAMLPRLVLNSWAQVI 
LiLL * PS KVLGLQA 


6344 


20245 


A 


6398 


35 


334 


QNETLSKTKKIKNKKKKKKKKRGGPLKK 
PLKGPKLTRACKKKPFSLKG/WRKKNNL 
GIFEKKPYFGGGKKWNKPPTKIKSFKEK 
KKF*REKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


H I QI RE FQRTPTRMS PKTSTLSH FKI KK 
QLVMNKRS FIKQWI S * QKPCQMARRD *H 
N I FTVQKGTNCQI R \ I LCPAKLS FKNER 
E I KPFLGKYKLRE F I TA* FI FKG 














RVQ\QSACLGLPSSWDHRHIPSC1ATF* 
♦KRGFAPLPRLLSNFWAQAILPPWPPDV 
\LGLQA 


6347 


20248 


A 


6401 


522 


46 


FFPPFLSFFFFSKIPKQSLGLVFYF*NK 
VWVFCPGWRAIR* FPFPA/APPLGSKNS 
/ S CLNLPS TWGYR I SGMVPPKAH * I P PE 
IFFFFFF/ CRQSIiALS PRLi/WQWQVQA 
I LLPQPPE * LRLQVPGRW 


6348 


20249 


A 


6402 


103 


261 


NIPFTCLI KKKKKKKKKKKKKKKKKKKK 
KKK/ IK* KKKKNVWGGVVKKKLGGD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6349 


20250 


A 


6403 


146 


384 


GFLPSLSGHKELSLFFLFETESHPVAQA 
\DLGSLKPLPPGFK*FSC\SLPRSWDYR 
RVPPCSVNFCVSPY*PG*SQTPDLR 


6350 


20251 


A 


6404 


3 


395 


ETESHSVAQAGVQWRNPGPL.QPPPPKPK 
RFPYLSLPSSWDHRHMPPCPANSAFSVE 
TGL/1jHAGQAGLQLIjTPGDPPT*TSQSA 
RTTG ASHRAR PS LKYFL I S F VI SSLLHG 
LFRSVLGSMYLRISQVFFCC 


6351 


20252 ■ 


A 


6405 


30 


205 


CRLTRPAEFRTRKTHT\REKPYECKQCG 
KFFSCKSNIjIVHEKTHKVETMG IQ* VMW 
LFL 


6352 


20253 


A 


6406 


3 


444 


ARKITGY*ISKTIVFLYT/DELAEKEIK 
KAIPFTIVPNK*DN*ELNLTKQVKDPHK 
EDYKILMKETEQDTK/WKDIPSSWIRII 
N I VKVAVLS KAI YRFSAI C I K I RVTFKT 
ECHWKKKIWIGKVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


A 


6407 


480 


307 


QHDSPSAKKPPQKPDKS/HPSSRS/HPF 
PPPPPC*TPPPEF*YPPPKGEWGARPPP 
QG 


6354 


20255 


A 


6408 


923 


833 


KAMKHFIilKTRAFFRHVTAHLIDQSIVK 
TELLYVTS F I AI FALIjWWYGKE PML/ WP 
GAAAHTCNPSTLGD*GGRTA*AQEFKTS 
1 J GS*NKTKP*CLSAV 


6355 


20256 


A 


6409 


382 


152 


ECSGTLLAHCNLCLLCSSDFP/SLFHNV 
SLSSNSSS*VAEIIGLCHHVWLIFVSAK 
LFLGRRVRNLFSPTPKNMFSK 


6356 


20257 


A 


6410 


553 


173 


CLL.I IKNTLPAKKKKKKKKKKKKKKKKK 
AR/WPP*K*RAP*NPVKTPGGSF 


6357 


20258 


A 


6411 


38 


562 


FLRQG L / DS AAQ AG VQW CNHS S QRAP P F 
GLGRSSH\PASLAGWSAVA*SRLAATST 
FRVQV I LLSQPPE * LGLQS CS I TQVGVQ 
QWRDLGSLQPPLPGFKRLS CLS LPSN* D 
YRRLPPRPDSS ITFSRKFFQSPQARNLP 
PTPELKPQQPVPAPSPSELHLSWSSGNK 
CHFLWFL.CP 


6358 


20259 


A 


6412 


414 


144 


FFFETESHSAAKLEC\ISAHCNLHFPGS 
SKS PASAS * VAGTTSVCYHAQLI FVFLA 
DTGFYHVGQDGPDLMFCPPRPPXVLGLP 
RLPLICF 


6359 


20260 


A 


6413 


3 


455 


AEAGGLLEPRSSRLQ/ CSQP * SHPCTPA 
WAT 


6360 


20261 


A 


6414 


462 


162 


TQGPRLFPHPFGAPGGPVPLGWEVRPPP 
APPGKPPFPPKPQ\NSPPWGGQPSIPPP 
LEG*ARKSP*PPKGL.VPVTPKGPPPPHP 
GQENPTPFPKKKKKKVN 


6361 


20262 


A 


6415 


499 


0 


HFTS KHHFGFEAA\A* YWHFVDW* LFL 
YVSIY**GSH 


6362 


20263 


A 


6416 


187 


236 


MCHINGIMQFFFFFFFENEVFFCPPARI 
GN * R / WPNLG * LKP P FPGLTLFS CLTP P 
RSKNKRPPPPARVIFLFFNKKGVFPLLP 
GRV* IPHTPEPLATTNLLSVTIDLLILN 
VS YKWDHAVFFFFFF 


6363 


20264 


A 


6417 


158 


412 


SSDPSRRSCLFSF/CFFFLEREFFFAPR " 
VEWRGPWFCLLTPPPPGKKHFFAPSLRG 
GGDYGHLPHGPVI FGTLRKAGFS PCGPG 
GS * PP P PGKKHFFAPS LRGGGDYGHL PH 
GPVI FGTLRKAGFSPCGPGGS 
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SEQ ID 
NO: of 
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SEQ ID 
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M 
eth 
od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glntamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=IMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W—Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6364 


20265 


A 


6418 


457 


11 


HTGQS PY RCS E CQK / AFTQKS AI TNHQK 
P QQ KGR KS MQ * LR I * ES FFD ** S HQLRHN 
RSSMNTMIVMGPFGTLVGEKPYRCNEYF 
KPSFLKISLVYRRDLTRRIIGQAHWPMP 
V I SAL WE AE AG R S LE VGS SR PAW PT 


6365 


20266 


A 


6419 


303 


376 


SALSLILGITFFF* IFLETGSHFIAQAG 
VQW/PIHTCDHSILQPQTPGLKPSSCLS 
LLSSWDYRHVPPVPRTSACFECTILRSG 
SVAHTVLWKAKAGGSIiEVRSLKIiACATQ 
GAPCLTKKKCDPQD 


6366 


20267 


A 


6420 


463 


6 


FPPNPHKKQNQQTP\PPKKAPQQQKYTA 
DP/QDP PGND P KAQAYKKS PAATYPAAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 

HHGLQSLSAS PPG I S * E PQLAPRNS PHW 
TRDL 














RGFAVIiFSAVEFPSSSKSHLDLETSPKA 
YKHGAPCSAPGSLQASLPKGRNQRNPFS 
NFFVLF* DKI SLCCQGWSAVIQS * LTVA 
STSPGSDD P PTTP P P PHAPDTWDHRHVP 
PHPAKFYI FYRDR/ S FTMLPRLVSNSWA 
QTVCLSSLFMLLPSLLWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEI IFSAVEFGWSDHGSLQTQPPG\SSD 
P PTLD S * VAGT TG LHHHAQLM F * L F VE I 
GSPGLKWSSVSASESEPLYPA 














bL.yijKfWNt.S ILjQKSCjIRWLiTPVI PALR 
EA/EEAGG*LEPRSSRPAWAI 














IVM*AEGCYLSGIWRTRLKMNPPQ*RKE 
RQIQVKSTRSDIFFFYFLFL 














GSSDSPASASRVAGITGVHHHAQLiIFLF 
LVELEFHHVGQDGLELLT\*VIHPPWPP 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPLGNF* IFWKKRG 
PPFVPQIILNPWPQKNFFAWPPKTLiG 


6373 


20274 


A 


6427 


445 


240 


R PRRADHL / RSGV * DQPGQHGETLRLRK 
RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


114 


450 


LLLLLLFESESCSVAQA/ PSA VL/ WILL 
GSL*PLLPGFKQFLSLSLLSIWEYRGLP 
PYPANFFKFLVEMGFHHVCQAGLELLTS 
GDLLASASQHSG I TDVSHRAQP VT I FLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


LRHCLFTQAGIQRQEHGSL+P+SPGLKQ 


6376 


20277 


A 


6430 


509 


228 


PHNKGRVS PP PPQNFFS P PSRNLGGG WL 
PKIPPPKKRVFSKKPPRGKKK/ ppp*kk 
KKNFSPPPFFGPPPKIIKKPPPPFFFFF 
FFFFFFFFFFFF 


6377 


20278 


A 


6431 


3 


117 


HS WES I * KRGACFNI F IAAQFTIADI WN 
*FMCLSGDE*IKKLGLIYT/MDCYWGVK 
KKETMPLATTWIELKVNILSEIGQA*KD 
KYHVF*MNR 


6378 


20279 


A 


6432 


417 


271 


lskktkfrnspqpplpcsfl*ndrkWp 

RVWHTCNPSTLGGRGRWIT 


6379 


20280. 


A 


6433. 


385 


145 


TPGIKQSSCLHLPKCWDFSCEPPHLANL 
NF/CKR*YIPIVPNSKV*TFSQEVFSYT 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=]Ylethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WPHSSSFTGKEVI ILFPKECPGMFK 


6380 


20281 


A 


6434 


65 


383 


LFPNLLKKKKKKKKKKKKKKKKGGPI KK 
KP*GDKI KTRQKKKNFFPKRGGKKNQPG 
KFKKKNNFWGGKKME KP P \ KKKKTCKE K 
KKI * KKKGEKKPQI APEKKKGS 


6381 


20282 


A 


6435 


3 


454 


EIQRTPVRYYTK*TLPRHIVTRLSKSTL 
K/EKKKKKKFLRAPTEKGQIPPKKIP/I 
KLTVNF* KKS * RR E K WG \ P I FN I LKKKK 
F* PKI FYPAKLTFI REGKKKFFPKKQGL 
GKFVTTRAPLKKI LTGVLNLKMKKGYLF 
PQNTLKYIAHNHP 


6382 


20283 


A 


6436 


424 


410 


NFFFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/QTPPPGGL 
F*KWLGGKKETPKKFSPPPPFFGGGGKI 
FF*KPPPPPGFFF*KSPPGGGGPPSVKK 
KKEKKHKKKKKKKKK* R 


6383 


20284 


A 


.6437 


3 


214 


E I DC CAAVWKQ P KC PS ADE * RKKMWYL Y 
TMEYCSATNKNEIQSF/VNNWKEVEIIM 
LS\KKSQAEKEKHGIF 


6384 


20285 


A 


6438 


83 


420 


QS FP * I S CRKYS / WLGAAAHACDANTLG 
GRGGRAAWGQKFKTS LGNLEPGPKTADP 
RTSHGCELRRSSYTYALTHGFSSTPETA 
RPAPLLPPPPPPTQREDYKEEDLYDDLL 
NK 


6385 


20286 


A 


6439 


163 


289 


KMYCYYYY/ C*DRVLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 


202S7 


A 


6440 


459 


431 


DKRIRTR I TPLAIRKRQI KTTV/ RKQLQ 
SHYMWVKETTNNK/CGIFIHSWWECKM 
VQLLWKTVSQVFKRLDLPYNLATLLI FI 
CQLQ I KTLSS PKYLHMKVHS 1 1 1 KIRKK 
Y I KKNLKCPSTDEW 1 1 KI * C* SIHTMEN 
ERA*K 


6387 


20288 


A 


6441 


478 


36 


VFGCRVILVEKLFLCEPLRVTFLRLGKQ 
LEKSL/WNFHDLELNISLHKNQPNRCVH 
QWRIG*RKRS*CSE**R/TMQFINSNLT 
L IN* I PSCHTAMDI I I I AQH / I THTVEY 
YS AI KKNE I MCF PATW VELEAI ILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


LVQRSG I HLP YS S TVPL * GL / HQKNE AD 
VYRNTRMFTAALFLIVQNWK*HNYLQTG 
EKINKLPYIHTMEYFSVIKSNQHSIT 


6389 


20290 


A 


6443 


414 


216 


CYIYTCPLTQQSRSWVAT*ASSKEEWIN 
JCLWHIHTMD* Y/ SSNKKNKLQINTTWTN 
LKSIMLTEISLT 


6390 


20291 


A 


6444 


7 


411 


S FYCLHS FAFSRVSYCWNIQYIAFLiDWL 
LSRSNSLHVFSW\WD/CKMV* PL/YETV 
WQ FLTKLNI LLLHDSALTLLG I Y * K/DM 
KSYVHTKMCTQIFIAALFIIAKTWKNL* 
CSSVGEWRNKLWYLQTMEYSSSVLKK\N 
ELSSHEKTWRELLRERSQSKKAIYCIFQ 
QYDTLEKAKLWRQ 


6391 


20292 


A 


6445 


257 


419 


KSLPFFF/CW* ETGSHS VTQVR VQ WCNH 
IALQPQTPGLiKQPSCLSLQGSWDYKFHF 
TQACPY*RP 


6392 


20293 


A 


6446 


20 


417 


HQYCFASFFFFNNFFKTESHSVAEAGAQ 
WHDLGSLKPLPLRFK* FSCLSLLSSWDY 
RCMPPKLDNF\*FLVETGFHHVGQAGLE 
LLTSDDPPALAS * SAGITGVSHRARPVE 
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Predict- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenytalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K = Lysine, L = Leucine, M =: iVIetliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














LFLLRRQGLRLLQNIGIGRQ 














VIGCQFL / RKTKRYLPYDLP I VLLC I YP 
R/EKKNLCRH*TCTSMFMTALFIITQH* 
KYPKCHS IGEWLNI LWHIHTME YYAEMK 
RNKLLIHEKTYMDLKDILFSGGQRVYLK 
SSILSIKCIRRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGPIILKVCQVILTHQQYFESQ 
WSIMALS NRKYNANCRYNLKCPNSYI KK 
EML/WPGAAAHACNLSTLGGQGRHIA*A 
QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 


381 


LCSLVFTH ICS I FLFFFF FFKDRALLP / 
H/PCWRAGP* S YLTAASNSG \ VKQSSNL 
SRGKPLCRANFGSFN CFWFWKKGGPTMW 
PRLV*N\ILLPQPPKVRGL*AAAHAGGQ 
GHDHTSLRPQTPGLKQSSNLSRGKPLCR 
ANFGS FW FIjVXjEKRGS HHVAQAS LKQS S 
CLSLPKLGDYRHEPSHRANMFNFRSLPG 
liRKADCLYPY 














LQNAVWQFLKNCKI VPHY* A/ I PHLG IH 
PNALKTATQRN I CAPM FI AALCATAKRQ 
CK/PKCPSIDKT+YTHTMKYDSA*KKCW 
LLARVWINWSPVHCWWECEMVQLLQNAV 
WQFLKNCKIVPHY 


6397 


20298 


A 


6451 


314 


72 


MLPLTNGSCRTKVSKDIEGLSNSINPLP 
LIGIYRTPFPKTAGTPFFFFFIjV /pngt 
FTKIDH I LGHKTRLN I LKM 1*1 LKYI P 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVIiLiCHLGWSAVA*PR 
LPAGSSNPPPS/CLPIPSSWDHRLVPPH 
QVFLEAFFFFFFF/CRKGVFFFLPGGV* 


6399 


20300 


A 


6453 


368 


172 


WLTSVI \QRFGRPRREGHLRPGA*DQPG 
LR * STHLRLPKYWD YRRES QC PARPV I L 
KEANMFHLG 


6400 


20301 


A 


6454 


1 


365 


NILKLNNKLbNNL\VKEEVAKEIKK*LV 

IALSA*RIVSNQCSK*ASTLRNHKKKSN 
LNPRLI*QQKSITLKREKSMKPKAGSLI 
RSIKLIIRLT 


6401 


20302 


A 


6455 


364 


1 


PSTRSCSVWPRLGGVQRQDKGSLPPLSP 
RLKQFSHLSLPSS*NHRGMPPCPANLLF 
KFYLFIYL/FCLFVEMGSHHVSQAGLKF 
LSSRDPSALVSQSVATTGMSRRAWQDI I 
LQQKQAATAA 


6402 


20303 


A 


6456 


91 


300 


GHARHFGRPRWGDCLGPEVRDQPGQHGE 
TLTLQK\ KNKN* PAWWHTPAVP ATQE AG 
ITGVSHRARPCPPN 


6403 


20304 


A 


6457 


2 


195 


RPSCLSLPSSWDYRHEPPRLAIFFFF/C 
FWNFKKRGGFPLWARWVSNS * PRGIGLtL 
WPPKGWGLRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQI FTAALFLLA.KKWKQPKCP 
STDEMIHKMWYVHKMKY*SVIKRNEVLI 
HSA\KWMNFENMLNEEACHERPHIV* FH 
LYEMSRVGKTGETKSRLWV 


6405 


20306 


A 


6459 


377 


161 


EMNIKCPSTDK*VKKRY/LYIHTMEHCS 
ALKGNEVL I I HAMTWMNLAN IMLSEKSQ 
SQNTTYSMITWKCLE 


6406 


20307. 


A 


6460 


92 


484 


LLQACFLRTVLIDYKMWAEKGMVGMCVL 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unkno"vvn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MRNMTVI 1 1 FRDRVLL* HPGWHAWRSQ 
LTSA/VKRSSCLKPPKC* DYRCE ILCQV 
E I AFTHRS LFNS FLKRSMRVI LLNY WKD 
PSLGFCSYDSTGMALLRNFS 


6407 


20308 


A 


6461 


483 


192 


MGLMLiILRANFQSTKKKMICLLKKSLTF 
LNC I VL>SNWP CT FNMYI FVCVFAVY KL.H 
VWAE \ PVAHACNPST VGGQGERTA * AQE 
FETSLGNIIRPCLY 


6408 


20309 


A 


6462 


451 


98 


KQVWQLL* KQWQLLESLNMKLLYDLAV 
LNLGAYPREEKTHVHTKTCTQMFTAAPF 
I \ I KK*KKNFS IDMKYEERDKMWTIHVT 
QHYLI IK* S ID \ YAPTWMNLENIMLSED 
SQTQKTTHCI IS FI 


6409 


20310 


A 


6463 


395 


67 


FLRFWCTHHWSS/DILYPVCSLLSLNRS 
HS FPRVHKS PLYHSYALCPHSLAPTYE * 
AHKMF/RLVNFLFLVETGFRHAAQAS PS 
LPSASNPPFSASQSAGTTDASHRAQTK 


6410 


20311 


A 


6464 


485 


220 


FFFETWSHSVTQARVQWCNLGSIAKPliP 
P\GSSDPPTSAS*VAGIAGMSHAPTHEI 
PSIQARQQHLQGEKKIELLSLPTSASQV 
LRLliA 


6411 


20312 


A 


6465 


1 


371 


ISIPPKKYLRINLiTTYVQGLVEQNYKTD 
KRNQGRSSKWRDGPCSRIGKLNIRMSVL> 
SNLIYSFNVILVIVPASYFVSIT/RLIL 
KFT*KCSRPRIVPLFHLLLPLLVFFVCS 
FILCRPGASLVK 


6412 


20313 


A 


6466 


489 


175 


YKIIEI SRCI K/GLL* LWRTTHLFSHPH 
WRRGMAYEKNADFRLGTVAHAFNPSTLG 
G*GR/WNQGFKTSLANVVKPCL,Y 


6413 


20314 


A 


6467 


2 


367 


FGTETLGQAGIKGDPGEKSHWGEGLHQL 
REALKHjAERVLILETMIGLYGS*LTEIj 
PAAAGSAGAPGQTGHPPGSHHLiA\EPEL 
GSGAGPAGTGTPSLLRGKRGGHATNYRI 
VAPRSRDERG 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGNSGYRHIQPHVANYLFICRDR 
/SL.ILLPRLILNS*L*AILL,SRPPKVL,V 
LOT 


6415 


20316 


A 


6469 


88 


435 


RG VLLHMLS CL PLCKTCFCS S FTFKHH * 
TS FS FS FFFFLLGNKVS FYCQAEGQGPN 
LG*LKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFGLSYFPLTRG 


6416 


20317 


A 


6470 


34 


106 


EFAIPFLTIYPIELNTHVliTKICI*MST 
AALSTR*KSPTCPSM\INKMWYIHFMEY 
YS AI SKGRNKANVHQWIH KM * Y I HTVE Y 
YSA I KSNEALT YATTW/MNLENI MLS ER 
SQS*KICI 


6417 


20318 


A 


6471 


390 


92 


DFL I DAINHNLKVF FYS *FI SMC/HP YN 
*SLSFSNIQKSLIKTMINVSLSFKY*YL 
IYFS * KKTF*N* RQGLTMLPRLLSNS WA 
QAILLPWPPKVLELQV 




20319 








167 


HIHI KTCA* I FSAA* FI I AKK* KPKCPT 
ADEW INKM W YVHT I EYYSAI KK \NKLL I 
PWMNLENIMLSTGNQTQKATYCMIPFI 


6419 


20320 


A 


6473 


3 


244 


TTILLGTTYDALFPELYTTYFVTKTLLL 
TSLFL* \ IRTAYPRLRYDQLIHLL* KNF 
LPLTLALLI * HVS IPITISSI PPQT 


6420 


20321 


A 


6474 


12 


224 


IASGRPFFCYDVCV/ VKRLCRHS IVI IM' 
* KKKKKKKKKKKKKKKKKKKKKKKKKGR 
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nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalaiiine, 

K^L-ysine, L^Eeucine, M == IVIethionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGALKKKPGGEPLFPH 


6421 


20322 


A 


6475 


417 


155 


FQS FTENKMLS S S I TCSS E I FGERKS QL 
M*RPSLL/SY*EKLPLIASTTVISQRPS 
AL * VQPL / PTAKNL * LTAGS KGHYYFLN- 
SKVFSN 


6422 


20323 


A 


6476 


] 


131 


RTR P PTRTRTRGR WLTPVI PALREA /EE 
AGG* LEPRSSRPAWAI 


6423 


20324 


A 


6477 


830 


513 


WNSVKESLPADTWFLPFLGPLIDIFLLL 
I FAL CLVNLLVKF VSSRLQ * FHVKMMLA 
QGFQP I PS SDPEDKDILPLSLLEQVS \G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


6478 


399 


196 


SKQSSHLS LLSR*KYRCAPPWWAFF/SF 
FLINI FCRDE/SLTMLPRLLLNS * PQVI 
LLPWPPKVFGLQA 


6425 


20326 


A 


6479 


1 


411 


NTGIMPANTVS ILLIjIDQEVLVTVKTCY 
LRNTFCGFSDGSGHSV\QTFWIGSTILD 
AIKSL/R/DSREDGKISTLIGIRKK* IP 
ILiMEDLVGFKTSQEEGTVDGLNLA*ELQ 
VEGPRDETELL* PQDQTCMDEKFLLMDE 


6426 


20327 


A 


6480 


381 


162 


AGFTHHNFFFLF*DMVLLCHPGYSAWQ 
SWLTVASNSW\VK*SSHLSFPKH*DYLQ 
AVHFRMASNSPGLYIiLYQ 


6427 


20328 


A 


6481 


379 


85 


SILFFLFHPINFFFFFF*AWIFI IPPPH 
\RAFKWRWGFFLADSVWF I KRLPQYY 
FFFFFPGFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLSVC 
GG 


6428 


20329 


A 


6482 


355 


117 


PKVKIFPPPPLPRKWDPRGPP*SPGTNF 
SIFFLVEKDFPHVAQAGLKLLDSSSTPS 
VAS\QSV* ITGMSPHAWLQVLLSL 


6429 


20330 


A 


6483 






TAFHLLDI YHKS * KIMFTNQPLYKCPLF 
IIAKN*K*LKCPRRADKQIVA/WCIHTI 
EYYSARKRNVLLKYTRLWMNLKS IMLCK 
RSQSQDVTYYVIPFICHS 


6430 


20331 


A 








NGFALVAQAGV /QWCDLGSLQRLPPIFK 
QVSCVTLLNS*DYRSMPPHPANFCIFSR 
DGV*SYWSEWSLTPDLGLPKCCDYMREP 
PCPANAKSFTVEETHVTYGAYETMLLSD 
HDESG 


6431 


20332 


A 


6485 


408 


146 


LKTETQTETCPPMFIAALFTIjV/RRWNQ 
pqclstdewisqi*yicimniksdikrn 
E alkhavi CMNLKNMVPFIS ptekgkkk 
EKKK 


6432 


20333 


A 


6486 


243 


11 


GMSQYQEYSKSNFFLSNRVLLCHPGWSA 
VAH/SR L TE ALNR FK * S S CLG L PKL WD Y 
RRVPLLLAQNATR * NSL* SCPA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRWPVWSQNS \SLK*SAYLSL,PRC 
W D YRRE P S CQAS RNTS I GRGNR KNQ WNT 
CMYLLF I L I PLKKYMCNGRVMEYFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAI PLKKGSKC * QGCRNTGLLRY 
+ W /MECEVAQPLQEMV* WFLQRLNI KLS 
YDPDF/ PLLGI YPKE * KAGT* TDI CMPM 
LVATLFSG 


6435 


20336 


A 


6489 


3 


203 


SSWDCRCIPPCLANFLICCRDR/SLTML 
LRLVLNS * TQAVLLPWPPKGMPGLLFVF 
YGGIVLSYVNIS 


6436 

- 


20337 


A 


6490 


413 


30 1 RGP/YQPGFL,GFSPPNGLGFG*KVFLPL 
1 PPPPRKGPGKKGSPRARGFPGQAPPKKE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NQRGGGFNSRGRLAKKKGFWFG*GPAKG 
KKFFFFLGAGVWGLKVLI P I KKKKGF P P 
PPLFFFFFFLQTIHPIRG 


6437 


20338 


A 


6491 


101 


335 


SSSFLGFPKYWDYRREPL/WPS/L.WQFF 
RLNIHLLYNPS I P I LAVYPREMKTCDLI 
KSCMRMFTVAVF\* ITQTWKQPNVF 


6438 


20339 


A 


6492 


532 


127 


STVFRARQLPCLHEPGRQSSP\PVPTSI 
LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PELGTQSFPVHKSFSPCLPSPKPLTVPP 
ALPSTGPSQSL*QKHSLASLLHTHSHTP 
PAP S CNP E S WG LDATAAI S TC F YG 


6439 


20340 


A 


6493 


399 


1 


LCPAPRGGVYRGR/ ESSLS WDGLHSVQA 
SWPLCLPTPASAMVDAPPPAXSLPSRSS 
XLDCCISSEQGSVGMGPAEPGAG*NLIjV 
CCLLRPLEKCSIWAGVSRFSWYSLSQLP 
IiARKGKS PNPLS FPNEKMPRPVS 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
FFFFFFFFFFF/HVAI FLL FCRDRG \ LA 
MLPRLILNSWP*RILQFWLPKVPGLQA 


6441 


20342 


A 


6495 


333 


190 


LFFFFFFFFFFFFFFFFFFF+GFFFFIF 
YS FRDG VLLCHPG WSAAMP S 


6442 


20343 


A 


6496 


1 


299 


LPRLEYSGTITASNSWAPAI1.PPQLIK/ 
YNFFCCC*GRVLTVX,PRLVLNSRPQIVX, 
PPPPPKSAEITGVNHHVWPKKI ISNEKT 
THCSCSALLAKQQI SS 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNLRHSPPCPARFFVL.FCL 
LFFGKN/RGLAMLLRLVSNSWP*VILPP 
QP PE * LGLQVQATMLSS 


6444 


20345 


A 


6498 


366 


78 


IiLAHCHS SNNKS LLI L.DNFNT YHYGPSN 
TVTS*F\*MSSLMILLSSP/C*GTSWSY 
LiF YALKKI FCKD/ RGLAVLPSL V*NSWP 
QWLLSWPPKELGLQV 


6445 


20346 


A 


6499 


37 


396 


FPRPTRPKSSSPAKATQQNTISFKKKKK 
S KQKKGNTVI I IRVKNQRKRKQKN * KLK 
Q*N*EQ*KINKISKPLAKLI/KKKRVKT 
* IANIRNERSSITT*STDIKRLIWKYY* 
QL 


6446 


20347 


A 


6500 


706 


394 


VPLLLPRLAVQWHNLGSLQPLPPGFKRF 
SCLSLLSS* DHRHTPP CPA/NS FVFLVE 
TRLHHAGQVCLKLLTSG/DTPSSASQSA 
GIKGMSHRTQPIEILFYSVFF 


6447 


20348 


A 


6501 


403 


216 


ILNVIjSFK/NMLY*HYTTVLRTVRPL,SP 
R* ATVTCTYTSGWPEAAEDPQKMTFPHC 
□LFLPHPN 


6448 


20349 


A 


6502 


377 


116 


LLPY*EN*FL,MYLPPICCLPFDFGNSGF 
CHQKKNFFFFFKTKSLSVA\RLECGGAI 
PAHCKLCLLGSSDSMKNFLNEAEFPNIiA 
FNGL 


6449 


20350 


A 


6503 


3 


401 


CVQHLRASWSSRWAWAWRAPHSEQPAGP 
AAPGNEGLS IRASGCGGCTGYPSSAS PP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPP PPWAP VRPE P PRRVPPPAP WRP VPS 
TTQGLRSASARHRTGRQLHLQ 


6450 

... 


20351 


A 


6504 


406 


55 


wpkgftpprfprkegsp^ppfpkvfilje 
trqpgpqtkppffppifkl.trpggrpp* 
fpppqrvragkpp*pqsfrvpgaifppp 
ppplkghpprpgfqkkkkkerkkrptrk 

RGGT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last nmino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycirie, H=Histidine, 1— I&oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


6451 


20352 


A 


6505 


3 


281 


FFFCDRVLLCHPGWSAMVRVRSRLTATS 
/ S FLGS S DSHACLS S S WDYRHLP PLLAN 
FCI FLKRQGFTMLPRPVSNP * AQAKRLP 
QPPKVLGLQA 


6452 


20353 


A 


6506 


1 


247 


LGSLQPPPPG\SRHSPASASQGAGIADM 
SHRTRQSIVFLRTNNKPY* IENMMITIS 
CTTAYKYE ILRHS FNKI CARS LYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAVNKERMLTASRK 
KKQI I + RSSNLSGDRLLNEEAS WEP* CK 
KSMQET IQATEGWND I FKVLKEKKNL \H 
S KILCPAKLSFK YEGE VKAFPHKPKWRE 
FTVTKPILQKC*RENQRTFDLTQI 


6454 


20355 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
ILVSPGFFPSPLF*KPPPEFFFFAPLKK 
IFFSPPRELKFFFFKGPPPFFFFFF 


6455 


20356 


A 


6509 






LSFFIjFFRDKVSLCNPGWSAV /VTFELK 
RSSCPGLPK/SQSHCTKPQPGF* KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWKP KVLGL* AVAHAYNPSTLGCQSAE 
WIT 


6456 


20357- 










ISFFFFFFFFLKKFFFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFSPVCPGFFKTNPKK/ 
SPPPPPPRGVGLGGWPPPPGWDP 


6457 


20358 










I QAR KRRPKRE KHP KKVSCS VAQAG VQ W 
RDLVLLQPLPPGLRVSLCHPS/WECGGV 
VSAH*NFCLPV/IKDGVLPCCAGWS*TP 


6458 


20359 


A 


6512 . 


2 


332 


WEKVKIATIjTGFWK*LI/SGSSRTLMGD 
FERFKTSVEKVIADWEIARELELEVEP 
ENVTELLQSHYKTPGTLAFSRLSPDVSL 
YSQLPETRSHEWQAASKIC-PRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGVWWHNLGSLQPPP 
PEFKQFSCLSLPSSWDYRHAPPHLiA/NC 
FCI FNRDG VL PC * LG WVSNSHLG FTGMS 
HCTQPLIiALSFLHG 


6460 


20361 


A 


6514 


1 


526 


FRLGRKPR*GGVM* PVWSRGEPGSVGAE 
AG/RS*SAPRRLLHHPAAGLATGLSASG 
RRSARWKMERASGLSPGGGLGATSRQMS 
PGTQLANPPDHGDKDCLGRI S PGSGKQ I 
QAAGQLPGPPTSLAPAQGRLRSLTPWGL 
QTPEHS E PEGIGHLQAATEAVLPHS TQN 
LITKRNLM 


6461 


20362 


A ■ 


6515 


747 


451 


QAGVQW*DLGSLQPPPPGFK*FSCLSLP 

QAGFELLTSGGPPASASQSAGITGMSHH 
TPLGVHALVGEN I Q 


6462 


20363 




: • 


• 


3 


SNKPKLPPCLPPGGPK/GNPPQKKKKKK 
GFGPG WAHACNP 1 1 PALWEAEAARGLP 
PD AGR YG PG GRRE A VRRE AGPQE CQDPE 
GQV+GDPRGCGAAHAPARGHPRKA*RGA 
EGSSPGLPTK 


6463 


20364 


A 


65] 7 


257 


575 


AS SVTWAGVR WCHLGSLQLLPLGFK* FS 
CLSFPSGWDY/ GHAPPGPAKFCIYSGDG 
VSLC*PGWSQTPHLR 


6464 


20365 . . . 


A. 


6518 


337 


389 


SQHFV* ISCLHDMPALASQSAGITGVSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenyinlanine, 
G=Glycine, H=Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Gluiamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RARLIFIFR\MDSCYVPQAGL,E/PPEL.K 
RSSHCGLPKPWDYRHEPQAQPVCRFIRA 
TKCAYVSTQALSPAPVQPGMTPPAKRGL 
PCQPPGSPCPSLCHHQIAGTPPYE 


6465 


20366 


A 


6519 


61 


323 


FMWDRVARKGSIEGASLELPLGGSVGGH 
GSCISMWGALGHSS\THV*HTHTHTHT 
HTHTQTNTHTPSS DSKKSS I L.DKEEAEG 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 
/C*+RCRK/IYTI J KHCQ*KCETVWPI,WK 
SVW/RVFLKLNIHPLJISPEIPLLGTYPR 
/DMKT*TCM*MFTATLFITTYS 


6467 


20368 


A 


6521 


1 


276 


RTRGFEGL1RFFLGFFGTVL1I1CCPGWSAV 
E\QSWLIAASASCLW\VK*SFCLSLLRG 
WDYRCIPPCLANF* *R/CGSWAQVILPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


280 


277 


GDDPS CSMGGGHCIiASRSE * GS VGVGPS 
KPGAGKYNLLVCYLLRPLEKRS IRVGVS 
QFS RCHQSWIiPFDRKGNS PT PCTYWVR 


6469 


20370 


A 


6523 




327 


SLFSS "-D/YRHIPPHPGNFRIFCR/SAL 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHCPQPCIMFLKADLNPVRCQFRCALAP 
EPYIQDPFLRDDVAFPCLGASAPWHLA 


6470 


20371 


A 


6524 


298 


414 


PGPQSETLTKKKXGGGRLKGSNFTPAGM 
EGNI FF/CFFLSKSHSVAQARVQW/P * W 
LTAGS LQAP P P RFKLFS S LL * V AI Y AS P 
TLLSS *DYRCTPPRLANFCIFPRDGVSP 
CRSCKAAVFNLLDTRDWFHGRQSFHRLV 
AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACS P S * DYRHVP P / RPG 
YFLHF * QKHFGQAGIiDLPTS GDP PAS AS 
QSAGITGVSHRTQPQLNYF 


6472 


20373 


A 


6526 


415 


86 


LYLPYBLAIPFLGMYPRGKKKCAHTKIC 
TRI FTWLFI IARK* KRSKNPSTREW/ 1 
KM WN I YTVESHS V I KRKE LL I HAS M I L V 
H* / WM I FEN I I LSDRSQMQK I TYCMVP F 
T 


6473 


20374 


A 


6527 


39J 


166 


FFFFFRDWFSLSHPGWNAE \S *LTVISW 
SWSQARLI FKFLKLRQGLNLLTRLVSNS 
RLQG I LS S WPAKVLGLQA 


6474 


20375 


A 


6528 


544 


176 


TMRYL,LTPVGMAIIKKAK\NT*C**GGR 
E/lGTL/LQCR*ECKIiVLPLWRTVWRFL 
KKLAIELPYAPPFLFSLEYGLFSLGSTN 
CAQLRSVYRKAVQNHQGVSGECAGFQDS 
SHS APVAI PTARQ 


6475 


20376 


A 


6529 


392 j 


139 


TYVKMHCK/WLGMVAHAWNPSTLGGRGG 
QIT*GQASTKLSHSWGLSRESRFWLECY 
SRSCSFCSCRLPADCTSLTDSKQ 


6476 


20377 


A 


6530 


362 


87 


NGPPFFFFFGEIDKLILILLWKCK/RPK 
I AKTVFKKKNKVGGIil *HYFKAYYKGTV 
IKLVQFWHKVRKTDQ*SRRESPETELHI 
YVLVNARDSA ' 


6477 


20378 


A 


6531 


1 


346 


APTRPRLKKTLSKLKTTNLQLPYDPAFS 
FLSVYPREMKT * R\ FAKIC V*MFMASLF 
MIVTTQ/NVPRCPLMGEWLNKWR\*FIT 
* YHTGMKNT \VMHKTTWMNS QR I MLNE K 
SQSLTY 


6478 


20379 


A 


6532 3 


294 


FFFFRDRVFLYHPGWS AW* S * LTAALS 



945 



WO 01/64835 



PCT/US01704927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possiblc 
nucleotide deletion, \=possible 
nucleotide insertion 














pwlkrhshlslpgs*ghrmh/pphlgnf 
fvXfclfl^rqclamlprqvlrswpqvi 
flpwppkavlglqv 


6479 


20380 


A 


6533 


2 


280 


PRYLVRNPTPTLGSCVTVDK* LSLWASV 
LY I YI YTHTHTD I YRKI YAF/ I FVCVYI 
Y I F I * KHI Y YKEQAHAI MDAE KFQDLPS 
ASWRPGKLVE 


6480 


20381 


A 


6534 


96 


183 


DQLNVFHRKLKRYDTYTFTVALLARHGG 
SC/L.SLGSS*D*RYTPPLQTN*K*FFL* 
ISLPMLDWLVSNSWPQAILLPLPPKSSG 
LEA 


6481 


20382 


A 


6535 


2 


241 


GFKRIDSNFESSVGKVPTN/TIFCERKS 
LSMHQI/SLLSYFKKLPQLPQCSAATTL 
\ I S * QPYTLKQDPPPTRLQPAEDSDGY 


6482 


20383 


A 


6536 


485 


97 


PTTRPILLLGFYTK*LKS\NKSDSCPPM 
F I ATL F/IL KCPSTDKWLKKMW YKHTTD 
Y YS AL KKGNHA / YM * QQMNLEG I ML S E V 
SQSHEGQLLHSTYVRYLK 


6483 


20384 


A 


6537 


411 


57 


LFPSHIKFVLSWQSCFAKTSGPQTPSFF 
FF*LFFF/RDRVSLCHLSWSAV/VSW3jT 
VASTSQAQACLSLPSSWDYRCPPPFPAN 
FSRRGFTILPRLVSNSRAQEICPPWPSK 
VLGLQRRGI 


6484 


20385 


A 


6538 


438 


201 


IYPKEMKT*VHTKTCI*MI>IATLiFTLAR 
KWKQTKCSSTDEWINKMWHIQ\TMEYYL 
AI KRNEAQLEHGE I IMLS ERGQS Q 


6485 


20386 


A 


6539 


466 


131 


SFPKKGGYRAPPPPPPPFFFFFFFFVEA 
VLSLCCPGGSQTLGLKRSAHLVLPKC * D 
HRHKP PCIASN/ S FFNRHRMMTNNCLMC 
IKSWCLCIINPFYHLDSSLLWMLSLIYR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPRIF/ CLGAPKKKI *VPT/L 
REIKLFPLKGPPLFFFLNKGPLCFPGRK 
GRGVLFFLATPKSWGQKIFPPSKEGASF 
PFFFPFFSW* RWDLTMLPRLVSNSWPHV 
ILPSQPPRVLGLQV 


6487 


20388 


A 


6541 


1 


341 


S 1 1 KTAREKCL VT Y * GTS I RLTGDF * KY " 
I/SAKILQVRRQ*NDIFKVIK/EKNCQP 
RIQYPAKLSFINERKIKSFPNKQKLREF 
XTARPDLQEMIKGILYLETKE * YLPS * K 
HMKV 


6488 


20389 


A 


6542 


409 


290 


NCKQPTCPSTNEW/ IKL* YICXTEYYSA 
IKRNKLQLHTTT 


6489 


20390 


A 


6543 


32 


397 


AMIVPliQASLGNRSRJCTLSIiKKRKKKKG 
EKETLPGGST* KEKVRKKQNNLLV* KTH 
IKPG\AGAHPGNPNIWGGGGRQITRSGN 
KNHPG*HGKTPFLQKKKKRGAl>LiKGTNL 
TARGRQSKTF 


6490 


20391 


A 


6544 


402 


0. 


LSWTQ* CKPIAPATGEGGTGGSLEPRES 
RLWCAM I APVNA/NCTPAVfVT 


6491 


20392 


A 


6545 


1 


365 


RKVTSLPSTSRCLS ILPHFSDGPRPVKP 
LQ* *LRCSICSASLPTHLHTARAAGLLP 
\ LAS STAVNQN* SRELGACVKWEHSG I K 
YGLF*RD/RGLTMLPRLVLNSWPQAILP 
GQPPKLLGLQA 


6492 


20393 


A 


6546 


1 

: 


331 


ISTFKSYLRNIFSKAIAAVDSNSTEGSG 
QG*LKTFWKGYTIIDAIKNIYDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVTAH 
WQIAKELELEMDPEDVTKWLQSHDTN . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6493 


20394 


A 


6547 


1 


417 


TRPKINKTKNLLFERINKIKKP*VKLTK 
NKNKNKTPHITKIRNESEAFTIEKLKSI 
LRE\YMNNCVYNLLYANRKPRQMDKFLE 
RYKQLKLTQEETETI,TD\YITSKEIKFV 
I * NFS TK *- S *G PNG FNSE FGRVLWFTP V 


6494 


20395 


A 


6548 


410 


288 


AHFLFFCINK\ SLTMLPRI/V *NMWP * VI 
FPPQLPKTLGLQV 


6495 


20396 


A 


6549 


1 


216 


IDRGFTMIiPQLVSNSIALKRSSHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQL 
HVTLFFPNT*GLTSEG 


6496 


20397 


A 


6550 


479 


146 


VPGFKQFSCLSLPNS*KYQRAPPCPANG 
FAFSVETGFPHVGQAGF*FPTSGDPPAS 
AS QS AG I TGLS LHAWP \ENL * KLRLHAT 
GLRLHDHVPLKVPNYLKVCSFKFEIADV 


6497 


20398 


A 


6551 


42 


484 


S MR WNS MNVT I M PANTTS I P Q P TDQE VI 
SVFNS/YYYYYRYCKATAAIDSNLSGGS 
RQSKLKTFWKGLTIIYAIKNICDSWEEV 
KI ST* T I * KKVI PTFMGDFGGVKNS I K\ 
EITADVAE I AR* L* LEVEPRVGWLTPVI 
PALiRKAEVG 


6498 


20399 


A 


6552 


304 


74 


HTGKAAKKKEC/WRGTVAHAYNPNTLGG 
*NGWIT*GQ*FETSLINMVKPRFYSCIC 
ISGYTHKVIHRII ICKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FWAGNSRMQQWCTDT/VSAKCSLDLLGS 
SNPPASAS * VARTTGARHHARLMLITMN 
EKPLGARYCARIFFENKLGI YPNESVPT 
KTNANWS KPRSCFSHDPDPGHWI YLMS 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


YCNLCLPGPSDLPTSAS*VAGTTCMCHY 
TWLIFFCI \LVETGFYHVAQAAKLFI YS 

i 


6501 


20402 


A 


6555 


3 


268 


DAWADNIKCR*GYGAI*TLIHCWWECK\ 
W*ATWWFLIILNMLLPYNPVIMV\PRTC 
IQIFMSTLFINTKTWKQPRCHLVGEWIN 
CGTCR 


6502 


20403 


A 


6556 


3 


402 


DAWTDN IQCR* RYGAI * TL I HCWRQC \ K 
W*ATWWFLI ILNMLLPYNPI IMDHK\TC 
LQ I FMS TLF I DTKT WQ P PKCHLVGE W / 1 
KLWSMPAKEYHSWLARTEL/ IVP*NT*R 
RL.SCT * LWEKSQYE KATYCWP TV 


6503 


20404 


A 


6557 


396 


2 


KKQNAFQSPSFLPRL>*TPKITILISFHN 
IYIYTHTYIYIHVCIFTYIHTHIYIFF/ 
RDRFLLCLPGWRAWOSQLTAAS TSQAQ 
GIPG*FFFK/SIFCRDW/SITMLPELVS 
NS WAQAI P PP WPPKVLWDYRCGS 


6504 


20405 


A 


6558 


393 


175 


FKSQGLVLLPRLEFSGMLIAHCSLKLPG 
FSDSLNSA\SQ*LGL + GCTATPGPYIjLL 
I YAGELGLNKRS I SS K 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPPPPFLKPPPRI*F 
WGPKKKKNF/CPPPGPKNFFF*KGPPLiF 
FFFFFFFFFFFFHRSS 


6506 


20407 


A 


6560 


37 


343 


YEPAWTiLDIYPKEliKT*AHRKTCTRIL 
IASPFKTVKMWQPPD/CP*ADE/WDK*T 
ACLQTVE * CS ARKRS ELS TM / 1 EM WRKL 
KCVALSERG QSEQAAC YVI PTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/YLSTSPNAGA*TTSPLPPQFL 
YHSPRRQQSGTKPPFFSKK/PEPPSP/S 
PNKFYPGPWWGVSPFKFPPPPQKS*GPK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KS PL P PEAP / P AP PLTQDC P P FVSNPGG 
KKKTPP /SAKKKKDPTPS/PSPPPRH* I 
GGPPGFSPPPLFKTPPPKLILGPPKKKK 
IFPPPGGKNFFFLKGPPPFFFFFFFFFF 
FSLLFPVGVWSRWLRSEAADLCGECYSS 


6508 


20409 


A 


6562 


1 


356 


FFFNFIFIYLF/NFLRWSLALSPRLNGV 
QWSDIjGSIjQAPRPRIjMPFSCLSLPSSRD 

hrrppprlanflyf* *rwgftmlarmvs 

IS * PQ/CDPPASASQ\STGITGI chgtr 
LIPRNPMIA 


6509 












LiSLiPSSW/DYRRPPPCIiANFLYF* *RRG 

ftmvsis*phdrptsasqsagitgvshh 
tqpiisflksffftlsnrfnrvcidsel 

LLQCKDFYLQKFNLHIAFQEYTHCHGVH 
Y 


6510 


.20411 


A 


6564 


2 


51 


FWREMQIKTVLRYFSLKLVKIKK*DST 
FFWKGC/GETGNLIHKANW /R/SI *QYL 
KKLYVHFbFVPVIPPLEIYLKNKKILST 
K*LFAVFIVTEKC*KKKRCG*T 


6511 


20412 


A 


6565 


2 


511 


VRW/SCAGL/LHTAV*IPGRrrTSEKCA 
QQVDETHRN/LQRLQPALINRKGPILLH 
NNTCLYTAKPMLQKLNELGCEVLPHLSY 
SPDLS PTNYFFNHLDNVLQGKHSHTQQN 
AENAF* EFVKF* SMDFYATGINKLIS/H 
WQKCIACNGSYFD 


6512 


20413 


A 


6566 


3 


248 


GDICT\SMLVPALFTVPK/TWNQPVSIS 
K*TDKMWYIYPVEYYIAIKKTKFC/SFS 
ATWMILEDIMLSKPGTER* ILHVLYLCG 
R 


6513 


20414 


A 


6567 


3 




wrdhs slhs * pfglkwfshls plsnwdy 
RDVPACPANFEKVLi*K*GIjIMIjPRL\LL 
SSFPEW* ILSSRLPASSFQSVGITHMNH 
CS 


6514 


20415 


A 


6568 


392 


166 


GGGGPPRGGGLEPPGPPPFFLKNKK* /R 
PPGGPGPLFPPPGGGREEKFFYLGGKNF 
H*TRVGPPPLSGGKKKKIRG 


6515 


20416 


A 


6569 


234 


411 


PGAVWHAYNPNTLGGQGHRIS*GQEFKT 
S/LGNIVRPHLY 


6516 




A 


6570 






FLQHV*NKFQTMSDQIIGKIDDMNTLID 
LEKNVKDL\LITQTEVEELEGENKIPIL 
PG 


6517 


20418 


A 


6571 


392 


140 


LPSSWDYRHVPPCPANFVLLVETRFLHV 
SQAGLiELPTSASQSAGITGVSHQARP /E 
* *NS IRKGATTIQRDLGPVLQVRYLVRC 


6518 


20419 


A 


6572 


2 


228 


RTCTQMSVATLFLIAKNWKQPRCPSIGE 
WINKL* YI LTMEYYLVI KK\NDFSSHKK 
TRGNPKCILLSKRSQSEKA 














AUHSHPIjGGPGQGVPKSRIPNPPCPQGG 
TPFFPKNPKLTRPGFARLPFPPLKKLKP 
EKFPNPG/MPGVPLN*IPPPVSHPGVQP 
NFFSKKKKARGFCLVLNPSSS 


6520 


20421 


A 


6574 


406 


257 


NKRE KKTRKMLDS I VI RKRH IKS QMRCY 
SAPIQMAKV\SNRQSPKFWVRCRQTISL 
*HC**ECKIVQSMW*TLQKFLPAIVFLC 
I YHTEKLRFTQQP / CT* IFLAALLMIV/ 
RTV/KRI J VCPSMGEWLSKLW*VNLSFSV 


6521 


20422 


A 


.6575 


370 


68 


EQGSVGMRPSEPGPGYNLLVCR*LRPLE 
KRTIWAGVS* FSRCSLSRLPLARKGKS P 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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od 
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NO: in 
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beginning 

nucleotide 

locntion 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NP\CTS*VRFSNGLS 


6522 


20423 


A 


6576 


1 


259 


IDLIDVLRPSLTVT*AWQWCNLGSL* P 
* P PG \ SGDPS AS ASRVAGT I GQHHHR WL 
IFVIFVEIWFRHILVYFHTAIKNYLRLG 
NL 


6523 


20424 


A 


6577 


418 


218 


KGKTRIiGRW P PCN PSTLG \ GQGGW IT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


1 


401 


EKAGCSGSYL+P+HFGRPRQEDHLTLGI 
*DQPGQCSENLSLQKNF/TNISWA*WPG 
IVAHACNPSTLGSQGRRIT 


6525 


20426 


A 


6579 


3 


356 


HWDYRHEPPFPVMHFFIWLPGCHSLLA 
S PNLTGCFFS I S ITESLLSLF I YLETRV 
*DLQCSYFWERRG\SLCCPCWSRAPGLK 
RS CCLRLPSS WDYSRLTPPHLANYCI LG 
GEGESS 


6526 


20427 


A 


6580 


411 


158 


IGSVAQARVQWRDLSSSDPSAN*APPGL 
KQSSC\PASLPSRWDY\NKCTPPDLANF 
YIFFVETGFPYVARLADCFYPPPL*KMT 
GSY*VFTMCQIPH*AFYKF*FI * FRVGE 
DKNSLPAWQHRETLSLQKKYKN 


6527 


20428 


A 


6581 


399 


218 


GGAELKVPK*GGGPGSPPLPPPLQKPRP 
KGPLWPGG*GPPRPPGEAPSPLKNKN* / 
PLPG/AGPPPVPPNWGGGFYPRG*GFG* 
LPLGP * TLARGKQRDPVFKKKKKN*NVN 
KKKAKL* KS*QFFFFFLKTGSRCFPRAK 
VQGPNGSYPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTDEVLPCSPGWS* 
NS W / VS K* SACLGL PKC WDYTREPQHLA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC* PK\SLGMLPRMISNSW 
AQAI I *RWPPKVLEKQA 


6530 


20431 


A 


6584 


65 


251 


HYFNSFNFFLMGSSNHPTKNPG* FFFFL 
TFYKNK/ SLTI LPRLVLNS WAQGI LLP W 
PPKGLGL*AQGILLPWPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMPPCLSSF/ SYFL*RRGFAVLPRLFS 
KSWVQVILMPWPPKVLGLQA 


6532 


20433 


A 


6586 


142 


334 


GRKLSQLLILNFTTRSVLKQMSW/MACV 
SETWPA/STLGGRGGRIT*GLKEFKTSL 
QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLIFRFGNFYCFTLKF/ 
H**VFLLSFVFYC*GHP/FEFIFKVFFI 
IFFHFKVFICVFKYLLFFFILIASTFYF 
FVNIFSFKKFVSVMFVLLIEMFYYGRFT 
IPLR*F 


6534 


20435 


A 


6588 


241 


251 


NSLFTKKKKKKGGALLKEPILRAGGGEG 
1 1 FFKGPQKS I PGPGFKKAGREKPGPS P 
*NWFF+KGPPFFFFFFCKE/RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


YFLFLPSLPTFFPSFSSFLPSVLSFFLP 
SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LS F FLSRVS FCHPG * SAVAQS RLTATS I 
SWAQAI LP /CR/NRDFTMLSRLALNS * P 
QVIRPPRPPKVLGLQA 


6536 


20437 


A 


6590 


411 


137 


KCIGTITTHCSLHLLGPSDPPTSASRW 
GLQAHHHI QLAC\ F * L ID* L I D * LNVEM 
ESRHVAQAGLELLGSSDPPTLASQNAEI 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQPPGAKKTSGGGFPIR/WGHQPPPPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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NO: in 
USSN 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

lo cation 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion- 














AKKYNFFFFFFW'RQAIAMLPRVVPNCQ 
T*AI I LP WPPKVL * LQA 


6538 


20439 


A 


6592 


28 


400 


IMPNCFPKWLYITFFFFFF+RQSL/DSV 
AQAG VQ *HSLGS TATSAFLGFK* FSCPP 
AFLE * LDYRHVPTHPTNFCVFNKD/MGF 
TMLARLVSNS * PQ / CNPPAS ASQSAG I T 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHI YLLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QTHLFWFPPGFSPPPFFKNRPRIFFLGA 
LKKNFFFPPPRHKNLFFLKAPPFFFFFK 
RSFFPPGWCAIGLFSLQPPPPRFK*FFC 
PSLPKKWGYRPLPPRPGKF/ CFFFFFVI 
LVENGFHHVGQDGLDLLIS 


6540 


20441 


A 


6594 


91 


274 


I LHWS * PLGNYHL / WLG WAHARDL STL 
GGRGGR I AGGQDFKTSLGS KQGS I S ARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


251 


HLKRCSTSSVIREVQIKTTTTCHFSE*K 
R/SEINMCW*ECGQTELSALLIRC*\WD 
SEMVQPLWERKEYLKQISTESSWAPAIP 
LLDI*P*ERCASLSLRKGWGYRPEPPEI, 
ASFLLGSYI 


6542 


20443 


A 


6596 


465 


268 


I LLCHLGCSAVAQD \ LTLCSLKRSSHLT 
SPSSWDSRRGPSHLTKFF/C* *RGIjNTL 
PSLV+NFWAQAL 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHLNLPTPK*L\D 
YRGMPPRSANFLFL*RWGRCVAQAGLEL 
PDSSSLPASASCPRQPVLYF 


6544 


20445 


A 


6598 


122 


418 


IIVNELCHIMCFFFFFFFKQNLLGTQAG 
GQGGKNNG*LKPKPPRLKGSPWLTLPGG 
WNYGGFPP/HPS*FFFFFLGKKGPYFCP 
PAGGKFFNLKGPPFFA 


6545 


20446 


A 


6599 


1238 


930 


FETDSCSFA*AGVQWRNLSSLQTPPP\G 
SNNSCASAS/RIAGITGTHHHARVIFAL 
LVETG FHHIGQGWS RTPGPQAI RP PQ P P 
KVLGITGVTSRARAISKHFV 


6546 


20447 


A 


6600 


398 


78 


P SS P F YLF I FRQS F P FFARVE E KGGHPG 
FPNPPPSGLKKFFYAPAPPGS KAPGQGP 
PA/RFYLFIFRQSFPF/CCPGGGKRGPS 
GFPQPPPFGVKKI FLCPSPPRK*GPRAG 
PPRPRKIFFFFFSSLFFFFFVRWGFTML 
PRLVSNSGAQAI CPPQPPKVLGLQV 


6547 


20448 


A 


6601 


3 


473 


FPKGKKKGPPPPAGG/QLKNPSKKPNKK 
PGAPPRGPRGPKKGGGGLGSPPQSPQGG 
E KWTLKGGPNLP P PRGGLGGARP P PKIG 
GGPLFPKGGGPKGGTPPPPLFFFFLRRS 
PALSPRLECSGAI * AQTSS/SPPARVGC 
PS P PAPTR PTGLTVS PGSN 


6548 


20449 


A 


6602 


398 


156 


NGRKHTDVKPEELPSWVPTRDLTPTGTA 
GAF / Q / RG Y YGY YNKYVTVKKGS IAGVS 
TVLAA * VL FSYCLSYREI KH E QLLRAH 


6549 


20450 


A 


6603 


380 


77 


PFPKKSPPL*NFPPFF*KKKVPPPPPGK 
LPLSFLFFFFFF/RDRVLLCRPGWSAMV 
HGHGLSPVNFCISTRNRAL/SMLPRPIS 
NTWTQAIRPS* PPKVLGLQM 


6550 


20451 


A 


6604 


352 


204 


KRGPKI *NFFFG*RQSLHMLPRWPNSW 
AQMILPPWPP\KVQGLQA 


6551 


20452 


A 


6605 


631 


357 


FFF*DRVLLYHPGWSAAAR\SLLTTTSA 
SMV\KQFSCLSLPSS*DYRCAPPCPASF 
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sequence 


M 
eth 
od 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IYlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CIFSRDRG FTMLARMVSNS * PQ V I R P P G 
PPKVLGLQA 


6552 


20453 


A 


6606 


3 


397 


VLHKI/TQRCSAVPLRAPTAFLEIGTQTP 
KRVWKCR * P /RETQNNLLDKEQD FKTYQ 
KATVTKAWCQHEDTIEQWDGIESTEKK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 
LKRGLILFPNLEGTRGGISTP 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGLAALQFLGFRPSKNTQL 
GNRFMCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSI/SP*VLTESSNSNPTP 
LKCC*HFPSPCL 


6554 


20455 


A 


6608 


439 


252 


LVPPLFFPRSFI.LFSPPKKRGGPKPV* F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 


20456 


A 


6609 


3243 


2602 


FFFGRDGVLiLCCPGWSQTPRLKQSSSLS 
L\PKC*DDWH/RATSSGLQVFLSICCSI 
ISIFFMPSICDKSTKQFSN/PQRAYIIiV 
G*R*VIYLGWLiAFLiFVFLiRQHL/NTVA 
QAGVQWCNLSLL* PPPARFKRFSCLILP 
RSWDHRRLPPRPANFFVFLVETGF\TIL 
ARLVSNS * PRDPPASASLSAGIMGVNHR 
TWPILVFLHPSVFFKVLQLFYL 


6556 


20457 


A 


6610 


81 


395 


TWAAL KLYSTCIFSSS ETE PD VS QSQ I T 
DTTACIME PGLFW/YRSIIWTIj*FFHFF 
WRDGGLTLLTRLVWNSWAQVISL.PGPPK 
VLGLQASATAPNSFILIREGIS 


6557 


20458 


A 


6611 


1 


126 


GRLRQENHLNPGGRGCSEPRLHHCSPAW 
AIERDSVSKK/NKNGLKRCFHLRKKTLK 
GEKRI W*PGGRGCSEPRIjHHCSPAWAI 
ERDS VS KKKKMD 


6558 


20459 


A 


6612 


389 


67 


YLFMYLF* /DRVLIiCHPGWSAVAQS*LT 
AASNSWPQVILLSNRFKSWTVIiLIjSSKI 
PWQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 


1 


384 


RCFKVKDLKMRPFWNIQVDPKSNDKCPC 
KRLKRRGHASLPPWEKRRRPGKDGGRGW 
NSAVTSQGT/VEATRSWKSQGSVLP*SL 
* RERS P V\ ELAHF * QRIiAMLPRL VLNS W 
PQDISPPWPPKALGLQEP 


6560 


20461 


A 


6614 


385 


174 


CLRLSALKKPKLNSWY*K*LYIMEY*SA 
LKT\NKG I LS FAI TWMNLDD I LLNE I SQ 
TQKDKYHMNSLICEI 


6561 


20462 


A 


6615 


1 


370 


MNPRAHl/SFLiLRRKERRIIjWLKALGHIM 
/FTV*GLTGIALANSSLiYIVLHDTYYGV 
SHFHYVLSIGAGFAI IGGFIH* FPLFSG 
YTKKKKKNIRGGGTKNFLLCPALVQREK 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHEPPHSAI FAQLYKL* K* L I YALKTG 
KVYVYKLYLNKAI KSQ \ S YNRASLSLT* 
* PRVKRKRS * PGTGAHTCNPSTLGSQDG 
LIA*VQEFETNLGNIAKPCLHQKKKKGA 
PV 


6563 


20464 


A 


6617 


1179 


915 


FFFETESCLVTQAGVQWRDLGSLQSPPS 
G\SSDFQLIFVLLVHGOAGLELLIS*SF 
ALASQTARITGVSHHAQLVWSDFIKSIN 
LFCCL 


6564 


20465 


A 


6618 


3 


390 


HASVKMELPYDAAIPLPSIYPKELKARF 
*DLCIPM/FITALFTIAKRWRQtiNCPSV 
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Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 














VIRSWGREGVKESCLMGMEFHSAR*KSS 
GDTFHNTENC Y* TVHLKI VSGKF* * L * C 
VLNYVGFFYHNKNIKRLS 


6565 


20466 


A 








QFLFERLFSSYSGYVSGVKMLGSTEVQLi 
FPRKCEMKSCSVMPRLECSGGISAHCNL 
CLPGS RDCPAS AS * VAGL I FY / 1 FLVEM 
GFHHDG / REL VLNS * P Q VI R PHQP P KVL 
VLHV 


6566 


20467 


A 


6620 


193 




TPGLK*SSCLSLQNSWDCIWDYRRLPLY 
PVNCFCC*FVCFVC/IFVEPGSCIAQYC 
LELLASSNSPMSASQSTGI IGMSHGACL 
ILT 


6567 


20468 


A 


6621 


383 


8 


LNDTVSTFFF*DRISL\CPSWSAVARSQ 
LKAASTSWVQAILPPQPKTNLLGSSDSS 
TSA* DQSPGFK* S FCFRLPGSWDYGCVP 
PFSANF*IFCRDKLSL/WAQAILSPWPP 
KVLGLQALWEAEEGG 


6568 


20469 


A 


6622 


395 


42 


HVS S TS FQGDE LHIiR I VALPL P \ PLHNIi 
PLPLS PRSL* C/HSRGWTVLPPSRLTAT 
SLPD/SSCLSLLSAWDCRRAPPHLSGFC 
WSAVA*SW\SLQPPPPSRLPWPPKVRRL 
QPLPSRHPV 


6569 


20470 


A 


6623 


3 


327 


PS PPSRWNPKDVP PNQENLG I F FFFL * T 
RDLATLPRLVPNS * TQVI /WPPKVLGLQ 
A 


6570 


20471 


A 


6624 


395 


47 


KNRALFFFPPPRKRGFFPKPLIWGTPRF 
FPPPPF*NPPPYFFFGAPKKKFFFS/SP 
PGGKIGSFKRPPPFFFFFKQNFFPVPPG 
GG P RGD FGS LQH P P PR VKG FS CLAP 1 1 F 
FSSF ... 


6571 


20472 


A 


6625 


100 


267 


LSIiSKKANISKFWLiGT\VAHAV\DPSTIj 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 


A 


6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFSFW*KWGVSLCGPGWSQTLGLKQSS 
CLSLPKCWDYRNKPLHlAFIiASLILFTF 
QEWDXTD 


6573 


20474 


A 


6627 


506 


323 


REAPGTIjIPCWQKYDMIKAQIjVWQFLRK 
PNIYLPYNPAIPL*HTCPRKINTQYT/C 
PITKT CTRMF IAALF 1 1 PQNWKQPT * P F 
CPFTIISHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKIYTLVLQQLCI\RMFTAILFSI " 
AL * FCKPMCTS I LE W I KKL * S I RTTE YH 
LVMRRNKPLLYS MPQM 


6575 


20476 


A 


6629 


396 


139 


FFSPPEKKGVFPSP*NLGTPGFFPPPPF ' 
KFCRPP*FIFGAPKKKK/HISPPPAEKFF 
FFKRPPPLFFFFFFFFFLFFFFFSAFKL 
KV 


6576 


20477 


A 


6630 


79 


405 


VSGRMTALESYLLASSA\RNKCSSNCKI 
KKMTARLDVIAHRCNPSTFGG* GGRI C * 
AHKFETSQGHIIRPCLYKKKKKGRPFKR 
TQFSPARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


663] 




417 


SFIFPCKFLNQFISFYKNVCHDFDCNCV 
ESKDQLG/WPDAVAHTCNKNTLGSQGGW 
IA*AQEliKTNLDNMAKPCIiY 


6578 


20479 


A 


6632 


74 


417 


KTVFRYNGWAL*ASENRISLADLLLPSF "" 
HVLLCLPKAG\PNFPLPSS IKKFFDLPD 
LIRSHKTHISEGVLFPFP / RRKE PRTER 
PRR I * TDRS CWVS SLS L I VLDHAFFVQS 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
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T=Threonine, V= Valine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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QFYM 


6579 


20480 


A 


6633 


740 


776 


CSFFLKKQTRSRAAVQAGVQWCHYSSLQ 
PS PPDLKQP PAS AS QTV/KD* GAYFWCC 
FFLFLKR WGL /NS WQAGVALL PDLP W 
RHLVIPLSLWAVEPALWAAAQL*LL/SP 
CPLAS QS PRHQTGQS PHQKCFPDI LGLH 
IALRSSFSPGLERPTLRLPPGLASFHT* 
ATWLSWPLPYK/ PPKRVKMYLVKADRPE 
PFIj*GTRGRKSVMQMTHAKTLQETIjYSG 
PEAQAAASVTDEF 


6580 


20481 


A 


6634 


2 


296 


FFFLRQSFALVVQAGVQWHDL.SSLQPPP 
PRFK* FSCLSLPSGWD*RHAPPRPAI FV 
FLVEMGF\TMLATMV\ * IS * PGDLPTS A 
CQSAGITGLSYRARP 


6581 


20482 


A 


6635 


407 


212 


GRPR*TPPPLGKKKKGAPPPKTKKKKNF 
LGGRDSYSQKKPFFF*RNKTRPGGGGPP 
GIPPPLKG*AKKKGLPRGPKFPVT* /NK 
PPPPPPGGNNQNPFPKKKKRKSTR* I *K 
IRISRFILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


223 


PCKKIiPQPLQPSAT\TLISQQPLTLRKD 
LLPGKRL* LAEGSDDSYQFLVAKYFKI K 
GLSFGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FFLDRVL/LCHLGWSAVNRSWLTAALTS 
GLKQS SCLSLPKLRDYRHE PQHQANNI S 
*SSWSDSNMQPRLKTTL 


6584 


20485 


A 


6638 


398 


26 


KGAPPPNIFFFFLFNIFFFFFFFFFFFF 
FFP INFLQCLLYYRAIjHFN KVKVI NN F / 
PLNFCAY YH PL* NM * FS Y I S FQN LY KF F 
UI/FGVFNLSGIYFNVL*IANF**G*LI 
LPGTIYLKLCYFTTN 


6585 


20486 


A 


6639 


1 


341 


NWDYRHALPCSANFFILVETGFCQLPRL 
VLNSCPQVSHHA\ KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSLQP*LPKLKGSTCL 
TTjPKG*N*RGEPIiPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NANFEKSFSMGKIL*NSLAWYKKILCD/ 
RKNRSMQKTPLD* * YYFKKLPQPL* PSA 
NTTLIGRPALTSRQGLPPAKTL* FAEGF 
DPMILSLF 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSG I PTF / CFFFLNGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
IiPSH*GPRHGPPRPGMFIYKNF/CFF/V 
RDGVLLHCPGWSQTPGLK 


6588 


20489 


A 


6642 


433 


64 


TGNFFLDSLFYFNPGPLGGGGIIiKGFFS 
FFFFLETKFFFFPPAGGQWRYFSSL*PP 
PPKFKQFFFLSM/WPRPGKFFFFFFGIL 
VEAGFHRVAQAGLKLLSSGNLPTSASQS 
ARITAKSHRTVF 


6589 


20490 


A 


6643 


424 


11 


PPS I FLAPPPSRGGGPPPPPKKK/ QS PP 
KKKKKGGGGGPKNPPPKKKGPPKKTTGF 
KKNPPQKKKKKNSPPKKKGGPPANK*KP 
PPRPQPFYFLTFYYKI FIYI FLLKFLLF 
YILFFFFIiKSSKQGRPLAI 


6590 


20491 


A 


6644 


411 


2 


PPLRIWLASPPSKIF*PQPL.HI/YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQGVF*NPP* KKKIYFSSPP*NF 
APPKI FLKTPPPFFFFFFFFFVEMGSRY | 



953 



ISDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
bd 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 
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correspon 
ding to 
last amino 
acid 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IAQARTRERTRGRTRGRTRG 


6591 


20492 


A 


6645 


34 


381 


AKLGTRGAAKQEKEIKVTRI *WKEIELS 
QFTEYTQKNL/NVTDKI J L*LVYMFSKIT 
GYKVNLKNINCIFHKHTNQQQKIIjKMPF 
IIA*K/TIK*L*INLTKDTQDLYAEIYQ 


6592 . 


20493 


A 


6646 


17 


448 


HFYAKLGTRASLSCSGLHPV*ASQPLYL 
PTQASAMADAPFPARLPPNSLISYCCTS 
NEQGSVGVGPAEPGAGYNLV*HLLRPLE 
KHSI*VAVSRFSHYSLSRLPLARKG\NP 
QTPCASWVRQCPTLLRLiALHGIjHPLSNQ 
SQGD 


6593 


20494 


A 


6647 


462 


336 


SLRLPGSNDSPASAS*VA\GITGPCHHA 
RLVFVFLVEMEGLGS 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWSISLAKRKM*IKIT 
MKYPYTS I KMA/ 1 KQTNKN * KKSS S I KC 
W * ECRTMRTL IHCW* * HITG * PLWKTVW 
KFIIKLNLYXjPYDPA\TCTQMFTA/ALF 
IIIKY*KQSKYPSSGYCVPA 


6595 


20496 


A 


6649 


3 


1492 


liAYARAFLDDTNTKRYPDNKVKLLKEVF 
RQFVEACGQALAVNERLIKEDQLEYQEE 
MKANYREMAKELSEIMHEQI / CPPGGED 
ERLTE F P SHLQRHQWDSNKHNGS RDDQL 
VFGRVITSHGPCVGTCFVICKLRMLSKA 
NH WGDRAQ/ ERTKG KGRERK * RTTLFLN 
R/RFYRSWKKVHIFF 


6596 


20497 


A 


6650 


495 


170 


G AAE F PAS QS AG I AG VS HHARLE E FFS L 
DSGRVMRGTEVLKYLGGYTITI *S IFFF 
FSFF\WPGLGSLQPLLPGFK\LSL.LSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSL*SSWDYGHVC 
HHAWLIF*K/CFSGGLALLPRLV*NSWA 
QAVL P PWPLKVLGLQV 


6598 


20499 . 


A 


6652 


241 


833 


KKTPLAMWNPWPGP PQIPHPLTKLVQMA 
G / QENGCLS GHDLE S I RPAGLHNS ARGE 
VL.GLS SSLGKELVFLQEELDLSE IHI PE 
AQEVEMASGHFAFLHVPVPDGRAPYCKA 
SLSASSSLSPTPPEDTAISSLRPPSAPE 
MLTQHGAQEQVEDHPG/Q*QPSPHSQSR 
PSPQKDPQALVIASLRSRTERKQASHGG 


6599 


20500 


A 


6653 


1 


398 


FFFDIGSHSLTQAAVQ*HDHGHGSLHPR 
LPGFK*SSHLSFPES*DYSRHTPPRPAN 
F/EVETEFLHVPQAGLMRCNLLPAT/AT 
SKSARITSVSHHVQPEVSIKW*NNISNL 
I KCFLLF YKYKC I F INTKFVFTV 


6600 


20.501 


A 


6654 


489 


282 


PAS / WDY / RHMP PLLANACAFLC * RDFS 
MLPGLVSNWWARVI CLPWPPRVLRI QAR 
ATTPG 


6601 












Syiat-bVlj \KLit.l-h>CjAISAIK_NIjCLIjGSS 
YSSTSAS * VAGTTGARHHTQLI FVFLVE 
MGFHHVGQPGLELLT\ *VIHLSRSPKVL 
GL*APSSARGQNSVSQ 


6602 


20503 


A 


6656 


412 


44 


R I SERRKS LGAVKTKDKKYSS SRVRTSN . 
PNCARSGASS KQS FLFGKKRAPHPG I KS 
F\KDALLWDGITGVHHHA*QI FVRSRE 
R*IRPVIPALWEAEAGGSRAQKIETILA 
NTVKPRPSLLRA 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6603 


20504 


A 


6657 


3 


241 


FFI FLEAR SHS VIQVGMQ WCDHS SL * TQ 
TPGLKES \ PASS FHRAG ITGVSHHTYLF 
Li*GH*SHSQGPHPHDLITSKRPCS 


6604 


20505 


A 


6658 


467 


42 


F FFFFFKQGFGL FP QLEG KEG I PGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FGIFGI *GGEGGF* TPGSGKPSGPGS PK 
\IPFCWVIRERPHGPPPRPFFFFLRQSL 
CFAQAGVQWCDLGPLQPPPPRFKQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


KNL KINWT* WbM P WS AT / WGGS LKPGR 
LRLQ*AVIAPLHSSLGDRARPYLSVSQY 
IiYFSLSIiSLSLS LTHTHTHTHTHTHKI E 
LLGCVAEIIHFF 


6606 


20507 


A 


6660 


3 


249 


LSPRP*CGDTISAHCN1,HLLGSSSSPAS 
VSQVAGITGACHHAWLIFVFSVEMGFHH 
AGQDCLHLL/NLVIHPPQPPKVLGLQA 


6607 


20508 


A 


6661 


3 


1507 


PEAPVPFLDSNQENDLLWEEKFPERTTV 
TELPQTSHVSFSEPDIPSSKSTELPVDW 
SIKTRLLFTSSQPFTWADHLKAQEEAQG 
LVQHCRATKVTLPKSIQDPKLSSEIiRCT 
FQQSLI YWLHPALSWLPLFPRIGADRKM 
AGKTS P WSNDATLQHVLMSDWSVSFTSL 
YNLLKTKLCPYFYVCTYQFTVLFRAAGL 
AGSDL I TALIS PTTRGLREAMRNEG I E F 
SLPLIKESGHKKETASGTSLGYGEYVIK 
ITL/SSSTDLWTHEIDLHRNSL*NRDSN 
CSNF/LREQAISDEDEEESFSWLEEMGV 
QDKI KKPDI LS I KLRKEKHEVQMDHRPE 
SWLVKGINTFTLLNFLINSKSLVATSG 
PQAGLP PTLLS PVAFRGATMQMLKARSV 
NVKTQALSGYRDQFSLEITGPIMPHSLH 
SLTMLLKSSQSGSFSAVLYPHEPTAVFN 
I CLQMDKVIjDME WHKELTNCGLHPNTL 
EQLSQ I PLLGKS SLRNWLRDYI YNWRS 


6608 


20509 


A 


6662 


326 


1 


IMIFLFFQTGSRFVA\RLECSGTITTHC 
S LNL PGS SN P \ P TS AS * WTHCRTRV I L 
NFSVDPGSRHWKGGLELLDAILKPWPP 
KVLRTRGS TPE FRSGGSGVR IRGFNF 


6609 


20510 


A 


6663 


414 


105 


QGKVGFKVGLNPLSRLLHFFKQKLFVLS 
*KGQVKKLSLKIPFPPGGFFSPPI *NPP 
P*NKPQRFKNPFMGFT/IRKKILFLAFK 
STKSGG*NPPSK/HKKI+FSF/ CFWETE 
SRSVTQAGVQWHYLSSLQPLPPRFK*FS 
FCKPHKRVFKPLGLVLWGG I SNRRRKKT 
SGGKRNFKA 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREAEAG * IA*TRET/RLiRW 
AGIVPliHSSVGNKSETLSQKKKGAPSKK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
SKKAWKELRPWKKPRGFWFLINST 


6611 


20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDLHLLPP*LAR 
FKRSSHLSL.PSSWDYRHTPT/HPGFFVC 
LFVFCRDRVS FVLPRLVSNFWAQAILLC 
GDRKVMGLQA 


6612 


20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP 
GSTAVAQ * ELTA\A* TLGLKRSAS FSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF 
TILPRLIIiNSWPQVILLPQSQRH 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
loca tion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6613 


20514 


A 


6667 


484 


20S 


ITWSAAGLIHYSFLNPGEPMTTEKYAQ 
EIDDIH *NLQHLQ PAL VDRKG P I FLHDH 
A*PHAAQPK\LQKLNKLGCRVLPHLPYS 
PDLLPLT 


6614 


20515 


A 


6668 


495 


227 


P AF VK VPPE KDQRQNS THAG RNHL PM E I 
LLL*KKKLIETESHSVT\RLECSGVITA 
HCNLKLLGSSDPPTSASDYRIiKPWHLAP 
SSFSMS 


6615 


20516 


A 


6669 


2 


65 


DGVLLTLSLLARLVL \NS W PQV INLPWP 
PKVLG * QAGLGT PGLK 


6616 


20517 


A 


6670 


403 


224 


ILL * P * P PGLK * / FLPS S WDWRRAS LCL 
ANFLFFVEMGYPRVARAGLKFLGSDPSA 
SQSAGVTGVSRPGRFLIRTVL 


6617 


20518 


A 


6671 


406 


162 


RS YAQAGTQWLDFS SLQP * PPGL I KQSS 
HFSFSQSASH/RHPDNYCIPCRNRVYVG 
QASLE LLGS CDLPAS ASQS AG I TGS CL 


6618 


20519 


A 


6672 


453 


32 


WSATLHTSVWCACATRCVGMCMTIjGV 
CGCECWV*GCVCTSVN* \HTSTGWSEK 
V*LCVCGCV*LCVAECMAFVSRQCVCDP 
VCCLPEKMRLSPISVA*VCAPMKVSPCP 
PQLSELHLLSLNIYINKCVSLSQNAWLF 
L 


6619 


20520 


A 


6673 


378 


180 


NEVIRSYKVSLFSPRIEKWINK/l/HWY 
I Y PME YDS A I KRNE VL I HATT * R KLEN I 
TLNERIFDIIPFI 


6620 


20521 


A 


6674 


378 


94 


PVKNRRTKELH I KDVCTPMLI VILFMI V 
K/S * KQPKCP * IDEWMNKMWYTHMVEYC 
SVSKKEGSPFICYNEDITLNEISHKKTN 
IYNSTYIRYLE 


6621 


20522 


A 


6675 


1 


380 


VCSL*VLCRHMSCFQNCSL*FSFSFWFF 
WRQCLPLSPRLECGGPMSAHHNLRLPGS 
S DS CAS AS LVAG I RGACHHAR VI FVF W 
EMG\FHCVGQAGLELLT\*VICPPWPPK 
VLGLQVLATVPGQPHIF 


6622 


20523 


A 


6676 


424 


135 


SHRWGVKCFKYNIFFFRD\RV*LCCTGW 
S*TPELK*SSCLGLPKCQDYRHETLRVS 
Y*F*NSI *QIGVLAMVSKLVLNS * AQAI 
LLPWPPRVLGLQI 


6623 


20524 


A 


6677 


408 


91 


HTHFVWPPVLIP* * PMR*VLLLTPNTNF 
FLRDRVSPCHPGWSVEAQSQITAASN\Y 
GLKQFSHPSVPKCWDYRRELPCMVSLI I 
FIYVHLHMYVHARKEKPEYPL 


6624 


20525 


A 


6678 


123J 


217 


PPYFNWPLTFPQKRFFKGDKTLKKWL * I 
RE 1 1 LQNRGIG * KTDS IR * MGKTEAPLK 
EKFLLGTALQPSFLPYMLGHFKEE*GEE 
FPGTHSWP FPS F PVWGLGLS VNHRG IHG 
PRVSLQTWGPGQEAMECECGNHGTEDTG 
\GAVLQPG*VT*GFHTPTSEPGLVAPPC 
TCSGLKAEA\GFLKLPGKQG\SEGALPP 
*SPLPSACFPELGLRRLGSSPGLMIHPG 
WGLFTSPLGLKILGPRHPAGPALPQIAQ 
RPSLKFR*RRLGWGPELLPAQTCGAQSL 
GPSTAPLLGQGPWGLG*HASRWGTLGLA 
VAPGNEVPSLSPR* PRGAGPLHRHRVPV . 
RSWG 


6625 


20526 


A 


6679 . 


1 


178 


RGWDYRRPLP/RPG*CFVFLVEMGFHRV 
GQDGLYLLTL * SS QS AGI TGVSHRARLQ 
PPIF - 
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sequence 


SEQ ID 
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sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid. 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


6626 


20527 


A 


6680 


3 


135 


KSEHLEREPI IEQLSTMFF/TAKYGWHL 
RGH * HRCQKKPEP PEDR 


6627 


20528 


A 


668 1 


425 


107 


KGPPTPS/PPPPSPVFPTEKILVPPPFF 
PPPPFKKPPPQNIFWGPKK/IKNFSPPR 
GEKNFFF* TPPPKYFLGAKKKKNFS P PR 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP \PGS KGAS PPS FQDNWGLRQA 
LPP P / RESFFLFLFLFFP I VQTGFHHVA 
RAGLELLTSS DLPAS ASQS AE ITGVSH* 
A*PFP 


6629 


20530 


A 


6683 


1290 


449 


PLSAPPPGDPSP* KSSPRAPNLPAP*PP 
NLSPWPPKCPRPRR*PASS/PP*LPVPP 
KSPSPPV\PWATEASRGHQPQIT*ELG 
SL.VLRMTEKGTWG+GIPDSIMPL.PRRRS 
EPRTCRE*GDTRWRWRRRRRSTGAVSTF 
S PG FACGGGSGA/ PAHPGEQKLGAS AR P 
DE / PVCPAAWVLLPS AR WAS PAAAPP P 
LPSQPVH*RFKP\SPPY*PP\ALPPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPLISVRPPVTINGCLPQSQPPKTHTV 
SRLYEEN 


6630 


20531 


A 


6684 


425 


40 


EWASLKN*HPP/GANQAVEAVEAPTHCW 
*GCKS IEPLWKTVWQCLDKLNICLLYDS 
TPE*MSEHVYQAACKKSFTAVLFMIPPN 
*KQPSCPSTGWLGQAWWLTLVIPAPWEA 
KVGGSL.EPRSLRPAWATG 


6631 


20532 


A 


6685 


2 


107 


SWLTAASTSQ\VKRSSRLSLPSSGDHRC 
MPANFFFLKDDVSLLPRLISNSWTQAIL 
LSLPTKGLEL*AHCSLNLPGSSDPPASA 
YQWG TTGACQL I FFF 


6632 


20533 


A 


6686 


12 


224 


TPPGSTHASGAPPCLANIKFNY \ L * RLP 
RLFWNSWPQVILLPWPSKVLGL.QVWATV 
PSPFLIFDRDIDREI 


6633 


20534 


A 


6687 


404 




TAARG 1 1 YGGP I KKL.L PRKPADVK/ SG S 
FKTAAP FFFFLGTRTLTEFLF 1 1 1 QN* K 
QPKRPST/DRINKLWYISTIEYYSAMKR 
NKLLIHTTAWMNYKGIMVSKRGQSQKSI 
FSRTKTGFGRICHISGCFLYPLPLRL 


6634 


20535 


A 


6688 


410 


163 


PRGNTPPGFSPPEKENQKVFNPPPGNIV 
*KKGGPPPQGAQKKGPPY*GPPLSFLPP 
KKKKKKGPPPPPPPKKNPPKGNS/SFSP 
KKKKKFFPPPPKKKGF*GGPPLKTLFFW 
GGGKKFFFFFWGKKEFPLGGFFFGGGGG 
GGPFFFFFFG 


6635 


20536 


A 


6689 


205 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGPKGENPPLALGGGGGAKNHPPNPQF 
ILLFLGGKNP* KSPPS PGGPP /ANRKIG 
NRRGP KTQ KIiAP P LGAKGGKP P P CP RGR 
GGGQKPPPQSPVYSPFFGGKKPL 


6636 


20537 


A 


6690 


3 


199 


HGSLQH'PPGFKSSSSLSLPSSWDHGHV 
QPRPAIF'/ CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


8 


350 


SVCLGLP/RCWDYRYEPPHLA* LAFLCQ 
1 1 FHHMDW I YHVS I LWDGYLGCFY FLA 
I INNV\G*AGLELLTSGDPPTSASQSAG 
ITGVSHCTHPYCYTFLTREITTPKEVKI 
GLiCM 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine; 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














E/SLTMLSRLVSNSWAQATLPPQSPKVL 


6639 


20540 


A 


6693 


369 


100 


PMLARMVS IS* PRDPPASASQS AGITGV 
SHRARP IYFFK*R* SHAGYS AVPVHRHD 
PTTDKHNSFLFCFLQTGSCFWQVGMQW 
HDHISL*PRTPGFKQYSHPSLPRH*DYK 
PTPP /HPS * FWQ VGMQWHDHI SL 


6640 


20541 


A 








FLSSWNYRCVPPHLAN*KKNFFFYTDR/ 
SLAVLPGLVSNS*TQAVLPLQPPKVLGL 
QA 


6641 


20542 


A 


6695 


402 


248 


SSWDYRCAPPCPANFYKIL*TGS/LLC* 
PRFVSNSWDQVI LP PWS PKVLRGV 






"A" 








VQE F * DR ILLS PRCHPGWRAVA * S RLTA 
ISASRLKARPSHLSLL,SSWDYR*VPPGP 
ANFLHFFCR/DRGFAMLPRLVSNSWTQV 
I CLLR PPSVGI TGVGHRARS I LLD 














PCLANFF/ VFFVKTGTHYAA* CGLRLLG 
SSALPV*TS*SAGIIGMSHCTC/LQITL 
LKTESHS\VAQAGVQWHDLGSLQPLPPG 
FM* FFCLS IQSS WDYRRTSPRPANFCI F 
SRDEVLPCWPGWS*TPDLR 














LWFLYQGNAGITGISHRAQPNSFVFFKD 
RVLLCCPGWSAVA\ HS SLTVALTSQAQV 
I FbPQP PG* LGNFCI FSRD/RGFAMMPR 
LISNSWAQVTRPP 


6645 












FEME S CS VARLE C S GT I LAHCN LHLL S S 
SSSPVSASRVAGTTGACCHTWLFFFFFF 
V/FLVETGFYHVAQMGLNNWAKAILPQP 
P*VLGLQARITTSSPNSSFTREHTSGSG 


6646 


20547 


A 


6700 


393 


88 


QKLYPYLTPYAKVN*QWIKNLSVTAKNI " 
* FLGENTGVNLHS FRF I KGFLDM / TPKS 
ISTKRK*KIKWSSSKXi\KNFYAIjKNNIK 
kmkrqptkyekihltlnnwv 


6647 


20548 


A 


6701 


494 


239 


KAGSHS\VSRLECSGMISAHCTFCLLS+ 
NNPLTS AS LAAGTTGVHHHAWL I FVFSV 

LL 


6648 


20549 


A 


6702 


2 


379 


RPTWVTWQNPASTKKKKNFFFFKTKSPT 
VPQAGGQGGNLG*RKFLPPGLKRFSGLK 
RPEAPKGQSCSLMGGGLCPLLPP /wsgt 
PPQR 


6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN*EK/IWLGTVAHTCDPSTL " 

GGAGGQVT * gqefkts rheplhltvlcs 
LF 














I CYYF I VFAI YLiNQ FFSNCYRS FTFFFC 

p fetes rs aaq agg ynptltllhp p plg 
ikqfscltlpss\wars*kppcptwgns 

VFVG I YKNKLGGGGAI rrvqtcnqt YSG 


6651 


20552 


A 


6705 


69 




NVEKRNNIHNIKKTPVYFKGFLLYCCKA 
EVFYVLDNPFIFYFLFYFFETHCVAQAG 
VQWHNLRSLQPSHP\SSSDYRCAPPHLA 
NFCRDGVSPCWPSW/S *APGLKLIHPPW 
SPKVLE 


6652 


20553 


A 


6706 


373 


75 

- 


HAQIiTFKFYYFRNTFCKAVASTKSNSSD " 
RSE*SKLKTFWKGLTILDAIRNISDSL\ 
EEVNMSTFRGV*KKLTPI IMDDFEGFDF 
TRE SNHRG ARHGG S RL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6653 


20554 


A 


6707 


331 


379 


K+KINKN*KGLTILDAIKNICDSWEEVN 
IST/LTGVWKKL/ITLIDGSEGFKISVE 
EGNAGWEIARELEIEV/EPEYVTKIiLQ 
SHKKT 


6654 


20555 


A 


6708 


57S 


152 


LQQHNPGSLQPLSPNGFKRFSCLSLPSS 
WD FRHAP P C SAN F F VFLVE RG Y HH VG * A 
GLKLLASSDS PTS ASQSAG I TGVSHRAQ 
PLIGFKVNSVKKSVSVPTRRILKGNMSM 
YEQL 


6655 


20556 


A 


6709 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS/RLLGSSDSPAPAWLYSFKL*I 
Y/FHKTCNYVLKINILLHVAIPYSFSWS 
C 


6656 


20557 


A 


6710 


9 


393 


KSKPCN*DKSQSYRL*LFAQLiFTF/SNF 
FYFLETF*ESHCVIQAGIHSGTIMAHHT 
LKLQGSSNPPASAPPSSWDHRCGPPPLA 
TFLKI FL * RQWAHYVSQARDNGLIMFPR 
LVFELLGLKRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


QDFGFLSFFLLFFFEKGLGGPTCRAGPE 
SNLME\GGPPGLKQFSGLTLPRTGNNGR 
ASRSRAKFWN*KGKGGPPGPPVELETFT 
LRGPPHLSSPKGKNNGGGPKPDFFKKVG 
KKAPNP 


6658 


20559 


A 


6712 


1 


188 


IOAWWYTPWPATWEA*AGGLR\EPRRS 
RLQ * AV I WLHS CLGDR VRP CLNNKKGS 
FPPQPLW 


6659 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVPFFF\RE + GLAMLPRLVSNS +AQV 
ILPPQPSKVLGLQARTTAPGFFSHLLKT 
LVPNNVN LNI 


6660 


20561 


A 


6714 


353 


229 


SSCLGLPKSWDYRHEPPHLAYFYY**RQ 
ALV/MLPRI^Y* FLDVNM*LPLPFHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATRAKLRFKKKT/CYQMF 
ITT1jFTIAKRK*LKCPSSDK* INKMWYI 
HMMEY/YAIKRNVJTLVKLNQNPYAL* KV 
LLRE * KQNP*SVRNI FKTHI 


6662 


20563 


A 


6716 


462 


232 


HFNLITIKIIPFF/C*DGGLTMLPRLVP 
NS WAHAI LiLPQPPK * LGLQARACKL 


6663 


20564 


A 


6717 


425 


48 


GFGGPPFKKARFGFLHPF*TPPVLRPRP 
PPVFFGGFKASPPKKKTPK/PPGGGGPP 
PYFHPFFLSPFFPIFFEKEPPPFFFWRG 
LGPPPPPFKNPFFFFFWVEGGKFLPQKK 
KKKFFQSISTRSVIHK 


6664 


20565 


A 


6718 


3 


413 


GTFLTKTSLCGTLLY*VISYMCV*FIS* 
\SYLVYI IYFKDICV/LLYMYIFYIYTC 
VHISVCMYMYVCMYVCMYKA* PANLKTF 
CTQKHLAWGWVQWLISVISALWEAEARE 
SPEVETNLAQLGPTLSLPRKKKKKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/LPPGPANFCIFSRDGVSPYW 
PGWSRTPD*RSLPGGDVMPALVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


SVIREMQIXTTMKYPFIPTGLIVIKKTE 
NN/RGWQRCREIKACIY*WWGCKMTQIjL 
QKT I W * FLKKLN I EL / P YDPAI PPL.GM * 
LRELKTCLPRLGWAHACNPSTLRG 


6667 


20568 


A 


6721 


391 


164 


RISETCT*MFPAALFIV/DQNNPDKWTN 
LMWS YTHTMDDYSAI KRNEVLLHSSTWM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


1 Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, IJ=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














XENIjENLMLSKRGQSPKTPFG 


6668 


20569 


A 


6722 


381 


27S 


SSLQPQTPGLKQSSCLSLSNS\PPRFID 
*SISL,YRQGLTILLRLr.SNSWSQAIVl,P 
GPPKAliGL*ASKKFLYSNI 


6669 


20570 


A 


6723 


2 


363 


QRSLEHRPCKKC/CGLW* VS * KVQYLV 
YKESSSFKNLLRRIHCNFLFSGIVFHF* 
KLLRTPRSFCLCGL,HIPINFTIF*IKIE 
KFKKC* PGTMAHPYNPSTLGGLGGWIT* 
GQDGLDLLTS 


6670 


20571 


A 


6724 


352 


58 


HGGAIGARFFK VFNL I * FNX. VCKDE / S F 
SLLARAVINSWPRVILPPWPPK\VVGFQ 
AKT P LPRLKG FLESLPS LKKKKNIAYAR 
TITAQIGLLVCFIDQQ 


6671 


20572 


A 


6725 


383 


54 


KSSPFFFFEQGLLCCPGWSAWSPLQPK 
CPRLKQFSPLS/LPSN*EYSCTPTTFSL 
HVCVSIHKYI*YIDQYIFKFF/CR/DRT 
LAMLPRLVSNSWSQAILPP* PPKVLGFQ 
V 


6672 


20573 


A 


6726 


3 


343 


LEV*ARATRQE* KINGT* I IKYKKVKQY 
SQNDMILYIQNPKDSAKKXPELVNKFSS 
FKVNRQNS WFL *NSNKKS /EKEIKKKL 
/ PFTI AS * RI KYLG INLTKE VKNLiGRAQ 
WVTA 


6673 


20574 


A 


6727 


3 


615 


HTSQGNL* IQYHTPWHPPSSERVERMNQ 
TLKSHLTKLVLKTRLSWTKCIjP iallkv 
rtvpqkevglspcemlyrlpyshftvdi 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
ELFSTDATLGSSPGTSAFSLGDDVLIRS 
WEEGKLKPAWEGPYLVLLTTKTAVQT/D 
NKKM Q THHTP VKKAS PS S KS CA I VP R P I 
PTKLKIKKKKA 


6674 


20575 


A 


6728 


2 


228 


SGAI / SVHCNLHFPG/SSSDSSASAS * V 
AGTTG VHTMA* L I FVFL VETG FHPISQA 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVEYTVLLF*GFKGKFPTLPLYIjSFSSF 
LKNHL I SNFFLNR VLLCYPDWS WG * FT 
AHCNLNLSCLSIiPSTWNDSRCVPTGAWL 
RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDRPGAGHSGPKAEAETRETLEKGL 
LFCCPGWSAVA+ PRPTATFASRAQSLTV 
SPRLECGGWSAH*NF\SFHFQRWGLAV 
I1RTLVLNSWSLALLLLWPRKVLGL1YAI1A 


6677 


20578 


A 


6731 


425 


158 


SQTHDLR*STCLGPKCWDHRCAPPRPAL 
FLF/FNKCPMSIRKIENAQ*I.PGI*KKN 
FCRDDHLPLLSRLILNSWAQGIL*CRPL 
KVLGL 


6678 


20579 


A 


6732 


431 


22 


RDR VLLCHPE* S A W * S * LTTVQNSWAQ 
GILSSI *DYSCVP\LiPVNF*TFFKSVYY 
YFI I F I FE I YFCNS KALGL/LGVSHRAR 
PPRLVLNSWPPQVILPSQPPE*LGLEAL 
ATAPAI PRNSNLVWHQYSFKS I PDDL 


6679 


20580 


A 


6733 


44 


414 


AIYDLPFLLIDLYPMELRASIRAHTCT* 
RFVATLFTVTKRLNQPTCPSTDE *QNKM 
WSL/YIMNYYPAIKRNEILIHENIMLCK 
RSQSQKAHTE*FHICEISRISKSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLTYHCRFS YNTASNKMRLS * TQMHQK 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SACSMCPGRL*GVHAMRPKVLMRLSRSH 
/VARPIAATCAQ*VCDRIK/R/SLLIEE 
QKIWKVLKAQTQSQTAKFKNSTFFRNK 
N 


6681 


20582 


A 


6735 


381 


88 


SLFPRLETPGLRPSFFFFLFSLSHN PGW 
SVW* SRLTTALNS*ARFSLLSWCVPLH 
QAHFF\FFCRDR/SLTMLPRLVLKSWAQ 
AIPPPEPLKRSGLQV 


6682 


20583 


A 


6736 


384 


293 


AHLSLLSGWDCRCMPPCLDNCVCLC/RV 
FVCR * GQAMLPRLVSNS WTQVI LLPWHP 
KVLGL*AQDQEPLWGRSGPGtiPQAASG 


6683 


20584 


A 


6737 


24 


300 


NII>SAFLCLS*P*H*YGLVIL*NVPQFR 
FLPLFFPMQMYTHIFFKIRIRPGVVSHA 
YNPSTLVGQGGR\SHEFKTSLDNIARPR 
LYQVQNKIR 


6684 


205S5 


A 


6738 


2 


353 


IHCW*KCKITYTLSKTVWQFPIQLNIVL 
PYIPA/IPLLGIHTREMKTHIYTK/T/C 
MQMFGWliFI3AKIVFQLVNR*IKL*SI 
HKVDYHS E/I RNKQLYEAT * LNLKCT VL 
SE*S*TQKTT 


6685 


20586 


A 


6739 


~2 


336 


IHATYTNP*HRH\VHTPTHVHT/HVYTH 
NGTHTHTARAWTPAPPTSLILPSPTESP 
PALQI PNLPFGLSSSPHPAGSSQGEPEE 
GGCQRRAKNRVGHRAEKWETELCAGDMN 
L 


6686 


20587 


A 


6740 


3 


385 


KKQATLLWIIHLATTFSVEI *QSKIEVH 
D I FKVLKEKKK* TT FYP KI TNPGKI AF I 
HEGKIKL>Y/IVK*QLRDFINSGLVLKAM 
LKRVL * S KRKGH * QTI RNLRVQNS LVTA 
NTQPNTEYCNTVIWGN 


6687 


20588 


A 


6741 


406 


155 


TGSCSVTQA* /CWSAVIVHCSLDLPGSS 
NPPASASWIAGTTGMCYHAQLPSFNLCL 
QFYCMGCVFSRFYLRFTFGLQSYTRKQK 


6688 


20589 


A 


6742 


1 


215 


LP YS PDLSPTD/ Y/ HFS KHLDNFLKGKR 
FHNQQDAENAF *KFIESQSI DFYAVR I N 
KLISHWQKRVDCKGSYFD 


6689 


20590 


A 


6743 


376 


109 


YHMEVWNLYSLKGSSYLRL.PSSWDYWCP 
PP * QANF*K\ FFCRDKDLAMLPRWLNS 
*LQAILLLRPPKAEPLQLLGQFQFRRES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NIPQFMFDILFQ*LITLSCPASF*SKFP 
IiSSLYFIQ*GR/WLGrVAHACNPSTLGG 
RGQR\ *A*DQELETSLAKKARPHL 


6691 


20592 


A 


6745 


403 


209 


HIVINRI IRKYCA*HYANKFNNLDKLDT 
FLE \ NTTCKNLTQEETENLNS PVSVKE I 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVLSILNVKIVFWFKTGCVHFLFPLN 
FLSLFQNYVCSHTS/YPLSLIPHESLWS 
FPPS FNH S S * Y * H FLKTN * MPDAMAHAC 
NPSTLGS*GRWIT*DEEFETSLANI 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPLYPKKSGRKKKPEG/PP 
PKPKKKLKTPGRKNDAKDI KPGGAKFFF 
S PKKKRGAPPGVPPRLiF * KPPQATI *GG 
PKKKKKP PQEGGKKGAPPKAPPL FFF FL 
YF 


6694 


20595 


A 


6748 


376 


140 


LEDNI /GYSNQNAGAGRVS * INTPKIKV 
S\QSKIAKWDLIKPKRLCAAKVSMNRVN 
RQPPEWDRIFALASAD\RGFLNKI LG 
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NO: of 
nucleotide 
sequence 
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NO: of 
peptide 
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SEQ ID 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Pheriylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 


6695 


20596 


A 


6749 


540 


172 


FFLRQSL/DSVAQAGVQWRHLGSLiQAPP 
HRFMPFSCLTLPSS*NCRCLPPCPANVL 
YF * QRWG FTVLARM VS IS* PRDPPAS AS 

PQNDLL/WPSKSFCGTLSHEKACKDNVF 
TTWAQWLiMPVIPAIiWEAEAGGSRGQEIE 
TILANTVKPHLC 


6696 


20597 


A 


6750 


1 


353 


WFLERESTRSENSVNIVEMTT*DLEYYV 
SKAVAGFEKIDSNTKRSFTVGKMLSTSL 
H/ CYREIFHERKSQSMCQTSLLSYFKKIj 
PQPPQPSASATLIVNIKAKISAKRIKLT 
ECSDDC 














SPS/ PPPPYFGGPPGFFPPPVF*TPPPE 
YFFGAPKKKKNFPPPGGKNFFFFKGPPP 


6698 


20599 


A 


6752 


77 


371 


1 TCPVLYLTSRNTKTKKY YLDCCCFW P * 
WNNRDQTYLHLFFF/CFIKKNFFFQPKR 
GARGGFCINETPPLGGKGNPPPPPPGRG 
EKRAPPKTLCFFFFF 














ETESHSVTHAGVQWHNIjGSIjQPPPPGFK 
RFSCIiSLPKK/WFSVKLN/PNT*SFCYH 


6700 


20601 


A 


6754 


1 


442 


C I CRTAFELTG RATRSEKEER KKERM YD 
*KKERKKEKEGILLKKDGQEEEEGRKEE 
RKEGRRQTQDE FGGQNRIjSTVKQGS*PK 
KRK*GIQ\QTGAPTQERDKETSQNKNEE 
KARHDS * QPQA* RTQSRLQQEGRKLWKR 
GliVSRRK 


6701 


20602 


A 


6755 


366 


262 


FF*RWGLTMPRKLVLNSWAQVILLPPPL 
E*LG*QA 














GLjKQSSCFCFIjSNEDCKHVPP\HRIjFF1j 
FVCLFCFVLFFW *RYGGLAVLPRLVLKS 
WP*VIPQPWPPQMLRLQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSSWDHRYMPPRMATF/SFVLFCLR 
LQV 


6704 


20605 


A 


6758 


46 


256 


KLKNKTEARHSSSACL.*/LPGSWDHRYL 
P PRIANF * TF WRDRVSLLP KG VSNLWAP 
TILLPWPPQVLGLQV 














GS PAS ASQHAGITGRNHHTRLQD I Y * LE 
/TGQTVTLQFS VLADTTLTKRLSWS ILC 
CYNKIPETG 














FGEPPGFSPPPVLiKTRPRNLFIjGPHKKN. 
INFPPP/WEIJSO J VPLKGPPFFFFFI J R*G 
LAMLPRLVS TS WAQ V I LLPQH P KLLE L Q 
VHAMVLSSRTSHSPRSRSLS PRFSQRLL 
PVSCNPAQPSPMA 


6707 


20608 


A 


6761 


2 


259 


LTIA*KNFIFLGIj*QGICQIPKDYQTSL 
R* IKDNLNK/WDTPCSWIGGLNI VEMSV 
LSKLMYRFRAIQMKTTSGFPFFFFWKKG 
IFF 


6708 


20609 


A 


6762 




169 


FFETEFRSFA*AGV* WHDLGSLQPQPSG 
\ S SDS PASAALFFFFFFLKKKI FFFPPG 


6709 


20610 


A 


6763 


1 


496 


MATCTPKHINTQICVSFLLSLKLSEFEQ 
MS IFKKGENELALGTGNIRPYDSCPGNR 
P/APFATSPAPRHPSGPSWRPPTSGGVP 
ARAECPAS PNPLPPGFCGQS PSRGRKEE 
* LRGESTS TLAI F * * APNRYHS F * LPRQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of , 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Al.inine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ''=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P * PHGDCPARKKKKKPND PRLQPTVT - 


6710 


20611 


A 


6764 


1 


229 


VLLLCFFFFVL*DRVLVGHPGWSSRDYL 
KTKTQKKQNNIFVCRVGG\LTTLPRLVS 
NS WPQM VLS PQ P LNALGLQ V 


6711 


20612 


A 


6765 


580 


273 


WRQS LTHSVAQ PGVQWHDLGSLQP P P PG 
FKRFSCLSLPSSWDYRRAPPRPTNF\*Y 
LVQTEFHHVGQAGLELPTSSDUPTSVSQ 
S AG IT S VSHHAQLDM I FLR 


6712 


20613 


A 


6766 


220 


239 


AQE FKAAAS TTALQTETLCDKKKKKKKG 
GVSLGGVSSNFRTGGGASVTSPSIF/CF 
FFFF*DRVSLSGVQWCSLQP*TPGLKRS 
SSLSLPSSWEYRHVPLKPA 


6713 


20614 


A 


6767 


89 


387 


FLSPSGSQKEHLQIQLRSTFSFFFFFKR 
EPRFGR VGGQC PNLG * WKP P PPG* RQFS 
GLTLT/SNWERWATPPPPTNFWIFKKKG 
NSPWWKRGAKI FGPWDP 


6714 


20615 


A 


6768 


397 


223 


IREVGWINWQHKDPCD/KWNCSVY*L* 
*WIHKPTHDKT\HRNTHTHTHTHTGMHS 
HTQK 


6715 


20616 


A 


6769 




367 


S WKVCS KRP I PSPSLFF* KHPRVLLRAD 
GT**VSLATL*HMQLS*AGSVKEAEPRS 
NAPPHKATFLFLFFCTD/SSLTMLPRLV 
LNTWPQAILPPCPPKVLGLPARANSSSL 
SVFKLTSFCC 


6716 


20617 


A 


6770 


490 


260 


FFETESCSW*AGVQW\LELLTSSDPPA 
SASQSAWITGMSHCARRCGTHYHLCERF 
LVS FVTALLSQLLELGLAHSRC 


6717 


20618 


A 


6771 


418 


130 


PSPRDL,NL.FPI J NGPPL,FFFFF*DRVI J I,C 
CLGWSAVASNN FDLK* STRLGL.PKCLDY 
RHEPLCPAMWDTLSSL*EIFGLFCHRPS 
LPAPRTG PGSQ 


6718 


20619 


A 


6772 


311 


314 


FAVGELFLLLHHVGHACLPFCHDCKFPE 
ASPAMLPY/QPVEL*AT*TGIEVLTPGC 
RFQAVFGLKVGFHQGPTPVYLGI 


6719 


20620 


A 


6773 


465 


283 


CPSLPSSWDYRHPPPRPANF/*YF*YHV 
GQPGLEIiPTSSAPPASASQSAGITGMSH 
RAWPK 


6720 


20621 


A 


6774 


410 


91 


RDHSKLQPQTFRLKQSSCLSLPSM*DYR 
HMLPCLAGWLAGWLACQTDRQTDRQTDR 
QTDRQTDR*TDR*IDSRDR/SIiAILPRM 
FSDSWPQVNLLPQ\WPPKVRRLWA 


6721 


20622 


A 


6775 


409 


253 


LI FAFF \ CRDGGLAIL.LRLVSNS * PQAI 
LPPWPPKVLGLQARAPGVRPDLS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHIiWAPLAFFFFGAISAHYNLPI 
PG\SSDSPASASQVAGTIGACHHVQLF\ 
VFLVEMGFHHVGQAGLELLT\*VIHLPQ 
PPKALELQI 


6723 


20624 


A 


6777 


79 


442 


RDEDRGLERRC VS ACS TPQLGNAS TT * G 
CPGVSDPSLHILF/CFFKEN/SCFAPOA 
EGGGHNLG * LE FG P PGLKQFS CLT PQR I 
WNNGGAPPPPLYFFFLKKRGGSPLWPGW 
A*TS*PRS 


6724 


20625 


A 


6778 


77 


417 


HRGLHLS FFLF/ S FLFFFLKRAALLWPR 
EQGRGWIWVNCTLSLRGQGISKALPS*E 
AGTKGGRPHPTLFLVF * KKTS FSQVDQG 
GIDLWARGNPPPGNLKWFELQKETLCPG 
HIF 


6725 


20626 


A 


6779 


62 


235 


IQELRIREGKCLAYEHTARNGKSQDLiHA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl»lanine, 
G = Glycine, H = Histidine, I = lsoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N = Asparagine t P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














GLSCLCCCCCCCC\CNCWP*STRKFFTK 
PE 


6726 


20627 


A 


6780 


908 


566 


SKMEFRSCCPGWSDLGSPQPIAPLGFKQ 
L\SCLSLP\RAWYYRHAPTHPANF\*FL 
VETGFFHVSQAGLELPTSGDLPTSASQS 
AG I I GVRHKAHPECVFQKQLSEANLRRI 
WNQED 


6727 


20628 


A 


6781 


11 


483 


TTALLKAVRH/G*HLSLQRLL,LSF\VCL 
CPAPRGGAYRGRQAS * SCGGL.HPVRASR 
LLCLPNQAWAMAGAPPPTSLPPCSLISD 
CCASNQRDS VGVGPSE PHAGYNLPVCRF 
LSPLEKRSIRVGVTRFSRCRIiSPLSLTR 
KGNSIjTPCASRVSSASPCS 


6728 


20629 


A 


6782 


225 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
S RS FLRPL FLS LS VCVFVCLGTRVPCAP 
EGGFLARRPFFWSASPRVSAWVLWPVGN 
RFPGGSGLGVCEGLGNVGI\WSEPQGFS 
FPPHPEQPLC*AGSRRALPNQGQR\LPG 
AHRLiPAGGCPQSFKKVWTPVALiCPHRE 
M*PQLDAGTEKEAGKGMGQACLSL»KGWP 
SWPSHPFDTPPRMPVWAWPPRILPGLW 
PLL* PPAGRNARRKKPETDRRGRSAGSQ 
PQ ACADGVRVS LP KSG ALVMS RTGLPVC 

PAVCVSLSLSLLALSFALSFSLCPSVCS 
CAIiGHVCPVRRRVGFLHVGLSSGQPLPA 
S L PGCCGRLA I VFPAVPAWGS VKAKATW 
ASASEPQGFSFPPHPEQPLC 


6729 • 


20630 


•A 


6783 


458 


170 


PCIVTASASQSAGITGVSHRTRLLVNFS 
KVGSVTISLG*EVGIiRHWP /NI/GRLDK 
TGPGE KQHFI LHAPQSAP S VYHCLGNTQ 
ELPPLSTAMISQPKR 


6730 


20631 


A 


6784 


"I 


192 


FFFFG* ILPLSPRAGVQWHDLGSLQPPH 
PGFKQFSCLSLP\RG*DYRACATRDGIiI 
FCNFRLGD 


6731 


20632 


A 


6785 


3 


246 


S LC VCQYYCRCAC V* VPKQVSG * VTMG I 
/CATGFVLI YLCG*QCLCG /C /C*EDPF 
YCEQLCVHVCSCVLGRGWSVCVCISRLL 


6732 


20633 


A 


6786 


408 


128 


GPVPYKARG*SPFWPPGEKPSFPKNPKT 
PPGGGGAVXjFPPPWRGRPENSPPPGGKR 
C/QPN+ IFFLPPHPGHKTQGPPPKKKKK 
KISAEFLSW 


6733 


20634 


A 


6787 


I 


418 


LTILGVMKNICDSWKEVKISAFTGAWKK 
LIPTLTD /EY*GFKASVEEGTAHVLETA 
RELEVEAEAVTEL/LQSHDQT 


6734 


20635 


A 


6788 


3 


237 


FFFLT*VLTIV/PG*SAEVQS*LTAAST 
S*\VK*SSHLSLPSSWDYRRMPPHPANF 
FFFFGKKSLILHPGRGGPSLPTP 


6735 


20636 


A 


6789 


282 


3 


PHPDNF \ RFL V * TGCLHVGQNGLALLS P 
GYPPTIiASPSAGITGVSHCTRPKRYF*M 
VLSNKSLNRFGLCTVIjPTCLQAPRRTGP 
PPATVLGRGT 


6736 


20637 


A 


6790 


62 


328 


FFRDR I L FCHPRWS AV IQ \ S WLTEAS KS 
W\IK*SSRLGLPKCWDCRHEPLiCIjDNTT 
Y*CRQNTLPITLIGCFRVSLPYAKYTGK 
FWGAEG 


6737 


20638 


A 


6791 


67 




TNKHTYFLELNKREKPQLTVFFFF* REL 
RGL I LANGN FRLRG * SNS PA * PSKVAGI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


rvi 

eth 
od 


SEQ ID 
NO: in 
tlSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=lso leucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGPPHQAGLILVFLEKTGFNYVG/RRGP 
PPPPPKELALQA 


6738 


20639 


A 


6792 


744 


382 


MTFKKTAHIYFYLYLFIYLRQSL/DSVA 
QAGVQWHNLSSLQAPPPGFMPFSCLSLL 
SSWDYRRPQPRPA/NFFYF* *RRGFTVL 
ARMVSISRPCDPPASASCSAGITGMSPR 
AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


R VLP CHS G WTA WQ S * L> I AV S TS Q A \ KR 
SPHLSIRHVPPHPA/NFKQFFCRDGLIM 
LPRSVSNFWAQAILPPQPPKVLGLQA 


6740 


20641 


A 


6794 


395 


25 


QPGLFCFTPRKKPDSFSKVFAVLSF/FP 
PSL*FS/PLSI.*VQEMVSICCPGWSAV\ 
TPGLK*SSCLSLPKGRGYRCAPPHSANL 
KFFL*R*VLFMLPRLGLH\SWVQWLCQ 
WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS* / CLiATPRSENWVLi*NGPPLFFFFL 
* RRDLAMLPRLVSNS WAQVMLPPWPPK\ 
VLGLQAGATMDHS C VQYSHS FAPS 


6742 


20643 


A 


6796 


444 


95 


NFVRNCQS FLQSGCTFLHS YQQ FLL/ PC 
PCQH* I MS VFQ I LTTVIGVYMHLV* I CS 
ALMTFDLFMCKVLICHLHIFFGEVSVQH 
FSLLLFLFESGSIYWQAGLHLLFSSNP 
PASAS 


6743 


20644 


A 


6797 


378 


732 


FFFWLNLGSSQPPP\PAFTR\FSHLGLL 
NSRDYRHLPPRLANF \ * FLVETGFRHVG 
QAGLKLLTSGDL PALASQTTG I TGVTAS 
PSLDEGVFKALPPPVGTTLCLPPWTTGG 
PFLGPAL 


6744 


20645 


A 


6798 


444 


1 


PLCFPPGGGGFLSSHQGFPPPPPKRKGS 
VGAKKKNIPPGPFPFFFFKKNFFF*GPY 
PF/SEPKKARWGPPPPGPGFSPPRSLVS 
PU3EKKKKECKIPPPETQPPSFGA*WGKK 
RAPPPGPRAGGRNS FL.EGGRGQGRTGGS 
VREFRAGR 


6745 


20646 


A 


6799 


462 


83 


IEPLRPAL,CLS*SNPASI*KPHFLL,FF/ 
CLPFLSFFFS/SFPFSFPFLFPFPFPSF 
FLIPFLSYSLCCSGWSTVA* *GL.TIAS\ 
TYGLKQSSHL/SLPASWFPCLTN* KKIF 
FRDGGLTMLIRLAYLFLNF 


6746 


20647 


A 


6800 


433 


185 


PPPGGRLL.VEKTRGEGQLKPGQPGRQAG 
LGMRGNPGKAQPRPASGAPEMQLPARPP 
S GGHLYGCGAKNNEGGANPLLCGGTGSG 
SVAAAGEVSKSAPDSGLMGNSMLVKKE/ 
AGGGGEPQANQETEN*EG*PMSPESGAL 
L>IjTS PAAAT lpl P V p P qrrg fap p s LFL 
APQP 


6747 


20648 


A 


6801 


395 


60 


IEFLLFLPLSVDGRLWSHQLLPVMNEAT 
/MSIP*HAFAWTC/SSFLRGGCLGA*ST 
LLYGGYMFS YLRNKQTI FQS S CC I VNGE 
QPRMRNRLFS*FIQHLVFSVFLIRAILE 
GI 


6748 


20649 


A 


6802 


423 


98 


RGVPY*PGGLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQS I LPGSWDYRHLHA 
NMSS * FLYFV* RQVFSMLPRLVSNSWDQ 
AI HP S QPHPALFFTCWS VKP IiATL 


6749 


20650 


A 


6803 


396 


194 


S LMKIAHV* VLHTDGGFMHPD S AI S CHD 
IFDSLHLTGEGSAKICRALYELIMQLME 
EHP\GEKQTTIA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6750 


20651 


A 


6804 


373 


192 


palnpwaqanlphqtpalssnffyf/cv 
emgltvlprlv+nswpeailillpwkpk 

VLGLQA 


6751 


20652 


A 


6805 


370 


242 


KKGTSKFLiFFPSYSFFSKFWYQKKVKIA 
FFFF* DRVSLCHPEWSAVW* S * LSAALT 
/CPGLQGSWGYRHVLPCLANFS\FFC*D 
GVSMLSRLENWPFCLF*PVFQPGQHGNS 
ISTKK 


6752 


20653 


A 


6806 


464 


182 


DGPHRFEPPISS*FL,PPWPPQKIiEIKG* 
APLAKQIFFFFFFL*RDIVLLCC*GWGT 
VYSVYSWVSSRLTAAS\TAVLKQSSHLG 
LL.SSWDYKHHH 


6753 


20654 


A 


6807 


3 


228 


YTCAGFFRRQSLCHLGWGAVARSQLTVA 
LTL/VPPGLK* SSNLGLPKCWDYRCEPP 
HLACVNFNTVKDVCFSCILS 


6754 


20655 


A 


6808 


506 


173 


NLHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ * FLVEMGSRHVAQAGXiELLNSGD 
SPASASQSAGTTGVSHHGQISLFKQILT 
STR I S E V I FLTLLiLR I LHD L P C LKL F S L 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGHHGETLSVLKIQKLARHG 
GGCL*SQLLRSLR*QTRLNSGVGDHSEP 
RLSHCAP /AWATEQVSVSRKCKKRQLHQ 
KERIPEWVRS 


6756 


20657 


A 


6810 


367 


31 


QVFQELYSQDLFIYLETLSLCHPG*SAV 
ARS*WTAASDSW\VKQSSRLSLPSSWD 
YR*VPPYSAHFDVNSSFLTYCTSPGLTQ 
KIEDSWLGTVAHACNLSTLGGHSGQIT 


6757 


20658 


A 








LLSS WK/YQVPPPHLVFIKIL/CRIGRE 
TASCCI * PRLWNS WPQVILRPQPPKML 
GLLSATMPGPISVFCFCFCF 














KRKKENIGE/HI*DIGVGSDILDLIPKA 
WTTKLNIDKWYHIKL/QKLLSS *RVAKT 
NKLKS FCTAEE I I TRVKRQPME WEKTFA 
S QTSDKGL I SNT Y KELKQLNS TETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


6813 


492 


243 


SS WDTGCTPSRPD \ I FVFLVETGFPMLA 
RMVLI S * PRDPPASASQSTIiVCTSR YKV 
YQNIHQQVNMF*NYGWCFFDIFTASVGD 


6760 


20661 


A 


6814 


32 


308 


EYTLPHPYLITGPPPVPFCPETYSHS*YF 
LSPNTQCIQ\PAPRTLNTASSTCLSRRV 
TARPVPIYQDPSLEPLQLPPNSIjTLPTP 
LRHSPCWV 


6761 


20662 


A 


6815 


97 


274 


GRFLELKVKRGSLGLGMWHAYNLSTLG 
G * GGRI P \ QGFKAS LGNI AKS CLYKNKK 
GAA 


6762 


20663 


A 


6816 


74 


376 


DFFI VYSVFTHLVN I KDVKDVLYKSLKY 
FFAIYRMGENIYKLYILGGINIQNISME 
YYTA/ I KKD* ILLFAATWM/RLEDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 


1127 


MRVTAPRTVLLLLSGALALTETWAGSHS 
MRYFYTAMSRPGRGEPRFIAVGYVDDTQ 
FVRFDSDAASPRMAPRAPWIEQEGPEYW 
DRETRNMK\ASAQTYRENLRIALRYYNQ 
SEAGSHTLQRMYGCDVGPDGRLLRGYHQ 
DAYDGKDYIALNEDLSSWTAADTAAQIT 
QRKWEAAREAEQLRAYIiEGLrCVEWLRRY 
LENGKETLQRA\DPPKTHVTHHP I SDHE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 

'26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Htstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
\V=Tryprophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATLRCWALGFYPAE I TLT \ WQRDGEDQT 
QDTELVETRPAGDRTFQKWAAWVPSGE 
EQR YTCHVQHEGLP KPLT PER WE AI L PS 
PTSPIVGHCAWAGAVLASWWSSGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSRPVP 
QGFWMCL5TALKKP 


6764 


20665 


A 


6S18 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF* I WD I ETG AAQLS CC VL VAQTE 1 1 YL 
QERNS S L SERE CVCAK\CVCMCMCVHVS 
TCVCVCVCV*V* KRHNWPGMVAHVCNS S 
TLGG 


6765 


20666 


A 


6819 


J06I 


305 


FFLRWSL/DSVAQAGVQWRDLGSLQAPP 
RGFTPFSCLSLPSSWDYRRPLPRPANFF 
YF* *RRGFTMLARMVS IS * PRDLPASAS 
QS AG I TG VSHRARRRNTI LY * HLNFV I K 
SHKP PDI VFRDL *G PRLHLENP VL *GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MSARQDLLFNASSHTGLFPFLQLNQTAS 
RPTSQVPKSLCFPLPWVQPFALPWVRQD 
EGRASVNSPQDPPRVPSCAGFTHSCNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA*AWKWCHLGSLQPLPP\GS 
SDPP\T*AS* VAGTTG VRHHARL IFVFF 
IETVSHHVAQPTGF* TPGAQAIRLPQPP 
ENARITDVKPLCLAHFCYSKHFIMVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 


3332 


TCLGSHATPHCGHSFSSCKILLF*FFVF ' 
EM * SH P VAQ AGVQCRDLRS LP P P PL* FK 
RFS CLS LPS SWD S RRP PPRLANF \CIFS 
RDGILS CWPGWSRTPDLR 


6768 


20669 


A 


6822 


563 


146 


APGLSFNCYIPPAPAFMPGELTLFEVD/ 
NRSIFPTEY/TIHILISSEDILHS*TIP 
LLGLKTDAI PRCLNQTTLTTGRPGLYYG 
QCSEICES/N/HSFIPIVLELrLLKYFK 
T*STSTLKYHCKAIQHLPFKLKIDGVYT' 
SLQ 


6769 


20670 


A 


6823 


399 


201 


KAEPPPAKSWSKGGYVLPRS / IRLDHVK 
WALEPDD I AVLNFMKERHPQS KS * TI LQ 
KWIQPFSPIVQK 


6770 


20671 


A 


6824 


323 


323 


LPTVI VI ILSFYNFFFFSMRLIRTLSFI 
CMYVCICIYVCIYVCIYVCVYI IYVCIY 
/IYINISF*NDRISFFFMLE* YVCVYIH 
EWIKKI 


6771 


20672 


A 


6825 


396 


122 


LP PGGHRPAVPGGREMGPSRGNPGNPGR 
F*KKGPPSPGGGPAPGGPPSPEG*ARKM 
G*PRGPSLP*TQIPPPGSPP/WGPPGVT 
PETLVG FKKRGHP PRGGGP PPG VPLPRR 
AKPEKWVNPGGQAFHE PKS PPRGPPRGA 
KPETLS PKKKKKV 


6772 


20673 


A 


6826 


414 


64 


RGVEGGAAC I FKGLS VAGKPG / YLS AMR 
SCFRRMLLFKS*G/QVIATKHMLPrVLR 
MVGD P VANVRFNVAKS LQKIGPI LDNST 
WQSE VKP I LEKLTQAQDVD VKC FAQEAL 
TVLSLA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEEIS* ITGRFLERINKIDKPL 
VNL* RKK\ VQVLNI RNEKVD VTTDRT * K 
\ I IKEHYGQLYANKFN/ IDEMDKFLKRD 
KLAKLTQEEIENLNIPVSI*KFLI*NIP 
KMKTP/ GPDGISREL 



967 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
. amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
l.tst amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
vj— vxiyi-ine, n— niMiuine, l— isoieucme, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
VV=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PEPPGLSFPHFRGGFPSSGGCQRGEGRK 
ERRPSWTGPSWMAQEGRTPAARLEGAVL 

GRQVQDVPAFP EGS ARQAR * HEE * LHW K. 
EILL 


6775 


20676 


A 


6829 


479 


261 


PCFSFLSNEDCRHAPP/RPV+F/LVERG 
FRHVARAGL>EFLT*SGLPASASQGAGIA 
GVSHCAWPLQVGIEASLQY 


6776 


20677 


A 


6830 


492 


313 


HPIGWPVNWGRASAVGE FRDWKLFYPEC 
E I RTMGGREHGQS PGAQRTFFQLLLS FX 
VESK 














SNGICLQASSAS PNKIVPSTSLKVCSN\ 
VRVDGSRNPSVAFPPLTPNLFAVAPSIS 
SGMG IETIPI QG YR VDEKTKKCY I P F \ V 
♦ANRHSPSGVYWINVHALVSGPPIiSDDS 
GVNKPQMKHQHCSA 


677S 


20679 


A 


6832 


374 


76 


QLVE KWTGNVKS MTDRVLTMRCELRARL 
*ALRTPGTWNLINDQIGMF\KFSGLSPK 
QVECLVNEKH I YLMPSGRINVSGLTAKT 
LGYVAASIQEAVTI IQ 


6779 


20680 


A 


6833 


369 


56 


LKYLGTSVENRYADQAKWLSPVISALW* 
FE / CGW I ARSEDRDHPG * QSE I PS / PAN 
FLFLVEMGFYHVDQAGLDLMTSGNPPAC 
NTGMSHCTRPGRTDVLAILCSDP 














DCLALSSRLDLQWCDLGSLQAPPPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\*FI.VE 
TG FHHVGQAGL KLLTS S DLP AS ASQSAG 
I TGLSHRAWPDSS FEGSSSS 














TAPAIRGVHIjRGRGWTVYGAV*AVPPAS 
IPAYPGWYQYGFYGADLYG \DMQHSDM 
RGLMVSQHS PMPHT\LQQS \ S VTVMAGC 
TQPSHTMPLPLPLVY\ELALWRVYAEVA 
TSYLPPAEVMLAPCK 


6782 


20683 


A 


6836 


1 


428 


GKTLNPPSLGKSFGLIGLPPENDHPPDV 
SVPRAAFPLRGPRSVQSWREMEQSVAQ 
LL.L * MLTFTPHKR I SAFRAL\QHS YLHK 
DEGDPE 


6783 


20684 


A 


6837 


3 


612 


FKGSGVFNRKKKLEEHPLWQDVALGTAL 
CPQLPSLPGWGVIEWPRRAACLPCNPQL. 
CPHCCGPQEAQYPL * AGGQ * GQHQEKPG 
A\GPHPPSGPQVPSAPGEAMTAMEGRKL 
SPAVAQNPPQRKNTLPRGPLPRGAPTTT 
P*SLRPR*TPVGAGLAAPIRSHHPGCSG 
QEALALCPDRPHKDRGVRRGAGGPRGRR 
VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


YRWGFTMLTRIjVLSS*PR/ CDPPASASQ 
S VGI TG VNHRVRL FFFFFGTG PHS \MAQ 
AEGQGHNLG*LQPQPPGAKGTLPRTGNF 
KGG PHTP PNF /CI FFGKNG VS P YCPG WD 
FGAKAI LP PG P PKELELEGS P PAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETSRLWSIGQCPLLIQTAQVLSFTSGH 
GRHREHRKPTDS PLPSACPAP / RLTPPP 
SMCPASHSLPSSQCMPGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA*TKPRTQLPSSMYCGRSS 


6786 


20687 


A 


6840 


365 j 


66 


LLERQI GRS VYGPS S KSS I FDVP S L.PHR 
GIKLVFCLCHTSSWDTGVHYAPPA*SWH 



968 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

Otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TWHPRPRQVGQ/AGFELLTS CHPSPLT 
SQTARIAGVSHRARPKFM 


6787 


20688 


A 


6841 


445 


188 


GFQ*FSCFGLLGGWDTRHVPPRLADFCI 
F/M* R WGFAMSARLS SNS * SAPSASQS A 
G I TGM S HCARPATLiNS P I CG T S L PLGS F 
FF 


6788 


20689 


A 


6842 


3 


412 


PQQSPQLTTVTAPRPPALLMGTTMAKLQ 
TRVSTSHLQGPRLTSAL.TYQMAQVAVAT 
P P WS VIA VWL S AR VTVL P / LN / VNG I 
SWIRQL.QKAAGRQPPSPHQTRNNSLE* 
VASNLSKFLILEVAELFCHVLTFFSYQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVNFYNGPRSLEKRLITLPILHYF 
LNF ILCKSS YLTLLNL KLQTNL F CK/ YE 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
SLLSS * DYRGMPPHLAN 


6790 


20691 


A 


6844 


2 


357 


FCPIARILQSGRFAVLPWNSGIjVRGRRQ 
EGPHGAFTREHT/SLRAPQPRSENERR/ 
R I ACALRAAHQASGWRLTPRAPRS PSGC 
HCPPAE PTSR* EAWGARRKGHGCCPAPH 
QPAPQDIS 


6791 


20692 


A 


6845 


392 


279 


HFFFPCRD/KSIAMLTRLILNSWA*VIL 
LPQPSKVLGLQV 


6792 


20693 


A 


6S46 


3 


275 


VNLVDKAAAGFERMDSNF \ ETS S TVGKM 
LSSSIARYREIFYERKS PSMQQTLFLSH 
FRKLP*PF*PLATTTLIY*QPSTSRQDP 
LPAKRAR 


6793 


20694 


A 


6847 


1 


454 


FFFFETQFHSLPRIiECSGAISAHCNFCL 
LGSSSSPASASQWGITGKKHHAWIIF/ 
VLFLVETGFHHVGHAGFELLTTCGPPTL 
ASIMCILVYIiLVFQHIiVSLKNLKGQNAS 
FYFSPFGQHLSGFVKCSIWCCGSLSGLE 
GHFLPPHSPPLH 


6794 


20695 


A 


6848 


13 


148 


GSAWRHTPAG\ELLKKLRHRYGLNPEGG 
GCTEP IWCYRTPPWTTD 


6795 


20696 


A 


6850 


204 


30 


TIFSRQVXiRIQNVLSEKLiRVSTVYANNG 
S \ VLQGTLWASVYHGKILIGTVFHKTLY 
CEL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMAMPIVIQPKNPSSYP 
CRRQFPLQLEAKEGFQLLTEKFQKHGLL 
IPYN/TPI LPVKKSNE KYRL VQDLRI IN 
EAWPIHPMIPNSYVILPQPPDAQWFPV 
LDLKDSFFCISP/VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


SGWDHRRHTQLIFMYSFLFFCRDR/SLP 
MLPKLVLNS WAQVI LLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


ACMILAAATI ILRW \ PLRYI ILIWGIN 
KFTKKLRNPYSIDNNELIiDFLSRVPSDV 
QKVQYAELKLCSSHSPLRKKRSAL 


6799 


20700 


A 


6854 


24 


458 


SRAARRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
SFPVESEAVMEDVLRPLEQALEDCRGHT 
RKQVCDDISRRLALLQEQWAGGKLS IPC 
KEENGS 


6800 


20701 


A 


6855 


3 


183 


CCD PPVSASQRAG I TSVTY/ Q/WPSLGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GNHPYARKIA\AQRRASTVSSVTQVEVD 
ENAYRCGSGMQMAKDSKSLKTHQTQPGI 



969 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

Location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, Lr=Leucine, IVl^IVIethionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














RRARDKAL\LVANGKRLWCIEGSVC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH / AHTHERAH 
THTHTRTHTPRSGMHRSNNRNLAKNLFK 
IFFFIiSSSPPSLFFLIIFPFLSLSLSSL 
SLSLYKHTPLCILSLRTEHLHRMGALLG 
GHILE 


6803 


20704 


A 


6858 


1374 


579 


CENPSRSTRPGQGPLLLPHQLPFH/RPA 
PSQSSPPEQPQSMEMRSVLRKAGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 
EMNDPSQPNEEG I TALHNAI CGANYS I V 
DFLITAGANVNSPDSHGWTPLiHCAAS CN 
DTVTCMALVQHGAA I FATTLS DGATAFE 
KCDPYREGYADCATYLADVEQSMGLMNS 
GAVYALWDYSAEFGDELSFREGESVTVL 
RRDGPEETDWVJ WAALHGQEG Y VPRNY FG 
L.FPRVKPQRSKV 


6804 


20705 


A 


6859 


36 


269 


IAIiGSMDVLTILMFNEHSKNEIKRTIPL 
KIATR/ IKYLGIN1jT\RGKKDIjDTENYT 
TLIEEIEDDTNKWKDIPCSWTGV 


6805 


20706 


A 


6860 


47 


SS8 


TLRARALQARPRTGSSCTAATWTS/SGA 
SQHSLRALSWRRLYLSRAKLKASSRTSA 
XjLSGFAMVAMVEVQLESDHEYPPGLLVA 
FSACTTVLVAVHLFAIjMVSTCLLPHIEA 
vsnihnlns vhqs phqrlhryvelawgf 
STALGTFLFIAEWLVGWVKFVPIGAPL 
DTPTPMVPTSRVPGTLAPVATSLSPASN 
LPRSSAS AAPS QAE PACPPRQACGGGGA 
HGPGWQAAMASTAIMVPVGLVFVAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
AV 


6806 












/LISTCQPPKVLGLQA 


6807 


20708 


A 


6862 


322 


175 


GCRCVPPPRASFKIFGRV/RGLDILPKL 
VSNSWPQAILPPWPLESLVLQA 














YVLRQGLTL1LPRL/ECSSAI I AHCSLKL 
LGUiNPPTS ASRVAGTIGACHHTRVI F I 
VFWI ES CLLAQAGPKLLGS S D PPALAS 

VliSIaHSHL I LAT I LRGGD VT S QTD K 


6809 


20710 


A 


6864 


371 


77 


RSMFAN\NLVYDTSDSDDYHLLKVLEEG 
IQTLMGRLEVGSRRTGQILKQTYSKFDT 
NSLNHVALLKNYGLLYCFRKDMDKVETF 
LRMVQCRSVEGSCGF 


6810 


2071 1 


A 


6865 






DYLPTYLKLFRLLFF /CLIIETESHYVA 
QAILELLSSSNS PTSASRSAGITGISHH 
TQ PNFSNNTDL KP YLRLS YALLi FA 


6811 


20712 


A 


6866 


1 


190 


GAKGMHHYTWLVRYF/ CCVEMRSHYVTW 
VGLE/PPAIiKHSSCLSLPKCRDYRHEPP 
HLALFCFSF 


6812 


20713 


A 


6867 


506 




EKPSKGHKKPYTWISIAILiFI IAKTWKQ 
S\PRCPSAGEWINC/WYTQTMEYYSMLK 
RNELSNCEKT 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/SLTMLPRLISNSRAQGILL 
PWPPKLLGLQT 


6814 


20715 


A 


6869 


10 


141 


GSAWLFF I FCR /DRGIALLPQLVSNP WL 
Q AIIiLWP PRVLQLQA 


6815 


20716 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQI AR W I ADRTDVN I RVI RP PNYAG PLM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGLLLAVIGGLVYLRRSNMEFLFNKTGW 
AFAALCF\VLAMTSGQMWDHIRG\PPYG 
PKDPHKGHVNYI HGSSQAQFVAETH I VL 
LFNGGVTLGMVLLCEAATSDMDIGKRKI 
MCVAG I GL,WL F FS WMLS I FRSKYHGYP 
YSFLMS 


6816 


20717 


A 


6871 


375 


1 


GFPGFPVPKFGIPAPVAPQNPFPPPSF/ 
PQGPPPSPGGPNPNGKNPFFPKTPFFLK 
N£PQKKKIFLKKFKRGPP\GPNQKPPFK 
DFKKKRGLKPPKKGGPLKKKKKKKKRSN 
YSLKKIYSPFGFHFF 


6817 


20718 


A 


6872 


255 


23 


GRVDPTSTSQNAG I TGVSHR /GVQWRDL 
GSL/QPLSPGFKRFSCLSLSSSRDYRLV 
/PFCIFSRDRILPFWPGWSQTPHLK 


6818 


20719 


A 


6873 




157 


YLLFTSYTHAHTHTHTHTHTQFLLLWTS 
GLS\SCCYIFSLSGPNTKCNMIF 


6819 


20720 


A 


6874 


351 


132 


SPRRCSAYGVA/IYLFLYFLINLLSLYS 
IFYGFAPNSFLCEVQEPSLG \ SGWEPL.S 
GNHFS S I KMKI AHR EVNM 


6820 


20721 


A 


6875 


99 


337 


AGSFFQKKGKKKISRVNRQPIQLEKIFM 
NCASDKGLVPRI YKELESAR\ KYQS IPS 
KVG 


6821 


20722 


A 


6876 


69 


306 


YVSHHLSFFF/ CFFGGGEKALFFCPRTG 
IKWPQFGLLEPSPSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6S22 


20723 


A 


6877 


335 


169 


WHDLGLLQPPPPG\SSDSCASASQVAGI 
TGMHHS FSNF I YLFCY FE KVFLTFTLD 


6823 


20724 


A 


6878 


2 


237 


ELGYKVIiPHPPYSPDLSPTDYHFLKHLN 
NFL\ FHNQQCAENAFQVFIESQSTDLYA 
TGINKIjISHWQKCVDSNCSYLN 


6824 


20725 


A 


6879 


2 


199 


RGRVGRQFQKEIEELKKKLEEGEE/VAP 
SSSSSSSDPLISEPDISGSEEDDDEEGE 
VGEDGEKKKKK 


6825 


20726 


A 


6880 


1 


206 


FFFFETESCYVAQAGLELLGLGDPPASA 
SQWGTTGARHHAQLI F/ VFVFLVEMGF 
HHVSQDGFDLLTS 


6826 


20727 


A 


6881 


1 


123 


WYIHTMEYYSALKRRKFIiSFATTWMNLE 
\EIMLSKISRRRG 


6827 


20728 


A 


6882 


15 


105 


SQLL/WRLRQKNRLNRGGGGCSFPLWCH 
CTL 


6828 


20729 


A 


6883 


223 


1 


AASTFLFPNLKNSLRG\SLRTFSSVTNV 
RKTALTWLNSQDIQFFFSRRSLTLVARL 
ECWRSWPAASRRLGCL 


6829 


20730 


A 


6884 


193 


3 


QGEQDFLPPCLPNFFNF\FFCRDTVSML 
PRLI SNSWPQAVI LPEPHKVTLELQARAT 
TPSHVFIF 


6830 


20731 


A 


6885 


2 


355 


RLLTS S DLPASAS QS AGVTGMSHRAR PR 
NF/CILPNLSPCNTHSPLSLPQPLAPTI 
LLSLWI CL/SLDI S /WDYRHMPPRLAKV 
FFFGDG VSQ \ YRPGWSAWP FAASTSGV 
EAILPPQPPK 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGSLRPGTPGLKQ/ FL 
PPLKQS S CLSLLS S WDHRCATLY 


6832 


20733 


A 


6887 


314 


203 


FLFFCGDKVLLCP\RLVSNSWTQVILQP 
WPPKVliGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRS T FKRGM / MG W AW WLMFV I PATWEA 
EVAAKIAPLHYSLGDRARLSKGGMKCTS 
I FCS KS AKDCQTLP VKKKI SL PTL PS I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHARIMSVFIVETGFHHIGQSGLELMT/ 
S /GNMHASAS QSAR I TGVS PHARPS LP Y 
YKCSKKQLCNE 


6835 


20736 


A 


6890 




169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVTIMPRYIFK\FFVEMESHCIA 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/SFF 
DRDE/ SLTVLSRLVSNSWTQAILPPRIF 
FENS 


6837 


20738 


A 


6892 


3 


330 


HESAAITGVSHCTQTFFFF / CETGPRFV 
PQVKGQGHDLS SGEFG P PRLRN WPGLTL 
QNAGNTGTL PPHRTNFGD FRRGSFS PCG 
PGGFGTRDLGGFPHLTGKRDTGLDPP 


6838 


20739 


A 


6893 


— 


348 


ARAFFFWNFFCK /NRGLSMLPR LALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHS S L YPHHS L I FPHHCS KFCFIKQW 
RLFILRITENQGLTPLNAAKPAEPVMNE 
ARTQA 


6839 


20740 


A 


6894 


340 


240 


CRD/RSLRMLLTLVLNSWIQAILLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


1 


HLCFWAG \ P VAHS CHPGTLGGRGGR I TR 
GQEFETSLLGPGTQDVLAKCSRA 


6841 


20742 


A 


6896 


454 


294 


PSS WDYRHA/ P PMHNWPDFFCRDE / S L 
PML PRLVWNFWAQLi I LLP WT P KALQLQA 


6842 


20743 


A 


6897 


49 


333 


I YICLSFYQS I I YLYDKQHDVLETVSHS 
VS LDLPGLRFSHLS LWDHRL I LPWLAS S 
NNFF\CRNRGLLVLPRLVSNSWVQAILP 
PWLPIVLGLPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLPRLVLHSWTQAIRP 
PWPLKVLGLOA 


6844 


20745 


A 


6899 


.127 


365 


LPERRRYVS I SGYRYHLPQS LLFFYAQH 
ATFNNELW \LGV I AHACNPS TLGGRDGR 
MALGQEFEIGLDS I VRLHLCNKIK 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGLELLSAG 
NPPTSPSQSARITGIWFFVWFSSNLSQF 
CYAHSFFTCWYSRA 


6846 


20747 


A 


6901 


1 


162 


GTSGTRLFTIGKRWQQPKCPPMAELINK 
MW\YTMEHYSVLKRKEIFCTNLTQLL 


6847 


20748 


A 


6902 


120 


313 


DKVL/WRLRQENPLNPGGGGCSEPRSKH 
CTPAWATRAKLYLKKKKNFKGTLSNWYS 
FRRVANTNLQMQ S GSN YMQ I 


6848 


20749 


A 


6903 


340 


234 


IFFCRER\SILPRLVS'ITWPQAILLPRP 
PKVLGLQV 


6849 . 


20750 


A 


6904 : 


1 


327 


GTSGTSRILPWGPGCIRTPELKQSAHLS 
LPKCWDCWNYRHEPPVYSFAMLIIKLFF 
ELHLSVI LWFLLLANPQTQTES KTFSKT 
/PRLVLNSWPQAVLPWSLKALASQA 


6850 


20751 


A 


6905 


532 


359 


RDG/DLVWWPRMVSMPGFKQSSHLGHPK 
CWDYRCEPLCQDFSYS I IT WFP PSREL 
RR 


6S51 


20752 


A 


6906 


270 


3 


GGTTVYPRKTKT/ WPGAVAHACNPNILG 
GRGG W I TRS GL P VS TKNTKI S RAWCCAA 
WLATPEAEQENRLKPGDGDRDHHGQHG 
ETPSLSC 


6852 


20753 


A 


6907 


1 


312 


GTSWKKMLPTLMDDFRG\KTLVEELIPD 
WEIAREVELE\MEHEDVTELLQPHDKT 
WTDEELLLTDEQRWFLEVGSVPVEDAV 
NIVEMKTKDLEYYISLLIKY 


6853 . . 


20754 ... . 


A . 


6908 . 


331 


61 


FRHVGQAG/LQKLLTSSDLPAPASQSAG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=lWethionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ,i =Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITGMSHH I RPKL I LRKKSMRAKLDSRQT 
RLQPP WEQGLLEEQHE VYNV YMAHVS K 
SSSRKLS 


6854 


20755 


A 


6909 


419 


199 


SAVQDGV \S WCDLGS L \QP PLSRLKQFP 
CLSFPSTWDYKCAPPHPATFCIFIRDGV 
SPCWSDWSQTPGLK 


6855 


20756 


A 


6910 


341 


69 


EAEVGGLLKARSLRPAWAIQQDP I S I LK 
KSFTLGGQGG WI TRSGDRDLRASASQSA 
GTTGVSHRAQ\LIFAFLVERRFHHAGQG 
GLDLIiTL 


6856 


20757 


A 


6911 


174 


356 


KTYLMPLRSSLRPGMVAHACNPSTIjGGQ 
GGWITRSG/ ESRPS P VLVNRLTRCQANT 
VKPCLY 


6857 


20758 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP/YFPPPPKKSPPPNPPPPFFFP 
PPLGKQFFSPPPPFFFPPPGFPFPPPPF 
FFFFFFFFFFFFFFFFFFFFCDKKYFIC 
LMICMLFFCKYILNFCRYS 


6858 


20759 


A 


6913 


43 


331 


IRDTYS FS LS APALA PMLGTKG VLLAVA 
SLGSRGASLCVFVCVCLCVRIH\ IGVQA 
SG CVCVCAC / CVCVC VRVCVC VCPHWPA 
SL1LTKDLQCI1PFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGLFLVFCFEM/EVHYIAQAGLE/ 
P PGLKP S S S LSLLS S WD YRHVM PHAAQ I 
CIS VFSLAQKGVP PSNWILI THYS I KS 


6860 


20761 


A 


6915 


532 


53 


SQMALHEGFLLLFLKEKEIjHGKFSHPPP 
SAPNSSMSKWFYFSETESHSVAQAGVQ 
WCNLSSLQPPPPGFKQFSRLSIiTSSWDY 
TYLPPHIANFLFLVEMGFLHVGQAGLKL 
PTSGDLPALGSQSVGITGVSHHTR\PRF 
LFKIGPLFIiFSKSMSQDEARI 


6861 


20762 


A 


6916 


2 


373 


WHTPWLATHEAEAGGS LEPRRAKLQLA 
MIVA/NCTPAWVT 


6862 


20763 


A 


6917 


1 


270 


GTRQSPRLSLLSSSCDHRRVSPHPADF/ 
S FFYFYFFCR/DRGFTMLPRQVli I WAQV 
MLLSQ P PKLLGLQ WG VSHGAR P PHCLG 
YSWVYNKP 


6863 


20764 


A 


691 S 


2 


367 


RCAPHCPDNSSFFK/RVKTGLTIFPRQT 
SNSWPQVTLLPWPP/KVLGLQA 


6864 


20765 


A 


6919 


3 


354 


HEVNIVETTAKDLE YS INLVDKTGFEKI 
DSS FKRRSTVSKMPSDS ITCYRE I FCER 
KSQSMLQTSLLSYFKKL.PQPPQPSA/TT 
TVISQQ PSTLRQGP P PAKRLRCAKGLND 
FQHFWL 


6865 


20766 


A 


6920 


366 


41 


STSGEDAVHIVEMTTKGLEP/YTAVPGF 
HPS S ERRS WGNMLSNS VTCYRG I FGER 
KSQC/RQTSLLPYFN/KPQPPQPSATIT 
LI\SQQPSTSRRDPLPPKRLQI/rESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLEPATLDLLQLKT\TIQGGSH 
SHHITAIP\PS\FTRENTLMFIHIiSPIL 
LLSLNPDI ITGFFLLKKKNKPKNHKNRG 
PSYKRTPEGPLLLLWAQKTLPPKGGLY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLV\QNSWPQVTCLPWHPKVLGL 

QV 


6S68 


20769 


A 


6923 


334 


91 


ERCGKCAHISEIFLSSLDYIYIFVFY\F 
FFLFFFFFFFFFFFFFFFFFFFFFFFFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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09/515,1 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glu tamic Acid, F=Phenylalanine, 

K=Ly s ine^L^Lx u ci ne ™M— Methioni n e' 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Unknown, * = Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFKFNTKS KEKMFNKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRLKPSSCFSL 
PSSWDYRHAPP/RP/ALFLFljFGLVWFG 
LVFLVEIGSCYIAQAGLKLLGSSDPPTL 
VSQSARIIGPTHHAWPTLTTSIQCTIRG 


6870 


20771 


A 


6925 


647 


1054 


CFWKAHIQAGETNNIQEGSERFHFCPAD 
EVYVHSGPAVEDQPQRRSFAL/ SAQAGV 
QWCNLSLLiQPLPPRFKQFSClASLPSSW 
DYHRHVPACLAKFC I FSRDE VCHVGQAR 
LELLTSGDPPASASQSAGITGMNHHV 


6871 


20772 


A 


6926 


3 


204 


dawettgfgmiydsldyakkiiepkhrXl - 

ARHGLYE KKKTS RKQRKERKNRMKKVRG 
TAKANVGAGKKK 


6872 


20773. 


A 


6927 


1 


112 


ptrprtrgvasvlyfttililipt\isl 
ienkilkwa 


6873 


20774 


A 


6928 


1811 


478 


drarsprsera\rrsssrlrqrdpslgr 
rrlrreirpglpeseprpprppaaltad 
qppprrlsesrggggmse \ age \atttt 
tttlpqapteaaaaapqdpapks pvgsg 
apqaaapapaahvagnpggdaapaatg t 
aaaas lataags \edaekkvlatkvlgt 
vkwfnvrngygfinrndtkedvfvhqta 
ikknnprkylrsvgdgetvefdwegek 
gaeaanvtgpdgvpvegsryaadrrryr 

RGYYGRRRGPPRNYAGEEEEEGSGSSEG 
FDPPATDRQFSGARNQLRRPQYRPQYRQ 
RRFPPYHVGQTFDRRSRVLPHPNRIQAG 
EIGEMKDGVPEGAQLQGPVHRNPTYRPR 
YRSRGPPRPRPAPAVGEAEDKENQQATS 
GPNQPSVRRGYRRPYNYRRRPRPPNAPS 
QDGKEAKAGEAPTENPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


L I FVFLVETGFHHVGQDGLDLL / NLV I R 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 


3 


364 


HEAS LTQTRTTMTHCSRTTGCSTAS GRT 
WTRSRHSCAWCSAALWRAAVASRCPSS / 
IPVTPPQCLSWP/WKVPLQCPPALS 


6876 


20777 


A 


6931 


2 


165 


grvgfcsvaqaglqwhdlsslkplpp\g 
ssdcpasasrivgitstafhflnnvf 


6877 


20778 


A 


6932 


1 


370 


TELRSYSPSNGAISSQCNLCFLGSGYS/ 
RSSTSQVAGITGA/R/HHTQLI FVFLVE 
TB FHHV \ AKAG LE Ij L TS GD P P AS AS Q S V 
GIAGVSHCAWPHVYILKEIMLGNLVRCL 
MSICHHHTESICLFL 


6878 












RRANT PNN P E AP P N / Q KiG*CKKKICKKKKG 
KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGEKKRGGKIFFPPPGGKKPPPQKKKG 
GGGGKKIFFFLFGPGGGGNFFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYFP 


6879 


20780 


A 


6934 


518 


340 


PKPKNFPS/PSPPKIPPPKKKVFSKKPP 
GGFNNPPHKRKKYNFPPPEKLGPPKEFL 
KRPPL 


6880 


20781 


A 


6935 


396 


299 


WEVC/GVCVCVCVCVCVCIHSLCKSTL 
HRTGV 


6881 


20782 


A 


6936 


486 


272 


PTRAPTRPAPPHLADFLIFCSDR/SLIM 
LPRLVLNSQPQMILPLWHPKWGLQA 


6882 


20783 


A 


6937 . 


1 


264 


YSNLRDRARSCLSLPSSWDYTCSWDYKC 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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od 
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NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASLRIANF/SNFCRERGLAMLL.RI.VSNS 
WAQVILLPQPSKVLRLCLAVSVFFGEKQ 
GQCS 


6883 


20784 


A 


6938 


2 


407 


TTGTCRSVWLTFVFLWTGFHHVGQAGL 
ELL/RPPQPPKVLGLQA 


6884 


207S5 


A 


6939 


422 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP\liP 
Y I LYFDCASNACLFFKNNTWRYIKRI FF 
SPTPFIPFFHEVENVTSLSYMVFQYCFL 
FF 


6885 


20786 


A 


6940 


462 


209 


LSEFWDYRREPPLPAKFAI FMLCRDRGL 
DTLPSLVSS/SWHQAILAPQPPIMLGLQ 
VHSAYKGFCLPVLTFEEVIGNRPLGLLP 
P 


6886 


20787 


A 


6941 


491 


299 


CAEYSPESGPTHASAHAS/ SNVQMVYSR 
ISCKEELLLGRTSPSKNYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHLSbPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDPPASASQSVGVTGMSHH 
S 


6888 


20789 


A 


6943 


932 


661 


FKTGSYSV\TRLEGSGEISAHCNL,RFLS 
SSDFFHLSSQVARITGSSQYARLIFVFF 
VEMGFAMLPRVGLKFLGFKPLHPP\RTP 
RVLGLQE 


6S89 


20790 


A 


6944 


381 


55 


PASLPPCSLISDCCASNQRDSVGVGPSE 
PGVGYSLWRRFLSRSEKRNIRVGVTRF 
SRCV/LSPLSLTQKGNSLTPCASQVRQC 
LALLRLAHGACTHWPAPTVWHSLVR 


6890 


20791 


A 


6945 


816 


513 


FFIFI YFFEMESLLPRLECSRMISAHCK 
VCLPRSSSY/PASASQVAKITGA/TRHS 
RLIFCVCVLVERRFHHLSQAGIiEtiL/NIj 
VIHPSRPPKVLRL.QGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LIPSLMDNSEGLKTSVEEVTADWKIVR 
ELELKVKPE/NVTDLLQS\YDKT 


6892 


20793 


A 


6947 


423 


60 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFKRPPPFFFFFFFFFFFFWHGTF 
PLNSIiVGPPKVEGWVSWAQLPRAVGPPL 
PNPKWGLNKPPAGGIiNKPIHFPSLNGSG 
EATPPLIPWC 


6893 


20794 


A 


6948 


1 


327 


MRLLAAALLLLLLALYTARVDGSKCKC\ 
SRKE PK IR YSDVKKL \ EMKP K \ YPHCEE 
KMVI ITTKSVSRYRGQEHCLHPKLQSTK 
RFIKWYNAWNEKRRVYEEAQGLRRRIG 


6894 


20795 


A 


6949 


952 


796 


FFLHFLVEMGN FLLVGQAGLELPTSGDP 
PAIASQSAGITGMSHRARPSVSS 


6895 


20796 


A 


6950 


364 


6 


I TTHKNYFLSRTATPPKEGLPQRKKI FL 
FFTP I F I FFVGF I FWSGGGRGFLVFFFV 
FFFFFFFFFFFFFFFFFFFFFFFF\RVK 
TFIGFKVLFFKINVFFFLIRAVPLDYMN 
HALTIFQW 


6896 


20797 


A 


6951 


418 


266 


ISFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFFFFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 
KECKKIFFSPPEKRGPPPSIIiKNPPHPFI 
FFFFFFFFFFFFFFFFFFFFFFLIGEKA 
MLLDMGA 


6898 


20799 


A | 6953 | 325 


143 


LPSSWDYRRVPPHSAHFKFFCRN/RGLA 
MLPRLAKPRFPSISIFLNHPQVSEILGA j 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TRLAR 


6899 


20800 


A 


6954 


585 


306 


KTESHSAIRSQCS \GE I SAHCNLRLPGS - 
SNS PAS ASRVAG IKGTHHQVQLI FVFLV 
EMGFHRAGQDGLRSPALVICPPWPPKVL 
GLEAWPALHC 


6900 


20801 


A 


6955 


2 


155 


FLVETGFHHAGQAGLELL ISGDPP TLAK - 

sVdsqdvritglshrawpfldiy 


6901 


20802 


A 


6956 


3 


346 


DAWPDAWGK \RNWPPHQANF YTFVE / TG 
FT/HVGQAGLELLGLSDVPTSSSLDAGI 
TGMSHCSRPRPEISSRDVHTTPGALQET 
WHTPDLiASSIRSLRCRLDCPHNHCPLDK 
DGVTL 


6902 


20803 


A 


6957 


1 


224 


QCGR IRSRKDARKGNVQI I PREGRRRTS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
STSPPRSPRLHRRRGATSW 


6903 


20804 


A 


6958 


102 


344 


GLLKPKSLKLQCTI ITPVN\NHCTAAWA 
T 


6904 


20805 


A 


6959 


3 


296 


KEEEEGEGEGGGGGG\EEEEEEEEEEEE 
EEEEEEEEEEHAEFTGHVES ITQCKLLL 
CHLS LRVEAG I THLLCFT VRAVTRSQRL 
LTKRRNGS VGHTS E 


6905 


20806 


A 


6960 


3 


172 


RLGLPKCWDYKREPP/RPGLELLTSSDL 
PALVSQSDGIKDVGHDTEPQDFYTLGQD 
P 


6906 


20807 


A 


6961 


2 


224 


LALLPRLARSGTI IAHCSL\NRSLRLGL 
SDPPASASESTGTTGMSHCSQPAKLLST 
CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 




330 


KGTLRRQVTYKGTLIR \ LVADTLAETFH 
ARKEWDDIFKVLKKHCQPRI LYUAKPS F 
KNEGE S FSQRKC I TTRVARS KMAKGVLH 
LEVKDISTIIKQMKIRSSLVEQIHKW 


6908 


20809 


A 


6963 


1242 


929 


ETGSCSWQAWQWCGHGYLKPQPSHAH 
/DDPPTASS I AGTTEAHYHAQL I FKKFF 
NSLCCPGWSQVSWRILGSSDPPASASQN 
AGITGMSHGAWPQLSIYRKGN 


6909 


20810 


A 


6964 


I 


378 


KFFLQLFLHSFIYEHFLFFSFLFFFFFL 
RPS FAFVAQAGVQWRDLGS PQPLPPRFK 
QFSCLSLLSRWDYRHAPPQSANF\EFLV 
ETGFLHVGQAGLELPTSGDPPTSASQSA 
GITGVSHRTRLAFY 


6910 


20811 


A 


6965 


393 


43 


VSPFPLKNFYFSFTPKTFVGGGGPSGPP 
PQKRFFFHTPNSPFFPFPPF/YKSGPRP 
GFFFTPPPEKGKNFP\PPLRLGPPPVFF 
TRPPPPFFFFFFFERQRTCSFLHFLWR 
PLFRFYL 


6911 


20812 


A 


6966 


225 


2 


TRVECHGVFFGSCNPLLPGLKEFSAPAP 
RGSGNSGPFPPPR/RNSFFFFFSWLLE 
MGFHHGQADLEILTSSDPP 


6912 


20813 


A 


6967 


8 


341 


FVSVPPI LHYI.FKI FFFFLRRSLCLPPR 
LECRGTI S AHGS LGLPGSSDSPS / STKN 
TKLPLRGGTCLGPQLLGLLRKEKTWTLG 
GGSGSYPKSGKRAMALQGGVQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


DYTCKHPHSSELKKKKKKKKKKKKKKKK" 
KKKKGG P P KKKP / HGGPQ F P PGRQ KKI F 
PFKGGFKKPPRGFFEKNPFFGGGNLGPP 
PPPKNKPPGEKKNFLGGRGQKNSFLSAW 


6914 


20815 


A 


6969 . 


391 


17 


VFFFFFLKKFFPPPDKKLFFPPFPLKIF 
F/SPPNPFFFLGGFSQISPPPKKGFFPK 



9.76 
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amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, impossible 
nucleotide insertion 














IPPGV/YFSPPFKKKNFFFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL 
RGNAGDCNGYGDVSYK 


6915 


20816 


A 


6970 


374 


268 


IMLPRL.V3jNSWPQVILLPWP\PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


FFFGDALSPGLECGGAVLARLQA/CLLG 
SHRSPASASRVAGTCKRPPPRPSGFFVF 
LVEAG LHR VGQDGLN LL /NLV I RPS R P P 
IVLGLQGMSHHARLFYFL 


6917 


20818 


A 


6972 


861 


528 


FETESRSVTQAWQWCNLSSLQP/LPPG 
FKRFFCLSLPSSWDYRRTPPCPANFCIF 
SRDGVLPWWPGWSRTSDLVIRLPRPPQC 
WDYKREPLRLAKSRHPWNHLPEQEVEHF 
K 


6918 


20819 


A 


6973 


274 


119 


P PN \ FVFL VETG FLQ GGVKLS T P GD P P P 
PASRRAG I TGVSHPACPGKE Y I FKN 


6919 


20820 


A 


6974 


370 


194 


KITNFAKLFGYILPYSHYFWCLEKNRNR 
/SLTLLPRLVSNSWAQAFLLLWPPKVLR 
LQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCSSI/SDCCASGEQGSVGVGPAEP 
G/MGYNLIACCLLRPLEKCS I KAGVSRF 
SWYSLSRLPL,T\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LLETEVGGSLEPRSSRP 
AWAR 


6922 


20823 


A 


6977 


363 


48 


GPPKEMCPYPN\PRPYECNLFGKRVFAD 
VIKFKI LRW\ I ILD YTQVDPKSNDECP Y 
KKQ KRRGHRQ IKRKS QADQGG RDWSGAT 
ARNAKMITSVGEDVRKRKPLAYC 


6923 


20824 


A 


6978 


9 


163 


AGYNFLVCC \ LPRLLE KCS I RMGVS RFS 
R YHL S WL P FAR KGNS PT S CTS WVR 


6924 


20825 


A 


6979 


408 


212 


I ETG FCHVAQDG ) LELiLASDDP PAS AS Q 
SAE I TD VSHRAQPHS AFLPWFS S JCI FAY 
GSVWLVFPHS 


6925 


20826 


A 


6980 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SIiASG\ ITGVSHHSRPSSAF 


6926 


20827 


A 


6981 


408 


2 


KVSFFFPKGPRGKKKNGAGGFSPFFPPF 
LGGGTKNFFYPP I FSQGGGKK/ QKP PPP 
EKKKNPPNPAFWGPPLVLKNWGGRPGGP 
LDPKVSNFMGAPPFLFGAKKKTFFSPPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 




302- 


RKHLPPH \ P VI FVFLEQATLRHVGQAGL 
ELLTSSDHRASASQSAGITGVSHCSLPA 
TSFSTPPFSVAKMLRGTQKNKVPLIGPL 
PQPIRLVAGHYFIYIT 


6928 


20829 


A 


6983 


2 


375 


RGRVGGPVGGPVGRTFI FFQAGSDSARI 
LVLFPYCTLPHPLPGLFFFFFKTGIKSW 
VLLNPSPRFVIFLEKKIFLPQPPFF/HI 
LSPPHPGGAGGGNPTARKIKGLSYRDPK 
EGHKRGWAHTPT P 


6929 


20830 


A 


6984 


1 


146 


PRPANFC/DFLVETRFRHAGQAGLKLST 
SSDLPTLVSQSAGTTGMSHRA 


6930 


20S31 


A 


6985 


2 


95 


LTMLPRLECNDTIPAHCSLNS/SGSSHS 
IjTS TS KVAGTTGVHHHFWIiMF / P LF I IQ 


6931 


20832 


A 


6986 


340 


169 


VCSSGLS S PLLEQHKTNL I FYASGD I CT 
/ANGKSGFNQPQPFLKTFCCTKRIMSCT 
YL. 


6932 


20833 


A 


6987 


346 


218 


PCLTNF/ S I FCRDE / S LTML/ PRLVLNS 
WAQAILLPRPPRLLRLQA | 
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peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6933 


20834 


A 


6988 


345 


141 


NRPKCPSMIEWIKKMWHIDTMEYYAAIK 
KDKFMS F \ ET I TLGE VTQE WKTKHRLLS 
LIRGS 


6934 


20835 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVLFVETDLFT 
LSLS PFFS VLFFFC/RRWGFTMLPTLVL 
NSWPQVILLPWLPKVLRLPT 


6935 


20836 


A 


6990 


2 


156 


FLVE / TGFVHVGQVGLELLTSGDLPASA 
SQSVGI IGVSHRVRPCFLFVCICLV 


6936 


20837 


A 


6991 


310 


96 


KNFFFFLVKKS P / LNVAQGGLKLLGSND 
P PAS VSQSGG I TGVS PRARPLNFYKEAT 
DPNTS VLKGLNI KKKKN 


6937 


20838 


A 


6992 


15 


279 


NLLCFPSISPLIGIFFFFERGFHKVGQA 
GLKLLTSGD / LRAS PSQRARI TAMTRHP 
HPLVCGLLNSTHCDWRGRIPHWGFDLRF 
REGPS 


6938 


20839 


A 


6993 


127 


352 


KliADTPQSRFVLVCV/LIETKYSLCCPG 
GLQLLAS SNP PTS VSQSAT I TDVNHKAE 
SQVFLNLWPKSSKPTNTGL 


6939 


20840 


A 


6994 


323 


119 


SSSEDHKHVP PHSAMFCFF / CLRQSLTM 
LPRLVLNSWAQVILPPSPPKPFLDALKK 
KSSKWMSSLLIT 


6940 


20841 


A 


6995 


230 


1 


FFFETES CSVTQAGVQWCDLGSLQPLP P 
GFKQFSCLS/LPNSWDYRHALPCPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 


20842 


A 


6996 


346 


121 


RDICTPMFVTALLIAKIRSPHKCPSVDT 
WIRKMWF\IYTMEYYKAIIKNKILGMSH 
HASLPVFNHMCKLKYLIS 


6942 


20843 


A 


6997 


- 


198 


SNNTWAFREPGITGMCHHAQLIVLLYM 
QFHHVDQDGLDVR /NLVI CPPWP PKKL.G 
IQALNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHLKSSQIQQGTVGHACNPSTLGDHGE 
RITQGQEFKT\GNTVRPCLYKD 


6944 


20845 


A 


6999 


3 


471 


LALSCSGTILAHCNLHLLGPSLPPTSAP 
RAAGTTGVHHHAWLI FVFFGKDGVAQPG 
LKLSFK/VIRPPQLPKVLGLQA 


6945 


20846 


A 


7000 


483 


239 


GNNNI LFLFLFFI ETGSHFVT\RLECQW 
VQSSAHSQPSAPRAQVLIFVFLLQTGFH 
YVG \QAGLQLLTS SDPPHS ALL KVRD 


6946 


20847 


A 


7001 


132 


353 


YAKLGTRGFARGPWPGGFLTVAVWMSF 
AQAGINPFVCIFSNRELRRCFSTTL/PL 
LQKIQVTKGTLLCYMREHL 


6947 


20848 


A 


7002 


2 


399 


2DAQEVELQEGKVHSESDKAITPHSQEE 
FQKQERESAE\SELTES 


6948 


20849 


A 


7003 


2 


135 


IFQFLVEMGFHHVGQAGLELL/NLMIHP 
PRPSKVLGLQAYWHSTRP 


6949 


20850 


A 


7004 


2 


114 


I FQFLVEMGFHHVGQAGLELL/NLMIHP 
PRPSKVLGLQA 


6950 


20851 


A 


7005 


455 




ALFPAWLWLPSSSKGGLKLSHGTTLTLF 
LLPSSTLKDLWPQEQWL/GTCNSQHFGR 
PRKADRLAPGVQNRPGQRGETPSRKKGG 
EGNWGGGGELHIYTYTHTHTYVHTYIHA 
NWTWWCVPWLAFQETEVGGPLEPEVAP 
AWVTVRPCLKKK 


6951 


20852 


A 


7006 


371 


172 


OVILCLSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLELLASNDPPASASQSVGITG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQALFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 



978 
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NO: of 
nucleotide 
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SEQ ID 
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sequence 
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od 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


I Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
Acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glyc"me, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=poss/ble 
nucleotide insertion 














FFFFFCYLCNSLFIWTQYFFVNE 


6953 


20854 


A 


7008 


1 


113 


KNFCR / DRSLDTL PRLI SNS W PQVTLL.P 
QPPKVLGLQA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHS F IHLCRDK 
GDLAILPRLVSNSWPQVILLPWST/NVL 
RLQA 


6955 


20856 


A 


7010 


609 


348 


ESHS VTQAGVQWYDLDSLQPLPPGS \SD 
S PAS AS HVAG I TGMRHHTQL \ I F VLLVE 
TRSHHVSQTKFVLFDTTKFWI FYSGKR 
TRIRL 


6956 


20857 


A 


7011 


382 


233 


DLEIFRGKTFLHHIFPFFFFFFC/RRRG 
LjAMLPKLjVIjNSWPQVILPKCWD 


6957 


20858 


A 


7012 


359 


199 


DLLPSLPPFLPSFIAFFLFETESHSSPR 
LECSGAI PAHCSLGDRARHRLKINK 


6958 


20859 


A 


7014 


3 


274 


CFFF FFETGSHS VTQAGVQWRHL.DS LQ P 
PPPGCKRFSCLS/LPSSWDYRHV/PSSP 
LLVRNSSICLLSCFEMTPELFDGVLAIY 
CCVTNDLKA 


6959 


20860 


A 


7015 


372 


54 


LPSAWEAEMRGPLEPKGLRSQRAEIAP\ 
CTPAWVTEQAYVSNKTNTERPFQKIEHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT/GATLFDGRPAVFDEA/DFPSVAV 
YLTGAEYTGEELDSDTWQAELHIEVFLP 
AQVPDSELDAWMESRIYPVMSDIPALSD 
LITSMVASGYDYRRDDDAGLWSSADLTY 
VITYEM 


6961 


20862 


A 


7018 


396 


34 


QKNE I KPTMQLHLTPARMAI I KKI LKNG 
GWHGCGE/METLLHCWQECKLVQPLWKT 
MWR FIiKE P KVEL P FD PA I P VLG I QQS QH 
WESSNPSTGDPAIPPLGIQQSQPRRKVI 
KKRYLHTRL 


6962 


20863 


A 


7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVLSDKLK 
EAETRAEFAERSLTRLEKS IDDLQDEL \ 
YAHKLKYQAISEELDHALNDMTS I 


6963 


20864 


A 


7021 


347 


3 


NDLKSLTHSVAQARVQWQANISSLQRPP 
PQFKRFSCFSLPSSWDYSLLNCKFYHLH 
FADEEIETLTSQRAQLECKSNMYKDIHN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


6964 


20865 


A 


7022 


2 


160 


LRED DRRGRG FHHFGQAGLELLTS SD / L 
PTSASQSAGITGVTHRARPDWSFCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLELLSSSNPHLGFPKCWD 
YRPNPPLPFSSRSLLFLPS FLIFCRDRV 
SMLPKi>ASSFW\VRPPWPSKVLGLQA 


6966 


20867 


A 


7024 


483 


143 


PLNPLKFFFFPKAFKFWGGVGPICSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KFGPPKEIbKRAPPFFFFFFFFLiRRDK/ 
SLTMLPGLVLNPWPQAVLPLQPPKALGL 
QA 


6967 


20868 


A 


7025 


414 


81 


PHPLKFFFFPETFYFWWGVFSPFSPPGR 
KVPPQKSPGGFLLAPPKGKSFPFPCPFK 
IGPPRGIFKRAPPYFFFFFFFNRRDK/S 
LTMLPGLVLNPWPQAVLPLQPPKALGLQ 
A 


6968 


20869 


A 


7026 


16 


218 


KFAPRCRNSARENTQNQPRKHSDTPFLK 
K/HKINNNNKLAGRGTHLWSQLLGRLRW 
EDHLSPRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYNliL\CAFLSPSEKC j 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen>lalnnine, 
G=Glycine, H=Histidine, NIsoleucine, 
K=Lysine, L=Lcucitie, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














S IR VGVTRFS RCHLS PLTLTKKGNS LTP 
CASRVRQCLTLLRLAHGALHPLSSTP 


6970 


20871 


A 


7028 


277 


480 


LGI FHMLSDS FLLLLLFCLFVFKIFFG I 
FCRD\RSCYEVWAMLPRLVSNSWAQVIL 
PPWPLKVLGLQA 


6971 


20872 


A 


7029 


24 


153 


SVWWNSP / RS S GEFRKTGVHDADFEAH I 
IDMLAKASKIEVYRGN 


6972 


20873 


A 


7030 


2 


470 


IETPRPLWWHSAYRGRQASLSCGGLHPV 
R/ASWLLCLPNQAWAMAGAPPPASLLPC 
SLISDCCASNQRDSVGVGPSEPGVGYNL 
IjVRRFLSQSEKRNIRVGVTRFSRCV/LL 
PLSIjTRKGNSLTPWASQVWQCLiALLRFA 

hgarthwpaptvwhslvr 


6973 


20874 


A 


7031 


487 


107 


sppapgvapapppptrvfskkkkkffff 
flkkifffkpggdpggifffpggpppgp 
kkfs pplfpkkggyrvfppppgkis i ff 

FFFFFFFFFF/CRDR/SLAMLPRLiVLNF 
WAQVILLPWS PKVLGLQA 


6974 


20875 


A 


7032 


576 


36 


gytsqmgrpgrgaphfpdraagqrrs s l 
prwggwaealltsqmgwrpgrgaphipd 
dgqpgrgaphlpdeeqlgrgaphlpdee 
rpgrg aphfpdg aagqr cas hp trggra 
evlptsqtgqpgrgaphlpddgrpgrgt 
phlpdg\gpgrgaphipdgaarqrhls p 
prrgncttrivdq 


6975 


20876 


A 


7033 


2 


129 


YGPTHASGAMLRSCAARLRTLGALCLPP 
VGRRL.P\EASRDPS 


6976 


20877 


A 


7034 


1 


287 


RLSLALVARDGVQWCDLGSPQPPPTGFK ~ 
RFSCLSljPSSWDYRHVSPRPANFVFLVD 
TGFLHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HASAHASALiFYFIFFETRSRSVAQAEVQ 
WRKLGSLQPPRFKQFSCLSHPYRHAPPH 
FANLVFLVETG FA\ HVGQAGLELPTSSN 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VIFYLI FFCKDGV/ LTMLPRLVLNSWDH 
AILPPQPPKVLGLQA 


6979 


20SS0 


A 


7037 


3 


256 


FFFFNRQKDLFGNFQTCQKWHS PSPRPT 
/APHQAPKGNTSPPSLPIHLTLASTCTY 
P I PQAMLENWS VPS PAQAAGEAAL WHPR 
R 


6980 


20881 


A 


7038 


13 


481 


FARLHE FGTSR VI YLIiARJJVSNS WPQ VI ~ 
CLPWPPKL/LGLQA 


6981 


20882 


A 


7039 


455 


255 


SCLSLPSSWDYRRAPRHPANFR\FL.VET 
GFRHVALAGLKYPPALASQNAGITGVSH 
HARPEQFIL.TN 


6982 


20883 


A 


7040 


490 


172 


KCWQG/C/GETGTLVHC/WIGRNCTMMR 
PQWQQAWQVFKKLHRELPSDPAMPLLGV 
QPGGLKIDTQT/RYMYMALLTVKRWKQP 
ECPP VDK/LDKLW Y IHTTE YRVT I KKK 


6983 


20884 


A 


7041 


438 


37 


IFPNLPPKLHYFPQFSPPEKLPKPLLIK 
MAKRQSQGKVFETP KE K/ SRAG PPP KFT 
TGFS AE KNGGPKPGG Y I S QR VQ KKKNNC 
HSRILHSAKLSFKNRSEIKTFLNKQKIjK 
/DFTASRPALQKI LQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


IEAISFLPFFFGDSLTGAQWRDLGSLQP 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


6985 


20886 


A 


7043 


59 


257 


LASLVSQSWMKKMWYIYTMKYYAATK\G ' 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1=Isoleucine, 
KHLysine, L=Leucine, M=Methronine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NEIMSSAGTWMKI.EAIILS KVTQEQKTK 
HHMLSL.RSGS 


6986 


20887 


A 


7044 


279 


14 


RTADTE IGRGS SLCWRYGSLQ I QVSKSH 
NFFFETGSHSATQYGVQWHNLGSPQPQT 
YGLQQSSHCGLPKCWDYRE/LPPSMECS 
GTISAHLSLKLMGSSNPLTVASQSAGIT 
GNS S S RLH D I N YGLQ 


6987 


20888 


A 


7045 


578 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGS SNSPVS APQVAG I TD 
AHNHIQLI FVLVETGFHHVGQAGLELQ/ 
NLVISLPWPPKVLGLiQAWSHHAQPFV 


6988 


20889 


A 


7046 


3 


229 


F FKTGS LS VS QAGVQ W/ PNLSS LQLPP P 
RFKQFSCLSFPSSWDYRHPPPHPANFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSSWHYGHVPTSPANFFYF/CVDTG 
PRCVSNSWTQAVCLPQLPRVIiGLQA 


6990 


20891 


A 


7048 


370 


141 


CQPPHPANFCIFVEIVFCHVGQTGFKLL 
TSNDLPAWASQSAGIWD\TGVSHHAWPS 
RFNSEVQFDYVSNKNVDFSLS 


6991 


20892 


A 


7049 


104 


361 


VTAPGLEAAFKERVWPGAVARTCGPSTL, 
GGRGRR I \RGQE I KT I LANTLKP CLY 


6992 


20893 


A 


7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GIIRKGGFTH\VGRGGFKPWTSGNMPAL 
PSQGVGFPGIiFS PPRPVRGFFYPLPKWG 
GLFLQNGVKFKKGGSGDLFKMEK.K 


6993 


20894 


A 


7051 


14 


208 


AHEIL.SEIENPLEPGWDHRCVSSCPAHF 
FVF/CYRDGGLPML.PKRVLNSWAQAILP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHTYMLTALFMTAKKWNKPKC/PVS 
DEQIRQ I WSVHAMEYYSAI KRNEALIC V 
TTWVNLENIMLSESSQKTMGCMLiRRS I Y 
MKGPE 


6995 


20896 


A 


7053 


421 


2 


LFLPPEVARGFPKGRGRASPS PGGFFFG 
PRGGPIFPPP\SFFPTRPWSKGEVFPSP 
TPGKGFRGFFFPPPPKGKGGVFPLiFFKK 
YLGGEGAPQNKGGPVFliLWPPHSGPPGP 
PKNFFFFFFFFETESCYVYPGCSAWQS 
Q 


6996 


20897 


A 


7054 


489 


40 


I PEAPGGPGFKREDS PGANKSGPKALYP 
PKTFHPPWEGRFRSFQKGAPPPPVFFPP 
FFPLPFFFLRQSHS I PQAGVQWRDLGSL 
QPPP PPG FKQFS CVS LPC P P \RQMGFRH 
VGQAGLELIjTSGHPPDSASQSTGITGVS 
HCAEFHRTD 


6997 


20898 


A 


7055 


3 


261 


TGSGSVTQAEVQAEVMAHCSLNLLGSSD 
LSALASRAAGAAGTCHHAWLIFVFFIEM 
G FHYVPRL / GLEL WARVIHPPQ P PRVLG 
LQV 


6998 


20899 


A 


7056 


1198 


1040 


FLWKIALLFYFKICLPQSIiQPLVATnjSS 
QQ PSTS RQD PLPAKRL \ RLAEGSDDR 


6999 


20900 




7057 


271 


33 


LFFFFKMESRSVARLECSGAILAQCNLA 
RLFLVE \QG FHHVGQDGLGLL IHPPWPP 
E VLGLQ AQATPP S P K I TYEAATVM 


7000 


20901 


A 


7058 


2 


194 


CYRRQPPHIiAMNFYFLLNFFCLFFCXKA 
LSLiLARLLLNSQIiQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 


A 


7059 


3 


272 


NMWKLNNILLNNQWVNGEIKMKIFKNCK 
E \NGNTT YQNLRDAAEAFLKG KF I AVNT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ n> 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp.irtic Acid, 
E=Giutamtc Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagtne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














YIKKDYRKTCLRLFYSYHFFNKEKEYTL 
KTMIKV 


7002 


20903 


A 








FWGPHKKNFPLPARGRKLGSFKTA/LPL 
FFFFFSFSLSLFFFSGSCSVS\RLKCSG 
TITAYCSLEFLGSSNPPASFSQAAGTTF 
TFFLFIG 














SCFRLLCLPKEAWAMAGAPPPASLPPCS 
LISDCCASNQCDS/VGVGPSEPGAGYNL 
WRR FLSGS E KRN I RVG VTRFSRCRPS P 
LSLTRKGNSLTPCS S QVRQCLALLRLAH 
GARTH 


7004 


20905 


A 


7062 


60 


324 


DDFVSWQTGVHWCGRGSLKPQPPRLSS 
TSRLSSPSSWDYGRVPF\NTFCRDRVSL 
CFPGWPRTPDLRGSSCPSPKVVKLHTRV 
TAPGQ 














RPGAPG / PDDFARMGPRPKGGAFTLVHP 
MPSGAAFPGLLSEP \ PHRGPRS FAARS P 
TLRYPGRAPS KGSGEDPARP PATGPGPN 
LIQDGLVIjMSLTENVC 


7006 


209O7 


A 


7064 


65 


481 


CLCPAP RGGAYRGRQAS LS CGGLHP VRA 
SRLLCLPKQAWAMAGAPPPASLPPCSLI 
SDCCASNQRDSVGVGPSEPGVGYSLWR 
RFLSRSEKRNIRVGVTRFSRCV/LSPLS 
LTRKGNCLTPCASQVRQCLiALLRLiAHGA 




209O8 










GGLKP / I PGNPGNSLFSQKKKKQPGAGG 
SPRGSPPPGGLGGGIFFGPKGL.GSINPN 
FGPAPPPGGQKKKSLF 


7008 


209O9 


A 


7066 


538 


24S 


FFLKQSCSVAQAGVHWGYLSSL.QTLHPR 
F\KFSCHSLSSSWDYRCAPPRLAN\FVL 
LAE MG FH FHH I AQAGLEL P T SNDL P TS A 
SQNAGITGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACS PGTLGARAG / W I VWAQQ 
FRTS LG I MVR PCLYL 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/SFTIFPRLVLNSWAQTILP 
PWPPKVLGLQVQTTVPGSYS PL.S bYKE 


7011 


20912 


A 


7069 


386 


176 


KRDKVDKWLKKMWCIHIMEYYLAIKKET 
LPFATVTVNLQ\TFILSEISQEQDKYHMI 
SLICGIKMLIYRIRE 


7012 


20913 


A 


7070 . 


30 


408 


NSSELGSVGVGPTEPGTGGNLLVCWLLR 
LWEKCS VW AGVS RFFQ/ CTDYDG FPW VG 
KGNPPTPCASQVRRHPALLRRTLHGLHP 
CASHEKNQVPQLEMQKSSVFCIDLTGSC 
RVELFP FGHLGS KNP 


7013 




A 


7071 


310 




HVPWRSFAPAHCNLGSLQPPPPGFKRFS 
CLSLPS S WD YRCP PPRLAN\ FCSGS LQS 
WLTATSASWVQAILLSQPPE 














I F I FCRV / RGLT I LSRLVWNFWDQVI LL 
PEPPKVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPNIYLKSLKERKECVCERGR 
RDRNKTLEDIMAKNFSNWMKI INPQIQE 
AQQTPN\QETGRKLALRHITIKI,LKTND 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYSPQKKGTJJPLTPPSNWGPK 
GAPPPRGNFFFFFFFS \ RDGCLAMLPRL 
VLDS W AQVMLLPQ P LKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWSPHHSLPNNNWLYRHMPPCPANVEFF 
CSIAVLPRMVSNSWPGAILSPWPTRMLG 
LQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


701S 


20919 


A 


7076 


1 


169 


PQLPSSRNYRHVPPCWANFCIFS/RRRC 
FTMLARLFLNSWPQVTRPHWPPKVLGLQ 
A 


7019 


20920 


A 


7077 


402 


231 


psclslrsswdyrcmpphltnyyyyyyy 
yfvemgshsvtrpqdilppwpNpkvlgl 
QV 


7020 


20921 


A 


7078 


422 


316 


FCKDK/SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 


20922 


A 


7079 


58 


293 


DPICTKSKKLKKTQ\TDKEDEITTKKAK 
KAKSKIKIEEDGEEGKVWVEEEVSVKK 
KKDKKKPWLGMVAHTCNPNTLGG 


7022 


20923 


A 


7080 


363 


123 


GVFFRDHARAFSLHTGVCAVHISRCP/H 
VLTHFKEFLFHWFLTIHTHTHTHTHTHT 
HTPSFPELGKAVPMHRIVIVPTNYQH 


7023 


20924 


A 


7081 


3 


181 


YFLL\ FFWDRVLPCHPGWSKGWSQTPGL 
GQSAHLSLPKHWVCRREPPHPAIjFLIiFK 
NTIS 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFFFFFFFFXPPFFFF 
FFFLKNSFPYFFLFFFFYPLFLPLFFFF 
FPPNFFFFFLFS*NFFFFFFPPPPLFFF 
FFFFLFFFFFFFFFFFFFSF 


7025 


20926 


A 


7083 


268 


92 


KNNLPSSWG\ YKCSPPHKGNRFFFYIKD 
GGGLIMLLRLISNSWAQVILL.SWPSKVI, 
ERQA 


7026 


20927 


A 


7084 


413 


0 


SCSVAQDGVQNDDLGSLQRLPPKNKPFF 
CLS S TPAA\ FPSDWDYR 


7027 


20928 


A 


7085 


400 


125 


KTRSFSWRGGVQL/CI/GSLKPRTPGV 
KSSSPSASPVGRNTGMCPPPQQIFFFLV 
ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


7086 


377 


38 


NPGPGFYFWGPKKKIKPFPPPGVKLVSF 
KRAPPFFF/ CFFFETGSPFVAQTSLELL 
DSHNPPTSASQS AG I TDMSHHARPMHS I 
CMIQLWKSYHRFHFISQGSRDCSSSREE 
T 


7029 


20930 


A 


7087 


1 


151 


SFCRD/KRLIMLPRljVLNSQAQVMLiLPQ 
PPKVLSHHTWPIIKFSFMHFNLF 


7030 


20931 


A 


7088 


405 


111 


PRLVLN\ S ELRRS AHLSLPKCWDYGPE P 
PCPAQMHVLKSGTAAHKVHD FLAS WSHL 
PTHSPFPVLCSKTQISVAFFIQNLRAGQ 
RSGQRHFGKQCGSIH 


7031 


20932 


A 


7089 


1 


309 


KQINKSSLRISGKPPNCECVLHLGKSVN 
KSLLEPHKLASDP/HLCDFLESS\EQVK 
SVKELNCHV\TSLYTLGALESDMAEYF\ 
. NKYTLG PGMLAHAYNSS I LGG 


7032 


20933 


A 


7090 


414 


240 


FSLF/CKNLKKGLTMLPKLVSNFWPQAI 
LLPQPPKVLGL 


7033 


20934 


A 


7091 


460 


328 


RHPPPYLGK/FFCRDR/SLTMLPRLVSN 
SWPQVILSPWPPKVLVL 


7034 


20935 


A 


7092 


388 


199 


KOFSCLSLPSRWDY/SALPPRPAHFCIF 
NRDGFSPYWSGWSLGLPKCSDYRRRIFL 
RAKAHVRE 


7035 


20936 


A 


7093 


18 


271 


MHSFATYIjljVNLHljIiVLCRIiCPLSMYSG 
EKSYILLTKRPMLWF/SSGPFPSWSFSS 
ITLRIFLTCLTLKLSCCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 | 415 


291 


GDLGS PQAPPPG \ S RHS PAS AS RIDGTT 
GARHIMNFKNLAI K 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylaianine, 
G=Glycine, II=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V«line, 
W~Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7037 


20938 


A 


7095 


21 


428 


DKGLTVTQARVQW/HDPSSRQPPSPGLK 
RFFRLSLPSNWDHRCAPPRVANYYFFYR 
WGSYHVAQAGLELLASSNTPTSASHSAG 
ITALFLRDTLVRFFMGSFCPPQSFGAAF 
TAQIPQRGRTEELFIFLFLXTDYLFS 


7038 


20939 


A 


7096 


320 


134 


LPL \ PKTTFYLFLNRDR VLLCCPGWS R \ 
HTLDFS 














DYRHVPPFP.ANF/LVEMGFWHVSQAGLE 
IiLGSSDPPNTSQSAGITETSHQAQAYTF 
IFIIY 












259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
FFFFF\CIFVQMGFHLVGQAKMLKLHD 








7099 




275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
DYRH\PRPANFCIFSNTKDTFI 


7042 


20943 


A 


7100 


3 


404 


HASGATWRNPVS TRVWWH I P I VPATQEG 
EMGGSLEPRK/LKAAVSHCTPVWVT 


7043 


20944 


A 


7101 


539 


336 


FETESRSFTEAGVQWRDLGSLQPPPPGF 
KR/DSPALASRVTGTTGAHHHAGLIFVF 
LVEAGVSPHWPGWS 


7044 


20945 


A 


7102 


408 


287 


WLILFFIFCRD/RGIiTMLPTLVSNSWAQ 
AILLPQPPKVLGL 


7045 


20946 


A 


7103 


1074 


705 


SFLRWSFTLVAQAGVRWRGLGSLQPLPP 
WFKQFSCLSLLSSWD\YGRPPPRLANYF 
F \VILVETGFHRYWPRMVSNSQPQ/ CDL 

TCTHSPYLQGIVSLG 


7046 


20947 


A 


7104 


227 


55 


FSQHKINIQFYISLSPTHITMSFFLCRD 
/RGLAMLHRL/VIJJSWPHVILPPWPPKV 
LRL - . 














ESSCFGLPKSWGPRKEPQPCLALKWTLQ 


7048 


20949 


A 


7106 


37 


143 


NSFF/ CKRRGLTMLSRLVSNSWPQVILT 
PQPPKLLiEIj 


7049 


20950 


A 


7107 


441 


106 


GAPS PAS LPP CS L I SDCCASNQRDS VGV 
GPS E PGVG YS L WRR FLSRS E KRN I RVG 
VTRFSRCV/LSPLSLTRKGNSLTPCASQ 
VRQCLALIjRXAHGAHTHWPAPTVWHSLV 
R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/MPSPPVFRVGLLVFRGPPPNWFLPMGL 
QPDGPGLGDPKKFAK 




20952 










PHLPDGAAGQRCPPPGRGGGCVRAGPQL 
PPGRGGWPGGGLPPPPSRTGWLPGGGAP 
HFSDGAVARQRVSSLLRRGGRAETLLTS 
QTGSRPGRGTPHISDDGRPGRD 


7052 


20953 


A 


7110 


408 


13 


PPWGPPGGGPPEARGSNPPGPPWGTPLS 
PKKPKNYSGQGGGPLNPPPPEG/SGPGI 
PFSPEGKGSLNPGGQKGKIGAPSPPPGG 
PKKKPPSQKKKKGKEKKEKVIKGESSQT 
LLFKVSKILSVNFLHLSSLI 


7053, 


20954. 


A 


7111 


290 


175 


LKKNCR/DRSHTMLPRLVLNSWPQTILP 
S QP P KVXiGXjQI 


7054 


20955 


A 


7112 


1136 


552 


I L.FKEMGCE AGQRCSCPVI P I TLGRPRR 
ADHLRCGVRDQPGPTWLKPRLYSNTKIS 
WAWWRVPA\IPA\ IRRPEATRFTFNPRR 
WRLPLNLRSHPLHSHPGRQSKIPIKKER 
KKGQDRKGKGCGQNWGQKQKRRKEGFOCE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MGMGRIGGKNKKEGRKERKEGRKEGNKW 
VWAELGAFPEQGKGI.RRG PS E EA VGVS P 

Li 


7055 


20956 


A 


7113 


570 


199 


LCTiLKQAWLMAGAPPPASLPPCGLISDC 
CASNQQDS VG VGPS E PGWY S LVVRR FL 
S RSDKRN I RVG VTRFSRCV/ L S PL.SLTR 
KENS LT P CASQ VRQ CLALLRL VHGAHTH 
WPAPTVWHSLMR 


7056 


20957 


A 


7114 


466 


284 


RTRGIGWTANRRMKATQPGLTFFHLIHK 
QEGGNVGS QKGQRD \ VTNS AFHNRLHDQ 
TANCL.S 


7057 


20958 


A 


7115 


474 


224 


DSNFEKSSTLGKMLSNSIKCYREIFCER 
KSQLMQQTSMlAFKKXiPQSPQPSATTTLi 
ISQLPPTSRQNPPPAK\RLQLAEGSDGC 


7058 


20959 . 


A 


7116 


377 " 


111 


FXXNFXSFLFFSLSLTRKNWVAHITFFF 
FFFLPTPPFFYFIFL-LIFCFFFFLFYIF 
FFFFFFSFFFFFFFFFFFFFFFFFFFFF 
FFFFL 


7059 


20960 


A 


7117 


417 


148 


VPPCIiANF\NFFVETGSPYVAQAGLELL 
GSSHPPTLASQSSQITRVSHCSLLYFLM 
HSNTS ISFFS FWALVS FLEM P S S CQDG I 
TVLAR 


7060 


20961 


A 


7118 


2 


196 


RVPPPCPA\NFVFFVQTAFCHVAQAGPK 
LLASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


7061 


20962 


A 


7119 


2 


417 


EAGGSLEPRNLRLHCA/LVAPMNSRCI P 
AWAT 


7062 


20963 


A 


7120 


801 


450 


GPKFLGPPGVRIN/RRPGGGGLPRVYKK 
NNYFEDRPIVLPAFFSGGGR/GPGPHDK 
HPLPRKKQKNSPGRGGEPPFFPPPGFPP 
GGERKNPL.TPQRKRVHWGQGLAPPFPPS 
LSAKPKLF 


7063 


20964 


A 


7121 


3 


199 


DAWGFALWQAGVQCCDL.SSLQPSP/RL 
ANFVFLAETGFLHVGQAGLKLLTSGNLP 
ALTSQRL.RLQ 


7064 


20965 


A 


7122 


407 


164 


NVAGLTGAYL.DS FFGRDRV/ L.TML.L.RLV 
LNSWAQAILPPWPPKVMGLQACTIEVSS 
IiVAQ IAS KR LSG VQ E S DE AA W I RD C 


7065 


20966 


A 


7123 


33 


192 


RWDFTMLPRLVLGS KRSAHLG LP KCWDY 
KHEP/PCCPGWSWAQRDLPTLASQSAGI 
TSMSHLHQAIPLKFLEQFIRFSILVF 


7066 


20967 


A 


7124 


3 


399 


SIPVDWPSRAPTPRGRESAAHQGN/YPV 
GQRN\RMAGLSPRSFCWREVSFSR/SHS 
C/ICWVQLGKSAVLPQQSRGPGAHEGSR 
RRGGSSPPCPPLQTQLVFSPQELSMSAP 
I HRLS VTL.QGDC IS TGGAPFRCRLA 


7067 


20968 


A 


7125 


1901 


1574 


FFEMESCSVAQAGVQWHDRGSLQPLPPG 
FKQFSCLSLPSSWDYRHLPPHRANFFVF 
LVETGLHH VGQ \ SGLELLTLGD / LL.PAS 
ASQSAGITGVSHHAWAKNPCFLTSIDT 


7068 


20969 


A 


7126 


408 


234 


LPSSWDYRSPPPQANFYIFSRD/MGFTM 
LARLLTS SDP PALASQS ARIAGVSHHAQ 
PVYF 


7069 


20970 


A 


7127 


88 


277 


KLSTEYKKQRPEDSFFLFL/VEIGFRYV 
AQAGLEIiLSSSDPPTLASQSAGITGVSH 
HIQPPFKK 


7070 


20971 


A 


7128 


3 


140 


QR W WCVP ATRKAE AG E S LETE S SR LQG A 
MITP\CSPAWATEQDLAS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7071 


20972 


A 


7129 


1 


212 


PLLFVSPPLLSVSPPLLPFP/EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPJLA 
FPAPLKHQDLHISHL 


7072 


20973 


A 


7130 


361 


3 


RNNISQDIAVIGNDSSNRSGQSKLKIFW 
KGFTILHAIKDFCDS/WEEVKISRLPGV 
WKKLIPTLMDDF/EGFKTSVEEVTADMV 
ETETELERLELEVKPEYGTGLLQSRDKT 
LTDEKLLLMD 


7073 


20974 


A 


7131 


421 


201 


NFCIL.VD/TGFHHVGQVSLELLTPSNPP 
ASASQSSGITGLSDHSQRKSELVLSFNF 
FALFH ILLCVMFLAIDRND 


7074 


20975 


A 


7132 


22] 


529 


L I KFT IHS F YFI YLFI FFSGDGCFAL.S P 
RLESSGVISAHCNIRLPGSSDSPDSASS 
VAGI I / CMEHHTQLiIIjY/ CLVET/EFHH 
VGQGGVDLLTSGDPPASASQSV 


7075 


20976 


A 


7133 


1 


227 


KG IHFS SANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 


DFVLPDGRVIP\KGIVCLINIIG\THYN 
PN\LiWPTPLEVYDPFRFNP\ENIKERSP 
LAFIPFSAGPRNCIGQAFAMAEMKVVLA 
LTLLHFRILPTHTEPRRKPELILRAEGG 
LWLRVEPLAGTSRGNPRASSRPSLSDPW 
VLRRPLPPPIFLFLLLLTLPLLFSVMAR 
EPPSPLRVEAPGPPEMRTPPAIEATPKG 
TLQPAGGRLRFLSGCVPILHQVAGHRYG 
KDKMGILQHPDGTVLKQLQPPPRGPREL 
E FYNMVYAADCTDGVLLELRQYIiPKYYG 
I W P P PAAPNDLi YL KLENVTH KFNK P C I M 
DVKIGQKKL 


7077 


20978' 


A 


7135 


1 


259 


GTS C I LASMLTRHRLRFSF/ CLFETEFC 
SVAQSGVQWYDHSS LQPQPPGLKLFS PV 
NFPSSWDYRKMPPCLASASILKL.LFTID 
LGFYS 


7078 


20979 


A 


7136 


1 132 


900 


CSGVISAHCKPRLPGSRHSPASASRVAG 
TTGTRHHARLNFC I FFLVEMG FHRVS QD 
GLHLL/NLWCPPRPPKVLELHA 


7079 


20980 


A 


7137 


1 


501 


FFLRWSLTVTRLECSDVISAHCNLHbPG 
SSDSPASAFREAGTT/GHDAQLIFVFLV 
ETG/ FHHWKDGVDLLTS 


7080 


209S1 


A 


7138 


2 


229 


WHSHGLLWSQPPL\YLSLSSSWDySHVS 
PCPAGLCLFLVETGSPYVAQAGLSLRGS 
ASQSTGIIGMSQPAGFWKQIi 


7081 


20982 


A 


7139 


255 


58 


IFGKGFPPFFCQNQGFGVPPPPRAFFFF 
FF/CPAKYYTMLPRLWSNSWAQAIHPP 
WPLKVLGLQA 


7082 


20983 


A 


7140 


422 


274 


TMLPRLVSNS\GLKQSSRLSLPKWWDYR 
YKPPCLVYTFFINRPPEVPKE 


7083 


20984 


A 


7141 


417 


60 


VWRFLEELETELPFDPAIPLLGIHLKEH 
KLLYHKDTC\WNQPKWPSMVDW I KEMWY 
INTMEYYTAIKKINAI FRDPLTSPWDPF 
SWRKFCLPLEIWPASYFLPGIKTRSLDE 
PLHIKGER 


7084 


20985 


A 


7142 


391 


70 


SPPPHPPRKGDYRGGPP/RPGKKREIFF 
FFFFLVDTGFCHVAQAGLKLLASSN/IiP 
TSASQSAGITSVRHYIQPTLLKI IPQFL 
DILLFLKIFHSFLKSFNFKIiGSFY 


7085 


20986 


A 


7143 


405 


304 


FCRDR/SLLMLPRLDLNSWAQGILLPWP 
PKVLGL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 11) 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HASEKIWYMYTMEHYA\SQKKNKIMSFV 
ATWMRLEVNILSKLMKNEMMSVNGR 


7087 


20988 


A 


7145 


343 


115 


CEFYFLSCIYVFIYFCR/DKGLSLLLRH 
VSNSWPQVTLLPWPPKVSHQAQPEFYFR 
NKQTKIDLSFEETAADLAYYS 


7088 


20989 


A 


7146 


453 


119 


RCWKGCGLiWISLRTT/WREWQLVQPFWK 
TVWRCLRKLKVEPPYEPATPLPGIHLKK 
TKAVAQRDM CT PL F 1 VAFQHHLLKRLF F 
SRCVFLAPLLKIA 


7089 


20990 


A 


7147 


3 


105 


TIEWGEVIS/SSYSSTMANNEGLFSLVA 
RKLSRPL 


7090 


20991 


A 


7148 


295 


23 


FANIFSHSAETLQTRREWGPIFNIL/EK 
NFQ PR I SHPAKLS F ISEGEIKS FPDKQM 
LRNFITTRPALQEFLKKEFMSFPGTWMK 
LETIIIS 


7091 


20992 


A 


7149 


1 


161 


RTRGFFYLDGVRSLAMLP/RLVLNSWPQ 
AI LLP WLPKVLG S QTLAMVAAYTVF V 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/SIFIFCRQR/SLTMF 
LSLVLNSWTQAILLPWPPKLLGLQACTA 
MPI PGVELLWVSGCFVKES 


7093 


20994 


A 


7151 


177 


34 


ELVS FLS FSLSLS FFPSFLPS FLPFLS F 
LPSFNPSF\LPSFLPSFL 


7094 


20995 


A 


7152 


398 


40 


MNDRVLLFHFGWNAVAQSWFTGASN\ FG 
LKQSSHLSLLCSWNYRHTPQCPASLLAY 
VCMYVCMYVCMYVCRD/RGLTMLPWLVS 
NSRPQAI LPPQPS KIMG PLLGAVAHAYN 
TSPKGSQRR 


7095 


20996 


A 


7153 


418 


179 


RNSCTQMRNEALPNSGYFPKLKYYYDF/ 
CQRQDLAILPRLVYNSWVQAILLPWPPK 
VLGLQHE P PCLAVI Q FLKGNLK I V 


7096 


20997 


A 


7154 


767 


537 


QALLCLPGWS \AVAQSWLTAASNSSDPP 
ACLPSSWDYRHVPPRQAEWSFRVSVSSP 
VKRDPGSPWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HAQLFFCRHW/SFTMLPWLVSNSWAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGGPQTGSFFFF/ CYERGVSLLPRLVSD 
TWSHVILQPQKPiCVLGLQA 


7099 


21000 


A 


7157 


3 


237 


MQI KA/ TMLYHFTP TNMTVLG FKKNSRG 
WQARGEIGTLVHCWWGCIMMQPLWETVW 
HFLKRLNKLGVWHACSPNYSGG 


7100 


21001 


A 


7158 


435 


211 


L FLVEMG \ FAVLAS LVLTLE \ PRDS PTS 
ASQRARMTGMSHHTQPLFCLFVS KFHLA 
KKLSL I WQNLRRAG I CRLS 


7101 


21002 


A 


7159 


3 


220 


SSSASQSGGITGVSHHARPSERQ\SCSV 
AQAWQWCNYSSLITPTPGLKRSSRLSL 
LSSWDY\GALPHPSVAS 


7102 


21003 


A 


7160 


1 


172 


FRKVLGRTGSNGLCTQVRVE FMDDTSRS 
IMRNVK\SPVREGDVLTLLELELETLRL 
R 


7103 


21004 


A 


7161 


1 


180 


DAAPVLKATARPTRPDRWK/ IDIDSLDI 
HGETPSLQKNTNTFRAWWCI PWPATRE 
ADMF 


7104 


21005 


A 


7162 


3 


108 


KKKKTGRFKLV\MQYNRRFVNWPTFGK 
KKGPNANS 


7105 


21006 


A 


7163 


409 


110 


GQHGLTQSLPLSAPPRGAVPRHAPRPD/ 
SGLSLSPLLPTAFATTGASSANRFVS IG 
PRDGNFLNI PQQSQS WFL 


7106 


21007 


A 


7164 


35 


371 


YFHHLSLLLKFLLASMMVFLTVWRLiLiE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN ' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














LLF F FLGETE P AFVAQ PGR \ QGPNLGSL 
RPHFPSVKHFPGLTPPRAGTSGLDPPSP 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 


3 


382 


YTGPAWWSMLWLATEEAEVGGSLEPRS 
SKLQ\QP PLRHCTPA WAT 


7108 


21009 


A 


7166 


1 


305 


TAVTVDPYHSVY I EVNARPGACMNGGGL 
LNGEQNSLKRKQLVQTLNKRHGEELAMV 
DSLTLGPKLEEA/ASPPAFARVRLRARQ 
ACRRKGSRIjCWEHLVSKV 


7109 


21010 


A 


7167 


383 


55 


RDLSMVRMKSMLAI ALCFTAIjMGMFNS I 
FYGRWAKLPFPPIjSYIQGLSHRNMMGD 

dttdcsfiflyifctmsir\qniqkivg 
lapsraatkqaggflgppppcgkfs 


7110 


21011 


A 


7168 


465 


166 


S P QGAQ I KG VGFQKLRPWG P \ PG F PRGK 
PFFWFKKKLPPQGGGPLCSRFSGGLSQK 
N\SGPWGE I FPWGQNGPPPS P PGGKETF 
FSKKKKKKKAVDYLNS 


7111 


21012 


A 


7169 


2 


323 


IQGFKTLSEEVSADEVEIVRELGLQAET 
EDVTELLHFNSTACCREIFHERKRQQMQ 
QTSLVSHF\RKLIjQSPQSSAIATINQSQ 
PSTLRQDPMPATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 


FGTIPLVLCADLNSLPDSGWEYLSTGG 
VETMHKDFKEIiRYNESLTNFSCHGKNGT 
TNGRITHGFKLQSAYESGLMPYTNYTFD 
FKGIIDYIFYSKPQLNTLGILGPLDHHW 
LVENNISGCPHPLIPSDHFSLFAQLEL/ 
YTAF P APS QRHP P S WQEWKHLQRTALI 
HL 


7113 


21014 


A 


7171 


1121 


320 


SSCFQTSSHACFVTEFVPGGDPMMQIHE 
DVFPEPQARFYVACVVLGLQFIjHEKKI I 
YRDLKLDNLLLDAQGFLKIADFGLCKEG 
IGFGDRTSTFCGTPEFIAPEVLTQEAYT 
RAVDWWGLGVLLYEMLVGECPFPGDTEE 
EVFDCIRLHGRPPTPAFLSVQG\VKFIQ 
KLLQKCPEKPLGAGDQNAEEI KVQP FFR 
PTNWQALLARTIQPPFVPTLCGPADLRY 
FEGEFHRAA\ PALTPPAPHSLLTARCX2A 
AFRDFDFVSERFLEP 


7114 


21015 


A 


7172 


355 


3 


GPLLSMLGMNILFLTLFLLLPLSMLEGG 
VCGNF/ I TNC CLE I DDNR KVI EDI TAK I 
QKLAHVPVQ IWKGWS PDTLFGGVJFS S LG 
RFKTLVG I VLVILGVCLTLPCLLPLLVK 
NIQSANV 


7115 


21016 


A 


7173 


184 


2 


EKKVEDDDDDDDFEMESRSVIPAGVQWR 
NlALHPLPSGFKPPSCLSLPRSWEHRCA 
PLHQCI 


7116 


21017 


A 


7174 


1190 


407 


SLKRRPSARGSRPMSMYETGSGQKPYLP 
MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPSFPSTLSWSRDESARRASRLEK 
QNSTPESDYDNTPNDMEPDGMGSSRKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TEDVIRKTEQITKNIQELLRAAQENKHD 
SYI PCSERIHVAVTEMAALFPKKPKSDM 
VRTSLRLLTSSAYRLQSECKKTLPGDPG 
SPTDVQLVTQCAVI PVCVRPFAKAAKQL 
VTITTKENNN 


7117 


21018 


A 


7175 


3 


136 


AIHLASLKVFYRYRVLSL/ LKRLVSNS W 
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PQVII.LPWPPKVLGLQV 


7118 


21019 


A 


7176 


3 


316 


GIVKFFFFFFLTKCRTVTQAGVEWCSLG 
SLQLPPPRFKQFTCLSPPVSWDYRHLPI 
CLANFCI F \ VETGFHHVGQMGLELLTSG 
YPPTSAPPDAEITGVIHRTQP 


7119 


21020 


A 


7177 


529 


217 


LLPRLEYSGMVIAHCGLQLLGSSQATLG 
ACYCAW /HNFKAFFVETKSHYVTQAGLK 
LLASSNPPALASQSVGITGVSHCAQATL 
PFKTVDKGPSFNLFSFTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKS KKD I KS I YH I MT / HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


7121 


21022 


A 


7179 


540 


69 


ACRGAHVQVGDGSALTYVEDGTACGPTM 
LSLiDHliCLPALAFNFSTCTDSGERRICS 
HHGVTAWSLGWREKLTRGDRPLLTS PGP 
ALP I G \ CSNEG KC I CQPDWTGKDCS I HN 
PLPTSPPTGETERYKGPSGTNI I IDS I A 
GAVLVAAIVLDETGWGYK 


7122 


21023 


A 


7180 


452 


279 


CIDSIYQEIYHLSISILSVYLiSI YRSIY 
LSIYLPIFW\SLYLGPRFLRTRDPQLLS 
LT 


7123 


21024 


A 


7181 


501 


183 


HVRCLAFICALIGFPSGYFSIMPWT/V 
DLVG I EHLPNAYG III CVNG I S ALLGPP 
FAGW I YD I TQK YDFS FY I CGLL YMIG I L 
FLLIQPCIRIIEQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


IiRVRS LALNQVSS AS R YWDTG VATKLAT 
F/SLFWKDRNLDILPSMVFNSWPQVIFP 
SWPPKVLGLQM 


7125 


21026 


A 


7183 


420 


73 


DSSDDSGQSKLKP/FWKGFTILDAIKNI 
CDSLEDFKISLTGAWKNLIPPLMDDYEG 
FKTRVEEVTADWEIARELELEVEHGLG 
AVAHAYNPNTLRLRSLKPRSLRPGQHSK 
TLSL 


7126 ; 


21027 


A 


7184 


567 


174 


F FLRDRVPS LTQAGVQWQDHS SLQPRPP 
RLKQSSH \ PALLSS WD \ YRHRPLCLAKF 
LIFLQRRGLAI/FARGWSNSWSQ/CNL 
PASVSPSAGITGESHYSQPFLLSDSQTK 
VYKTHRSTISSTITEMGCERKD 


7127 


21028 


A 


.7185 


345 


59 


RNGSSCLWQVIQSLTLLPRLECSGVITA 
QCSLGLVGSSNPPTSASC\VESHCFAQD 
GLELMGSSNPPTLAG I TGMSHCAQPLYD 
LFLYEYGLSVHY 


7128 


21029 


A 


7186 


1 


209 


LSWLEIGSHSVTQAGVQWVHSSSLQP* 
MLGUKP S SR FSLP KC WD YRHEP WCLPGL 
LRLKFDPSVGGVA 


7129 


21030 


A 


7187 


3 


225 


EKTPVS DRG AKGCTE S LVNRRP CFS ALE 
VDETYVP+ EFNAETFTFHADI CTLSDKV 
RHFLKP I ALVI LNHH I F 


7130 


21031 


A 


7188 


1 


377 


ES *RNHNGDEDDSHVRAQGACIEAKHDK 
YMADMDELFSQADDNRKKRDI PD YLRGK 
ISFELMRQPCITPSGITYDRKDIEEHLH 
RVGHFDPVTRSPLTHEQLIPNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 


A 


7189 


1 


359 


SRATDKVQRQGHG*GIHNYSDWENTDWF 
RETKNQSVPLSCCIETVSNGNGSLAHPY 
DLYAEGCEALVGKKLLEFMMKVIWAELA 
FAAI QLL SMLS AC IVMCR KSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 


QNQKFPFEPGTS*LGLYPKKYKLIYHKD 
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PKTCMFTAALLTI EKPWNHHKHLSMADW 
IKKVWYGTVGHACNQSTFLFWFFFEMES 
HAVAQGGVQWLNLCSLQAPAWATERDSS 
QKRPKIDPHKYIVN 


7133 


21034 


A 


7192 


1337 


342 


I S E ATRTFRES IWQFQSS PREDDRNRQI, 
S ASGSRQSHPGPRS TPG PGQGHRS SHQG 
CAPVSSAARPGGPSFRLSEGALSRSQFS 
VLV Y IP* VAAGQ FVICACLG AS S P P S RL * 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCPKTKYSWSGPDDSPSVLRDPEAGTT 
RWAPSGNPEEEWLQLKAKYLQNISEVL 
SVLHLLHIKALQLHSTNPFFVNARHLLV 
GQ IS SALTLNNLSQ ILFLLLRFPFLSQV 
ERHSPLQKTGTTETNLVLFQDTVPMNFP 
S S I LL P S KYS CPALDKE AS H F YQ 


7134 


21035 


A 


7193 


469 


130 


SASWDYRRPPPRPSNFLY* *RRGFTILA 
RMVSVS * PRD P P AS ASQS AGCAYS I YLQ 
ERFARLHPACLAPRYMASDPLSSLHHSV 
PPGLHLEHEHWL WHCLLPGQ I CRPGKLW 
Y 


7135 


21036 




7194 


430 


149 


TVWQFLKQLNIELSCDPVILLLGICPRE 
LKTYVHTKTRT*AFIVALCTIVPKQKQL 
KYPSVDEWVIKMWYVHTNDYFSVIKKNK 
LAKHRGSHL 


7136 


21037 


A 


7195 


257 


412 


TATSNKEPLKKVRCLIWRKKDCWARGFT 
ACNPS TLGC*GRW I MRSRDRDHPG 


7137 


21038 


A 


7196 


1 


124 


PLISGWHPCNPSTLGGRGWRIA*VQEF 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YGI ILFITSEVFVFAGFF* AFYHSSLGK 
KISKKESVIIY 








7198 


234 


416 


GVHAE VLI SPG PQDRLRALW I P 1 1 FG I 
bFAI L.LVLVF I KKVAKKPTN KVGHP * E P 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VHYPVPQTGSPCCVPHPRGLSTIHPGLH 
S RYTGLGMVAHACNCSTLGGQSGKIA* A 
LEME I SLGNMVR P CLYERTNKQS KPRHM 
GITFGLTFSLTLTDKYSLI ISGFYLPQV 
LISLHPLGPY 


7341 


21042 


A 


7200 


396 


21 


GLKESHKCCAI PACLQRDPLVRVLGKAG 
F*RKSLSISTELQSLFDSPDFSKITGKP 
I KLTQVEHRAGFEWNEDGAGTTPS PGLQ 
PAHLTFPLDYHLNQPFIFVLRDSDTGAIj 
LFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSLANF* KFFVDKGB-TDLPRLVS 
SSWPQWIiPPRPPRELGLQA 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVSQAACGTSGYKLSVGYMV 
QGPLTHLMLGLLISLIFLSRGSGRTWAF 
SHS* YKTSDDLPCRNRREVIEFLHYRNL 
HSH I CLS VTKTFI. 










262 


406 


ILARMTS WNS IKHL.ILFLK+ S SHLS I P 
S S WNYRCVLPHQANFLFLFL * R* S FTLM 
PRLVETVLLIQNSPTKIDSSPDDFSGKF 
YQTLN I VFENIRRGNTSNLLHKNSRILI 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 


DrVLERSGQGTAALPRSTSLDRDWAITF 
EQII1ASLL1TEPALVKYFDKPECMMARIT 
SAKNIRMMGKPLT*ASDYEISAMTG 
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nucleotide deletion, \=possible 
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FFFLVETGFTVIjARMVSVS * PRGLPASA 
SQSAGIAGVSHRAQPALRFLMLENRMC 


7147 


21048 


A 


7206 


1 


237 


GKCAHTHTHTHTRDTHRHAHAHTHACTH 
THTYVQGCSLSQWSQDCVRMWACHIFGV 
PRE+ASRYTSSLVYKPGCTGVCG 


7148 


21049 


A 


7207 


2 


239 


KVDSEHKLERSSGGVLGRFKGKGSGGIiN 
LGNFFASRKGYSRTGFDRLNTEGSDQDK 
EDDGSESE*EYS*PMPALEPCSS 


7149 


21050 


A 


7208 


1 


405 


GSRDNMNVILICFPNAPKVSAESVKNDA 
QLDKYLECTAQE I I *NHWEDVPDLFHVM 
RTLPSDNMPSLPPGGELASMRTVIEAVY 
YRLNPYINDDTDSTSTDDMWFNCSSSHG 
VYLDLHRRVQVNFDETFSMHPEL 


7150 


21051 


A 


7209 


2 


761 


QDFGTRFATVLIiFYGYFKCTRTL.KGTCL 
YNAILYKVCSPRNDQPDVCYNPSFFLLP 
IKTGELLGFLVYASREKRS I AI GD* KDD 
KWPPERI IQYYGPAT*AQDSSWGYRI PI 
YMLN* I IRLQAVLEIITNKTGRALTVIiA 
LQE I QMRNA I YKS RkALDYLLAAEGG I C 
GKFNMTNCCLHIDDQGQIVEDIIKITKL. 
AHVPMQIPDIVWKNIVKIPVLFCSVLIT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


715) 


21052 


A 


7210 


147 


390 


KTGLSDQMPRAI QLSYSYLS PLLY * * S Y 
KKKKKKKKKKKKKKKKNFFKKGPGN* SF 
WGGEKKSGPGAGLKNPGGGKNTFY 


7152 


21053 


A 


7211 


433 


214 


WGRGNAL.TIHPPWVPFQESETTPSPL*G 
ITLLDNPFSPIPKQPPPPH*SITVPYLL 
SSPQPGFPHPFQAPGLA 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAAIMSDFSDRSRHSTLKAS 
E*GFTMIDAIKNICDL*KEAKISILTGI 
* KKS I PTLMDDF * GFKTPML* VTANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456 


24 


DAQAFSWYSQGYFNSSQGFS ISLPGDTS 
PVKIKRMGGPLFFS*LLKSTCWACSKTS 
RVPGKVLKTGLPRPR*PVYKDPLGGFYP 
RF 


7155 


21056 


A 


7214 


2 


445 


ILRDLCQRVPTWSGFPSWAMEliLVEKAI 
SSASSPQSPGDALRTVFECISSGIIIiKG 
SPGLLDP CEKDP FDTLATMTDQHRED IT 
SSAQFALRLLAFRQIHKVLGMDPLPQMI 
QRSNIHNNRKRRRDSDGVDGFEAEGK*D 


7156 


21057 


A 


7215 


203 


3 


VRPGWAHACNPSTIjGG*GGRIA*SQEF 
ETNLGNTVRPCRNSGEVIiNRKTVSIRRA 
LRVFS plhta 


7157 


21058 


A 


7216 


1 


285 


GKCAHTHTHTHTHTHTHTKMHTRTHAHT 
HTHM CRGAV* VRGARTVCTCGS AP FLE S 
PGKEPKDTLRPLFVSQAARAFLGEKAFC 
SVYRQREPPPI 


7158 


21059 




7217 


1 


386 


GDSNCE WS VL. YV I IATFG I G VALG I LS W 
TVICMLKRQNGKPKMKSKYKNLDATDQE 
SL*LKPASRAGIKQKGLLLSSSLMHSES 
KLDSDDAIFTWPDREKGKLLHGQNGSVL 
NGQTPLKATS PREE I L 


7159 


21060 


A 


7218 


70 


379 


NILYFNIFFFFK*FIjKFL.WVFFFFFFF* 
VFFFYWFFFFVFFIiCVFFFFFVKIiKI FF 
WGPFFLFFRFFFCLGFFGTFQSWRWFFA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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LFPWGLMFLFPLLFFLVFF*LVFF 


7160 


21061 


A 


7219 


1 


208 


PRLAASVS PGDMLGMQVLGLGTVAS ACN 
CSTLGGRGMG I T * G * EFETSL VKKARTC 
FYKRLKRQKSANL 


7161 


21062 


A 


7220 


455 


96 


HYLYGSQGLSRFYFVTQGRVQWHDHGSL 
QPPTPGVKQSSSLSL.PKCWDYKCEPPCL 
ARPCPFFFFL*TSSVDTGLLLNQQPIL+ 
IMTFPYPLLRKSLSFHHQIYEPTSICSH 
LLSFCYT 


7162 


21063 


A 


7221 


201 


319 


I YELWPGPVAHACNFS ILGG * GEQI AQV 
QE FKTS LGHLG 


7163 


21064 


A 


7222 


307 


3 


NRRQAGPFPAKLGRPSPKGGFPNFFKLF 
FFKSSF+KNPKGQGWGFPPLIPGFQGPQ 
VGGSLGAPGLKPPWGTPQNPFSKKKKKK 
RSNCTDTGTTIAHRRQM 


7164 


21065 


A 


7223 


331 


141 


SETITANVSPNL*DANDVPIQCEISPLI 
SYAGEGLERYVADKEFHAPVI IDENGVH 
EWKNGI 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 
PSIKASQRAGITGLTHRTGPK*KF 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPIiPPSLK* FSCLSLLSSWDYR 
CAPPTT* LI FILLVEMGFHHIGQAGLEL 
L.TSGDPPASASQSAGVTGVSHRTRPPSF 
VLVNPIPPLLPPVPSPDGPNISSSTTSW 
RGYPEVLRWGLCSFWGCG 


7167 


21068 


A 


7226 


84 


204 


FIFFFFKR*SLTMLPRLVSNSWARVALP 
PQRPKVLGLHV 


7168 


21069 


A 


7227 


3 


307 


DRAAPRGAGRAGAVGPHRTGDWGLAGTP 
AGRGHARH I GRG P KQGRAAGQRGHLHPG 
GLLGRGGNS EP * RKQRTPGS MRRAAPLL 
STGQVWEGPASVGDRVP 


7169 


21070 


A 


7228 


52 


1176 


GSQQPLLHISEAQGDTLiIKKAPAEAAWP 
PLIAFSSGGRLPCSHES*SAAGYKRLDD 
TAAGGPPSPGGSPRSPATAFGLHPVWGG 
QLRPRGGRGLPATGSGEAFQALTQLSRG 
RACTGPPAPGSEDAAPSGCSAARWSP*M 
AGSSSCAIPDSAASCICPQGPPWRW*SR 
APARNRCSASSGRSRC*QLRRRRYIPGL 
LHPVLHEALPDS WGCE TQLGPPLAS EAA 
PPRSRRREEG*RGSGRC**KPSLPRSWS 
LCRNQAEAGFVSILVWLSGLAGAPSRPA 
AAAPQIAPNQHRS PP PRLAPVPAASGSP 
WPWERDLGVLETLNVGNFAQDPNPSACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPN 
TIVKSVGNMCL 


7170 


21071 


A 


7229 


3 


112 


ASNLGSRGCSEPRPCHCTPAWV*E*DSI 
SKTNKQKN 


7171 


21072 


A 


7230 


103 


299 


AMSQIRNAIPFTIAT*RL*YLEIQLTKE 
AKD I YNENYKI PL KE I RD VTNKWKN I PC 
S*IGAQLCK 


7172 


21073 


A 


7231 


2 


137 


SLTLSLSICFCL*RWGI,SVLPRLISNSW 
PQAIFPPWPPKVLELQV 


7173 


21074 


A 


7232 


3 


273 


LTHIVDGPNFLTTMYQCL,MCVC*QELGY 
QP VTTERCHNVCNDCLQRS FKAQVFS CH 
ACRHDLGQWYIMIPKEIMQTLLDLAFPG 
YSLGR 


7174 


21075 


A 


7233 


338 


136 


RPRWEDCLSPGVQNVCGQHSESPFLQEI 
* KI SQTWWCKPWPS YQE VEGELLE I RG 
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G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possiblc 
nucleotide deletion, \=possible 
nucleotide insertion 














STCFGLPKCWD 


7 175 


21076 






437 


261 


HLGFPECWDYRHEPQHPT*FLEHFM*RR 
GLTTFPRLVWNSWPQAIFPRWSPKVLEL 
QT 










4] 




vtvmeikgkinlvldi ikdiyvegfsty 
YF*RQGLTMLPRLDIjDSWAQAIIjPPQPP 
E * MGLQECNHHAQIjSRYF 


7177 


21078 




7236 


1 


154 


GTRDFSVKTL*ARREWHDIFTGMKEKHF 
YPKKVYALKI PFKNEAETKKVEV 


7178 


21079 


A 


7237 


1 


179 


SCLSLSSS*DCR*PPPCPANFVKSFFRN 
GVLTMLPRLVSNFLPQANLLAQPLSGFI 
ILL 


7179 


21080 




7238 


340 . 


140 


KINWVWWPVPVI PVTPEA* AQKLP* PGK 
LKFPLT* IPPLPSRLGHQSQTSFPKKKK 
KFYILSSDSV 


7180 


21081 


A 


7239 


2 


175 


QFES KPS PAWATEQDPVS KKKKKKKKKK 
KKKMKKKNSS KFKM I P FTKRQ Y * EGNW K 


7181 


21082 


A 


7240 


336 


192 


QSETPSQSNQKFIF*GQGIALLPRLVLN 
SWAQAILPPWPPKMLGLQA 


7182 


21083 


A 


7241 


1 


186 


DRPGQHGEMLSLLIFCFFCRDGGLTMLP 
RLVLKS*AQVILPSRPPKAI,GL.QDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLNKLGIDGTWLRIIRAI F* KPIANI 
ILNMCSFL 


7184 


21085 


A 


7243 


360 


217 


LPSRGDYKRKPPCL,VNFYFL*R*GLTL1j 
PRL I SNFWAQM I LL PQSPKC 


7185 


21086 




7244 


35 


276 


VQQQRVKQGRKYYS IVLKN KKKLCN YLR 
VGE*MG*RNIGWVQ*VAHTCNPSTLGGR 
GGRII*PWEFETSLCNIG+TCISG 


7186 


21087 


A 


7245 


1 


356 


IQHTFSLL1 FGI KYS YYVLKI VS WIAAV 
AHACNLSTLGGRGGWIT*GHELETNLFF 
AVKPWVYDKNS FP PTLNLRFAKS3HLWE 
PRGKNHGPPETPPLSRVSPPQRFFPGGT 
RPLKKE 


7187 


21088 


A 


7246 


3 


293 


CKCGKAFHNFYS FQNHE ISHTGEMLYDI 
RNVGK*SIFF*HHCQHKTTHRAEKLCEC 
KICREAFSHFGNLKVHEAGRSGSRL* SQ 
HLGWANCLTSGI 


7188 


21089 




7247 


6 


387 


ASIPCLTYPSWEFL* FI*QAGHFSFSKK 
KLFICAFFSNRDEVSLMLPRLGLNSWAQ 
VVLLPWPPKVLGLQVRATTPAKLGIFWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQSFVEKMNKEL 


7189 


21090 


A 


7248 


373 


146 


HLGKKVLI FWPQ YLALLNPPNGGVSR VY 
PRCRPLMVFFFFFFFC*GRGLAMLTRLV 
SNSWSQVI FQLQL P KVLGL 


7190 


21091 


A 


7249 


2 


184 


AI LRG I * KKF ISTLMDDFEGI KT S LE E V ' 

TADWEIAREVELEVEVEPKDVTELLQS 

HDTT 


7191 


21092 






795 


544 


EMKSCSCCPGWVQWRDLGSLQPPPPRFK 
RS P AS AS LVAG I T AACHHARL I F VFL I D 
TGFHYVG*AGLKTPEA*ATAPGPIFLYF 


7192 


21093 


A 


7251 


34 


242 


KNTSLKKKFFWCQKLRNVSLVEEFAFWP 
GAVAHACNPSTLGDRGGRVA *GQE FQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


NIQVLKYVLTSYLKKKKCAFRLGPVAHT 
CNPGILGAQGRRIS*AQEFETSLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7194 


21095 


A 


7253 


126 


366 


INLTECQQYLDKAIFFFFLKKGLFAP*A 
EWQGGSFASLQPKPPRLKKSSHLTLLST 
WD YGGALPCRANFG I F FFWKKGG 


7195 


21096 


A 


7254 


2 . 


213 


RPTWATWQNPASTKKKKNSFFFETEFRT 
VAQAEVQGGNLG*LQSLPPGLKRFSCLK 
MPEAPKGQSCSLIG 


7196 


21097 


A 


7255 


150 


263 


EGVSIiLPRIiACNGAILAHCNLRLPTSSD 
LCAFS * TTTRYYKYSGYCFSHPNLMLI C 
SRFTLLSSHSSLGHLTLFLLiIVCFEKES 
HS C PGWRAMVLS WLTAT FAS RRQATFAP 
SLELSR 


7197 


21098 


A 


7256 


386 


93 


PPGGLPPQSIPSPRAQLKPKSPRGKKRG 
GPKPGGSQPGFWGFFFGPGPPFKRGGGL 
NWVLGAKN* KGVKKNQKKKKGKMPQETK 
RGNRS I DMKFGGS 


7198 


21099 


A 


7257 


153 


369 


KENSRLQGYGPLFSHS*FTNSCPGQVQW 
LTPVILALWEAEVGGSKDQPGQHGKTPR 
LLKIQKFARCGGGRL 


7199 


21100 


A 


7258 


180 


3 


CSTLI * IRKVWLGAVAHAYNPNTLiRGRG 
GR I A * GQVFKTSLGNNVKTCL FLPS PHN 
QQ 


7200 


21101 


A 


7259 


22 


265 


QLRAIPDLKLDPLMEGKEILSRTLLGSG 
TVAPICDPSTLRGQGGWIA* VQEFKTSL 
GNMAKPHLYRKKKLLGQVTKLEYKW 


7201 


21102 


A 


7260 


326 


2 


KSAMQ I CIVDPLNAFCLE I CCHANLHCR 
PSNAFCFLKVDNVRAFVS PS I PQVSL I L 
KNFSWPGVVAHTCNSNTVRGQGERIA*A 
QEFETILGNTGRPCLYFLKNNRKS 


7202 


•21103 


A 


7261 


1 


68 


VNLCFILWLGPGWAHACNPST*GGRGG 
RIA*P*ELETCLSNTA*PGWAHACNPS 
T 


7203 


21104 


A 


7262 


282 


176 


GLKKTFFFIiVRERVLLCYPGWSAW* SQ 
LTAN* TPGLKLFSCLSLPCS WNHRHVAL 
HVAENIC*CKMPWPATWEAEAGEQLEP 
RSSVCSEL 


7204 


21105 


A 


7263 


588 


351 


NNTVCIiLESISSVNLCLPHLQHSPVKLE 
IFLFFFSSDNFPSTHHITVRPLSPS*AI 
IFPVTCTYTSRWPEATKDPQKK 


7205 


21106 


A 


7264 


327 


2 


KI KER INNFI FI CRDRVS I CCPGWNAVV 
TLNSWAQAIRFCLSLMSS*NYRCAPTHQ 
QVTT I FFCRDQ I E IHMVPKYHLGNTLNY 
FFVLFCFVLFLRHPFSAHRNFRLP 


7206 


21107 


A 


7265 


121 


270 


AQEAG FE LLTSGDL.PTSAS QSAE I TGMS 
HRALPSC Y* FYVTLHGLTVACA 


7207 


21108 


A 


7266 


3 


227 


SSQVSLPNNWDYRREPPCLRFVFLFFVE 
TRSCYVASLELLGSGDPTVLAS *SARVI 
DMGHCDWPNLSEILLHVR 


7208 


21109 


A 


7267 


300 


1 


KMLLNT IMCNRE I LCKRKS PRVQQTS FV 
SYFKKLPQPPQPSATTTLISQQPSTSKQ 
DPPAAK * h * FAEGRLLAFPSNNVLYQLR 
YIMCFLPHTGIVHLVN 


7209 


21110 


A 


7268 


29 


344 


ASLLALKKKSGLGAVAHAHNASTSGGRG 
GRIT*AQEFATSLGNMARTCLYKKKKKK 
KKKGKNFTGVGGPQRGAPI FGP WGGGKS 
P APG KPGGPNP F PG KGAPTL F 


7210 


21111 


A 


7269 


3 


209 


LHPLSPGPPAPVAAGIPGPQASR*GPCR 
PDPSPDSASVDASLPGPALSL.WSWGSNR 
GEYGTTPASQIH 



994 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7211 


21112 


A 


7270 


86 


206 


CQ PG WAHAYN P S T WGGRGRQ I T * G QE F 
ETSLGNTARPPL 


7212 


21113 


A 


7271 


45 


274 


GXjGTVAQACDNPSTJjGG * ggw ia* gqef 
KTINNDKSNKWQYLAI IEQINNRMLICK 
NKKELTD I CNNTDE FQKHRE 


7213 


21114 


A 


7272 


1 


122 


VLVETGYLRVGQAGLKLLTPGDPPALAS 
QS AG I TGISHRA* S YMHF* L VETGYLRV 
GQAGLKLLTPGDPPALASQSAGITGISH 
RA 


7214 


21115 


A 


7273 


3 


242 


LLKQLDYRPPHPAKFFFFLEFLVKKGFH 
L.WKGGFKTQTSGGLPG*ASKGVGIPGV 
SHCPGL IG VFNLTLKVAVSLMDL 


7215 


21116 


A 


7274 




169 


LAL.VAQAGGQWHDL.GSLQPLSPSFPQFF 
YLSLPSS*DYRHEPPHSAWFSFFSNYR 


7216 


21 1 17 


A 


7275 


353 


76 


FAHLGLPQCWDYRREPLCAT* KLFLKGK 
* PSKFQVI FSS LGTGHFSNACFS C VMWT 
IVLL.I INFWVTGIDGRTSGSSSVKCGNH 
SIYPMVLL 


7217 


21118 


A 


7276 


6 


87 


RRGLTLLPRLGSNSWPQAI LLPWPPKVL 
KL>*AQTPGIiRQSSCLGLPKC 


7218 


21119 


A 


7277 


1 


203 


FQVKMLCSGWMWLTPVIQALGAVAHTGN 
PS TVGGRGGQ 1 1 *S * * FYTS LDSGNPPA 
SASQSAGITLA 


7219 


21120 


A 


7278 


272 


51 


IFFPVTM* CGGHDVQHHFNPS *NNLSSWS 
SWFMPIIIjPLWEA*AGK*LEPRSLRPAW 
ATKLDSNSKKKIIEIHA 


7220 


21121 


A 


7279 


1 


131 


SRDYRHGPLHPTDFVFFVEMRFHHVAQA 
GLKLLGLSDLPALTS * SRDYRHGPLHPT 
DFVFFVEMRFHHVAQAGLKIiGLSDLPA 
LTS 


7221 


21122 


A 


7280 


411 


36 


FLECVPNFFPPKKKFFFKKNTGQVFFPV 
TFL*KGPALFFFIFPFYGKKFFFLSSAK 
DFPPRVFF*KGPLFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFCFKHFIYKKVHF 
♦SSVHFQPIIFF 


7222 


21123 


A 


7282 


395 


181 


RKHES IHHDSVISVTFNMQSLRLFFFIF 
FILSL'LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 


21124 


A 


7283 


392 


2 


IKFFGYNV*FNYILKNGLF*QLLI*LIN 
LFILRQGLIFWQVCKVLVFFFFFFIFF 
FFL.FFFFFFFFFFLLFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLTQNMLFIVK 
FCKDTFKQQNIHLRTRGR 


7224 


21125 


A 


7284 


113 


291 


YVCVCVCVCVCVCVMEY*SAIKRDXIVA 
FWTWMELETIILSEVSQEWETTHHMLS 
LIN 


7225 


21126 


A 


7285 


327 


44 


LIKVSPNFSPPKKKVLPKKPPCTIFPPS 
PF* KTPPEPI FIKPPHKKKNFNS P P PQK 
♦GPPQGFFKSPPPFFFFFFFFFFFFFFF 
LGFSSKLLIR 


7226 


21 127 




7286 


3 


215 


VI L I I FM I * EAFA WKRKVL I VE E P S I NL 
E * LYDGG YYDGLVLTSGLGPY 


7227 


21128 


A 


7287 


367 


119 


LKKAPAPPFLNPPFWGKNFILSPPEKMG 
PPRGFFKRGPPFFFFFFFFFFFFFFFLG 
FFFFFFFFFFIiQYNNMYFH* IL1 INQ 


7228 


21129 


A 


7288 


73 


259 


INKYLQN* KKKKKKKKKKKKKKKKKGGP 
FKKT PRGAQ I S PGVKKKNFPPKGG FKKK 
QAPGPF 1 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7229 


21130 


A 


7289 


337 


53 


LRSFAPFHLPINNLLTDIHMVIPLPPSS 
LCSHYLFNEVTVTTLFKTTTHLQLGTMA 
HTCNPNTLGG*GGRSA*AQESETSPGNM 
AKPRLYLLNF 


7230 


21131 


A 


7290 


1 


96 


KHHFG FEAAA* YWHFVDW* LFLYVS I Y 
**GS 


7231 


21132 


A 


7291 


288 


2 


SRVEFKGPFMAHCSVTIPGSSYVFTSAS 
* VAGPHGARHHAC* FLF* FLV+ MLLEMG 
S RHWQAGL* RVS S S DP P ARACQ SGGIP 
GVKPKAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGGGRSRKSYLG 


7233 


21134 


A 


7293 


3 


346 


NPIoNP L.VNLTVS P KRNSSLDTRKKPCR E 
RHREKETLRKKRNQHTEARGRQRENHPE 
TE*ETGQRDRDKQTKKQKDTDIQAYTYT 
DREKRDKKRHRQ I R * AEAER I RDTERQR 
DI 


7234 


21135 


A 


7294 


46 


346 


ETGLKKWGDLSNGTVRAVLNLAQADFCV 
S I PS YHI KHGNTKNNCGGEGKKKILWT 
AKKS E PGWTHACNPN TLGG *GGRVT*G 
QWFKTGLGNIVRPCST 


7235 


21136 


A 


7295 


254 


12 


RKDAENNGSRAE* IKRVKDLVQSKLKFF 
QELEKEGILPNPFYEASITLIPKPDKDT 
SKEDYRPISLMNIDAKILNKILAN 


7236 


21137 


A 


7296 


258 


1 


C I GQ C VLVAVGREL.LC VSRS WCGLLiH VU 
E VGGE ALGRLL WY PWTQRFSE S FGDLS 
TRDAVMGNPKVKAHGK* VLGAFSDGLAL 
liD 


7237 


21138 


A 


7297 


191 


263 


VCSKCEPFIYIMDYYAAIKRNKIMSFVA 
RWVQLEWSELTQEQKTENNMFS* WFG 
FLLLC 


7238 


21139 


A 


7298 


2 


164 


SRVAGITGAHHHAQLIFVFLVETGFHHV 
GQAAFKLLTSNKKARPQ*LHFFYFFK 


7239 


21140 


A 


7299 


307 


8 


VMI I TFWLPHLSGY I EKSTP YECGFDP I 
SADRVPFS IKFFLVAITFLLCDLEKALL 
LPLP * ALQTFNLPL IVMSSLIiLI I ILAL 
SMAYE * LLXGLDCSE 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPSTLGG*GGWFA*VQEFETSL, 
DNMVKPQLY 


7241 


21142 


A 


7301 


140 


22 


FNFLFLYP*NYFFFFFFFFFFFFFFFFL 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 


151 


SIHRSEFRPRRSHSVSRPESRGIILVYC 
SLYFLGSGDSAPWS * VAGTTGVCPWAS . 
LIFGFFFFFFNRDGVLLSCPG 


7243 


21144 


A 


7303 


318 


17 


LVPTKRPQVENYYPVHTYIWYCTKGKRK 
WAGIGGSCLSSQHFGRPSWQDPLSLGV* 
DQPGQHRETPSLQK* KNWWQEP I VPATQ 
EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QW CDLG ATT PANFCI FFVRNGVS P CWSG 
WS * PP DLKLS ARLH FQSVFFNP IISIGL 
FFL 


7245 


21146 


A 


7305 


338 


168 


REPPLPAANFVFFVEMRSHHVRQAGPEP 
PSSSDLPASASQSAGITGVHCHAWP* *T 
YK 


7246 


21147 


A 


7306 


276 


161 


I INAAI KKNEFMSFAGTWMKLEAI I LSK 
LTQELKVK* KLRKFNTDTILLLLLI FKI 
F 


7247 


21148 


A 


7307 | 310 


61 


ITLVYCNFFYFINFNFLKLFDS FVI ILS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K = l>ysine, L. = .Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R = Arginine, S = Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LKQI YSCTAIVYI1>IIj*AFVQFKFIjCFY 
FLYFKN I FIYKNI FYKI KYV *NMYFYF 


724S 


21149 


A 


730S 


257 


1 


CGRVDLH*AVFFFFKRWSLTVLPSLDLD 
SWAQVILLP*PPE*LGQQSCTTPPSSLS 
TFLYAFLHIPVNKMSCFVGKDRGMFFSY 


7249 


21150 


A 


7309 


2 


181 


RDGFCHTGHTGL* IiLTSGDPPASASQSA 
GITGVSHHARPCLHSLTAFFKNWMVWPG 
QQE 














NLLLDTI * KKKS IRI QE * * QYKS ISKRN 
YNTVS DKS WLGTVAHACNPSTLGGQGGR 
TA* TQEFQTSLDNTVRP CVYY 


7251 


21152 


A 


7311 


2 


193 


FVDFSMHILDMLSLYLSIVICYITLFLS 
I IDNI YVNGPHKI I IPYFYCIKYIFYI Y 
I WFCF WFFET Vl/LC CFSVFT FNDY CS YL 
HYFLLLGVFGLFHFF*FLYLYMVLFLVF 














AFFFFKGP*SEPLTLASSLHWYFPGGHF 


7253 


21154 


A 


7313 


167 


2 


GKPCLLVVMCRDSQKCGEKGTGLKREAL 
RMGK*GQARWLTPVIPALWEIjEAGGSP 


7254 


21155 


A 


7314 


334 


152 


CHVAQAGVQWYNHSSLQP* PPGLR+FSH 
LSLPSGWNHSWTPPHPANFFIMGSCHLP 
LINK 














VSHHARPTFYLS *QVGLTVTS FRNPQ 


7256 


21157 


A 


7316 


155 


9 


QRIKH* FHWLGAVILTCNPSTLGGQGGW 
I TGGQKFETSIAYMATPRLY 


7257 


21158 


A 


7317 


1 


293 


GIPSTWDYWCTSPQPHF*FFFFFF**KR 
DLAML PG G VLNS W I P A I L Q P KG P KVLG I> 
QA WT I L YKG Y YG KEM WHKE I CKGTHD VL 
SKLLTTKRFKRLL 














SNSIiVYLI KPDSVYI FLLFVETVSSHVA 
QVGLELVAS SST P I * ASQS VE I TGVSHH 
ARPKKVT 


7259 


21160 


A 


7319 


169 


265 


GGRMALSPGRGGCSEP*GCHCTPAWETE 
QNPA 


7260 


21161 


A 


7320 


169 


260 


KPQKI PVACDQQFTDKRNCSHGDYECNM 
TTALQPG 


7261 


21162 


A 


7321 


580 


400 


FFFFERESRSVARQPGVQWRDLSSLKS P 

FEFW*RWGFTTI 1 ARIVI J LS*PCDPPTSA 
SQSAGITGVSHCAQP*FFFFERESRSVA 
RQPGVQWRDLS S LKS PPPGFKRF PCLS L 
P S S WD YRRAQP C PANVFE F W 


7262 


21163 


A 


7322 


130 

— 


26 


SPGPLFGLIEPLPSRFKGFFWLNLP*TW 
E YRLAIjTAPIKFFKFLIKTGFPH *G IKL 


7263 


21164 


A 


7323 




"303 


KSSGT I KGRAQS PFLQLVAI EQCQDCFH ' 
FRSQF*YLKKQLQPGMMMHAYNPSYSKG 
*SRRIT*AQEFKSRLGNITKLCCLKINN 
SLIFKITENIE 


7264 


21165 


A 


7324 


194 


8 


DWVEVLKKSTYRPDTVAHACNPSTLiGG* 
GGSIT* GQKFE IRLGNWG PCLKKNLTN 
QLGVGG 


7265 


21166 


A 


7325 


413 


72 


GVPIFLKTLTGKTISSERESKVPLEKGK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


1 Predict- 
ed end 

otide 
locution 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro)ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














ANFRDKRGIPPHHKRVIFAEKKL*KGRS 
PFDDGFQKKLTQPRVSAFKGHPPLQPLY 
AFIKSFREGTHHQDCPFWHLTLLSARRQ 
Q 


7266 


21167 


A 


7326 


394 


177 


NVNHTHTHTHTHTH THTHG WC KS DTG Y S 
SDS*CITPRCIHDPTWPSSSQTVNIiAMG 
LTLAKEVliANMTQTEV 


7267 


21168 


A 


7327 


232 


22 


SRPSGSKKGAPALQQSSCLSFLSSWDYR 
HKPPYLTNFCIFL*SQSFPMLPRLiLSNS 
QPHSNLPXWELQV 


7268 


21169 


A 


7328 


232 


3 


YHRSSGSLISLIQIRLRSKI*APSIPII 
KEWLSSVAHVYKPNTLGGQGGRLT*GQE 
FETSLGNIARPCLHENCLKN 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTRVSKAPSNRQCLGVNTH 
VLGHRRPVKGNAKTRWSASESQAHTCNI 
T TLGGQGGL IT*DREFESS LATMQN P VS 
VSGWLQKMFSKQKLYSKFFPNVAYKAFR 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHLHKKFKARCGGMTL* SQLLRRIoR * K 
DQLSLGVGGCFELLWCYCTPAWVTERDP 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMFYTYIYVLDIYTYTVFFMYIYI 
P+CVCKYISCLHIMYRFISMYPCVYVYV 
CMCVCNVIHVIVENAGSCLALSALIEME 
GMARDS KMWTFNPDE KMRLSYKKQLEKL 
NFMILCPTWKEF 


7272 


21173 


A 


7332 


147 


49 


KNYIYFCRRGLPMLPRLGLNSRAQEILL 
PQPPM*LiGIjNSRAQEILLPQPPM 


7273 


21174 


A 


7333 


148 


361 


S WLGTVAHAYNPSTLGGRGRQ I T* VQE F 
ETSQSNIARPLLYSVLAYAVQPPLPMVA 
MRKIiWLIPTSRITAL 


7274 


21175 


A 


7334 


305 


2 


CPGGFVTRVHR* YVRLSQPDLRVFTSGA 
MPASAAQSAGITGMSHRARPVCS I I IFL 
NI FKVTCI FVELTI PNQNKQNTFLSSTV 
ASCRILASRPTPIWVL 


7275 


21176 


A 


7335 


1415 


1164 


FFLRQSLLYCPGWSWAQSQLIASS ITQ 
A*AILPPCSWDYSSQHTWLKFFFC*DGF 
SMLPRLVSGSWAQMVLIjPQLPKAQAKL 


7276 


21177 


A 


7336 


1 


206 


GTRADHLMSGFQDQPGGHTETLSLIiKIQ 
RSQAWWHVPVMLPTR* IDARESLERLMT 
IALLKVRNRYIF 


7277 


21178 


A 


7337 


65 


302 


KSSLFYFLKIETGFHYVAQTGL*LiLjSSD 
DPPASVFQSAGITSMSHHTQPKEIjLKST 

rntlee pnrewakarewahrrnt 


7278 


21179 


A 


7338 


208 


38 


npvsakneeyprgtfdlgtqaevqwhdl 
gslqptppglk*fs*lsprv 


7279 


21180 


A 


7339 


2 


198 


aradhprsgvrnhpgqhgetpvstkntk 

N * AWWRVPLVPECWD YI LFYMFTNCMDN 
KLPKWLTLC 


7280 


21181 


A 


7340 


2 


227 


ARGEIMSFAATWMQLEANILSKLTQE* K 
AKYCMFSLLSGAKHWVLMNIKVGTGDTR 
DYLGCSTQAVLKNLGTRAE 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDSCTLAASAEKACGACP 
LWGKCDAESSKCVCREASDCEEEGISIC 
VDVNCKEQTMPEC*AVAL*CTVHLIYVT 
INNALCLPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RGL>GTVAHACNLNTFGG*GRWII+AHEF 
E TSLRNI TKPSLREWPDML 
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SEQ ID 
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SEQ ID 
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peptide 
sequence. 
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od 


SEQ ID 
NO: in 
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09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T= Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7283 


21184 


' A 


7343 


306 


164 


IYMLYFIYITLIYIHTHTHTHLYLDKYI 
YL* CVHCF1.* * AYMI CHQR 


7284 


21 185 




7344 


442 


S 


SQMSKLKTWFRAVAHAWNPSTLGGQGRR 
IT* VREFETSLGNTARPNLYWtJ 


7285 


21186 


A 


7345 


3 


248 


HELSLSLL.LSPFFFKNEKGSR*VAHAGL 
KLLTSSDPPT*ASQSARITGISHCARPL 
LSHFLAYLALPCHQNRLCTGHLPST 


7286 


21187 . 


A 


7346 


2 


246 


HELYQS ILLSPLiFFKNDIGSK* IAHAGL 
QLLT*SDPPT*ASQSARITGISHCARPI. 
LSHCLAYLALTCHQNRLCTGHLPST 


7287 


21188 


A 


7347 


1 


130 


GTRLLKRLTQETRLNPGGRGCREPSLHY 
CTPTWARQ S QTVAHN * KRLTQETRLNPG 
GRGCREPSLHYCTPTWARQSQTVAHN 


7288 


21189 


A 


7348 


516 


288 


F FETE S R F I TQTGVQW CDLG PLQ PLP PG 
FKGFSCLSLLSS * DYRCLPPHPANFCI F 
SRDGVS PC * LGWSQTPDLK 


7289 


21190 




7349 


417 


148 


S T S WVQ AG VQ WHDHGLLL P G P P RLK * S 
SHLSLPHG * DYKH I LS CLANF * NI FVEA 
GFRHTAHGLiEPLGSSDIjPASASOSAGIT 
GMSHLA 


7290 


21 191 










TQKFKTSLSSWDYRRVSAHVANFCIFSK 
DRVSPY* PGWSRTPDPARA 


7291 


21192 


A 


7351 


189 


37 


EAIAVRRI I *NLFFFFFF*RWGLTVLPK 
LVSNSWAQAILSPQPSKVLRLQA 


7292 


21193 


A 


7352 


131 


2 


KSRLGTVAHACNPS TLGGRHGW I T * SRE 
FKTSRANMVKPRPRA 


7293 


21194 


A 


7353 


1 


106 


GTRGQCLTMLPRLVLNYWAQM I LP P WPP 
*VLGLQV 


7294 


21195 


A 


7354 


939 


653 


KGNFIFLRQS FTLVAQAGVQWRDLGSPQ 
PPPPGSK*FSCLSLRSSWDYRHGPPHPA 
NFAFLVEMGFLHVGQAGLELPT*GDPPT 
SASQSAGITGR 


7295 


21196 . 


A 


7355 


270 


26 


EARFLGLLRDIAQLCYPGWSAAV*S*I,T 
AAS TS W AQVKAS TH L.S L PNN * D * ARATM 
PG*LKKYVFF*RQGLiAILPRIiRLiVF 


7296 


21197 


A 


7356 


329 


206 


HHLANF*TFCRDRGLTMLARLNSWPQAI 
LLPQFPKVLGLQM 


7297 


21198 


A 


7357 


128 


253 


MRPDTVAHTCNPSTLGGQGGR I T * THEF 
ETSLGDMMKPYIjYK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCSE P * WRHCTAAWVTEQD P 
VSKTIKK 


'7299 


21200 


A 


7359 


1 


340 


GTRERNWHLA * ALHTHTHTHTHRHTHTH 
THFSFPANS*RDTFVIF*QSLLLEIMTS 
FSPSLGLKCSFSQQPSGLLSSIAGERDS 
RIHGENTVSMCLHNRLKAELRSDVISLR 
V 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYNLLVCHLLRPLEKRSIR 
VGVT*FSRCHLSPLSLTRKANSLTPCTS 
WMRRCL I LLR LTLG VPHPMSCTHCPTLP 
S EMNLGPQLEMQKS PI FCVAHAGSPRLE 
LFLFGHLGSTPNLPSC 


7301 


21202 


A 


7361 


234 


29 


LSTSPKGSKALKFNVTLLLNIAF* CNSG 
CNSNL* S * QQY * RVI QSHHLSQTLWLTP 
VIPALWEAEAGG 


7302 


21203 


A 


7362 


68 


339 


DRD IAN KP AWQLCQS R ILYLAQLQF I DQ 
GELLPDFRIHYKVI * TCTAWH*RRNRL>I 
DQWNRAD I PDLiNPCQAWRLTP V I LVLWE 
ADVGGS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


1 SEQ ID 
NO: of 
peptide 
sequence 


M 
etii 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


•Predicted 
beginning 
nucleotide 
\oca tion 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7303 


21204 


A 


7363 


168 


2 


CFYS THAHIFKELRQRPGAVAHAYNLS T 
LGGRGGRLT* GQKFSTS LANMVKPHS C 


7304 


21205 


A 


7364 


399 


62 


QPGQYGKHPVLIKNSKIKPFWGDPPWP 
NAR EG * A * KM VE PGKVR VQSAQ I KALE F 
NLGPKKKVPFKKKNPWLAKGGGLRPEIP 
AFGKAS KGGS PRSR I LRPPGFLP YPAHF 








7365 


289 


76 


VCDVLRESRAQLSRLKCSGII *AHCNVS 
IAGSSNCHASALRVPGSTGVWVESERGD 
VDRFEAYGSRGYVCA 


7306 


21207 


A 


7366 


364 


182 


Q WHDHCJJFKLLGS SNP P TIAP * VAGTAG " 
TCHCAWLI * FLTWKIGFKTSLKI IVLLG 


7307 


2120S 


A 


7367 


155 


2 


SLGTKREGWEKSATVAHACWPSSLGGRG 
RWIT*GPEFPTSMANMMKAHPRA 


7308 


21209 


A 


7358 


342 


184 


LSLPSSWDRRYTPPRLANFFKFFFKR* S 
LTMLPRLYRPFYITTHQTLNSAMTF 












211 


AREPKCL*TDEWINKMQYIDAVEYHLAI 
KLNE I LTHATT WMNLENML I GRS* TQ KA 
TCCVIPFMDSIYV 


7310 


21211 


A 


7370 


190 


344 


YKNVT VKR I LT W P S T VAHACN P S TLG GQ 
GGRFG*GQEFETSLGKIARPCLY 


73 1 1 


2I2I2 


A 


7371 


24 


254 


KPQGIGAHASSQHYGRPRREDHLSLGV* 
DQPGQHGEIPSLQKKKKKGLPGRGGVCL 
KPWFLKI LGEKNVWVLTPKG 


7312 


21213 




7372 


43 


358 


DSNLNYSLFFHGEADLGTTQVLTHPSTT 
AMYFVHYCQSP* ILYGTINT* PPWEKN 
PIH I KTPS PCLQAS TAINLQLSHINCNS 
KATPHPLGYQQTYPPLTVHST 












288 


IHIILAFTSSLLGILVYRSHLIASLLCL 
HGIILSLTI IAPL I PLNTHS LLAP IG P I 
AILVIAA* QAAVGLALLVS I STTYGLNY 
VHNLALLQCCN 




21215 




7374 


335 




VQWNDFGSLQNPPPGVSPFFWLSLPNNL 
GFKGPPPRPGPFLKF* * KPGFS ALTRMV 
S LSRPRDS APLAPPKAGVSRLNPRARAP 
FFFFFVFCFHFFLLSGEKKSLAHKGGI 


7315 


21216 


A 


7375 


399 


91 


RIFFFPPSLKAPPHRQKPTPFPPTKKPL 
VFS I PPI FPPKFLKKNPFSPWEVFLFLS 
FFQKDAFFGPQV*KGNFFFFKGPPPLTQ 
NSPQGVPPKKKKKKKKKKN 


7316 


21217 


A 


7376 


377 


131 


YLYIKFYNFIIILFYIFIFNIF*FNMIF 
NLLFYIFYYILINFNILIILIINFFYFI 
LLIIIJJIILILYIIIYYFIIKNIIKT 


7317 


21218 


A 


7377 


2 


156 


FHHW*AGLELLGSSNPPTWAFQSAETT 
GVSHHAQLPHFILRKQELLSLGL 




21219 




7378 


319 


59 


DTKMNSQDILYSIQSNRT*W*LIKRQ*T 
KQKS APRNGNEVP * THQHSTATTLIS QQ 
PSTSKQDPPAAK* L* FAEGRLLAFPSNN 
VF 


7319 


21220 


A 


7379 


308 


12 


FSPTI SNFNYKSNPS PLIYLFQ * PITQL 
I PTNISTNTPLNL* FYLRLIYSTSISLL 
P I SNNVKI K*QFEHTKPTPFLPTLI ALT 
TLLLPISPPILIIL 


7320 


21221 


A 


7380 


11 


169 


RRDLAMVPRLFSS S PGLK* SLHLGFPEH 
WD YSHE PPSWLLFFLFLITEGYFF 


7321 


21222 


A 


7381 


300 


2 


KFFPPPGFFFKGFFSPL*SHQKKTQIFF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF | 
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nucleotide 
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ino^first 1 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

correspon 
ding to 
last amino 

peptide 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 

N-Aspar ^'g^ne L P-Pr ol . 1 ^ =Meth ' 0n,ne ' 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan. Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














FF* INSLFIQNSFLT 


7322 


21223 


A 


7382 


163 


160 


PGTSNDEIQRRNSRNRISPIAER*GLSL 
LPRLEWMGSNEAHSNLELQGSSNPPASA 
SKSTMITGVSHCA*PRLEWMGSNEAHSN 
LELQGSSNPPASASKSTMITGVSHCA 


7323 


21224 


A 


7383 


1 


183 


VCVCVCVCVCVCIKWDSYSAIKKNELVA 
FWTWKRLETIILSEVSQE*NTIYHMSS 
LICGS 


7324 


21225 


A 


7384 


2 


196 


KIQMWPGWAHACYPNTLGGQDGRTA*G 
QEFQTSLGNIARPCLYQQQQQQQQPRCS 
LTYDGDKS 














RNLSSLQPPPPGLK* WSCLSLPSSWDYR 
RPPPCPANCFMLCEFFRTNGVFPCSPGV 
YFEPRN 














AGITGVSHRARPLVLFLRDSLYSGTQAG 
VPWRDLS SLQSRLAEQMRFCFP 


7327 


21228 


A 


7387 


164 


311 


SKQQKGIiSGVAHPCNPS ILGGRGRWIN* 
GQE FETSMASM VXP CLS LLKX 


7328 


21229 


A 


7388 


3 


218 


E IMS FAATWMQIjEANILSTLSHE * I AKY 
GCTYPSSTEQLGYYS 














SDCGLTPIAQTGVHWHHLSSLQP*L*PV3L 
G*FSHLSLPSSWDYKYVPPCPASFLYYF 
LVETGFCHVAQDGLELLDSNNP PSSASQ 
SAGITGMSGCAGLRRF*KLRSWRYLVLL 
AMQVNEQGTSQAGEAVSEADAGQGCCCG 
LC LHR I LiS QQ YAL P 


7330 


21231 


A 


7390 


169 


322 


APGNFKTN*RKNGFWAGNTLGVSVKNMK 
NGWAWWLTSV I PALWE AE AGGS P 


7331 


21232 


A 


7391 


1 


353 


GTSPHYGSFHGHQKDVTGKGSQSRPRER 
VLGSHARKNSLPVHEVKAYLb*KYLiS IL 
RATHILISPDGCWVPIFIAQSYIFLSSC 
GFFLLFFLLLLTDS FCSWHL IIS FLALF 
VI PLY 














IQAGACCAECIPSKGIRWNPSIVEGLHS 
DMSWSLCLFLSGAISVLCNFRLjSGSSDS 
PMSA*QAARTAVIRHHTRLI FCI FSRDR 
ETGFHHVGQENLLMLLLQLFSGLGFS I S 
ATLL TLML»IjV 


7333 


21234 


A 


7393 


362 


176 


ARMECSGAI ♦AGLRGSSDSIASAS *EAG 
YTGKCGHAO*TFVFL>VEVRFHHVGQNAI 
NIRTSL 


7334 


21235 


A 


7394 


2 


304 


ARADCCARNE * GS VGMGPSE PGVG YNLL 
VERFLRPLEKHS IRVGETRFSRCCIjSQL 
CLiARTGNSLTPCTSWVRQCLALiLRLMLG 
ALHPLSCTQRPTSHSE 














PYNPSTLGG *GGQI S * AHKFETSLGNMV 
KPHLS KNKQTNKQTNKQTNI SWSWCTCL 


7336 












RDHFYRESHSLITY ICLENVKSNLjKCGE 
GPLNFTFHLQRKNCWLGHTRSPGGVYHV 
GRTETATGPIKRRNGLGAVAHTSNPSNL 
GGQGGWIT*GQEFKTSLSLPRWLTPVIP 
LV 


7337 


21238 


A 


7397 


2 


242 


RRYSD YPDAYTT * N I LS SVGSF I S LTAV 
ILIIFMI *EA*KKKKKKKKKKKKKGGPF 
KKTLGGPKFNRRMEGKNFSLKGGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


•Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metl»ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7338 


21239 


A 


7398 


1 


289 


TKLLGLYYTLLQASNDFKSPFSIFDGIY 
GSTFFVTTGFHGLHVIIGSTILTI*FIR 
QLIFHFTSKHHFGLEADA*YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


III FTNKI YI KINKRLKLFI I KI LDLY I 
I*IFFFRK*FTFKQ*KF* IININKIYYN 
NIK* * IFIYLFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 


334 


NS CRRQRLLDLLLL P I TL * LPQL I G Y I E 
KSTRYECGFHPISPARVPFSIKFFLVAI 
TFLLFDLQIALLLPLP*ALQTTNLPLIV 
MSSLLLIIILALNLAYE*LPKGLD*AE 


7341 


21242 


A 


7401 


2 


302 


HTINLLGERYVXiATTY* *STMTLLLTGL 
TVLHTGLYSVYLMTTTQWG*LTPHIINM 
KP*FTR*DTLVFIHLSPILLLSLNPDII 
TGWLVLLSYELYLLT 


7342 


21243 


A 


7402 


57 


338 


TTHNVIGYIEASTAYESGCDPISPARVP 
FSX KFFLVAITFLLFDLEIALLLPr.P* A 
LQTTNLPLIGMSSLLLIIILALSLAYE* 
LQKGLD* AE 


7343 


21244 


A 


7403 


407 


68 


QEELVDPLTTVREQCEQLENCVKAR*RL 
*LCDERVSSRSHSEEDCSEDLFDFLHAR 
DHCVFLILYINLKYMCELT 


7344 


21245 


A 


7404 


494 


154 


SLFIFYLYLIFLF**FFFHYFFYFLLSI 
FSFKLFFFFLLHFFFHMFFFLFFSPPNK 
IFFPKIPPRFFLFPLF**KFFFFLPFKF 
LSPLGFFF*SPPFFFFFFFFFFFFFFFF 
F 


7345 


21246 


A 


7405 




400 


NPGRFLSTSNSSLYERTREIRPTSQSAF 
PRK* YHLKKKKKKKKKKKKKKKKKKKKK 
KKGGPLKKKPRGGQK*TGGEKKKFFPKR 
GGKKKPPGKF*KKTFFWGGKKMGKTPQK 
KINPLGKKKIFKGKRGKTPPL 


7346 


21247 


A 


7406 


2 


257 


RASLCHPGWSAW* SWLIAALTS WLKQF 
SHLNSLKCWDYRHEPPHLA*LRFEHRHS 
GSSICTLNR*ILLSFPILSIRYKNGLIL 
Y 


7347 


21248 


A 


7407 


259 


47 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 
FFFFFFFFFFFFFFFVK*SFIEMFSFW 
LYLLRQVS I S IAYTLFG 


7348 


21249 


A 


7408 


3 


248 


VTFL*CDVEIAVLIPLP*ALPTTNLALI 
DMSS LLL 1 1 ILALRLAYE* LRGGLD * AE 
EHQATHS AYERAP PL 


7349 


21250 


A 


7409 


2 


350 


RSGMPRRYSDYPDAYTT*NILGSVGAFI 
CLTAG I L I I FM I * E AFAS KRKVL I VEQP 
SINLE*LYGCPPPYHTFEEPVYIKSRHI 
RKESNPPKL.VSSQPHGLHDFFKKKLGTI 
TEKN 


7350 


21251 


A 


7410 


2 ' 


232 


LDQTYAKIHFTIVFIGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILSSVGSFI 
ALTAAIL I I FMI * E AFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSLLAVLL 1 1 MTL * LPQLNRY IEKSTPY 
E CGFD P I C P GRVP F S I KFFLVAI T FLLF 
DLETALLLPLP*SLQTTNLPLIAMSSLL 
LI IILDLNLAYE*LQKGLD*AE 


7352 


21253 


A 


7412 


2 


194 


S I EC F LVP I TLLL CDLE I ALLLP L P * AL 
QTSNLPLI VMSSLLLI 1 1 LALNLA YE * L 
QKGLD*AD • - ■ 
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3NSDOC1D: <WO 0164835A2J_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


LYTTYFVTKTLLLTSLFL*GRTAYPRLR" " 
YDQL IHLL * KNFLPLTLALL I * HVS I P I 
TISSIPPQT 


7354 


21255 


A 


7414 


1 


185 


STPLP*ALQTTNLP1jIVMSSLLLIIILA 
IiSLAYE*LQKGLD*AEKKKKGAAVt,KDP 
SGGPS 


7355 


21256 


A 


7415 


347 


228 


LE*LYGCPPPYHTFEEPVYIKSRLKRKE 
SNPPKLVSSQPHGLHDFFKHIKNYRKNK 
RKEGIiASVKENDRGP 


7356 


21257 


A 


7416 


1 


296 


HAYDHASAQLNGYIEQSTPYECGFDPIS 
RARVPFCI KVFLVAI TFLLFDL.EIALLL 
PLP*ALQTANLPL,IGMSSLLLI I ILALS 
LAYE * LHKGLD*AE 


7357 


21258 


A 


7417 


3 


202 


TCS I * HVLVRMT* L I YDLE I ALL * PVP * 
ALQSTNLPLISMTSLLLIIILALSMAYE 
* LQTGLD * AE 


7358 


21259 


A 


7418 


I 


321 


SNTLLALVLITITS*LPQLSGCVGECTP 
YECGICRI LNARALCCMQI FLKAI TFLL 
FDLEIALLLPLP*ALQTTNLPLIDMSSL 
LLI I ILALSLAYE* LQKGLD* AE 


7359 


21260 


A 


7419 


1 


212 


RTRG 1 1 FR PLSQSLY * LLAANLVI LT * I 
GGQPVSYPFTIIGQVAYVLYFTTILILI 
PTICLIENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GRVGGRVGYSIAFFLQAITVLVLDLESA 
LLLALP*SLQTTNLSLIVMSSLLLIIIL 
ALSLAYE * L PLGLD * AE YWGEMMTTTLD 
A 


7361 


21262 


A 


7421 


367 


269 


H CDL KLLG S S DP PAS AS * VAW I TGT CHH 
TKLS 


7362 


21263 


A 


7422 


2 


116 


ALQTTNL PL I VMS SLLLI 1 1 LALRLAYE 
* LQKGLD *AE 


7363 


21264 


A 


7423 


2 


336 


KPECGREGILPCCSSSAWPEGSFRPFQM 
NLGSFLSFFFLFFFFFEKGLFLCPPV*S 
PYGKSNLLEPSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL* IRTAYPRFRYDQLIHLL*ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


21266 


A 


7425 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TLALLI*HV5IFITISSrPPQT 


7366 


21267 




7426 


1 


314 


INTLLTLLLIIITL*LPQLNGYIN*STP 
-YECGFDPISPARVPFSIKFFLIAITFLL 
FDLEIALLLPLP*ALQATNLPLIDMPSL 
LLiIII LALS LAYE * LHQGLY 


7367 


21268 


A 


7427 


1 


331 


I L I INTLLALLL 1 1 1 T F * LPQLNG Y I EK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLFDLEIALLLPLP*ALQTTNLPLIVM 
TSLLLIII LALRLDYE * LQPGLD * AD 


7368 


21269 


A 


7428 


2 


373 


SDRNHI IHALVITILLGLYFTLLQASEY 
FESAFTISDGIYGSTFFVATGFHGLHW 
IGSTFLT I CFIRQL I FHPTSKHHFGFEA 
AA* YWH I VDVAGLFLHVS I YRGGS YNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


63 


PSFFPPFPP*NFFFPPRPLFSPGGFPPF 
FPPPQKGPPPKIPPGFFFSPPFWEKFFF 
PPPPLILPPPRFFFNPPPPFFFFFFFFF 
FFFFFFFFFFSFFFWDFLGSGC 


7370 


21271 


A 


7430 


1 


109 


QTTNLFLIVMSSLLLIIILALSLAYS*L 
QKGLD*AE 



1003 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 

nucle- 

locntion 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid,. 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutarnine, R=Arginine, S=Serine, 
T=Threonine V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /^possible 

nucleotide insertion 


7371 


21272 


A 


7431 


358 


290 


FLL VI PNFLFKFL I VI I IS I VPNVS FFH 
IIYILSIFFPFI IFILLLSSFNFILFFF 
FITSFPFFYFFFFFFFFFFFFFFFFFFF 
FFFFSFFFFFFF*FFFTN 














PTRPPTRPHNSLQL*TPGLKQSYPRLPS 
SWD*RHMPSHPANFRKFF*RWHLYCPGW 
S*TPGLKQSYPRLPSSWD " ' 






A 








FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLSYSFCPLSFLCLRCIPHSAFFWG 
NPRQVQENCVI VT I YRYLS FTALHQL 


7374 


21275 


A 


7434 


. 163 


1 


SVYVKRIGLGRVAYTCNPSTVGGQDG* I 
T*GQEFETI LAKRRE PL FLAC FL FL I 














NDDNS VE ^G X R LHKG LGT VAHT Y FCP S T L 
GGQGGHIA*AQEFMTSLGHMT*SSLYHK 
LTEYGWRDTASLLMC 














GGDLRHWPSSDSFFLSLSFFF*NKNGGG 
LALLPRLDLNSWSQAILPLKPPNALGLQ 
A 


7377 




A 








FVCLFLFFFFEKESTFQHDLSSLQPPLP 
RFKRFSCLSLPSTWDYRLAPQCLANF* I 
F**KSGFTFG 


7378 


21279 


A 


7438 


1J9 


300 


YCVSLLIFFIFI IYIFILLIFKFLFITL 
NYISFLFFYF*H* FYF*KNFI FVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKLQHHDPTT I SLVI TRNWKQ P KCPNVQ 
QIVH* S AI KRNELL IHAKTWMNLRGI ML 
SEICQPQKILYLEIPFI 














FSFCCVCVCVCVSLPKHAIIEMLNQTKP 
YVPKGVAYLPEREPFIVPMDPELTA* YE 
DYRTHESAQEPSEPHQLSRRSLPH*PGK 
ATRDHS P PTDS WGKRRTSH 


7381 


21282. 


A 


7441 


3 


384 


VIHL*AGAVIIGSRSKSTNALAHFLRQG 
TPTPVILVLGIIETMNLLIQPVALAERL 
TDN I TAGHLLMHL I VS ATLA I LT I NLT S 
TLIMFTVLILLTILQIAVALMHAYVFAL 
LVCLYLHDDAYDHGG 


7382 


21283 


A 


7442 


.15 


277 


GLAILPRLVPNSWPKVILQVWPPKMLG* " 


7383 


21284 


A 


7443 


21 


694 


NGGLNAHLASASEFDHSGVQLIEREEEI 
CIFYEKINIQEKMKLNGEIEIHLLEEKI 
QFLKMK I AE KQRQ I CVTQKLL PAKRS LD 

GENRARFLPGKDLTEKEMIQKLDKLELQ 
LAKKEEKLLEKDFIYEQVSRLTDRLCSK 
TSGCKQDTLLFTKKMNGYSRRIKNATEK 
MMALCA*LSMKQAPTIELQKEVQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRSLLEAGDGRIAGDFFFF*TGTNFVAQ 
AKVQWG I HS S LHP * PPGLKQS S YLNLLG 
S YHHWLRPHALFFFLKNRGSL FC PGHFG 
TGFRGTTFWGAK 


7385 


21286 


A 


7445 . 


35 


343 


. GYSI IGCPKFLDEKI FDSGHEKERKINS 
LLF*KKKKKKKKKKKKKKKKKRGGLLKK 
TSGGPKIKRGGGIKNFPIKGGVKKTSGE 
FFGKKPFFGGEKFWEILP 


7386 


21287- 


A 


7446 


403 


193 


EYIKQGAE*NNQPKLHAHRKRLTWKETP 
RCIHEG IAPAIMNSDCLVFDTSIAQLFA 
ENGNLGINVTISMC _ . 



1004 



vISDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamtc Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutarnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


7387 


21288 


A 


7447 


411 


110 


LTDSLINI VYR* KMS Y I S * TNSA* VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAHIVLVYSSSYILVYYESHKYGYIMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 


1 


380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKNYELKSLIK 
T YGAKKAE I EHTEN* KNRETLLERAAQL 
AIKVTNPNAMLRTVEHDYAMYMHNFWLN 
KLNNCHKVAISCKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL*PGWAHTCNPSTL 
GGQARRIS *AQELETGLNNLMRPCNYNK 
F I NVLDDVHG VTS P VS Q YYKKVC 


7390 


21291 


A 


7450 


405 


3 


PPRFFGFFFPFYPLKFFFFPKGFNFFGG 
FFPIFFPPKKKFFFKNSPGGFFLPPFLG 
KIFFFLPPFKFGPPRGFF*RAPLFFFFF 
FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 




348 


PRVRPRVRYKLQTKRCDLTLPS *GHSHV 
SLRNTTFDAL KS * DTDS LLS P KL.ECHG V 
NTPNCNLRLPSLSDYPASLSREAWIAST 
RLRAWEIFVFLAQMRQDYCMNPGGGGCN 
ELK 


7392 


21293 


A 


7452 




168 


LVHDGHAGLKLLTSGDRPA*AYQSAGIT 
GVRPLGPPTYYYFRTASFMTFICEFATV 


7393 


21294 


A 


7453 


184 


260 


SSRGGSLASRSSR*STLLGL.PKCWDYRC 
MPPSLANFFFFNF* I*NFCFW*RRSLGM 
LPRLWNS*MQAILPLRIRLQGIHFAGT 
QTFSL 


7394 


21295 


A 


7454 


405 


64 


FFFFFNFF* KKKI * DVFAKI FFKI FFFF 
SGLKIFWGGGFQNAPPQKKFFF* KIKSF 
FFLNFFF*KNIFFFVAGWFFLLNFFFR 
GAPLFFFFFFFFFFFFFFF FFFF FFFF F 
FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQSFGRPRWENRLGPEQHIKISRYRPAQ 
VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLP KYWDYMPE PLLLVY IQCFHFRTFCN 
KAAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 


21297 


A 


7456 


2 


183 


TERDSVSKNKYIYIHTHTHTHTYICVHG 
VIYVYI*SLLLIYNIAYILIYHVFYILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFC I FM * RCGFNNVGQSGLKVLLRGYS 
PASASQIAGITGVSQCAQPVAYS 


7398 


21299 


A 


7458 


2 


194 


TSVRKYF* YPFTITDRIYGSTWWATGL 
HGLHVIIGSTFLTICLIRQLIFHFTSYH 
HFGSEAAG*YWHFVEPARLFLYVCIY* * 
GSTWWATGLHGLHVI IGSTFLT I CLI R 
QLI FHFTS YHHFGS EAAG 


7399 


21300 


A 


7459 


22 


319 


GGS VTGAYHR I LDHDRNQ I FLA* GGSVT 
GAYHRILDHDRNQIFLALLISMLLGLYF 
TLLQASKYFECPFT I CDGIYGS T FFVAT 
GFHGLHVI IGSTFLTICWIRQLILHFTS 
RHH FGLEAAA * Y WH FVD W * LFL YVS I Y 
**GSVTGAYHRILDHDRNQIFLALLISM 
LLGLYFTLLQASKYFECPFTICDGIYGS 
TFFVATGFHGLHVI IGSTFLTICWIRQL 
I LHFTSRHH FGLEAAA 


7400 


21301 


A 


7460 


1 


222 


TILLGLYYTLLQA*EYFKATFTIYDGIY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etti 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid,_ 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T = Tlireonine, V—Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLIFHFTSEHHFGFEAAA*YWHFVDAT* 
LCMYVSI Y* *GSTFFAATGFHGLHDI IG 
SAFIiTICFIRQliI FHFTSEHHFGFEAAA 


7401 


21302 


A 


7461 


3 


246 


LNYS L FFHGEANLGTTQVLTH PS TTAM Y 
FVHYCQPP* ILYGTINT*PPWHKNPIK 
KKKKKKKKKKKKKKKKKKKRGGGF 


7402 


21303 


A 


7462 


2 


155 


SRSRAALIjLPLP*ALQTTNLPLIVMSSL 
LLI I ILALSLAYE * LQKGLD*AE 


7403 


21304 


A ■ 


7463 


} 


136 


PTRPAPSHLLYCKNIKKLFPSGTANEAL 
I IAD I FLRCKKKKFKVK * * VLKASEE * N 
GCKHLHGMH * NSAVCTVFWLKQ I MNS D * 
**KVNTQ*KKKKKKfCKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGENNK* KNIKKLF 
PSGTANEALI IAD I FLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


NIN*MWYMHTVECYSDFKRKEILIHATT 
WMNLEDIMVSEISQSQKDKYCMILHIEY 
KLYIHM 


7405 


21306 


A 


7465 


416 


55 


LFFLRLGYFFLPNIRVFFFLSTLFFFFF 
VWGFFFFGRFFFFFPPPQEVCFFFIF*G 
IFFFFLFK*KNMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KANTKKMF 


7406 


21307 


A 


7466 


295 


84 


IWGNVTLLQLVSCLFCFLFLH*GFFCCC 
CCLFLFLFVCLFVCWFLNLKPSIKCLLP 
FSLSLGCAEIPSSF 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRIP*GHEFKT 
NLANMVKLHLY 


7408 


21309 


A 


7468 


3 


192 


PSEYFESPFTISDGIYGSTFFVATGFHG 
LHVI IGSTFLTICFIRQLIFHFTSKHHF 
GFEAAA*YWHFVDW*LFLYVSIY*+GS 
TFFVATG FHGLHVI IGSTFLT I CFIRQL 
I FHFTS KHHFG FEAAA 


7409 


21310 


A 


7469 


2 


110 


GRVGKHHFGFEAAA*YWHFVDW*LFLY 
VSIY**GS 


7410 


21311 


A 


7470 


380 


8 


TPQKRKKLKKAGEGVFPPPPKKTTPPPP 
PIFIYFIFFFFFIYFFFFFFFFFQYKSF 
FYSHIMIQRHFLNF*ESFYSCISILQ*L 
FKFFSFLGGLIKL 


7411 


21312 


A 


7471 


1 


107 


PTR PHHFG FEAAA * YWHFVDW * LFLYV 
SIY**GS 


7412 


21313 


A 


7472 


2 


360 


VNTLLALLL IM I TF * LPQLNG YI E KSTP 
YECGLDP I S PARVP FS I KLFLVAI TFLL 
LDLEIALLLPLP*ALQTTNLPLRAMSSL 
LWI I LALGLAYE * LQRGLD* AEWLHSL 
TQPTNNN 


7413 


21314 


A 


7473 


423 


167 


VKVHKCFLPGVAPHAFNPS TLGGRGGR I 
A*AQEFKTSLGNIVRPPSDTCNPIVLGL 
OA 


7414 


21315 


A 


7474 


] 


73 


I KPE * YFLFAYT I LRS VPNKLGG V 


7415 


21316 


A 


7475 


131 


3 


ISWVGAEWLTRVI PAL* EAESGVSRGQE 
IETILANTVKPDAW 


7416 


21317 


A 


7476 


2 


103 


PRVRSTGFSSIAHITRDVNYG* I IRYLH 
ANGAK 


7417 


21318 


A 


7477 


2 


325 


NTLLALLLIIITF*LPQLIGYIEKSTPY 
ECGFDP I S PARVP FS I KFFLVAITFLLF 
DLE I ALLLPLP * ALQTTNLPLI VMSSLL 
L 1 1 1 LALSLAYE * LQKGLTERDT 



1006 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, ^Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7418 


21319 


A 


7478 


423 


170 


FFSFFFFFFFFFFFFFEIDKNVYLIM*C 
TNPQEKRNLLFCFLVMVSPVCITPHTYI 
CVCVCVIYIYIYIYIYIYIYPFIFSCCD 


7419 


21320 


A 


7479 


433 


50 


RVFFFFFFTPEKKTFFPQEIYLFFSPFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFFFSPPFKKKFFFFFPRFFFA 
PPRFFF *GPPS FFF FFFFFFFFFFFFFL 
TVKFTRGFRGFCGQI 


7420 


21321 


A 


7480 


401 


337 


FFFFFGKFFFPQEFTVFFPPFPFKIFFF 
SLRLLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPLFRKKFFFFFPG * FliAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRPI*VGGR 
VGNNPNPNTNYLPN 


7421 


21322 


A 


7481 


1 


84 


KNFLPLTLALLI*HVSIPITISSIPPQT 


7422 


21323 


A 


7482 


562 


412 


LFSFHVSGTMAHTCNPSTLTG*GRIT*G 
LEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


233 


DIFHFTSKHYFGFEAAA*YWHFVDWRIi 
FLYASIY* *GSLL 


7424 


21325 


A 


7484 


385 


56 


GNFPTPKKFNFFYPFYLSIFFFFPKKLK 
FF*EIFPIFPPPKFIjFLPPNFPVLFFFF 
FLFWEFFFSFLFWFFFFPFFFFFFFFFF 
FFFFFFFF*KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


FFFFFSEGSGLIREDPIGAGALYDYGR* 
LVWTG*TLFVGVYIVIEIARGNRLCD 


7426 


21327 


A 


7486 


16 


129 


FXXXRXXLIXKEPSINLE*LYGCPPPYH 
TFEEPVYIKS 


7427 


21328 


A 


7487 


183 


387 


I ASGLD I FIiLiLLLETGSHS VTQAGVQWH 
NHDS LQPRJL.TGLK* S SRRS PRGS WDYRV 
LLCHPGWSAEA 


7428 


21329 


A 


7488 


351 


169 


EKKIFFFSPRGYGPPPFFFLKAPPHIFF 
FFFFFFFFFFFFFFRGGGGDHKNFFFFF 
WGFLKNFPLFFFFPRV+ 1*1 FFFFFFFF 
FFFFFFFFIiGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THAS AL FLi * I RTA Y PRFRYDQL I HDL * K 
NFLPLTLAIjLI +HVS I PIT I SS I PPQT 


7430 


21331 


A 


7490 


98 


273 


TSGS CCS AH I LAFKP * AGMATF I TKSL * 
* CHTHTHTHTHTHTHTHYMYI YTHMHTY 
IF 


7431 


21332 


A 


7491 


70 


405 


LYGCPPPYHTFEEPVYI * EAFAS KRKVL 
IVEEPSINLE*LYGCPPPYHTFEEPVYI 
KSRQKRKESNPPKLVSSQPHGLHDFFKK 
KKKKKKKKKKKGGGPI.KKPRGGPNLTGG 
GKKNFFFFRGGEKKPPGGFWKKTLFLGG 
GKLGTPPPRRF 


7432 


21333 


A 


7492 


22 


208 


RSSLPARPTRAVILIIFMI*EAFASKRL 
VLIVEEPSINLE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 


204 


KL.PET VKKKKKKKF YS FRS I VMLLLYL F 
S * SLS FFLL.YSLL FFLFLLS FYIYSFLC 
FPLCKTATTEH 


7434 


21335 




7494 


2 


104 


TTLLLSRT*NKIjT*IjTPIjIPSTLI,SI j GG 
LPPLTG 


7435 


21336 


A 


7495 


291 


160 


PRHF* IFVEMGSHYVAQAGLKLVASSNL 
LAWASQS AGVTGVSHGSLLITS CL 


7436 


21337 


A 


7496 


422 


180 


LGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS*QNIFWXIFLFVCFVLETESVTQVGV 
QWRDLGSLQPLPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 
GNF* FYK*LLGNRDGVCLPPSS YL 


7438 


21339 


A 


7498 


1 


410 


PTRPDQLiIHliL* KNFLPLTLAIiLI *HVS 
IPITISSIPPYT 


7439 


21340 


A 








PQKYTLiIMKICI IPPGKGFRPVEMLAGG 
NENTR CLIEERSY K YHY S L I KLGLQQQC 
IFSSLLYKYNMIYCLiMLCYYIGYIFIFY 
DMYYRQYSVLYYKLYNYVMLI * I YILYL 
FMCI *KYIYFIFM*YKYIFIYLYLY*YK 
YIFYIYLCVYKNIYTLYLCNINIYLYTY 
IYINTNIY 


7440 


21341 


A 


7500 




110 


FTSKHHFGFEAAA* YWHFVDEV*LFL>YV 
SIY**GS 


7441 


21342 


A 


7501 


168 


327 


AFCYKATVIKT1 * Y* YKHRYTEQWSRTK 
NPDINPYICGQMI IFLQLKSIjHKI 


7442 


21343 


A 


7502 


400 


159 


NNFSPLRNLQGFFGPLKFPMGFQAGPPG 
*GFSFFLRSNFFLDVFRGKY*TFGF*DF 
F YC P S PNE VLG E P P LMGGNKPG F P 


7443 


21344 


A 


7503 






TSQKKKKKFGCKTPKKKKKNREKPPKKKR 
GTRGEPFKTGGGKEKKKFFKKKKKKGKG 
PP * PQLVKRGGREKKKKNLWKKKGAF * K 
KKKKKKKNPKKKKKKPGKTPKKKKGDPG 
GTF 


7444 


21345 


A 


7504 


44 


344 


LLASLANIALPPTINLLGELSVLVTTFS 
^SNITLLLTGLNILVTALYSLYI FTTTQ 
WGSLTHHINNIKPSFTRENTLMFIHLSP 
ILLLSLNPDIITGFSS 


7445 


21346 


A 


7505 


2S6 


252 


I *FAFHHVGQAGLEFLTSDLPASASQSA 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


RELPGLKRFFLLSLIjSSWD* WRMPSHPA 
TFCIFCRDEVLPCCPAGLLILLVWNLKN 
CCCYHHHRYYYCY 


7447 


21348 


A 


7507 


400 


20 


SLPQKKVSLLKLKSAVQLVMF*DRFSLC 
HPGKRA*GQSGVTTASSLRAR* SSCLTW 
EYRPS P3jPLADFLFW*RRGL.P I * PRWS 
NS *GQVG£.[jHQPSKMLKFSSI.SPCARPL 
FFFFNRSTNLRVQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTS ANPPAS AS KSFG I TGVS HHTR P 
*PRFLKVNLIYSEMEVYN*ASFRHVD*G 
GLELLTL*FTHLGLPKLWDYRREPPHPA 
LTKI FKGKPYIQ 


7449 


21350 


A 


7509 






FLVETGFHHVG*AGLEPLiASSN1jPASAS 
QTAGITGMSHCAQPNPGSLLNKTMLAVA 
TEQWVGYMWPSVFGHITARPEGKRLSDC 
LHSDTWVCWCVCSGLI FVCFETGCRFA 
LQPG 


7450 




A 








DKVCSVTQARVQWHDHGSLLFLSSGLRQ 
SSHLSLLSSWDHRHMPPGMANFFYF* *R 
LRSHYVAQAGLKLLASSNPPTLTCQSAG 
I TGVS HNTW PKALFS VTDLKH 


7451 


21352 


A 


7511 


417 


33 


APPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF*KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
F KAGG E KF F S RGE I NP PRG AGKKKGF FL 
GKKNTKKKTPKKKDL 


7452 


21353 


A 


7512 


81 


296 

G 


CLREGASHSAAQAGLHWHDHSSL>HP*NP 
GbRGS SRLNLLS I WDYRCVP PSIANFES 
CLLKIILGSPFPKNL 
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nucle- 
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last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptbphan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


7453 


21354 


A 


7513 


412 


279 


KKYTQGLG WAHAYNPSTLGSQGRR IP* 
AQFETSLGNTGRPCLH 


7454 


2)355 


A 


7514 


74 


126 


KI *DWVPKGAPLiQGPVF*NSTFRPRVRN 
RGPPWPRPGPTVWKG*NKKN*PAPRGSN 
PPFFFPGNPGPFKLRGWPGSS * GSFQNC 
KKAQAG*APT*KPGPPPPQNKV 


7455 


21356 


A 


7515 


332 


20 


GERQVGSYFFMNTERLFGMTKKF*KWIV 
SMVAQHSSSISGVWVFRGQELAFPLSPD 
WQVDYESYTWRKLDPGSEETQTLVREYF 
S WEGAFQHVG KAFNQG KI FK 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVIARI VLIS *HCDLPASASQ 
SAGITGVSHCTRPGL.SFLYTPTAKHST 


7457 


21358 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQRAKI LLLLGGHKPGLTI LPRLSSNSW 
PHVKWPRWPLKALGLRG 


7458 


21359 


A 


7518 


151 


476 


SLGKFAVS FS FS FLK * RRALiAVTjPRIjCS 
NSWP*A\TLI,PWIjPRVGIAGMSYHTQPDY 
H FQARAVAE I LGLVSTQHSS VTGL PVEN 
SFRAGHGGPRLQSQHFGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CSFRLGAVAHICNVITLGGRGGRIA*AQ 
EFETCLDNIARPCLY 


7460 


21361 


A 


7520 


14 


219 


APSIHGFGFLYTYIHTHTHTHVYMYI* V 
YIHTH I ESI KCQPLTQLQSNKKANR I VF 
RTQFMLTCYFMR 


7461 


21362 


A 


7521 


134 


334 


RKHSC-IFFNFCFYLSLETRSHYVASTG 
LDLLASSNPPALASQRARITGMSHHAPP 
VLTRFKKSLF 


7462 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 
NPPSPPPQRGFFRGFFIiFPPAFFLKTPP 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRLTSLE*NTSLRLKKKKKKKK 
KRGGP FKKNP WGAQI * PGKKKKNFFLKR 
GAKKNHLGNFGKKP YFWGGKKLAKP PKK 
N*TFKGKKKFLRGKGGKKTPKPWLLK2F 
FSGFYLKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


GTRSCSVT*AGVQWHDHSSLQP*TPWLK 
RS S WLRNRWD YKHE PPCLNN FY FFCRDG 
GLLCHPGWSQS PELK 


7465 


21366 


A 


7525 


433 


228 


MQPYNWEVNSSSQLSLSSARQRVTHVLT 
GNFFIK* ERWDLPVL.PRLVS KS WPQVI F 
PPWPSKVLKLQV 


7466 


21367 


A 


7526 


26 


351 


CRSLVFSLPKPGL.RFVSILHIQVFSHFD 
FFFLTGNTQK* RK* TPF I FPA* LFKTPA 
PKPGMVAHTCSPSYSGGLDGRIA*AQEF 
RTSLGNKTNPSLEGGRKKKKPSGS 


7467 


21368 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFCPVG 
LiPPFGKKANQPTPKNPPKKQKPSGGGLS 
PMGKPP*FPCSMPSDSPGSVSFSRGGVA 
P APKD YCP I V* AAL I PSKKKKKS KPG * V 
KKGKPNLYS I YNK 


7468 


21369 


A 


7528 




132 


FYYL*RRGFPVLPRLVLNSWLQVIIiLPW 
PPKVLN * AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FFFLPGWNRVLLMGRGLGEANSREGRG 
PSQPRGMHGQLEVRGGRSQGDGWHGTLS 
PPWGRIYRGAPPTFAAPQAPKPFRQX.LP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
FKLLTGTSETPGPSGSPSVGSANAQAAG 
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M 
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Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glut£imic Acid, F=Phcnylalaninej 
G=Glycine, 11=11 istidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATT I VARGHT PCTRGPI.DTHAHS SLR WH 
AHTRPP PRPLAHEHGTHTHAHPHTPGAW 
VGEGP FQGWGRA 


7470 


21371 


A 


7530 


63 


205 


GRPTRPRTRGVKILi*QFEHTKPTPGIjPT 
LIALTTLLLPISPFILIIL 


7471 


21372 


A 


7531 


390 


56 


KTFFFFNLIAFTEAKSVLLFLLLFFETG 
SRSSAQPGVQWHHQSSLQPQPL*LKGSS 
CLSLLSN*DYRCAPPCQ*SPCLSLLKCW 
DYKREPPHPAKESHMQQSYLIYSLYIA 


7472 


21373 


A 


7532 


543 


335 


LAAYGGTCIjQS*LFSRLRWEDHLSPGVQ . 
GCKELQLQHCS PAWATE * DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 
SSEPLHTTSMWNF* IFKFLNLK 


7474 


21375 


A 


7534 


386 


106 


SWVTDEITMEI *ECVLNDNSDTPYQNLW 
DTEKWLRAKFTALNACMKKSERAQIRH 
TCVHTHTHTHTHTHTGSKCLLRKYLLLT 
IQQQNIRIN 


7475 


21376 


A 


7535 


398 


2 


SLQPLPPRVKCFSSPSPPRRWDYRGAPP 
RGGKFFFFGKKEFPPVGQGGF * FLPPGV 
LAPPPPQKGRISGGSPRPRPFFFFFKIR 
ISYPILAKQLKLCFLKKKKKSIQAIKLY 
HHQKNPITKSSVFWESQPRA 


7476 


21377 


A 


7536 


2 


344 


HSKCVPAEWTMAYYIRYLSISSLLHIL 
ENIVHFFFHFLY*GPNNLFFFYFAGTRL 
I QRS WK * PMI I + * VHVQ I KREGQ I FNK* 
I FCMIGRGG* ITCGQEFETSLVNMVKPC 
L.Y 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQFNKbP * LF I P 
CGHIPRRY 


7478 


21379 


A 


7538 


1 


383 


WHERTHSRI I ILFQGLjETLLPLI AF* * L 
LASLANLVLPPTINLLGELFVLVTTFS* 
SNITLLLTGLNILVTALYSLYIFTTTQ+ 
GSLTHHINNIKPLFTRENTLMFIHLSPI 
LLLSLNPDIITGFSS 


7479 


21380 


A 


7539 


31 


332 


DNYLSSYSSSGSWMRELILASQGIRWEP 
IVDRTPSHHRTHTHTHTHTHTHTHTHTV 
IiFRLGPWTHAREPRGHICGN*IRNQSP* 
KNPRTRGKNLRTPHGW 


7480 


21381 


A 


7540 


199 


15 


NNVQ I K* QFEHTKPTPFLPTL I ALTTLL 
LPISPFILIIL 


7481 


21382 


A 


7541 


337 


157 


AEIVPLNSSLGNRVRLH*MESYSVAQAG 
VQWHDLGSVQPLPPGFKRFSCFSLiIjSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARIGS DQCLS AGTQ WENKSQ I S GDFM 
I'JNDFRS * S VTRVGVRWRNHGSL* PWIPG 
IjR* SSGFGIiPNCWDYRDRNEWGGKKEV 
TCPTVNLQKSQ 


7483 


21384 


A 


7543 


493 


180 


GGFRGSWLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLR FPRR WGGREGS PRPSYFFFP 
*EKPRFFFLGRWLNFLGQGVGPPWPSQ 
RVGFPGLT PGPRAS FFFNL I 


7484 


21385 


A 


7544 


403 


426 


FF*KFFITFHTLFFFFFFFFFFFFFFFF 
FFFFFFFFFLAPPPGGFFFSFFYQKIFL 
KFPGQPEKFALPFGGGFFF*GKFPQFFF 
CFLVFFFFFFLFLENGVSLYCPGRSRTP 
GLKLSSRLLASLRAGITGVSH 


7485 


21386 


A | 7545 


3 


335 


RHYSDYPDGYST*NILSSEDSLIS1jTAA ' 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibte 
nucleotide deletion, \=possib)e 
nucleotide insertion 














LLMVFMI * EAFAYRRTGLI A* EPSMNLE 
*LYGCPPPYHTFEEPGYIKSRLKRKESN 
P PKL VSSQPHGLHD FFKKKKNRGGP F 


7486 


21387 


A 


7546 


424 


140 


RRVKFVGGGAPQGPPPKRGVLPKIPREK 
VFP p p PG * NGP PGPAFKTP PKKKKN I S S 
PPPGNWAPPGGLLKGPPFFFFFFFFFFF 
FFFFFLNPLK 


7487 


21388 


A 


7547 


2 


348 


FCHVAQSGLiKLIiASSDPPTJjASQSARIT 
GMSHHGGLK* GE FLG PSSNKGALLLWEA 
TWLQSFMKSASLFVKPRPTFVYFVFAFV 
GYRD I YFTASSL I KVI QEGLKYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQA YTHTNTHTHTHX-HTH * KKTG IHAH 
KHTHTHTHTHTLKAEKQI EGGK 


7489 


21390 


A 


7549 


256 


405 


CELYSGKEMELVFGLFIFTGVTRLETFY 
KKECS S FFHSQSRGHKLQP S I LNEWWH I 
RNPLGIVTHFLLL*KTGVAEYKNSLNVVH 
HPFFLSYAVSFLLQGWSFAMLSKLGSNS 
*VQARCLPQSPKVLGLKESPQERTVNVS 
S IRGKKWS WYLDYLFSQGLQGLKLFI R K 
SVHHSS I PRAEGINCNHQY 


7490 


21391 


A 


7550 


2 


362 


ILIMNTLLALLLIIITF*LPQLNGYVEK 
STPYECG FDP I S PARVPLC I KF FIAAIT 
FLLYELE I ALLLPLP * ALQTTNLPLIGM 
SSLLLI 1 1 IiALSLAYE * LQKGLD * AELK 
EEQKTLQC 


7491 


21392 


A 


7551 




317 


TTFDNSALLFFWDGGEIEFHSVSQAGVQ 
WHNLCS L*PQPPGF KQKL I L PQ I VKENV 
SKISSQLLFSRVNINISPSEQCITPPSL 
QLARIGIFYAAKIHLTKGLRG 


7492 


21393 


A 


7552 


479 


183 


YLFKVHTTASHSNGACTGPTVGDIIVSS 
TL* SMTEQVTMPtiASAVSDGTVPSVRTA 
SRGSEQAAESMVSSEHIEILEHAGELVI 
ASPEGQLEVQTVIV 


7493 


21394 


A 


7553 


454 


105 


XPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GSPPKK*FVFPVGRVLGTGVTPliSPFLK 
TTPLLLWD P PLSQPPGGVKP ENS LYPG K 
KRFR* PKFPPCPPAWGTKRKPPLQKKKK 
RQVL 


7494 


21395 


A. 


7554 


439 


1 


LLKRCVRKDS PP PQNKI FFF I LKKFVFF 
FAPY*VRKFYFLTAHFGKRPPQIYIFGP 
PPVFLI FCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFNKQVFI 
ERLLCARHCFRPPPEEKTKNKIEAANPN 
SSGEER 


7495 


21396 


A 


7555 


323 


82 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR* *AYL 
VRSPLLFIFVFSYF*IjI*HVYHRYSVYI 
ELDVLLTGSKIIIil 


7496 


21397 


A 


7556 


194 


412 


KMYRTSLGHL.VEAKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKDSRGGW* KKFGGG 
HNTRGVK I I FFFS LGG 


7497 


21398 


A 


7557 


414 


219 


KGFKKLFPPPVF*FFLCPPPIFLKGFP* 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGGVFWGKFF FSRR VFFFFYHL IQ I L 
SPPPQKRPPPKGTPEG*IFPIjFKEINFF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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FFFFFFFFFFFFFFFFFFFFFLKVKTGV 
YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEIFPPPGISGFFPPFPL*NFF 
FPLRPLFFLGGGAPFFPPPKKGFFPKIP 
RGVFFPPPIjREKFFFSPPP*IWAPPGFF 
LKGPPLFFFFFFFFFFFFFFYRKARTKP 
I CL WGDVS PS KH FQC I 


7500 


21401 


A 


7560 


1 


396 


KICHR ILRPI KKVK+EKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGAS 
LKI PSGGPQLPRGGEKNPPPFLGGEKKP 
PG VFLKKHLFLG GGHFGPPPPKKNSPWG 
EKKNFWGGGGKKKQNIPGGKKIPSHKKK 


7501 


21402 


A 


7561 


1 


390 


NYVFKEFSDKLGAVAYTCNPSTL.GG*GL. 
WIT*GQEFETSLGNMVKPYLYQKYQKKK 
KKKKKKKKKGGGPFKKTLGGAQFNGGGE 
KKFFFFLGGVKKPPRGFLGKNPFLGGGK 
MGAPPPPKIKALGEKKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 
FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTSKHHFGFEAAA*YWHFVBW*LFL 
YVSIY**GS 


7504 


21405 


A 


7564 


313 


97 


SPTYPCYIRNSAQSFSLIAKSFCSHPHT 
YHHDPHLEKSLTYCPQSPS*PIIPPVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVFVATGFHGLHVIIGSTFLTICFIR 
QLI FHFTSKHHFGFEAAA* YWHFVDGV* 
LFLYVSI Y**GSTFLTICFIRQLIFHFT 
SKHHFGFEAAA 


7506 


21407 


A 


7566 


1 


130 


F IRQL I FHFTS KHHFGFEAAA * YWHFVD 
W*LFLYVSIY**GS 


7507 


21408 


A 


7567 


434 


1 


PDLNKVIFFPHRKKKEPYKCHIVGKNLR 
NEMY S I QDTNS L PRRER *RKTRTCI I FP 
HKIKTSEHSFDIKEKH*NYVTLCKVLRK 
LSTNQRQS*IFLSTNKNDTLGRARWLTP 
AI PAL WE AEAGG S * GQE I ET I LANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFSFREKGGEFVPPPPLiKI FFFPPPPNF 
FGGGGPPSPPPQKKVFFPKPPRGFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
MFVCLY 


7509 


21410 


A 


7569 




155 


PTR PDLRS LRFEAKS PNSRRTLHKPGVT 
I *EAFASKRKVLIVEEPSINliE*LYDYG 
FFYYKIKEPVYFYSNQKKE 


7510 


2141 1 


A 


7570 


434 


1 


SPTEAGGKNFPKKNPGRKTPPKKKKNIF 
FSPLTPKKFFFPPRG * ILGGGGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFSPP 
GKNGGPPGVFLKGPPPFFFFFFFFFFFF 
FFLRQSHTLLPRLECSGA 


7511 


21412 


A 


7571. 


493 


196 


S SLDCS * SLQGS I * FLTLMMLAVDFVDV 
LY*V*RVLYVLVQVAITKCHIVSGLNNR 
HSFLTAVKARKSKIKVPADLVPGEVSLP 
SLFS 


7512 


21413 


A 


7572 


1 


239- 


KGLT I LDAIKNI CDS * KGVKI L I FSEVW 
KKLVPTLTNDFERFNTSVEEVTAHWET 
ARELELEWPQDVTELPQSHDQT 


7513 


21414 


A 


7573. 


391 


32 


FFPQPSPPGVKGEFCLNLPNNWGFRAGP 



1012 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G/utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stt>p codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PP PGYFVFLVNKGVFPVGPPGFELPS PS 
*VAPPASQKGGVSGVSPCPRVWVPFFKK 
KKKLRGVEQRCTMHLLVYSKNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 


2 


383 


PRVRS TAFS S I AH I TRDVNYG * IIRYLH 
ANGAKKKKKKKKKKKKKKRGGALKKKPG 
GGQKKGGGEKKNFFFKGGEKKKPGGIFE 
KKTFFGGGENGAHPPQKKKTPGGKKKIL 
RGKGGKKPLYPWGEK 


7515 


21416 


A 


7575 


402 


69 


KIFFKNSPLWKISHPPTYRGFFPPFPP* 
NFFFliPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYliFPPRIFLiAPP 
GVFLKAPPPFFFFFFFFFFFFFFFFF 


7516 


21417 


A 


7576 


1 


119 


LI FHFTSKHHFGFEAAA* YWHFVNW* L 
FLYVSIY**GS 


7517 


21418 


A 


7577 


219 


69 


■ NDISANTAIKKFWKLPKCPTTEGWLNT* 
LYIHLVEYYATLKVCGDLYVR 


7518 


21419 


A 


7578 


3 


287 


HASAQLNGYIEKSTPYERGFDPISPARV 
PFFIKFLLVAITFLLLELEIALLLPLP* 
ALQTTNL PL I VMPS LLLS I I LALRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNL.PLWMSSLLLII ILT*ALQTT 
NLPLWMSSLLLI 1 1 LTL I LAYEGLQ KG 
LN*AE 


7520 


21421 


A 


7580 


3 


299 


DAWVRLLDFDF* VELPTEARIRI I ITSQ 
DVLHS * ADPTLGLKTDAI PGRLNQTTFT 
ATR PGVY YGQCS E I CGANHS FMP I ALEL 
VPLKI FE IGPVFTL 


7521 


21422 


A 


7581 


3 


116 


AFAS KRTGLGVEEPSLTLE * LGGGPPP * 
HTLEEPVNI 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQIFPPLFFFWGSPWKFFF 
TPFPHFVFWFGNFKFNPFFAFLFRLPFP 
GEKIPLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFFNM*S*LF*SPANFNV*VTH 
LFIFMVFFHPDFWSFGPILW 


7523 


21424 


A 


7583 


288 


9 


RGSKNLEKNSFPYKVNNHGNGSPLFPHL 
FFSPRGVGPFFFFFETRSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWDY*RAPP ' 
HPANFVYF 


7524 


21425 


A 


7584 


1 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THFGHNG IS VS PKKKKI PRGGGP P PLFP 
PLPRVGGGDSL*PRKGGFN*TKPAPFPS 
SWGKKKELPFSKKKKKKKRKKGSISTPF 
SRIE 


7526 


21427 


A 


7586 


205 


87 


SISLSLS LSHTHTHTHTHTHTLSHPRC V 
PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPSVTTLVTSQQYQDPPPAKRL*LAEG 
SDDC 


7528 


21429 


A 


7588 


159 


195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL* SYVAQAGLKHLSQSDPPASASQS 
AEIRGMSHNAWPHF* YFVELVHFIYINK 
FVGIELFILTS 


7529 


21430 


A 


7589 


76 


205 


RKKPEPEKTCFDNITGNTYPCLSTYEHD' 
SD*YYVNRT*LCAER 


7530 


21431 


A 


7590 


218 


20 


LSLYLASHLWLKPPHPLEWAPSKSSLRL 
GTVAS P CNPSTLGGRGGWVT * GQESETS 
LANMAKPCFY 



1013 



JSDOCID: <WO 0164835A2_I_> 
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PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
LSSN 
09/515,1 

26' 


.Predicted 
beginning 

: nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7531 


21432 


A 


7591 


142 


12 


QIDYYYYIjFI*RKGI j TLMLRLVSNSWLH 
IILLPWPPKVLGLQM 


7532 


21433 


A 


7592 


2 


224 


QIIEALLITILLGHYFTLLQA*EYSKAP 
FTISDG I YGSTFFV.TTGFHGLHG I IGST 
FLTI C YNPQLI FHFTS EHKFGLQAAP * Y 
WLFVDVARLiFVPAS I Y* *GSLFYPPTSL 
RVLQGSLHHFRRDIjRbNIFCNHRLiPRTP 
RHYWLNVPHYLLQPPTNISLYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA*LWTMTF*LPLVDGYIKK 
FTPYEGGFEPKSAARVLFSIKVFLVAIT 
FLLSDLQSALLLPLP*AVQTTNLPLIVM 
AALLLDI IIiALSLAYA*LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVLCLLKMCVCVCWCV 
CVFCAC * KMCVCVC WCVCVFCAC 


7535 .. 


21436 


A 


7595 


383 


43 


VFISETLCEHVAWSQTTIESLGLGTVAH 
TLNASTLRG*GRMIA*GQ*SKTRRGNMD 
SISSHSQTIAQSPHLLGRHQPKWCMCVQ 
AAEAVLKGQVETDASFQIQKIPINSLIN 
L 


7536 


21437 


A 


7596. 


2 


331 


WPPTGITPLNPLEGPLLNTGGLLA*GVS 
MT*AHHSLGENNRNQVIQALVITILLGL 
YFTLL.RASEYFESPFTISDGVDRSTFFV 
TTGFHGLHVIIGSTLLSIWFIRQLICNF 
TSKHHFGFKAVA* YWLNFTKYLVHPPTN 
M 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLLPPFPLKIFFFPLKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF* IPPP 
YRKKHFCSHTPKNLAPRWFF* KRPPHFF 
IFFF* * RWGLAMLPRLFSNS *AQAVLLL 
QSPEVLGL 


7538 


21439 


A 


7598 


401 


189 


R VTjP C FAGW S*TPGLKCPT CLGL * KCWD 
YRRE P P CL I R F YL FVYFIjRQGIiAMLPS L 
VLNSWP*VTLLP*SP 


7539 


21440 


A 


7599 


81 


368 


KCVIYPFLSITLGKAKYDFFFFIiEKELA 
FFPQGEMQGGNLG*LKPPPLR*RDFSCL 
ALPRGGNGRGAPPSPTNFGFLGGNGVYP 
SGPGGFETPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHPQLRLFCSHY*NNIVEERLGVLAR 
TCNPCPLEDKAAWIT*RHEFETSIANMV 
IFCLDIGSKSFLSMRFSRCLLWKL 


7541 


21442 


A 


7601 


1 


196 


WEPSLVGETNVNSFNQKYINWPGAVAHT 
YNAGTLGGQGGW I T * GQELE I TLSKKVK 
PSTYFTNTR 


7542 


21443 


A 


7602 


315 


82 


TCTQVF IALFITAKKFKLICPLTDNWIS 
RVWYVHTMKHISAI KRKAVIHATMWVKL 
ENIILSEIKQSQKATGCMISFI 


7543 


21444 


A 


7603 


334 


118 


PKI KTEGGP PPKKNAGGMYK* KKFILVL 
AAPPHPPFGVFSVFSFYSFWFFRFFFFF 
FFFFFFFFFFFFFFLR 


7544 


21445 


A 


7604 


3 


288 


DAYTT*NIL,SSVGSFISLTAVILIIFMI 
* EASS S SS S P PGGLRPRAFP VRP PGAGL 
VF*VSVPPDLPQCSLAGLQVLRFMEr J AR 
LAPHIRWIIQ 


7545 


21446 


A 


7605 ' 


354 


33 


EGRARYTRVPQRPRFHLLGIFFAPQLGK 
KKKFS F S KKKKKKNQLWLiGAVS HANNLS 
TKGG*GRGSRFT* A* EFRTSQKL.GTKGN 
MVKPISTNTFKNERGGAELSQL 


7546 


21447 


A | 7606 . 


401 


161 


YLVSHLLQIILPPALFWIFFF*DRVX,L 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide, 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknovro, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














CGPGWSAWWSYCSLCIiLGSNSSPCQSL 
PSSWDYRHLLWRFLMPRG I LCVYV 


7547 


21448 


A 


7607 


354 


50 


SGAQDPAGQHGQTPFSQKVPKKVYPAWW 
CMWPATQEA*LKKSLEPWRRRFLIiREA 
FFVYLG VWL.SNGE PPPLGG GGS VFLEAL 
EKSVGFSSLiGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


TIKNICNSWEVKIPRFTGVWKKIjIPATIj 
MDDLEGFKTLVEEPTEDW* ITGELEVE 
PE^^/IEFIlQSHDQTS^DEGLIlLMDEQRR 
W 


7549 


21450 


A 


7609 


120 


358 


TTAVYLGHYCQSP+ I LNGTVNT* PPWH 
KDPILFLTPFPCFHASSAMNLQLSHISC 
NSKATPHPbGYQQTYPPLTVHMT 


7550 


21451 


A 


7610 


3 


164 


HE * AS I P * NTHTHTHTHTHTHTHTHFYK 
EENLSHLWNI SC I FLGAHKGKKKMN 


7551 


21452. 


A 


7611 


85 


324 


FRFFAFFFF*KKISFCPQAGGQGGILPS 
LDPPPPGLKKFSGLTPPRSWN*GPCPPP 
GLIGVFLKKRGFPIjVGREGPNIjWT 


7552 


21453 


A 


7612 


268 


317 


F LENC FNPGGGGCS QP KLG P CP PAWGAK 
* ASLS KKKKNKNKN* LSYKAPSYGYKGK 


7553 


21454 


A 


7613 


101 


339 


AVPLTMVKIHALWKRVWRFHNKSKL*LP 
CDPAISLVSMYPKEMKSVCLKDVCSPRL 
ITGPLTIAGMWNPPNRSSMDYSG 


7554 


21455 


A 


7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 
GTLFF*AVNGGGNQVFLFLRVXjNGL 


7555 


21456 


A 


7615 


30 


328 


NYCLDFHGETELGTTHFLTHPSTRAMVF 
E PYCQAL * I L+GVINT * PTVLHRNRSHI 
KAACLCLiQARTWNLiQISHINWKY*TTP 
HPL*YQQTYPALEYI 


7556 


21457 


A 


7616 


39 


159 


TPGLKQFFHLGLPRCWDYRHEPPRPCPN 
MYIiI*ISF*CV 


7557 


21458 


A 


7617 


290 


91 


KMSFLIFFRDMASLCCPGWT*TPGLKQS 
SCFHLPRSWDHRCTSLHLAVKTFIiYYFL 
KM FY YQNVWH 


7558 


21459 


A 


7618 


37 


246 


TQQEL VY I Y R Y I DTH THTHTHTHTHTHS 
HTHYWI YTWKHTL.KGPKLF* EKTKMGST 
P I ISWKERLSLiYRS 


7559 


21460 


A 


7619 


23 


190 


IPPQPCPFLLSLFCT*IDMSCNMCACVC 
VCVCVYVCVICDMVLLPFCFCKLECSK 


7560 


21461 


A 


7620 


367 


2 


FFSPKALIFLGGGGPIFPPPKKSFFSKI 
PPGGFFPLPGFLKPPPPPLF*NPPLKKK 
NFFFPPPENWAPPRVFFKGPPPFFFFFF 
FFFFFFFWSSRFLIFPQFFFLNTJKIGKK 
KIFRHLGAG 


7561 


21462 


A 


7621 


6 


25 


S VI LQS I I I F I ATTNLLGLL PHS LTPTT 
QLSINLAMAI PL* AGAE I IG FRS KI KNA 
LAHFLPQGTPTPL I P I LVI I ET I S LLI Q 
PIALAERIiTANITAGHLLMHLIGSTTLA 
ISTI*LLYYNP 


7562 


21463 


A 


7622 


383 


67 


AFLNPGKKPFPPGFKRFFCPPPPKKWGK 
NRGPPPRGKIFFFFFFVKTKFFHVGPGG 
F* ILTSGDPPGQQGGLGGFFFFFQIGGT 
GDLNIiSHKSPMPFTINTHNKG 


7563 


21464 


A 


7623 


372 


35 


TGGFWGVFFFFFFKQGFPFKSTGGNLGP 
KNPCPKGPRVSPPPPPF*NRGPQGGPPP 
PGKKFPFFFFFFFIETGFPHVALAGLEL 
PGSGNLPTSVSQSAGIPGISTWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LS G VEWVDHMIN I CL FFKKXiP P CV * KWL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 

NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E= Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinej l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YQPVAVAHACNPSTLGGRDRWIPGGQEF 
KTTCLSLPKRWAHRREPLYLAFCPFYNG 
IGCLLIIGF 


7565 


21466 


A 


7625 


271 


277 


NRGKGKTGSLGILEKNPFFGGGPFWANP 
PP*GliKFWGGVGPKRPPPKfCRVFFQNSQ 
GTSFPLPPVLKSGPGLVLKKAPQKEKNI 
TFQPPGKFGPPRGSLKRAPPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 


2 


269 


LLGELSVLVC* FPPKHTTFLLTGLS ILF 
TGLYSLYILTTTQWGSLTHHIINLKPLF 
TPKNTLMF I HLYPI LLLS LNPD 1 1 KKKK 
KGGAF 


7567 


21468 


A 


7627 


397 


164 


FPPPKKVFFQKNPKKKFSPPRVF*TPPP 
PPFFLPPPKKKKFFFSPPP* IWPPPGIF 
*KPPPLFFFFFFFFFFFFFFK 


7568 


21469 


A 


7628 


3 


324 


DGMNVSTL YGATGSHGLHD I MGSALLT I 
WY IRQLILKFTSKHHFG FEAAA* Y WHFV 
DVD * LFLCGS I Y * * GSPGGPKFSRGGKG 
IAVPHWGDD 


7569 


21470 


A 


7629 


2 


246 


SGLGTTLS*KNISVLLTGLKILVTGLYS 
VYI FTATQWGS LTHHINKI KP * LTRENT 
LMFIHLCP I LLLS LNPD IS YPGFTS 


7570 


21471 


A 


7630 


112 


290 


LISGKGQWTQMRLPVLVTLFTDSFSK*K 
NPSTLGEQCGRIT*GQELETSLGNIVRP 
HLY 


7571 


21472 


A 


7631 


248 


2 


KVPASGKRPE PHWYP YHAVPGCGR I PAT 
AI WTQKWYQGFPPALS PRLE CSGAISAH 
CYFGFPGPSDSPASPS*LAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQCFKPAI PALERPNPQECFRPGVLN 
QP * PQNKS PFFFFFFKKKKKYI YI CMW 
HAYNPSYSGG*GGRIT*AQKFKTSLNNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDSRHEPPGLANFCIFSKN 
RGFPISDRWINSWPRVFG*PKPPKMGG 
FQTLVQGASLGFLLIFLKTGFPFFSQHE 
EQGWDTSSFQHRFFRGK 


7574 


21475 


A 


7634 


1 


209 


NAYRIVILCQKLFPLLSGKMKFKKTRCW 
LGMVAHACSPSTLGG*GGRIA*AQEFGT 
SLGS I LRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFLFFSLFFETKSCSL 
TQTGVQWHDFGSL*PPPPRFK*FSHLGP 
LSSWDYRHAPPRPASFFVFLVEMRFHHV 
GQAGLELLTSGDPPASASQSAGITGMSH 
HAIiAHLYFSSGKS 


7576 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGITGMCHHAQLFF 
FLAF*LIFNF*FFFGNSFFFVPQAGGHW 
GIGG* LNPPPPGLKRFSCLTLWGG 


7577 


21478 


A 


7637 


3 


93 


EAEAGES LEPGDRGCSKLRS CHYTPAWQ 
QSVTLVSKKETKNTKTSEVPFGVLNLLY 
KVSINILIFKEIFYQAPLILV*DNPDSS 
TLS SALQ PGQQS KTVSNKTKQKTVS * PG 
MRV 


7578 


21479 


A 


7638 


46 


228 


GIPHI QPS KS VFNNVHGPGTVAHTRNPN 
TLEY*GGSIT*GHEFETSLGNIVRLPPP 
SLQSI 


7579 


21480 


A 


7639 


1212 


1023 


KNHSNE W I KKM* Y I CTME Y * AI RRNE I M 
AFAAT*MKLETIILSEVTQEWKTKYCMF 
SLVSGS 



1016 



_0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ W 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


iVI 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A=Alanirie 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


7580 


21481 


A 


7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFMVAFFLIAKNW*QPK 
CPSSEEAMNKLRCIHTTAVTiHSSEEEQP 
TDTHNNSDLRS I APSWRSQTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 


A 


7642 


2 • 


201 


AGAPPPAWLPPCRLISDOASNQRDSVG 
VGPS EPYATVRKYLEIjLLCQMHHDMCTY 
RFSIRIVLNL 


7583 


21484 


A 


7643 


521 


385 


ggspleth*ytsqgggipytdltghhps 
qgriqeapklthl 


7584 


21485 


A 


7644 


331 


3 


dlvpkkggggkkttprikkfppkgedff 
fftphyfwfwvffwggafffffffff 
ffffffffffffffffffffft*dyyis 
f*eerfnkfyrcqstlvqightsnl 


7585 


21486 


A 


7645 


88 


367 


hcs flllgkkkkkkkkkkkkkkkkkkkk 
kkkktkkneyknllggapp* kppsgaly 
kyrrenfipfsfervskypprdlckknl 

FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFFFFSPPPXFFFLFXFFXXPFF 
XXSFFSLPPLPLXXPPPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFNI*TI 
VL 


7587 


21488 


A 


7647 


34 


293 


TGIPGVISAIFMI+EAFVSKRKVLIGKK 
PSINLE*LYGCPPPYHTFEEPVYIKSTQ 
KRKESNPPKLGSSQPHGLHDFFKEKKKI 
IK 


7588 


21489 


A 


7648 


22 


106 


KNFLPLiTLTLL* KNFLPLTLTLLI * YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 


1 


231 


FDRT I S KD I S K * MVLVS KKKKKKQK KKK 
KKKKKKKKKKKKGGGPFKKTPGGAPKKP 
GEEKI I FFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI IKTCS*AKKKKKKK 
KKKKKKKKKKKWGGAFLKIPWGGPIFPG 
GGKFFFFFFGGGY 


7591 


21492 


A 


7651 


57 


375 


SGKSEFHRVPQWPGTGADACNPSTLGGR 
GRWIT*GQEFETSPQGDPISTKKKKKKR 
GALLKDSLGGPNLPGFGNLKVFS FRGGI 
LKPTWEFWEGTFILGGEKIGPN 


7592 


21493 


A 


7652 | 


3 


357 


IAFLLALS KI THAS I P I PVS S PSKSPRS 
KGTKKKKKKKKKKKKRGGPLKKTL.GGPK 
INGGKKKKIFFFKGGEKKTPGGILEKKL 
FLGGGKMGPNPPKKI KPLGEKKNF * GEK 
GEKKP 


7593 


21494 


A 


7653 


113 


378 


MGAFNLGPFLLMGGGNPSGPNGHWVLGA 
GSYLGPGKTPPKKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGLAPPFTKSPLGF 
LQKK 


7594 


21495 


A 


7654 


341 


147 


NFFFSLKPFIFFGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPLNFAPP 
RVFFKGPP 


7595 


21496 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVIiFPVSPAP 
SLRG 


7596 


21497 


A 


7656 


2 


162 


ES KGYTHTHTHTH I YTHI YTHTYTPREH 
* PKKADVAMLILNKLFFKPRCVALL 


7597 


21498 


A 


7657 


333 


28 


LEFFFNFFFSYFLFFS*RVWGPERPPPP 
LKKAV* KFFLFDIFLFFFFFIFGLGFFF 
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SEQ JD 
JVO: of 
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sequence 


SEQ ID 
NO: of 
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sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last simino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcinc, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalhnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEHIRLLMFVTFSSHR 


7598 


21499 


A 


7658 


1 


137 


HSSLGDRVRPCLKKKKKKKEKKKKKKKK 
KKKKKKKKKKKKGGGLF*EKKKKKKKKK 
KKKKKKKKKKKGGGLF 


7599 


21500 


A 


7659 


236 


32 


IIFLI KFLYF* FI F I * F* FFLTFFYFKR 
LKILFFFFFFFFFFFFFFFFFF*KNFIK 
NIIYFFIFYLFYYLKFFIYIKTFLFLKI 


7600 


21501 


A 


7660 


301 


65 


KAPPKIFWFFFFVFFFFFVSFYIFFFFF 
FLFFFFFFFFFFFFFFFFFFFFFFFWKS 
LAAT * FLRG FETF * LAYI LKTT 


7601 


21502 


A 


7661 


70 


273 


KHSPPHKPASD*NTELTIKKKKKKKKKK 
KKKKKKKKKKKKKKKKKGGGAL I KNPWG 
GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 


1 


353 


ILIINTLLALLLI IITFGLPQLSGCLQK 
STPYEGGSDPISPARGRFSLKGPLQAMT 
FLL*ELEIAIxL*PI>P*ALQTTNLPL>IDM 
GCLLLN 1 1 LCLS LAYE * LP * GLDCS RYH 
SQRIQ 


7603 


21504 


A 


7663 


1 


131 


FI RQLI FHFTSKHHFGFEAAA* YWHFVD 
VA*LFLYVSIY* *GS 


7604 


21505 


A 


7664 


2 


192 


PLSQSLY*LLAADLLILT* IGGQPVSYP 
FTIIGQVASVLYFTTILILIPTISLIEN 
KILKWAY 


7605 


21506 


A 


7665 


356 


2 


TFFFLSSPGGSCGSPTPARPKKNPPPWG 
GGLPLFSRRGALFPKNFFWGGYPFFFLF 
*KKPPFFPPPGPQSPVTSPKDWPPLRM 
PPPRPHVRPLGLPKKS FSSPRHEKQVKK 
TKKRAA 


7606 


21507 


A 


7666 


312 


2 


GFFFFFFFFFFFL*DRVLLLLPRLECNG 
VISAH 


7607 


21508 


A 


.7667 


I 


233 


FWLSSNAEFDSANSCECLEVQRMIQDQD 
S FPTYH YFDM YVC I YV* * RS LAVS PGLV 
SNSWPQAILQPQPPQSLGLQE 


7608 


21509 


A 


7668 


174 


295 


IFFLPF CGEHSLAVLPRLVLNS WTQAIL 
PPLPP*ILGLQA 


7609 


21510 


A 


7669 


183 


330 


NTKF KL Y W S P VAMAHACN PRTHRGG W IT* 
CEEFETTLANMVNMVKPCLY 


7610 


21511 


A 


7670 


3 


319 


TSNTLLALLLIIITF*LPQLNGYIEKST 
PYESGFDPISPARVPFYIKFFLVAITFL 
LFDLEIALLLPLP*AVQTTNLPLIVMSS 
LLL.1 1 1 LPLALAYE * LQKGLD 


7611 


21512 


A 


7671 


1 


313 


ARGERERERERERERERERERESGGGGP 
TQTDCKGGRNT*RGRE I YRESE * DDRPP 
FLPTYRVNLQRPVGLRRLKGAGDKTFCL 
I LTLARDYVW PDYRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR*REREREREREIFREKNSQ 
S 


7613 


21514 


A 


7673 


1 


115 


DELANLFI I * KAGFPVLPRLVLNSWLQV 
ILLPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRRERERERERERERERERERERERERE 
RYRE*DRERERERERDCVWGGGVISLSR 
AVAIiSGAPAVGRQTRERISR 


7615 


21516 


A 


7675 


1 


234 


ARGERERERERERERERERERERERERE 
RE I ERERERLTQRERE RERGRGVWHAPL 
AIERG* KPQGGFWWKEERVKKEPLGKLL 


7616 


21517 


A 


7676 


1 


262 


ARGERERERERERERERERERERERERE 
RERERERERERERERARAPQNL* ERWAH 
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NO: of 
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SEQ ID 
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od 


SEQ ID 
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26 


Predicted 
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nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


1 Predict- 
ed end 

otide 

correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K— Lysine, L^Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argiriine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, * v =Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPPAPGVSLSWREHTRGGPPWGKKEGGW 
GRLGHPPPSILRAKGG 


7617 


21518 




7677 


214 


2 


QQASVAQAGVKWPDLCSLQSMPPRFK* F 
SCLSLLSSWDYGPPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


7618 


21519 


A 


7678 


3 


341 


HERHELEELIKTLFFFFFFRKKNHLCKP 
GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LREIGAMAPQPGGGLLI *NKTPFRPRGQ 
GGPKPPTPGKPNPPPPPKGNFPL.*GPNP 
QPPGSPTPPPPQKVIFPFEGHQA 


7619 


21520 


A 


7679 


■ 


357 


GTR I NTLLALLL I I I TF * LPQLTGY I KK 
STPYECG FDP I S PAR VPFS I KFFLVAI T 
FLLLDLEIAIjLLPIjP*ALQTTNLPIjIDM 
SSLLLIIILTLSLAYE*LQKGLD*TDFY 
SVGGEAN 


7620 


21521 


A 


7680 


365 


1 


PDASQ* HCTNGQSNRLLNLLI KALLTTA 
KIWIQPKCPPVDARIKKMQ* IPRMEHPS 
AIKKEILPSVATWMSLEDTVPSRISQAQ 
KDKYHTVSCMWNLRMLNAEPEGRPWPR 
GVQGWGAARA 


7621 


21522 




7681 i 


387 


3 


FFFKKEFCPRKKAKK*NWGPGNLGPRG* 
KNFPPPPPQKRGKKGGPPOGGEILGFL* 
KKGFPPMGRGGSKLWPLGFSPLNPPKRG 
GKKGEPPPPPQRGKPGGPPPKGLFFFFF 
FFLRQSLTLSFRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST* * ELNKRWG 
LGAVAHACNPSTI>GG*GRWIT*ALEFKI 
NLDNKARSHLKI KKQHQQQKNSGLDA 


7623 


21524 


A 


7683 




348 


GTRERKPTWLWYHRERGMERDCSCVPGS 
SGISESRVWVQVGMEVYPAI I YLCLEPK 
YLVISEFSPAFRIWFLGLSWAHTCNPS 
N LGGQGGR I T * S QE FKTNLAN I VR PHLF 
FFLF 














GGPSIFPRLALIjGGGGPGGPPLPRGNFS 
FFNFFVKKGVFFPPGVFPFLPPGVLPPP 
PPFWAPG*PRPPNFFLiGAPGFFPFFFIjG 

ggffpvapggv 














TSE I YKKQLNRKKPNNPI * KQS S K 














FISKTAFQPGPWE 


7627 


21528 


A 


7687 


459 


298 


LSLP S S WNHMCALPHPASFFL * RQGIiAL 
LPRIiVSKSWAQVMLLPWPSKVLGLQV 














FPHHNVHEVIVRPLSPRYPIISHVTCTY 
RSRWP*ASEGSQKK 


7629 


21530 


A 


7689 


284 


163 


GTVAHTCN1«STLRGRGGWIT * GQE IDTS 
PAWPPKVLGLQA 


7630 


21531 


A 


7690 


83 


255 


KVDYVSIKSEFF*DRVSLYHPS*GTVAR 
S*LTTTPNPGIKQSTCPTVPSSWDYRHV 
LPLPTRTIFSVPTQSLATILI * ITLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLDPADFS FNFFSVDVGLAMFPRLVL 
NS * HQ VI Lli WPLT VLD YRHELLR PAWG 
FYYLNFT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP * LFYLYHLLFFGFGLFLFFLI 
KLGSSLFINFSKKS VL * FTNWI YFS VLY 
FTDFCFIFIISFFGFSLIYFIIAVFPLK 
FIGFILSFI* ITVFYGQAQWIiTPVIPIL 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysleine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WVAEVRGSL 


7633 


21534 


A 


7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*RCGFTMLPRLVLNPWPQGILPPQ. 
P P KVLGLQ VLRHH I WLR 


7634 


21535 


A 


7694 


3 


373 


WPGYTL.NQAYAK IHFTIIFI GANLTFFP 
QHFLGLSGMPRRYSDYPDAYTT*NILSS 
AGS F I S I TA V I L 1 1 FM I * E AFAS KRKVL 
IVEEPSQSAGITGVSHCARAEYLFIDRR 
DGLSLCWPGWT 


7635 


21536 


A 


7695 


406 


394 


KFF*KKFLFPPTPTFLLEACLFFFLPPI 
KGFLFKYIAPGIKTPPPKKEKFFSSIiKV 
LFSPPYFFFIPPPPIFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFRFEG 
ECWRL*W*FYFYFFKESPHCIP 


7636 


21537 


A 


7696 


2 


334 


ILINNTFLALLLLI ITFWVPQLKGY I KK 
STP YECGFDP I S PARVPFS I KFFLAAI T 
FLLFDLEI ALLLPLP * ALQTTNLPL I VM 
ASLLLI IIIjALSLAYE*LQKGLiD*AE 


7637 


21538 


A 


7697 


2 


95 


LNLTIYIILTTTAFLLLNLNSSTTTLLL 
SRT*NKLT*LTI YIILTTTAFLLLNLNS 
STTTLLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS *SQVILLPWPLKVL 
GSQA 


7639 


2 1540 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVLiKKQPPKKKFFFLJLiFTPERFFFPFL 
♦KKNIFFFP* YFFFPLVIFFL*PPPLFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFSSNSFPFY 


7640 


21541 


A 


7700 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LCITI FFFFLEKNFLFVPQVGGQGG I LG 
*LiKII>LRG*KQFSCFTLiPGRWTNGGIPP 
PPKNFL.KKFF 


7641 


21542 


A 


7701 


50 


225 


PWNMVKMSCWLGKVAHVCNPSTLiGGRGR 
RIA* AQEFETS * VTQ * DPVS VI KKRKKN 
FK 


7642 


21543 


A 


7702 


2 


146 


EIALLLPLP*ALQTTNL,PLIVMSSLIjIiI 
I ILALSLAYE* LQKGLD*AE 


7643 


21544 


A 


7703 


I 


93 


ILIINTLLALLLIilMTMGLPQPKGYIKK 
STP*ECGFDPIFPARVPFSIKIFLIAIT 
FLLFDLETALLLPLP*ALQTTNIiPLIVM 
ASLLLII ILSLSLAYE*IiQKGLD*ALLL 
IiIMTMGIjPQPKGYIKKSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEAAA* YWHFVDVG* LFLY 
VSIY*+GS 


7645 


21546 


A 


7705 


1 


325 


TAGQFLPKLSILLSYNPAITFLGIYPKI 
LKTYVYMKTCTWMFIAALFIWQTWKQQ 
SKLWYIQTIKYYSVLK*NELSSYENTSK 
KLRCILLRERSQYKKPPYFL.IPTM 


7646 


21547 


A 


7706 


2 


169 


SRSRAGT1AI*TINLPCTIjMIFTILIVL 
TI LE I AG AL IQAYVFTLWS LYLHDST 


7647 


21548 


A 


7707 


263 


346 


MLINVPLGLLFVGVI LS KES PS VDQGGV 
QWINLFSMQPPPTGFK* FSCLKA* ASDN 
LSPHEQYRLALSFLKLTL 


7648 


21549 


A 


7708 


343 


163 


PKEF* *RQGFFHVGQAGFKIiLRSGNPPP 
SASQNGKITGVNPLAWQTNNSI PPMAP 


7649 


21550 


A 


7709 


459 


144 


FWPGASSHAFDPTTLGGRGGRIA*AHEF 
KTSLGNIVRPPSDTCNPKVLGLLA 


7650 


21551 


A 


7710 


2 .. 


83 


NFLPLTLAL3 J I*YVSIPITISSIPPQT 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
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residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7651 


21552 


A 


7711 


19 


181 


KYVHTKACT*MFISSLFIITKNWKQPIF 
SSIGE*VNKT*YIHTMEYLLFRNKRK 


7652 


21553 


A 


7712 


392 


49 


SDIFNSDIFSHY* CSVF*SYLiAYNRYL.S 
FL * LFLEFI YSYP * IDHAFHCFLHPVS S 
FWLDFFLFIFSFFLFSYFFSYFYF*FYF 
*F I FNF I FNLFF I LFYF * FS FFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


FFF*EWVLTTLPRLVLNPQAPAILLPWP 
PKVLGFTGVSHIKI 


7654 


21555 


A 


7714 


2 


408 


WNGMEWNGLKWNRLERYGMQWNGLEWNR 
TE*TRMVWNAMEWTRIELNGLEWNGMEC 
NGMHLNGLEWNGMERNGTDPEWNGTDPE 
WNGTDPEWNGTEWNGMEWNEVDSNGMEW 
NGIDSNEMGWTRMEWNGMESTRV 


7655 


21556 


A 


7715 


362 


223 


WWFMPIIPTLWEAEVGG*LEPRSSKPA 
WATWQDAI FKKKFKHQH P 


7656 


21557 


A 


7716 


2 


335 


LiIVIINTLiIjALLLIIITF*LPQIjNGYIE 
KSTP YECGFDP I S PARVP FS I KI FLVAI 
TFLLFDLEIALIiLPLP*ALQTTNLPLIA 
M S SLLIi 1 1 1 LALS LAYE * LQKGUD * AE 


7657 


21558 


A 


7717 


2 


142 


SITL*I>PQLNGYIEKTTPYECGFEPICP 
ARVPLSIKFFLEAITFLLFDLEIALLLP 
LP*ALQTDDLPLIVMTSLLLIIILTLSL 
AYE*LQKGSD*AYMPRPRPFVHKILLRS 
YYLLII 


7658 


21559 


A 


7718 


6 


92 


FFEAAA*YWHFVNW*LFL,YVSIY**GS 


7659 


21560 


A 


7719 


417 


164 


PLEPL.ISGRGLPQIAPPPKKGSPPKSPR 
WFFFPPP*KKKNYFPPPPKILPPPGFF* 
NPPPPFFFFFFFFFFFIFFYFFKTALPlj 


7660 


21561 


A 


7720 


25 


401 


THNS * DPNVJD* I PHYA* P* TSTVKKKKK 














TGGGKIIFFFFMGGKKKPLGDFLKKNFF 
LGGGNLGKHP PKKLSLQKKKKKFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 


2 


APPFFFYFFFFIFFFYFFLGVGFSFK*N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCFYMFFFFVAFIIRLNFSLQAKFFL 
LNFAIjLPFPEIFFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFKIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
*KKKFFFFSPRFFIjPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFLN 


7663 


21564 


A 


7723 


77 


220 


T P RRGWAYWIiM PAY PAILE A* AGGLLE P 
KSLRSTWPTWRNS rSPKRK 


7664 


21565 


A 


7724 


16 


258 


ISSFVFKRRSFAMFPRLVLNSWPRWLP 
PRPPE * LGLQACTTTFSFNFFDG IFLRP 
LKEDVTARDQRE I LYVFALF YYGGA 


7665 


21566 


A 


7725 


353 


111 


IFGAGPRFVFKAGGQGANSRSLKPFPPG 
NKWSFPFSPSKKWEGRG*PPRPPFFFFF 
FFIREGVSLCCQGWSQIPGLKQSS 


7666 


21567 


A 


7726 


2 


185 


MSMGHTRLSSAWTGKPPLSVEDDFEKLI 
WE I SGGKLEAE I DLD PGKDEDDLLLELS 
*MIDT 


7667 


21568 


A 


7727 


346 


143 


SQAFIiYL.ESFLLLLLLY*FLFFEMESHS 
VTQDRVQWRHLSSLQPLPPRFQQPSRLT 
LPSSWEAEVGG 


7668 


21569 


A 


7728 


411 


221 


NLRPPGSSYSPASAS* IAGIPGARHHTQ 
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amino acid 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LI FAFLAE I LKNLAKL YL I PR KPN I Y YL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAAS TNMLLFFLGRHL WE WLGHM VGL I 
FTFKKLPNQAGRRGSHL* FQHFGRQRWE 
DHLRSRVQDHPEQHS KTPSSLQKNKMSI 
ESNLLNIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


SICEYCFVSSVFVLKRLFSVMQAFSLEG 
CVSCQS * AFLQHYLFKKLYFLY * WGLTM 
LSRLVLNTWPQVIHLPWPPKVLGL+AFC 
GNEA 


7671 


21572 


A 


7731 


354 


238 


QS I LDDVAMVLD * QAEVF I VKMWRLL I Y 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGALAVLSRLI LTPGLKES S CLDLPKGW 
D YRWE P PRPGCFF * LMVLVLS F 


7673 


21574 


A 


7733 


1 


181 


VNAGADCSSIGGVPFLQHKKCHGKDYEP 
RGITTLERSYVEETTEHLVSKSK*PLRA 
QINL 


7674 


21575 


A 


7734 


336 


80 


HKLKE PPGVFPVFP FKNLE FGPGPPKFF 
FKKAFSPILSFFFGKIFKIPRFRGENFA 
P*NFGKNS + KPRFS PPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT*VPSPIIFPVSEKWA 
GCLHLCLNFTCTELRLLTSLLTI RS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSALLMLLRMSARTSVCTVRP 
LSPS*AVISPVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 


1 


103 


LDSKG I YMTLQPS KLEPKLEANVE IREH 
MLENS S RF * RDLYDSAAFKARTKARS KC 
RDKRAHVGEFF 


7678 


21579 


A 


7738 


193 


300 


GGLPPPPFFFFFFLKKRGFPWVTRGGPN 
PPPLEMGGPTPPKGWNYGGGPPP*FHPF 
GGVGPPISKGGGLGPPRVTQGKPLFFKK 
KKKKNGGGGKPPYSPFLGGVKQKKGVNP 
KGGGS KKPKFP P P P PPGGKKKKP FLKKK 
KKKK 


7679 


21580 


A 


7739 


150 


17 


GRVAQVWWLMHAI PAL* KAEAGRS LEPR 
SWTPDWGTWHL PISTK 


7680 


21581 


A 


7740 


1 


358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKSPTTTPKKKPKIGT 
PPQTPKKINCS*KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F*FFLGDEISLCCPGWSLTPGLEQASCP 
GLPRRWNYRCEPPHSVKTIF . 


7682 


21583 


A 


7742 


131 


12 


AAST YGQ * FKI CGA ILRLMPI VI E FI PL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQYPWCFCNHMSMGKESKTLNRSGMVAH 
TCWPSTLGGQGGRIT*A*GSKSSLGDKV 
RPHLY 


7684 


21585 


A 


7744 


347 


202 


SLFVETGYHYVD*AYLELLASGSPPASA 
SQSFGISSVSLCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKTYIHTKTCM*MFIAALFIRVRKWK 
QS KCHSADEW INKF * Y I KI CLS I HQLND 
IWIASTF 


7686 


21587 


A 


7746 


358 


169 


AAAVAERTQNTEKTEDLVLGLWIKKVIY 
WPGWAHVCNPITLGGRGGWIT*GQEIE 
TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AI QMLRNQVKIKLPKS PVEI I PFKI VPQ 
FQVQKIKSQNNFLLLFQESQIFFATSTG | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenyl»lanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K = Lysine, JL^Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SVRPLSPS*AI ISPVTCTYTSRWPEVTE 
KSQKK . 














KKKARHGI/GAVAHACNPSTIjGGQGGR I T 
*VQEANIVRTCL>Y 


7689 


21590 


A 


7749 


314 


52 


LNNVDLQFFLGPRKKKRNLNDSSLFVSA 
EEVRLMYCFNECLAILHYF* FSNKFI VFV 
* KFGHLLDENMGS KFDN I GMNAMANKDN 
ASK 














VHTHTHTH'i'KTHTQIPSLPLNLEGPVGY 
IoLS CGS LHLKNL W * KCMRKERKK 


7691 


21592 


A 


7751 


3 


238 


FLIQTG FGHVSQAG WELLR * GNLPALAF 
QSAGITGMSYLCVAESLYLLPPFLKILC 
SSLLTLSPYDTWKVAIjLCSYTV 


7692 


21593 


A 


7752 


2 


247 


NSDIFDIYIIQYIFLIYLiIYI IQYIYLI 
*VFLI IFFKFTLiK*EFIFISLiSISFRIjA 
FIVYRDVILLFIjFFFS*NFVGDSIS 


7693 


21594 


A 


7753 


359 


2 


tlkirpi^ylplffffff^kkifffpp 
gggkgv*sgfylmgrgtfslffkrfflf 
yppkegglrgsppppsffffffkkrgfp 

FLARGV* KKGPQGTPLPWLPKSLGLRG 


7694 


21595 


A 


7754 


3 


174 


CSVIQAGGQWCGHSSLQP*IPGINQSSY 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 












219 


IWPLSHVHSNPLTHSPSQVDQGt.FTT.LI_, 
PCLSGYMYVQFPLAEMPFISSLLLENAC 
LSLCEGSEAQKPLLWEGLSYLPPLNQLP 
LCL* LCVCISGGVCRCLMC I LGVCVCMH 
VCVSVCPCTHTLAQETKGC 


7696 


21597 


A 


7756 


2 


195 


VALMADGAIDTESNDYGAFMPLGIERGL 
DRIWEMPELWLRPNEFDCMTD*PCIQHA 
PSVSCGLA 


7697 


21598 


A 


7757 


4 


280 


DHIVDMITPSFTRRTIAVF*DLNLYIVI 
RGH ITSILKPN KNL KLW Y I YT I E YY S AL 
KGNEILIHATIWINLENMQDEINQTQKD 
IYCTISLI 


7698 


21599 


A 


7758 


352 


107 


FLTRASRGHNGETIVTSWPGAVAHVCNP 
SSLGGRRRWIT*GQEFKTSliANMVITFF 


7699 


21600 


A 
— 


7759 


92 


4 


RPS PRRGGWTA* GQE FKTS LGNMVKP CL 


~7700 


2jgQj 




~7760 


~~92 


— 


RPRRRRGGWTA*GQEFKTSLGNMVKPCL 
Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 
PGWS * TPGLKLSSSLCLPKCWNYKHEPL 
HTAHHNF 












167 


VDS TD KR PG AVAHNYNPS T VG S QGGQ I T 
*GKFNPSI_ANTVKLFKKKKKRKRGALF 


7703 


21604 


A 


7763 


309 


226 


RPRRRLTLSPRL* CSGAISAHCKLRLPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLFFVCFSS I FMTFNE * QGLKVTS 
GISSNVY*FLLIWILGKITYQDIGRCFS 
EYGS PEQHNLG I VRNASSRLGWAHTCN 

LY 


7705 


21606 


A 


7765 


43 


339 


FFFFLFFFFFFIIFLLLSFFF*DHFFVS 
LLFISNFVFLFPIFIFTVFQAHLFICLL 
FFPICMFLNLFFPNFNYYFYPFPLFFYL 
LFFFFG* FFNHLFYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSLSKMNISCVSSSISFFF* 
FILIEIGSR*WQAGLQLLGSTDPPASA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
otl 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


. Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SQSARITGLSHHAQLSSTSLVVENVEW 


7707 


21608 


A 


7767 


346 


90 


TPPSSFFSFPPFPFFSFIPFRFFFFSNR 
DRVLVCYLGWS*TPELK*SSQFGLFKCW 
DFRCGPPCLTSTAFYISITVNFSPSNCY 
S 


7708 


21609 


A 








AASTGVGPSEPSAGYYLI.VCRLLRSLEK 
CSIRVGVT*FSRCHLSPLSLTRKGNSLT 
PCASRVR 


7709 


21610 


A 


7769 


5 


272 


AVYHPINESCVLNIGKDSCLLY*LFEKK 
KKKKKKKKNKKKKKKKNKHYE * FNTFPG 
VARKINLFLCIQKVDAGGGTLWGGGPPI 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QR P QS RQ CRAW LGM VAHAYN P G T LGG * D 
RRTT*GQEFKTSLGNIVRPYLYEKMF 


771 1 


21612 


A 


7771 


97 


410 


A*WLMPVTPIIWEVKWRLLEPGSLRPS 
WATWRKP ... 


7712 


21613 


A 


7772 


5 


415 


ILCVYLHFVGTHP FVHL* YMH FLPLTLA 
LLI*YVSIPITISSIPPYT 














WRVTSGHSG * AAAYAT * Y I LS AEGS F F 
PLTEELLI+FMKREAFT*KR*VLIIHEP 
SINLD*LYGCPTPYHTFPDPVYLNSKRR 
RRD SHP P KLASRL PHVLHD FFKKKKGGR 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 
FFFFFFFFFFFFFFFFFFFFCRD*VSLC 
CPA*SQTPGLK* YS YHGLPKFWNYWHEP 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLI I IIEMESRSVSAHCKLSLPGSS 
NS S AS AS R VAGS RG AHHHT WL F FFFFF F 
F * KKKGAPPGGQGG FLTLKKKNP P PRA P 
QRGGINGGTPRPQNGGGKFFPWGEKKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAIESHCVTQAGVQWCNLASLQPSPTEF 
K*FSCLGLPSSWDYRCVPPHPANFYIFS 
RDRVSPCWPGWSKTPDLK 


7717 


21618 


A 


7777 


400 


2 


NTEFPYGPPISLLGM*P*EMETYIPRKP 
CTQMFLEVLFTIAKK* KKPKRS * LING* 
I LVYPLMEYYSATKRFDVMIHTKNVDAI 
GN 1 1 LSE RRQM * KTH I VYS I HRAS CTAE 
VRI KVS YKRAATW I KS I LI A 


7718 


21619 


A 


7778 


360 


218 


PYLANF*NLL*RWHLNMLPRPDSNSWPQ 
VILSLWPPRVLG*QRGGRVEQRGARGNW 
MSKCPEGGDCLCWVAGA 


7719 


21620 


A 


7779 


1 


372 


FEVRS CS VAHAG VQ WHDHRGFE FLGS S D 
DPSVSAS CI SRITGASHCTQG* L * LLTK 
VL*VSAFCLSVNTLRLTSLLKSDTFLGS 
RSKLNS LGESLQTY I S PSRHFIQLLCFR 
S LFPR FLVKHKK 


7720 


21621 


A 


7780. 


372 


219 


ATSLLSYFKKLPQPPQSSATTTIISQ*S 
STSRQDLPPAKRLELTEGSDDP 


7721 


21622 


A 


7781 


358 


145 


IKNIHDSWEDVKISTLTRV*KELIPVXiT 
DNSEGLKTSM*EVTAEWEIARELELEV 
EPKDVLVQFHDTINR 


7722 


21623 


A 


7782 


317 


146 


GRVDCKCPFYRFQMLLVRLGMVAHVCNP 
S TLGGRGGW I P*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVS ITQTS WAHPPTSAS *VAWTTGMHH 
HT W LNF V F FVEMG FHHVGQ VG IRLPLLS 

WK - ... 1 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWDIiAMLARLVSNSWLQGSSYLSL*NCW 
DYGSEPLCPAL 


7725 


21626 


A 


7785 


355 


3 


FGNSGR VDL F F F F F FLDRVS L> C CPG CS A " 

VTNLEVHHSALQPQTPGLKPSSHIiSLLS 

SWDYRRVPPHPS*GLF**GR*YSITQLR 

FLNLHLESGLRSLLPKPVASTVRIRTQS 

GLGLK 


7726 


21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTRNTKVSQAW * CVPVI PANREAETGE 
SLEPGRRSTRP 


7727 


21628 


A 


7787 


270 


1 


RP R RHL VPCVLMLG PKCPGLPHT LTT L P 
YTLTTPSPTRSLGPPECK*SFHLSFPSS 
WDYRRMPQRLANFCRGRILLCYPDWSSS 
PGLKQS 


7728 


21629 


A 


7788 


229 


2 


DG WVR I NW AQ PGT VAHA YN I S NXiGGRG R 
RIT*GQEFKTSLTNMEKPCLYSDHEVRS 
LRPARPTWRNP I STKNTKM 


7729 


21630 


A 


7789 


118 


263 


CSKSYYWPDAWHTCNPSTLGGQDGWIA 
*AQEFKTSLGNIARPPISVF 


7730 


21631 


A 


7790 


246 


359 


TFIFSETRSHSVTQAGV*WHDHGSLQP* 
PPWAQVILSC 


7731 


21632 


A 


7791 


70 


102 


AA*SRLTATSLSRVQAIPLPQPPEQLGG 
QGGRIS 


7732 ' 


21633 


A 


7792 


7 


312 


FLDFQLRLHSNSYCEDKGVFDSEIYEIC 
IVXiICKK*RKFLNQKKKKKKKEKPGLI * 
KKKKNSQKKWFDFPWLPPNPETKNSLFL 
PKEFLiWLiKTKPPFPLITjT 


7733 


21634 


A 


7793 


1 


341 


I KPE * YLLFAYTI LRS VPNKLGGVIALL 
LSILILAIIPILHISKRRSIIFRPLSRS 
LY*LLAADLLILT* IGGQPARYPFTIIG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 


A 


7794 


1 


143 


YGSTFFVATGFHGLHVIIGSTFLTICFI 
RQLI FHFTSKHHFGFEAAA*YWHFVDW 
* LFLYVS I Y* *GSTFFVATGFHGLHVI I 
GSTFLT I CFIRQLI FHFTS KHHFGFEAA 
A 


7735 


21636 


A 


7795 


2 


257 


KWAI IEEFTKNNSLI IPTIIATITLLNL 
YFYLRLI YSTSITLLPISNNVKIK*QFE 
HTKPTPFLPTL IALTTLLL P I S P FI L I I 
L 


7736 


21637 


A 


7796 


22 


326 


RDASDCSFQNLPVPIjWVEK*MVFLLTKK 
KKKKKKKKKKKKKKKKKGGG P FKKNSWG 
AKI I PGKKKI I FFFLEGQKKKLWGFFKK 
KPLFWGGXKRPNPPKKN 


7737 


21638 


A 


7797 


506 


281 


RGAN*NRSGCGKRHEERERERERERERE 
RERERERERERESPRPPKRQRERErEIQ 
TLSLTVS IiAPPPTCVF 


7738 


21639 


A 


7798 


435 


336 


MRSNI HF FSHTHTHTHTHTKT F * ILKQH 
TFSK 


7739 


21640 


A 


7799 


411 


106 


RNPPPLFFSPPLQRGGFPPIYWGPPRFF 
PPPPL*KTPPKLKIGAPPKKKKPPPPPG 
EKMVSF*PPPPFFFFFFFLRRPFPLFSP 
GGGPWGHFRGPPPPPPGV 


7740 


21641 


A 


7800 


2 


296 


FVPSTANWCFFLWSVF*TGGSNYFFVHR 
SYSQAGVDLIFIRIiANTISNLSFI* *RG 
GLAMLPRLVLETRAQTILHSWPPKVLGL 
QVWATVPS FQFLKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last S amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalnninei 
G=Glycine, H— Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


7741 


21642 


A 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFLGTPGFFP 
PPLF*NPAP*FFFWAPKKKIFSPPPPGK 
KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHT I AKI WKQPQC PSMDD * I KNMR YL.YT 
VE*YSVLKEREILLYWGHYAM 


7743 


21644 


A 


7803 


407 


3 


GFIH*FPLFSGYTLDQTYAKIHFTIIFI 
G VNIAFFPQH FLGLS GMPRR YS DYPDAY 
PT*NILSSVGSFISLTAVILIIFMI*EA 
FASKRKVLIVEEPSINLEWLYGCPPPYH 
TFEEAVYI KSRQKRKESNPPMY 


7744 


21645 


A 


7804 


2 


156 


THRRI IIL.SQGL*TLLPLIAF* +LLASL 
ANKKKKKKKRGGPFKKFPWGAKI 


7745 


21646 


A 


7805 


2 


309 


THSRI IILSQGLQTLLPLIAF* *LLASL 
ANKKKKKKKGGAPLKKI PGGAKI KRGME 
IKNFSQKGGEKKTQRGIFGKKPYLGGGQ 
NWEKLPKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TLSDLERLLMKAVSHFLMI DLLEKC * CV 
LKNQS S KKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLiIjKNSWGGPNFPGGEKIFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRGFVFFFFEMESRSWQARWQWCNL 
SSL* PPPLGFKQFS CLRIVYRKDSLSFN 
KPRLIiP 














GSTFFVATGFHGLHVIIGSTFLTICFIR 
QLIFHFTS KHHFGFEAAA* YWHFVDW* 
LFLYVS I Y* *GSTFFVATGFHGLHVI IG 
STFLT ICFIRQLI FHFTS KHH FG FEAAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCG * ISDCCASNE*GSVGLGPSE 
PGAGYNLLVCHLLRPIjEKCS IRVGVTRF 
SRCCLSPLPLDRKGNSLTPCTSQVRQCL 
ALLQLTLGALHPVS CTHCP T I SGEMNP V 
S QLEMQKS P I FCVTHAG 


7750 


21651 


A 


7810 


_ 


157 


FLHFGQAALELLTSGDPPASAS *SAGIT 
G VSHRAQLVCTF I T I YVFLKNSS Y 


7751 


21652 


A 


7811 


368 


44 


QNFPPPKKRLPPQPPLFFFLVPS FKGEG 
FFFFLRGFFFFSSAQRALL* FIYFIFFF 
FFFSFFFFFLYFS I S PDSKGDTHHDLLL 
GVSWWTRSLPLWIARHNMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKI PPGFFF 
SPPLKKKIFFFPPPFFLAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLiRCIVIPSIj*KKKKKKKKKKK 
KfCKKKKKKKKKKK 


7754 


21655 


A 


7814 


3 


412 


HE QEHL * LLLP * * PLAI I * V I S TLAETN 
RTPFDLAEGETELDSGFNIEYAAGPFAL 
LFIGEYTDI I IRNTLTTTI FLGTTYDAL 
SPELYTTYFVTKTLLLTSLFL*MRTGYP 
RFRYDQLIHLL*KNFLPLSIjALLI 


7755 


21656 


A 


7815 


1 


183 


LRLERLSEFSTRRERERERERERERERD 
TRIDIYIVSQKRKKYIV*IRRNIFVYHA 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * S S PGLLViSS AS VS P SGHMS 
PSQQTSPWVSEESLLLGRVPGFLYFPPV 
DAPGPGAWLAHWQAVLKLLGSSDPPVL 
PSQSAGFAGLSNCAWPW 


7757 


21658 


A 


7817 


2 


219 


ADRLRNS * E A* ERERERERERERERE PY | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














PPSHIYTDRNRH*MRHEERTL,FLWGSGE 
VRDAAP WVSH I PNTAL 


7758 


21659 


A 


7818 


190 


292 


NSHLIFPWTKVFFMS*CTYNL)VLPGSEK 
KFYSHA 


7759 


21660 


A 


7819 


498 


139 


RRAATPAPAAAEPPPPLQRP * PPSEPPD 
TQRDARGRRRGEDPGNS P FH PNPDRQ P S 
RALCSTPRMHRLWIGPAFFiiMTSLSVSG 
AVI PRNGGPGGVSSGPCLLQLLCGQAGS 
STIRKIPS 


7760 


21661 


A 


7820 


489 


330 


ELGFLHVPQAGIiELLS* SNPPASASLPT 
SWDYRHDHRTGHHPGIYDSKMCIF 


7761 


21662 


A 


7821 


490 


260 


FFIiXXNXFXFFFFXXFXXPXFXXXFXFF 
FFXXXFFFXFFFFFFXFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 




7822 


493 


482 


VGQAGLKRLTSGDL.PASASQSAVITGVS 
HRARP IMYFRYVQ * AKGSHV* RWY 


7763 


21664 


A 


7823 


456 


121 


ASFFPIQYKGLGGATPPQEGGCG*GIPI 
KWGYKRRPGGPHGGSKRPPTQ*KKSYFI 
NVLVLFYLRDKGLAI PTL.VSNSWAQVIL 
PPWPPKVLGLQTRVTVPGQNILKEKMFF 


7764 


21665 


A 


7824 


294 


457 


LCIGFVILISYFNIMENWYCRPGTVAHT 
CNPSTLGG*GRQITRSRDRDHPGQHG 


7765 


21666 


A 


7825 


26 


235 


S VW WNS QTGNKVKFTI S W * I D KQMRC I H 
LMEYYSAI KRNEVLINATTT W INLNT I K 
LiSKRSQTLKSNTLi 


7766 


21667 


A 


7826 


30 


303 


SYVSWEFPGSSGPTLFSPWHQTPRTYL 
NSVASAINI/TQCPH*PEKKKKKKKKKKK 
KKKKKKKKKKKKKKGGAFKKKGP* KTPK 
KTPGGF 


7767 


21668 


A 


7827 


464 


21 


REPPHPAPADILIIjHFWPIjE1>*ELISW 
LNTRCWICDDTPRKLTCQPMPRIiLLQT 
EPL*VLFSNLGSFGSIjSKS *MGLVNFFC 
KGTDSRSFSLFVAVFCLFL*RWGLAMIiH 
RLVLNCWPQVILPPPPPKLLGLQVEATT 
HEFHGTE 


7768 


21669 


A 


7828 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFIKKFFI 


7769 


21670 


A 


7829 


463 


313 


DIYPERRNS KIMS FAATWMKLEAI ILK* 
LKNRKPNIVCTHL 


7770 


21671 


A 


7830 


449 


266 


WQMRFLHVGQAGL * LLPSGDPRASQVQ 
SAWI TGVSHHAQRS I FFKKRNNDCDRPD 
NTECT 


7771 


21672 


A 


7831 


480 


148 


IELRLSHGAAEFHRPREWYGIFKVLKER 
DFYPRIIYPAKIILRHEGEMKTFPDKQT 
LRHFINTRPVLQEMLKKVHQSEIKGC* * 
ATNNHLKVKKKKN S LG W 


7772 


21673 


A 


7832 


23 


131 


QRGNSKGYHLKMIQQEEIRKX.EEEKNQL 
EGEIIHFYKMKAASEALQTQLSTDTKKD 
KHGKKQ*FL 


7773 


21674 


A 


7833 


16 


339 


NTDTLGSLMAFCRDGLAMI.PRL.FLNTGL 
KRSSCPDL PNC WD YRHGPPYLASFVLLK 
* TLS I LLLPYYHKMHTCVFMVLCTNFCG 
GVCPGMELLCLKVYGYLSLKHTFH 


7774 


21675 


A 


7834 


355 


70 


KKTPRGFSGVFLGPFYKKGPLFFFFFFF 
FFFFFFFFFFFFFFFLFLQFFILCQQFL 
S*STEKTVHSKYILSFPFHCLIFCPFIL 
LCSTFCNGAHY 


7775 


21676 


A 


7835 


494 


282 


PENGMIQGGGACSEPR**HCTPAWAKER 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ JD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gtycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Try pto p h a n , Y=Tyros i n e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DSVSNAKTIQNKKNLGN*HS I STAKAKN 
HIS* YKSELRKKM 


7776 


21677 


A 


7836 


305 


185 


LVIETCLTVRPIiSPS*AIISPVTCTYTF 
RWPEVTEESQKK 


7777 


21678 


A 


7837 


179 


191 


LGCRKYIiFYLNGGliKCLYI KCVCVCVCI 
YIGVYIGMCVYICIYIYLMAS*NQVTSF 
LSFYVFLLNYR * I PVKVITVL FNLRMTG 
LP 


7778 


21679 


A 


7838 


466 


343 


LGRLRQENHLPPEGRGCSEPRSRPCTPA 
WETEEDAVSKINK* INK 


7779 


21680 


A 


7839 


451 


89 


LRDTR * SG VI CLPKQAWAMVGA PPPAS L 
PPCSLISDCCASNQRDSVGVGPSQPGGG 
YNLLVRRFLSLSEKRS I RVGVNRFS R CH 
LSPLSLTRKGNSLTPCASRVRQCLALLR 
LAHGARTH 


7780 


21681 


A 


7840 


442 


164 


AHTNQFSQCIKKSVTPDLMEEMYKKAHA 
AI*ENPVYEKRPKKEVKKNRWNCPKMSL 
AQ K KHQ VA* KKAS S LRAQE QAAE S 


7781 


21682 


A 


7841 


494 


359 


ICMLPRLVLNS*PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKIFLFRPGLNLMGDIWVPSTPP 
LQG*TNFPPQPSQEVGLQKAPPPPGLIL 
VFLSRQGFPQLGRVGFHFPPPGDPPPRP 
FKKI WSHPGGGKS P PKKKKKKE I AEVLG 
AFRLR 


7783 


21684 


A 


7843 


404 


56 


HTLSTNVCSSYLNLDFFSLRGRGLIMLP 
RLALNSWAQEILLPQPPE*LQLIjRRLRQ 
ENLLS PGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLLALLLI I ITF*LFQGIGYINNSTP 
YEGG FDPI STAR VP FCIKFFLEAITFLL 
FDLE I ALLLPLP *ALQTTNLPLI AMASL 
LLI I ILALSLAYE*LQKGLD*AE 


7785 


21686 


A 


7845 


419 




FLFFFFXFXXYVLSCGIY*YWCNLFSS* 
QYFFFYVMAFRI FLFTFLFLHFYYNMF* 
CTY FYLS CL * FE Y I CSH IIHYFI YFLHL 
ENF 


7786 


21687 


A 


7846 


475 


124 


FFFFFQPPQKEGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF*KTKGVFFF 
PFF* * KIFFFFPGVFFFFPVFFFLSPPP 
SSSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


LIDTVSPSVAIAGVQWCNHSSPQP*PPS 
IiKRSSRLAIjPRFWDYRCAPLCKAHSFNS 
NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQLFFFL*ETGFHSVARLECSGVIMA 
HCSLDILGSSKRPANFLNFL 


7789 


21690 


A 


7849 


101 


398 


LFQKKKKKKKKKKKKKGGPLKKKPGGGK 
NKGGEKKKIFFLKGGEKKNPRGNFKKKT 
FFGGGKKGEKPPKKKKSPEEKKKF*RGK 
GGKKSLICWVEKNLG 


7790 


21691 


A 


7850 


87 


226 


GSLS P I MLVL.P FL I HLL* KNFLPLTLAL 
LI*YVSIPITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAIIRRLLH*FPLFSGYTLHQTYAKRHF 
T I I F I G VNLT FL PQHFLGL S GMPRR YS D 
YPDAYTT*NILSSVGSFISLTAAVLTTL 
MS * EAFASKRKVLLVEEPS I DLERLYGC 
P PPYHTFEEPVYI I SRPRRKES WPAKLV 



1028 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


1 SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
locution 
correspon 
ding to 
last Amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamirte, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptop)ian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=posstble 
nucleotide insertion 














SSQPHG 


7792 


21693 


A 


7852 


1 


80 


S S LLL 1 1 1 LALSLAYE * LQKGLD * AE 


7793 


21694 


A 


7853 


5 


135 


ATFLYFS*K*SLSMLPRLVLNSWTQAIL 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


7854 


38 


419 


FVMMPLHSSLGEGVRPYLKKKKKKPPWE 
GGSPPKNFKNQIFA*NQKPPFFFFIPPN 
KKWGS P PGGGAS PL * S HPSGG PGG PNNK 
VKISKPPCPPGGTPVFTKTPTFT*PWGP 
I PEKEGRKTVCPRTH 


7795 


21696 


A 


7855 


398 


61 


FFSPRPRGGGFPPPPPKNFFFSPPTIFF 
GGGVWNFFPPPKMVFFLKISPWFFFPL 
IRKKIFNLS PGSFWSPQGFFF*GPPPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLPPSPKPIKIKNYNRP*EPK*T 
KICSLHSLPPQS 


7797 


21698 


A 


7857 


1 


332 


TPPIARPAENSARGAEGRRR* I XALQVC 
NKKWP PTKLHC I KKKKKKKKKKKKKKKK 
KKKKKKKKKKKLWGGGGGKNFFKKGGKK 
KLGGG FKKRGRKKKK 


7798 


21699 


A 


7858 


3 


191 


SPRHLTRAAG I RHEGQTGLKLLTSGNLP 
AS AS * S AG I TGMSHR I GPRMLDLLTLVS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFLHL*N 
GGMVLS 1 1 LS I KS PLRS PSLS YS WL * * P 
REKIKWWGGQVRWFMPINPRTLGGGGGQ 
IA*SQEFKSSLDNMVRPYLSEPP 


7800 


21701 


A 


7860 


355 


86 


YIKG*QMVGNRGASQAGMTGYGMPRLIL 


7801 


21702 


A 


7861 


336 


225 


PQLAGPVKTVRKVYKKEKARVI PEEKKN 
FKAFAS FRKGRANGRVFGIRAKKAKEAA 
KQDV* KKPPI PLAGGS PTRE P PSQAHLF 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGERGGKYHRGKKKPPGH 
QRVNPPS PKKKKKNWGRGKNP FTPLIGG 
GQAKKP PQPGRGECP* KKVPPNWSVLGK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RSFFFFFFFFFFFFFFFFLVARK>YYFL 
ATKKKKKKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 


70 


PLTASISLPVNV)RYIAPHRVVKIK*KSV 
Y*KYLA* CLAQSKCPKSVCSHLSSP*LH 
FVPPPNPAEAMWTARQC* CIRGLGGKGT 
LCVAHTYNPSCSGG*GRRIV*AQEFKAS 
LGNSE PL I LKKERKKESQAWWL I PI I PA 
F 


7805 


21706 


A 


7865 


67 


385 


VARTTGMQHHAWLIFFIFIIFVETGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
EFKRSLGNIVRPCLYKNNKNKKN 


7806 


21707 


A 


7866 


454 


0 


LSFFFFPLGDRVTLCCPG*SAVARSQLT 
VALNSWAQAILLPQLHKHVPPCLANF*N 
FF*RQRYTMLHRLVLNSWAQVILQPWPT 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


P AT S SS S S S S SS S S AAAA VAAAATAAS L 
PPCRPPLRPPQPLAPHEVATLAPWRG*K 
PS FTSSGKRWLTSEPQTSGSCASCL* SM 
RASSPSSG*SKKKPPLPAEAAASVAACA 
VCWRVRAPPYVAATTAYTMAVMGWMAFP 
WE V I WTRWRMMSQAI RPLQTWTNS VTAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K = L»ysine, L» = Leucine, M = Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














S 


7808 


21709 


A 


7868 


52 


187 


LDFN*RRCLTKLPRLVLNSWAQVILPPW 
PPKLLGLQV 


7809 


21710 


A 


7869 


33 


373 


AGVQWRDLGLLQPPPPGFKRFSCLSLPS 
G WDYRHAP PHLANFWLVEMG FHHVGQA 
GLKLPT*GDPPTLAYQSAGITGMSHRAR 
PGISVFHLKLHSFDPFKIIFRPGAETMP 

s 


7810 


21711 


A 


7870 


100 


119 


VFVEIVFPEGLLFSPCSFHLCGFRIHLK 
AEF* FFVETGSMLPRLVSNS WAQVILPP 
WPPWL*LQA 














LLMLPRLVLNSWAQVTLPPQPPKVLGL* 
AYI 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFLGKKDLFFSPRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPAAP 
TGVFFFFCGGGA 














PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCKLCL PSSWD Y K * HCP Y LATGAPGLL 
SQSELSYVTL*LQESRPLCIPVDFMTFL 
PSLCFQSPNSTTL 


7814 


21715 


A 


7874 


342 


716 


STGIFLFVCF+DGVSTLAQAGVQWPDLG 
SLQPSPPRLK*FSCLSLASSWDCRHVPP 
LLANYLYF* *REGFTVLAMMVSFS * PCD 
PPTSASQSAGITGVSHCAHPGMFFIjNWP 
LKSCPSESGAAV 


7815 


21716 


A " 


7875 




215 


P PPARCH * PP * * TLVHAVE FKSHHYSL I 
SSTQGHKHCGRPQGPLPRKTRDLCSLVY 
LLTFPSLLSYDPAKSLSARNTQE 


7816 


21717 


A 


7876 


492 


176 


PCGGRGAGGNWGWPPWHLRASASSRWAW 
ALQPHTRSSQPAGPTGPRQ*GAQHLGQQ 
LLCS I S HQ ALAAP P QGRAQD LQ PAM PE P 
SPRPPLPWAAQ 


7817 


21718 


A 


7877 


281 


439 


TWSIDL*HMAYFLFFSFLFFAFLSFSLL 
S FSLLFFAFLFL* VPS FSLFQRQS L 


7818 


21719 


A 


7878 


507 


338 


LLRRLK*EGHLSPGGRGCSKPRLHHCIP 
HWATE*EPFSKTINK*N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


PPCLANFLKFFL*RQSLPMLSRLFSNSW 
PQAVPLPRPPKRNQLYLYFGCCSQLCNP 
QHLCS KQHT FKILFDI EKHFRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISLC*PVLLTSLV ~ 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


78S1 


3 


203 


LSLJjPR1jECSGTFTAH*SFKIjLGSRDSP 
TSAS * IAKTTGMRHHAQLRFFFFVPGGV 
CVCVCGCVYV 


7822 


21723 


A 


7882 


324 


211 


AWWLMPVIPTL*EAKMGGSLEARSLRPV 
WAHNETPFLF 














QHFGRLRQVDHLRLGVHYQRGQHGETPS 
LLK* INVYI YMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRLAQGLEFLFNFCFEIVT 
EPPCPALAYF*NFL*RWSLATLPRLVLN 
SWPQAILPRWPPKALGLQE 


7825 


21726 


A 


7885 


471 


215 


RGMCVSH I FLCVGAR YNI SSARESLLKE 
FTECTERKHTHTRAHTHARARTHTHTHT 
HTHTHTRV*NSRSQPYCRVHACSPAYLG 
G 


7826 


21727 


A 


7886 


462 


204 


RHSS*LGLPKCWDYRHEPPHLACISSF* 
RKSDISLSDELGTKCHKWWPQWKSRAG 
S VGRD PCSGHFILAG FGQLGA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


IVI 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, II=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


7827 


21728 


A 


7887 


409 


152 


PVFLAXjERPNQQDFLRPGVLDQP*QNNK 
IS FFFFFFKKKKKYI YICMVVHAYNPS Y 
SGG*GGRIT*AQKFKTSLNNTVRPHIjYK 
K 


7828 


21729 


A 


7888 


1 


152 


GVHVCCDCKC I C VCLC WGVHECLC VC V 
SK*VHMCHREGTIFGNISWEGD 


7829 


21730 


A 


7889 


301 


399 


EI INI VFFFYFFIiPLLFFFFWGGRFKKQ 
LNPEFLTNSWKKNHKTIGGKNPMGG*KK 
AITNPQKKGRKRSFSF*NKAQPTQGKNG 
QNSQGSKAVGALTAH 


7830 


21731 


A 


7890 


478 


142 


KFSKKKKRGVLNFYKKKAFFSKPPKPVF 
KKKKLLKNPFFKAKKQFFFKTPTRPLFR 
EKFFPGPPKKKKKRPPGLFKNN*KGPPW 
PREKKRVKKKKKKVKSHYYNNHMPISVH 


7831 


21732 


A 


7891 


497 


320 


PPWKFEFEFWFSP*LHKYYPPYVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


IPPFKKFF* EMGSHAVT* AR WQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSARRQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRSAPQVEGKGPNFCSLEAPPPGIiSLFS 
CLNF*RS WDPGPPP * PHATPKTRAGGDL 
LCATHHRARHCTPGLRFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


GIL.FFWRDGG1.VILHRLVSNFPVSSDSP 
I SASQS WDYRCDPPCPACPLI FW* LLKH 
S F * FF WRDGGL VI LHRLVSNFP VS SDS P 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


IhS FMTT WMEAEDI MLSEI s qtqtdr yc 
MILLVESKNVNLRPGMMAHAYNPSTLGG 
* GER I S * GQEFETS LVNI ARHYFYKRN F 
KIKSKFKKIVRKSQKKNVRRWLLEARNK 
GRYG 


7836 


21737 


A 


7896 


510 


335 


SCIVSLHRSWDLWHVPPFLANF* IFRKG 
GVLPCCPGWQIYTFQIjQIPFLSITPVAF 
SG 


7837 


21738 


A 


7897 


37 


260 


AGNSQNQGGGACSEPRWRHCTPA*ATER 
ESVSKNXQTKFLCSVPYFESTHFHCTNE 
ATGAWGVE KVALGHAAS W 


7838 


21739 


A 


7898 


35 


466 


THTCGGIRKKLTNKRENQK*SQQKPSAP ' 

KEPEVKNKQKKTKNKPKKKKKTPQIQPT 

KECPFRNPNLRVFTFFVTTQNKTPTRGRK 

TFQKVKLFPPNPCLGNKPQTLLPRGGLQ 

KKKKKIPFFYSHPPPLAQKKKATPWQNG 

LGFT 


7839 


21740 


A 


7S99 


494 


149 


PARLLSLHSYEVKSVLPKEEGLLNFFVQ 
SVTAHPTSCIGI.EE I *LLD 


7840 


21741 


A 


7900 


494 


114 


GAEGGPLSKKKKKNLLPDYHTGPMSLSL 
YLWPFLSNGHTLTLMCAHTHTHTHTHTN 
H*LSRVGLiPVLKPGQYQANGNRWSP*YN 
I KE * PRKTRVMPVTPQLRNDGPMLTFWC 
SSRNSFAMHVLGKL 


7841 


21742 


A 


7901 


19 


454 


VWCNSSVS IHTHTHTHTHTHTHTHTRSW 
VGWHMPIVLSTQEAEAGRLLGSGSLRIQ 
RAMIVPL.HSSLGNAARPCV*HTGSCCVS 
QAGVQWHNHSSLYPQTPGPKQSSSLSLL 
S R * D Y RHVP PHP AS CVC VRVCVCVCVCV 
CVYGDRG ITPH 



1031 



ISDOCID: <WO 01 64835A2_L> 



WO 01/64835 



PCT/USOi/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7842 


21743 


A 


7902 


3 


393 


DAWDPLNPLVNLFVSPKRNSSLDTRKKP 
CRESKKFNTHSRPKSSHQLRKRSSSTPT 
T*KIPNI*LNSSTPSRAKKKRKKKPKKK 
KK* KNS QKKKKPMRKTKKKKGGGLLKK I 
LGGAKPFGGRKEKIFFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF*G-»GFVIFCYVGQVGL*LL 
TSGDLHISASQSAEIRGVSHHAPPLSLI 
WFISQCVGILNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


21745 


A 


7904 


1 


181 


SQ*LY*LLAANLFILT* IGGQPGSYPFT 
IIGQGASGLYLTTILILIPTICLIESKI 
LKWA 


7845 


21746 


A 


7905 


3 


281 


HAYAHAS ATFFVS TCFHGLHD IMGSAFL 
TMCFIRQLIFHFrSKHHFGFQAAA*YWH 
FVDW* LFLYVS I Y* *GAVLKEPWGGQS 


7846 


21747 


A 


7906 


1 


84 


PTRPLTMLPRLVSTPGLKQSSQLSLPKC 
*DYRCEPACTAYSF*LTMLPRLVSTPGL 
KQSSQLSLPKC 


7847 


21748 


A 


7907 


3 


392 


DPDNYTLANPLI TLPHI KPE * YFLFAYT ' 
I LRS VPNKLGGRLALLLS I LI LAI I P I L 
HISKQQSIIFRPLSQSLY*LLAADLLFL 
T* I GGG PRS Y P FI Li I GQVAFVLYFTTI L 
ILIPTISLTGDRILKWA 


7848 


21749 


A 


7908 


2 . 


231 


GCVEKGTLSHC*WECKLVQPLWKTVWGF 
LKELKIDLiPFDPAIPLLGI YPE ENKFTL, 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFSFFVLLPFGFFFLiFLGLFFFFFKIiFF 
FFFFFFFKI PFFFSKKVG VGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP*RV*FLE 
GGWSNLSPPPK*GPSPKFPTRVL*GPP* 
GKN*NFGFPG*NWAPHRFF*RAAPFFFF 
C*DR VLLPRLVSNS *AQVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC * LF IETGFPH IGQAGLEL.L.TSNNP 
PALASQIAGTTGVNLRPGQDFSSYHSEN 
PLVYGSLYIA 


7852 


21753 


A 


7912 


1 


264 


SSTTTLLLSRT*NKLT*LTPLIPSTLLS 
LRGLPPLTGFLPKWAKKKKKKKKKKKKK 
KRGGGGWKKKPKKGGGGETSSKKDTFFK 
KGGP 


7853 


21754 


A 


7913 


416 


129 


FFFFFFFAPPPKIKTPPFFFKREVYIY* 
ALSVFQSVCKALYIHFRIISSQYPC*LG 
FFFFFFFFFFFFFL.KGSLDLLSAYCVPG 
SVQGALHTLSH 


7854 


21755 


A 


7914 


397 




IFSNLFFPPQKKMFFPPPPLKYFFFSFT 
ALFFFLVFFPFFPPPKKIFFFNNPPKFF 
FYPS * KKKFFFFI P VYFLAP P EFFF* AP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


IFLSPRK*GFFSPPPP*KFFFSPKPLIF 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPL,NFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPI*KYVFIYISLSLIPFCKFLK 
GRECDIHFCDIQHSYSKYLGHFKCSTND 
RQMNK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2. 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
ASPCYIMLGYNFSSFPCGTISIAPGFNF 
YRLYFISHASADAWVDPRFLDRYRRG IN 


7858 


21759 


A 


791 S 


2 


407 


PPPPQVYFNIWRTIFFGEGLLHIFPPQK 
RFC F YKTYTNY I KTLFIKKKN I FS LAH I 
KMVPPRSIY*TPPP1iFFFFFFFFLjFFFF 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


11 


KMWY I YTLECYS ALKKKKE I LLFDTTL I 
NLKDIMLSKISQHRKHTHT*FKT 


7860 


21761 


A 


7920 


1 


70 


FFFF*AAESCSRIKCSGTIIAQCNPELP 
GLSDPPTSAT*VAGTKGVCHHTQLRFI F 
IV*LSGRILLQDQVQWHNHSSV 


7861 


21762 


A 


7921 


123 


260 


G I QARKE PG AVAHAY KLS TLGG * G G W I T 
*GQELETSLVNMVKPCL 


7862 


21763 


A 


7922 


184 


399 


LHLLMRVLKKKKKMGG PLYR S PRGAKVN 
PALQRLDSLLIGSRILSNLGIFGKAPHS 
WGG*AFFAATGLHGLHVIIGSTFLTIGF 
IRQLIFHFTSKHHFGFKADD* YWHFVDV 
A*LFLYDSIY**GSRILSNLGIFGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGMI * EAFAS KRKVLI VEE PS INL.E * 

IjYGCP ph plhtsnkkp phtkkkqkkkkk 
kkkkkkkkkkxkkkkgggllkklwggpk 
ntggkkknffffwggkkknlgdflkknl 
fwgggkigptppqkknpigekkf 


7864 


21765 


A 


7924 


1 


398 


PTRPPTRPTSSRSRAPPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF* FFRG 
*GFYFFPPLYFFICGGGGVFFFFFFAGW 
NIFFIPPKKNGIFKTLRGKGG 


7865 


21766 


A 


7925 


2 


335 


NLTFFPQHFLGLSGMPRRYSDYPDAYTT " 
*NILSSVGSFISLTAGILIIFMI*EAFA 
S KRKVLI VEEPSINLE*LYGCPPPYHTF 
EEPSI KKKKKKKKKKKKKKKKKKKKKG 


7866 


21767 


A 


7926 


1 


328 


RTRGRTRGESNPSLRIQNSPCHLSHPIL 
KKKKKKKKKKKKKKKKKKKGGPLKKNPG 
GAKI * PGPKKKNLFFKGGL.KKTPLGNFE 
KKPFFGGGKKRKKPPQKKKTLKEKKKF 


7867 


21768 


A 


7927 


27 


226 


LSR* KM * NKS HIYSFE KKKKKKKKKKKK 
KKKKKKKKKKKKKKQKKENRPKNGDSEN 
GHPKSKIFWL 


7868 


21769 


A 


7928 


398 


2 


GGGGGPGGRGRP* *NPHKKEGRDPPHPK 
KKFFPRRRKNRGGGGREKSPPQKKKAPQ 
KKPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAPPFFFFFFFFFFFFFFFFFFF 
FFLG I IiAR TTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 


156 


HASAHASAPVFV* S VLMTAVLLLLSLPV 
LAAGVTI LLADRILNTTFFDFA 


7870 


21771 


A 


7930 


405 


249 


AS AHAS AP LFV * S VL I TAVX.LVLS LP VL 
AAGIT I LLTDRNLNTTF FDPA 


7871 


21772 


A 


7931 


921 . 


699 


DHPGQHSEILSLQKMKTMSQCGGAHL*S 
QVLRRLRREDCLSPGSQSCSEL*SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR*SSYLCLPSSWDYR 
CEPLRPAQRRGILMAATSRILTRSKGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFSCFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN. 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla janine, . 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Thrconine, V=Valine, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QNGLVQALRP PGP PTFPFL I FPTKAPWS 
WPFFGGKKKKKLSPKGHSERHLHHKYIR 
LLSQHGGM 


7874 


21775 


A 


7934 


1 


118 


NF* I FCRDWGLALS PRQLSNS WPQVI LP 
PQPPKVLGLQA 














KKKTGLIKISTLCFVKNTFKKTKWCAPE 
RKKKVP I FLSYKRPVFKI YKKLFPFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP 
PPCPVNFFCIYNKNGVSPG+PG 


7876 


21777 


A 


7936 


3 


131 


GFHR VGQ VGL * LLTIiGD PPAS AS QSAG I 


7877 


21778 


A 


7937 


334 


146 


HPQGLNI *NKMVSSSGAKHSSRLSLPKW 
WDCRQEPPCPVIMFLKRGINHALYSPSR 
KLLHFL • 


7878 


21779 


A 


7938 


2 


371 


S EPGAG YNLL VGR F * S PS E KCS I G VG VT 

CLALLWLVQGVLHPLSCTHCLALPSEMN 
PVPQMEMQKSPIFCIADAGSCRPELFLF 
GHLGSSPQSKL 


7879 












RLNLGGGGCCEP * S CHCTPAWVTKRNS I 
SKK 


7880 


21781 


A 


7940 


1 


191 


PTRPSQTPGLK* FSHLSLPKS WDYRCEP 
GLYSSRW 


7881 


21782 


A 


7941 


127 


383 


SKDCVRIVLLRAQAKAGSYRTVNWCRPG 
AVAHTCNHS TXjGS RGGR IA * GHE FKTS L» 
GNIVI LPLYKKNKKKKKRGARLKEPNLT 
P 


7882 












KNILPGYFCFFLREKPPPPGLESLFSKK 
K.KKKKLARHGVAPLLGRMR* EDRLRPGV 
QGCNELWWCHCTPAWATEQNP 


7883 


21784 


A 


7943 


3 


225 


EHGSLYPPTPGLKQSSYLSLLSR*DYR* 
AAPCPANFFI FNFL * REDLTMLLRLVSN 
SWTQAVLLPPKVLGLLV 


7884 


21785 


A 


7944 


2 


232 


T L LPDNLS QT I YPN KKKKK KKKKK KKKK 
KRGGPFKKKKF*TRGGGKKNFFFRAPKI 
FFGGRVLKKGGGKKPGEPTN 


7885 


21786 


A 


7945 


394 


41 


WCRWLETWGAGGSGAVPPPPFFPPHSSS 
WDLiALQGSGASSPFSPGAVAHACNPSTL 
GGQGGWIT* VQEFETSLANMMKPHLFSK 
KGPVTAVAHPAKAHRAMWCTPTHMHTQH 
VLHVL 


7886 












FFFFFSTGLFVYWSPTH*SLPVPRIFFF 
S YLWQTEVE TMLG TE 


7887 


21788 


A 


7947 


391 


212 


KEMQ IMPAMR * HLTP VRMV I TKKS KAKC 
WQGCEEKGTLAHSWWKCKGTAETGKLAG 
F 


7888 


21789 


A 


7948 


106 


301 


LQLPITQLVWHSIYFMKKIRYRLGVVAH 
TYNPSTLGGQGGRI A* GQVFKTSLSNTA 
RQKKKKKKG • _ 


7889 


21790 


A 


7949 


342 


208 


GLG AGTH T YN P STLDN *GGWITSGQE FD 
TSLANMVKHHWQTGQF 


7890 


21791 


A 


7950 


391 


21 


RQG FPMLPRL I SNS * P KGS AHLS LP KCW 
E YR S EP P S LAT WYVTLS RHFCS TLLLGQ 
GRSKFHMKWESRLEGLLAKVSSLRSSYC 
SSLRILGLEILLSPLLFFLETGSGSVSQ 
AGVQPLCPGLK . . - .. 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


IVI 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location . 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Is61eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7891 


21792 


A 


7951 


146 


355 


ENIRQTQNEDWI KSSNETVFCKN 1 1 1 S 
LG WAHACNP S PL * GQGRRI T * AME F + T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KG I PPLPPRLKDKGPFFFFFFLKRESLF 
LP*FWKNGCRGPPLFFFFFLRDSVSLLP 
RLVSNSWP*VILLPQPPKVLRLQA 


7893 


21794 


A 


7953 


S61 


586 


DR VR S VAQARLQWHDLGS LQPPPPGF*Q 
FS CLS LLSS WD YRRMHTWP F VF* WRWG F 
TCVAQAGLDLLSSDNPPALASQSARI IG 
ESCCVLLK 


7894 


21795 


A 


7954 


402 


52 


PRKVS I S * PHDLPAS ASQNAG I TGVSHR 
TR PS FN FS Y I ETAI TETRKS LQ PS PGMN 
*SLPSGMVGFKPHSFIHSANFCEALAVQ 
QTLGEVPGHAPLSPPRQEPRVCPKKVCL 
KTALK 


7895 


21796 


A 


7955 


259 


3 


YNTKVLS FPI LL* TFLKHLWLGAVARAF 
NPSTLGGGGGRIA*GQEFKTSLGNIERC 
RLYKNLKLFLESGLFLMAWRGAKRVRTR 
G 


7896 


21797 


A 


7956 


171 


386 


LPIRLFSLVFTTIEKSRPGMVAHTCNLS 
TLEG*SGWIT*AQEFETSLGNMAKLSLP 
KIQESASLITRSQNK 


7897 


21798 


A 


7957 


397 


283 


PQPLPPGLKQFSCLSLPKC*DYRRDSAR 
PAKKNPTFNK 


7898 


21799 


A 


7958 


169 


324 


GTHRVEMCSQWLGWAHACDPSPLGGQV 
RKIT*AQEFGTSLGNIGRPCLYIK 


7899 


21800 


A 


7959 


290 


397 


DCSSCCS PLKGR*RG* AQWLTPVI PALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


LSKTGSLYVAQSWYWLFTGTIMVHDSL 
KLLCSSSPPVSAS * VAGTTGAMRCCEVTj 
TTFQPDVYRL 


7901 


21802 


A 


7961 


404 


174 


PGPQKGS FF* KKKKKKKE IDHILSHKAN 
LNKC * R I KI I * TMLS DQNE I KLE I I I KR 
QLLKETLFLEMKKYTIRKLS 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS*HQVILLPWPPKVLGLQL 


7903 


21804 


A 


7963 


436 


106 


GGGKSPPPGNWAPPGNPGEPLFF*KKKK 
KNPGGGGGRFFP P P PGGRKKKGVYPGGG 
GFYLTNFPPPPPPWGKKKNFFFKKKKKK 
KKKKGRIAICYIILFITSINFKKAFI 


7904 


21805 


A 


7964 


212 


395 


ERERKERDRDGEREREREEREEGRKKHG 
GR*GGREGERHRPRGAEDPDRAGRATDK 
RTGAL 


7905 


21806 


A 


7965 


344 


345 


FSDCYKRSSCASWTWYTRLAHHQPPARP 
GAAPS PRKRWVD FA VNRNFPSRRHPCAT 
PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHLPLPIPPLSQPPPSP*PAPLVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVI*LPWPPTVLG 
SLSIF 


7907 


21808 


A 


7967 


92 


268 


I KMSLCPNFFFFFKKQNVGPRAGAYPCN 
PTILGGQGRKIT*GQDLKTTLVNMGKPH 
FY 


7908 


21809 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTHTCENSKT 
TKNXANNCS * LESQRNNGKCTYAKF 


7909 


21810 


A 


7969 


413 


12 


PPPPPLLTPWPCFFPQIL*PVGHSAPPL 
FK*LKIPRPFFFPPPPKGIGAPPPKIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KSKKNFLFIFSPFFNPKYYNT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I~lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7910 


21811 


A 


7970 


399 


212 


TGVQWSNHSSLQSQTPGLKRLFLLILLS 
IWDYGCMPPYLSFKW* FCTKGGSCYVAE 
ADVNIL 


791 1 


21812 


A 


7971 


584 


424 


RPRRENRLNPGGGGCSEPRSRYCTPAWV 
TE*NSVSNRNKTNQNPIFFSCKQFD 


7912 


21813 


A 


7972 


105 


1 


PSMPQLSHL* NG I LGRVRWLTPVI PAL W 
EAETGRS 


7913 


218)4 


A 


7973 


363 


28 


S S SLGGHPQL W FHTLKLRPVTVSYA* N K 
LCDK IEKKLFFT I ALHNKFLRKKLTRDV 
KILFHNK*IKKEGPNEF*NILCLWIERI 
NINKVPYPSKVMYRSNTLPIEVAVFKSQ 


7914 


21815 


A 


7974 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALASQSAGITGMSHHAQPLS I I F 


7915 


21816 


A 


7975 


2 


60 


FS CLGL P FC WD YRHAP PHLAN FCRDGVL 
NY*PQVIHLPQLPKVIjGL*ACTTTPG 


7916 


21817 


A 


7976 


325 


81 


KTIHSLLFGQFFLLQPLLPSPIPHAAPP 
TPHLI FLLLLFCF* FFK* RQGKQGLTTL 
PKliISNSYSQVTLLiPQPPKVLGFQA 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK* FSSLSLPNSWE 
TKAGRSREPGGGD 


7918 


21819 


A 


7978 


382 


221 


DC I I S AS YLQKNFFFFLGDKVLLCHPG W 
SAVA * S * LTVTS KRVHFYS SEISLY 


7919 


21820 


A 


7979 


423 


29 


FLW * RRGFTS LPRLVSNFWAQG I CLPWL 
PKVLGLQA 


7920 


21821 


A 


7980 


3 


238 


SLAFFVETGSHFVA*AGLELLSSSNSPA 
LASQSAE ITGVSHHTQP E * GYSHM * PQY 
PYQLRFNSDTI ILSNVLFVPNF 


7921 


21822 


A 


7981 


324 


128 


WLSVS P YTFE LSLL VGYE FVI HDV I * TL " 
IFVWWVSIiFCFLWFFLVCMVFWFFC 
CCFLSVFCI ■ 


7922 


21823 


A 


7982 


1 


86 


NLTMLPRLVSNS WAQAI CLSWPPKVLGL 
* AQA I CLS W P P KVLGL 


7923 


21824 


A 


7983 


1 


137 


RPANFC I FFVETE FCHVAQAGLELLG S G 
DPPTSASQGAGTTGMSH*D*PANFCIFF 
VETEFCHVAQAGLELLGSGDPPTSASQG 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRSVQRVIYQYV*NLQIHVPFDSTNL 
GIYLTD I LPHVWNDNMYKVFAAVS FVI A 
KY *NPCLTRRQ CNKLWP IHAME YYVTI K 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQKWPRGPGENLGGFFFFFFFCRDDGL 
TMLPRLALNS * VHAI LP PQ PR KVPGL WV 


7926 


21827 


A 


7986 


409 


183 


LLRLK* SSHLS LPKW WDYRCE PPCPAI F 
FKRKQILKESFPGENTCCHLFNSLEVLK 
NKRRTCYNFPNTLAVLKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRLLQPPPPQLK*SSHLSLQSS 
WDYRHVPPCLANFFLFWYFLQGQGFAFL 
LKLKFLFK 


7928 


21829 


A 


7988 


483 


209 


S S PSLGNFC I P S SGGVS PCWPG WFQT PD 
LVIWPPGPPSLVFFFFFFRDEGLSMLLR 
LTLNF*AQVILLPQLPE*LGLQADRQVP 
LYPAEVF 


7929 


21830 


A 


7989 


164 


29 


KFWFWLGTVAHACNPSILGSQGRRIA*A " 
QELEPSLGNEGRPCLYK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQEEEFSKHHFGFEA 
AA*YWHFVDW*LFLYVSIY* *GSPPPP 
PKILI KKNGAETKKFFF 


7931 


21832 


A. 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDS PASASQN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K- Lysine, L-Leucine, M-Methionine, 
IV — Asparagine, P — Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IG I TGVSHHA* PDFTFLEQF 


7932 


21833 


A 


7992 


469 


13 


KGDTVSSSDSSPSCGGLFPVGAS*LLCL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGSVGMGPSKPGVGYSLLVCRWLRPL 
EKRTI KVGVTR FSR CRLSQLPLARKGNS 
LTPCVFWVRHCFGSHSVGCTHCPAPAVR 
QAPVK 


7933 


21834 


A 


7993 


15 


439 


I ASGR PFS IKFFLVAITFLLFDLEIALL 
LPLP+ALQTTNLPLIVMSSLLLIIIIJUj 
SIAYE * LQKGLD * TEPNKKKKNKKKKQE 
KKKKKKKTKKTKGGGLLKK I LGGAP I L W 
GGKNMFFFFWGGEKKTPGGFLGENLFLG 
GGK 


7934 


21835 




7994 


392 


186 


YFYI INNFFFFKKIFI**I IFFFLKKFL 
*YSPKKVFFLFF* IFFFFFPSSSLFLFF 
SSSSPFFFFFFFFFFFFFFFFFLRG 


7935 


21836 




7995 


1 1 


3S6 


TWEVEVAVS * DHAI ALQPGQQE *NAISK 
KKMTGRMFIEVSFI IASSWKQLKCS*TG 
E *MKS LWYIHI IE Y YtiTI KENKVLEHE V 
S RLTL I DTGRERS KTKEYTL YNP IYTN1. 
RIAKLSGCGGSHL 


7936 


21837 


A 


7996 


3 


186 


DS FCF * RQSCS I TQAGVQWGD RGSL*PQ 
S PGliKR S SCFS LP KHWDHRDE P LHP AG V 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDHSFFFEMESRSVAQAGVQWHDLGSLQ 
ALS SGFM PFSCLS LPSSWDYRHPPPCPA 
NFLYFLVEMGFQHVGQNGLDLLTS * S TR 
LGLPKWLGITGVSHWHPAQDGGFHHVG 


7938 


21839 


A 


7998 


468 


336 


RRGFTMLARMVS IS * PSDMPASASQSAG 
TTGVNHHIRIHRSLL 


7939 


21840 




7999 


464 


283 


TL CGG I RPTN3 MKGNLLYS KS I GLNV S Ii 
LPKNPRRNIQNNV* PNTWALQPSHVDI 


7940 


21841 


A 


8000 


46 


487 


RQGRLS LQKFLL P F VQLCPAHRGGVYRG 
RQASLSCGGLHPVRVSQPLCLPT*VSAM 
AAAPHPASLLPCSLISDCCASSEQGSVG 
VGPSKPGAGYNLLVCHLLRLLEKRSIRV 
GVSQFSRCHLSWLPIiARKGNSRTPCTFR 
VRRCLiAIi 


7941 


21842 


A 


8001 


295 


1 


TQPWGTKRKLPLKKKKKNSGDREWRWLC 
NS VT I L NATTMYTS E WLRPGTVTHTCNP 

YKKILKISPLWWQT 


7942 


21843 


A 


8002 


144 


317 


ELFYLKEMV*RPGPVAYACFPSTLiGDRG 
GRIT*AQEFRTDLGNIARPCVYLRKKKK 


7943 


21844 


A 


8003 


69 


291 


HCVWTMI FIVIKYFYLNPQKNSMRYYYS 
HFMDKETDVQSD * MTC* RLGAVAHACNP 
STLGGRIA^GQEFRTSL 


7944 


21845 


A 


8004 


3 


176 


IVAHSSLTPGLKQFSCFSL,PSS*VYRNM 
LPRLANY* FFGTNRVSLSCQGWS *TLAS 
LLG S S S S PAS ACQVAKS TGTC YHAWL 1 1 
DFLVQTGSRCLAKAGHELCSCL 


7945 


21846 


A 


8005 


419 


178 


NPPPKKLFLSSSSSSPLPHLNWGTPGFF 
PPPPF*NPPPEFNFGAPKKKKKLSPPRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FNFFF F * YFLR WS F VLVAQAGVQ CMDLS 
I YLNE FWLLDF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 
AAPNSNPPQAHAPPAPPAPRAHSPSPPP 
RGATPPFPPPPFSFIAAVGRAPIFPREP 
LLPSSSSPPPPP*IGEPPGFSPPPLL,KT 
RPPK*N*GPPKKKNFFPPPRPKNWFL*K 
GPPPFFFFPFFFLRQSFTLVAQAGVHKT 
RLHLKKKSN 


7948 


21849 


A 


8008 


170 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT * AQKFQTSLGNI VRCHFYFKKY 
FKNK I FFYS PF INTNPK KKKKKGGAVLK 
NQSLRPRAGKVLVFLWGHIjIQFRGPRFK 
NGGTGKPGGLPNLiIjAPR 


7949 


21850 


A 


8009 


1 


662 


RTRRTRSWWYMPVT.PLSHYGEAGESLGP 
* KWMVD* APIAPLHSTLGDKMTLPDI KK 
KNGETVANTS PLSTTAKPAKLFLKAKKD 
QLMRJDLFLPKTPKKPQMSTGLDARSKRW 
LKI I WRRHGIWPLKNIGPTEDVQASAHG 
GVEENMTSDIEIPEAKHDHRPTEDVQVS 
AHGGVEENITSDIEISEAKHDHHLVEDL 
SESLSVCLEDFMTIGSQWKPICLS 


7950 


21851 


A 


8010 


377 


2 


NFFFFFPPFWGFFFFLGPGFFKVFF* IF 
PFFFPRNPPFF*FFLGVPLFFKK*IFFF 
PFS IRFCFFPKKFFFFLILFFFF* FPFF 
FFFVYVFDPFFS F T.LYI FFFVYFSLP FT 
VLI I SL I YHFAPA 


7951 


21852 


A 


8011 


1 


363 


PTRPLF* *RGCFTMLPRLILNSWPQVIC 
P PR P P KVLGLQT 


7952 


21853 


A 


8012 


278 


39 


FXXXFFLSPPXFLI FFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ* FSNFDFTP FTFTLI SN 1 1 


7953 


21854 


A 


8013 


79 


412 


MNECIL.GKGGKDGGFFEQCENLGLC*FL 
KVP * DS D I WLG AVAHAYN P S TLEG QGE R 
ST*AQEFKTRLGNMTRPCLYQKHTHKKK 
TKKKPHPGLGWGANGPTYLGGVGTKIT 


7954 


21855 


A 


8014 


277 


19 


FGNIHNFFLiIGSRSVTQAGGQWCNHSSL 
*PQTPVLKQSSHL.SLPSSWEIEAAVSQV 
R I TALQPGRQS KTL PGNSGPVLRADARR 
SI 


7955 


21856 


A 


8015 


I 


130 


AR I VS I S * PCDL PASASQS AG I TG I SHH 
TQLAHNTLKASPTLF 


7956 


21857 


A 


8016 


4 


417 


RLM P PHIiANFL F F * RHG FT YVAQ VGL KL 
LGSNDPSASTSQSTGITDVSHCTQPHLL 
KSSHSTFPLKTLDSMATNCSWDKVHYLP 
TACRALLPCPLPSACFPSLSQVPSVFTC 
CVPATIiTITIiLPLLRAFEHVTLCPIiF 


7957 


21858 


A 


8017 


388 


158 


CVTWASNCINWICFPLS*INSHLCNEII 
VLI*GEIYNVCISIYLCIYMCVCIYISV 
CVCVC I YVH* TPYL IHKCL I 


7958 


21859 


A 


8018 


154 


382 


G I KRPGAVAHNCNPSTLGG *GKWIG * AQ 
EFKTSPGNMVKPQIYLKKKKKKGGAVLK 
DPSLRPRAGKVI VFFWGPL I 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPRLVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLRLANF 


7960 


21861 


A 


8020 


412 


168 


GNLLGSPJjPGNVCKNKPFILF*KKK*SL 
AVLPKLVSNFWP*MIL,PPWPPKI*GVQA 
RAPPPSQILKKKKSSHFYNSHVNSL. 


7961 


21862 


A 


8021 


1 


124 


TRFLHVSQAGL*LLTSGDLPASASQSAD 
YIFIISPAKIIFL 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G»utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, tt=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7962 


21863 


A . 


8022 


3 


200 


HLS P YFYFFKDRVALCHPGWS AWQ * PI 
VTSIFWSSC*KLTSVVRRWRPSWLTR*N 


7963 


21864 


A 


8023 


3 


132 


FLFFFQF*FMRLGLAMLPMIjGMNSWA*V 
HjLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGLEP PTSGDLPASASQS AE I TRMS 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SDKW I KKMRYIHTTEYY + ALKRKE I LTH 
ATTWXiNLENI L.LSEI SQS PKHRYYVIPL 
T 


7966 


21867 


A 


8026 


175 


382 


GKRIFPPYPPQEGGKTGPPQTPGLFFFF 
LKKK*PGGIjGGARFSPLLGGVGRKNSFT 
LKGGGAINLKGPPSL.PPGGKRWAPPQKK 
KKKGQAWWLMPVI PALWEAE AGRS 


7967 


21868 


A 


8027 


83 


374 


GDQVWWLMHACNPSTLGGQCGRITWDRE 
FETSIiANX FFP I S TKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKAS SGGGVS * K 
KNFSPRAGKEIFF 


7968 


21869 


A 


8028 


382 


148 


S PHCS LDLVGLKQSSHLSLPST * DHRC V 
PPRLA.I FSFAFLFFWEATTTTTKATMLP 
RLVLNS*AQAILLiPQPLKVXiRL 


7969 


21870 


A 


S029 


88 


208 


S GS V * ENNPLS WAW WLQS 1 1 PATQEAE V 
GRLLEPRSSRSA 


7970 


21871 


A 


8030 


1 


283 


NKSRIKKAIKMTEKMKEKIEMMKLMNQL 
PNGEHHAMIQMMMKIEMKGRMKIKE * DK 
ESDKDDRKDEREDRDDETDEPTPERRTS 
RDDSDDDEDRDEGKNEDKRKDDSKDDDE 
ADEDTDQDDYDPMDAHKAADEDDDEDED 
EVAE QMTKKKKKKKKKKKKKK* * KDDRK 
DEREDRDDETDEPTPERRTSRDDSDDDE 
DRDEGKNEDKRKDDSKDDDEADEDTDQD 
DYDPMDAHKAADEDDDEDEDEVAE 


7971 


21872 


A 


8031 


2 


139 


LQTLLPLIAF* ♦LLASLANLALPPTINL 
LGELSAQFAVIGMYILY 


7972 


21873 


A 


8032 


80 


324 


YLHFYFFVR * KLMGLDTVAHTYNPS TUG 
DPERRIA*A*EFKTSLGNMVKPCIYKKK 
KKKKKKKKKKKKKKKKKKKKKKKKK 


7973 


21874 


A 


8033 


299 


158 


GNQPGASAHTCNPSTLVGQGGRII*AQE 
FKAS LGNMVKH YLYQKHKN 


7974 


21875 


A 


8034 


416 


60 


AQWLTSIIPPL*EAEVGGFFEPRSLSPA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAII J PPWP 
PKVPGLRHEPPRPA 


7976 


21877 


A 


8036 


385 


151 


FLYFL * RLGLTLLPQLVSN I SSSDPPT * 
ASE S AGI TGVSNHAHLRIC I LTRLPGNS 
SH *NVRRAGLGDTQSSALI PPG 


7977 


21878 


A 


8037 


3 


406 


PAS AS * VAG I TGTHHHAQL I FV FLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGET 
RGGAPAPKKLGAKKE PSHLLGGGAQNLP 
KPRGQKGFGFYFYFLARDFFIjG 


7978 


21879 


A 


8038 


428 


20 


LANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPI/TPPLWEAPKARKLGSPWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 


A 


8039 


3 


194 


QNKI SQVWWYTPVILATWEAEGGESVEP 
GRQRLW*AKTAPLHYSQDEKYLKASVDS | 
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last amino 
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peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylftlnnine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YCLISCH 


7980 


2I88I 


A 


8040 


415 


96 


LEPL*KYFPILDAIKNICDSWEEVKIST 
LTGV*KKLIPMLMDNFEAFKTSMEKVTA 
D WE IGELELE VEPAD VTELLQS CAKLE 
GMRS CFLWMSKESGFLRW I YS W 


7981 


21882 


A 


8041 


413 


254 


PGFKQFSCLGFPKR*DYRHKPPCPASKY 
FKINYLRNIYRSLILKN 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
*GFGPPPPPLF*FFFRGGPKKIWFFPFF 
LWPPLFFFFFPFWPKKTPPKKARSLKIF 
FFLGPKKIPNPFFFFFFF 


7983 


21884 


A 


8043 


414 


153 


GRGWATNFYFFLI ILK* FFFIFKKFFFF 
HLFFFFFFFFFFFFFFFFFFFFFFYKIF 
LFTLVLTCPQTSCREAEPFDHKVCSVLK 
ML 


7984 


21885 


A 


8044 


401 


292 


NYLPPQ*KKKYYPQGPFRKPIKKSPPPC 
FFFFFFFFPSFPPFLNIPPPQKGKFPPK 
KYFFNPPLFPPLFFLKPPPLFFF*GPKK 
KNLIFHPPSKKICPFKRGPP*FFFFFFF 
F 


7985 


21886 


A 


8045 


370 


14 


TPPPTKTPFFFLYPPPWREKREGFF*KA 
NMSPPVWKKKIPPLPPYIYL*KGGSRYL 
AKLSLNLKGSIYLSFCFPPLPPSLPLSL 
FERWGLTILPKLVLNSWAQMILLPWLPK 
VLGLKV 


7986 


21887 


A 


8046 


395 


268 


PFFFF*ETGSCSIVQVGVQWLFTGTILV 
LISVGVLTCSISDL 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPLGGESETPLGYPGKPRFF*K 
NKKINPARGPGPWPPPPQG*AGKRPLP 
PRRGFQLTQNGPPPPFPGGKKKPPFQKK 
KKKREPAIC 


7988 


21889 


A 


8048 


369 


177 


RNTRGGPTFFFFFFFFK*NLARLSRLEC 
HGT I S AHCNFRLLGSSDS LPLTSND IGQ 
INKSLFA 


7989 


21890 


A 


8049 


1 


128 


ALGLVAHAYNPS TLGSQRG KI A* GRE FG 
TSLGHRARPCLYKK 


7990 


21S91 


A 


8050 


166 


163 


PVHQGEQTTQDKCL,R*STHLGLPKC*DY 
RRE PPCQAS KLVPTG I LADS KHS KCQTM 
DSCFSNQLYKQR 


7991 


21892 


A 


8051 


402 


245 


YFKSVCQAQWLNTYNPSTLGG*DGWITC 
GQEFRLPWPSGITGASHHAQPHFF 


7992 


21893 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 
PA*ATEPDPVSNKQINK 


7993 


21894 


A 


8053 


1 


154 


GFLRVGQAGLELLTLRDLPTSASHSAGI 
AGVI YRAWPGEA YL FSG * WLRFT 


7994 


21895 


A 


8054 


406 


150 


PKKKKNPCPPPEVKFDSLKRAPLFFFFL 
EESCFVTQAGLECSGAILAHCNLCLVGS 
SDSPLSL * RWVGNSHF I FTSLLSVS PRS 
N 


7995 


21896 


A 


8055 


1 


125 


TAVILI I FMI * EAFASKRKVLI VEEPSI 
NLE*LYGCPPPTH 


7996 


21897 


A 


8056 


400 


260 


HMSPHLAFFFSFLFL*RLDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGTVAHACNPSTFRGGLIT+AQEFK 
TS LGNKVRP CLYKKQY 


7998 


21899 


A 


8058 


402 


211 


FSCWWCTKPKAAG*FPAPWGPPFLNNPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 
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peptide 

sequence 


Predict- 
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nucle- 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
C=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7999 


21900 


A 


8059 


-j 


279 


VLLYHPG WSA WQS WLMQ* TPSNADI WR 
IARGFKRSSHFSLLSSWDHRHVTPCIiAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
*APSPCGLI 


8000 


21901 


A 


8060 


408 


315 


SLTMLPRIVFNS*AQAILLPWPPKVLGL 


8001 


21902 


A 


8061 


198 


2 


R I QHVE SNCHLE VEGGWAQ WLTPS T LGG 
RGRMIS*AQEFETRLGNTVRPCFYKNLiK. 
SQPGMWRI 


8002 


21903 




8062 


3 


294 


GLSPTAHPITAFTSQPLPSGLSVCRLFL 
WAMCEQCPHLP I RPCCPR I I GPECALYT 
TSCLLHQLKKREGF* KKKKKKKKKKKKK 
KKKIF* WGGGKK 


8003 


21904 


A 


8063 


197 


184 


IHDFLFFF*GRVSLCCPGWSAAAQSRLT 
AL* PGELKETFYL* I RRS WDHKHAP PL>P 
APF*LFWVTLIITPSTHEVSSAVCSLLY 
VKSEQKCQ 


8004 


21905 


A 


8064 


3 


373 


TSGSLAKRPADCLSAQPTTPSSLCGCS P 
VR VC I TVPRCTHARAHTHTHTHTHTHTH 
SNHG * APGAL Y I QELQRGKVSHVP E KEK 
SDPETA*TKGG*HVTNPKGRQRSGEQEE 
HRICVGRCCSG 


8005 


21906 


A 


8065 


532 


106 


ENKPIHNLSINKYIGKILRDKALLYCPG 
WRAVMQS*L.PVASNSWAPAILLCPGF1,S 
TWDYKRVPPQLVKIFSKICKKNKPTNItiR 
PKCHTRNQD I YKRGKVGFLCS YCPPALL 
LLLSIjLLiLIjLLLFSFSTRDSASGAGCSS 


8006 


23907 


A 


8066 


411 


130 


PKAQLI FLRR + SLTVFPR WSNSWPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIL 
RIPTSVIFFPLNLCSNIKLAEIKEGCSA 
AFL FNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDEWVEKIWYIHAMQYYS 
QRDDYCMIPHI *GRTRGVH 


8008 


21909 


A 


8068 


410 


116 


STFNIQTWPGTVAHTYNPNTLEGQAGRI 
T*GQELETSLGNITRPNFLLFCQACPGQ 
GLSTVRMCGSLHYFVLPLRPLLKCLDNR 
KSPFYHPVILPCYE 


8009 


21910 


A 


8069 


158 


289 


DLCSSLRNIVRPQLYKK*KISQVRKKKK 
KKKKKKKKKKKKKKKKKKKK 


8010 


21911 


A 


8070 


388 


1 


APTPPCFLFKPFVLGGPPFGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPLMVG*KI 
KIFLPPPPPFKTPPALiSPFFFFEAEFRS 
CWPGWNAVMRSRLSATS 


801 1 


21912 


— 


8071 


15 


336 


KLDKRYDRTGS PGTTHASGYL I EPL VAE 
ASYELILSIiAFFFFERRGLAIiLPRLEGS 
GVI VAHCTLKLLGSGGPPASAS * VAGTA 
GMNHHEGHEPGLFEKQNSGFIF 


— - 


2 1913 




8Q72 


396 


~274 


PHLHTWESSL*RQGLAMLSRLVSNSWPQ 
VILSPWPPKWVL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQACNPTPLGGQGQWIP*A 
PEFKISLGNLAKPCFY 


8014 


21915 


A 


8074 


3 


257 


HAFVLIjFETRSCLiSLRIjESSGAMIAYCR 
LEKLGSGDPLTS ASQVAGTKGM * DDAWL 
RQVSNSWPQVILRLSHPHMFLFQFSLET 


8015 


21916 


A 


8075 


1 


119 


TLPATWEAGAGGLL,EPKSLRV*CVFTAT 
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SEQ ID 
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sequence 


M 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=!solcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VNSSRTPAWAA 


80)6 


21917 


A 


8076 


232 


266 


LPYCSGQWCLGLPIIQGVQWHDHGSLQP 
RLPGL,NQSSCLSLPKCWDYGREP*YIiA* 
YQARYHGSLARQNVFNI RPGI MVHARS P 
STLGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCPPAKKGFFPNPINLGTPRFF 
PPPPF*KPAPEINFGGPKKKKIPSPPPG 
VKFDPLKRPPPFFFFFFFFFFIRLLV 


8018 


21919 


A 


8078 


424 


144 


LKNYI FFTFIi *NRDRVSLCCPRIiVLNFW 
AQE I LLPWP PKVLGLQA 


8019 


21920 


A 


8079 


182 


45 


VHKAGMVAHAYNPSTLRGRGGWIT*GQE ' 
FETSLASLDNMVKPRLC 


8020 


21921 


A 


8080 


405 


163 


YFLEEMGFHHADQAGLELLTTGDPPGGM 
SHCTWLNLNVNLI QKHPHRN I QNNV * PH 
ISGPVKIiTHEGNHHNWHMSTQKHSP 


8021 


21922 


A 


8081 


39 


240 


QS KTVS KKKR K 1 1 FCRYRVS * S PGFKQS 
SHLGLPKGWNYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 


263 


DCCVS SE *GS VGVGLS E PGAG YNLLVFR ' 
LLRPLEKRSIRVGGSRFSRYYLSWLPLA 
RKGMSPTPCTSQVRQCPTLLLGLHPL.CD 
KPQ 


8023 


21924 


A 


8083 


1 


248 


SVGVGPSEPGA* YNLLVCCLiLRPLEQHS 
IRGGVSQFSKYRQSWLPLARKGNSPTP* 
ASCGMRGPALLWLTLLGLHPLSNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLANFFFFFLGKK " 

GVLP*GQTGLEL*N*KNPPA*PPKGAGV 

KGVTHRPKP*NGGKGRFEKDTPGGPFL 


8025 


21926 


A 


8085 


409 


145 


LRGPYKKNFSFQAPGRKIGSFKKAPPFF 
FFFFF*GQSLCCPVGLSTGVTIAHCSI>E 
LLTS S D P AS S AS R VAGTTG ACH CS * VL Q 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFLNFFFLVEMN " 

LTSLSRJjVSNPWAQAVLLPWPPKALGPQ 

A 


8027 


21928 


A 


8087 


222 


3 


SVGITGV3HHVLWYFTLKGFLGRGAHSC ' 

NNSALGGRGGWIT*AQEFETSLGSVARH 

HLYKKYKNYPVAWRL 


8028 


21929 


A 


80S8 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVS FCHPGWSAWQS WLTVFP I LCATVK 
YFTCQQYI IMI FALNS 


8029 


21930 


A 


8089 


297 


71 


FIFPENKNRPE WAHACNTLRGRGGRI T 
*GQEFEKSLSNIARPHLYKTKTQNILSK 
YIQYLGFQLTLQIDLANIY 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGFFPKAGFNLGN*RNFLASPS 
KRAGMAGINNLAGPIFRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQASKKERKE * KK* RKKERKKERKKER 
KKERKEKERKEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQGTThTCIFDhSYSSShSHVISQShC 
VAFPLPECIFPHKPLCPACHFFFFQSWG 
IiALtiPKREYRGKIGVNGNLELIjG*NNPP 
SGTSKIAEPAELFHHTHARPFNWIIKTD 
FPYIAQTGF*LRPSRNPSTLEV • • 


8033 


21934 


A 


8093 


249 


143 


NIFAATWMRLETI ILSEVTQG*KTKHQM " 
FSLICGS 


8034 


21935 


A 


8094 


3 


280 


PERWD YRHEPPGP AS FT * LLLFVFNL * R 
KGLYVGQRGRQLLASSSLPAPASLSAGI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, i= Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRMSHCTPAWVTVRPCY* INK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 


404 


16 


QMGFCLFFLPSSKEVFFPPIFFFYRESF 
FLFHFFLDPRQY*LYFIIKKNIFLPSPF 
IFFLFIRVSPYIFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KSYFIFSLIIFSHYFFLLFLKHLLATYY 
CFLIDCTLFFSFYCPFCCLFVLLSGRFP 
RSCL*TFH*RLYFSFPIFNVQKLF*LSD 
ICFCCIYLFIYLFIYFETVFCSCCLGWS 
MMA* FNGMFKDR I 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE* KKRKKERKKEKREKKRKR 
EGGRKEGKEGKNS S T 


8038 


21939 


A 


8098 


48 


331 


PGTWHACLGGEDILKWQTKGRGRRAGSH 
RTSWIEPLETVFLELEAGGRLESPGKNY 
SRLGAVAHICIASTLGGRGRQIT*GQEF 
DTSLADFCST 


8039 


21940 


A 


8099 


7 


373 


NGRLRRGFAMLARLVLNF * PRDPPT * AS 
QS AG I TGMSH HAR PTIESFIHY CWE CKM 
E*PLCKLILYFHLFIFKRQKLGVHWCHH 
SLLQPPTPGLQRSFRLTPPSGWDYRRGP 
PCPANI*LII 


8040 


21941 


A 


8100 


116 


432 


YLRNLFYQRQ J AFSPAILNLiviCSAL*ESH 
TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
SLPSSNIFAPLSYFFRNSSFTNISPSEI 
GPQITDPLFIKKKIKVSVMVE 


8041 


21942 


A 


8101 


361 


138 


FNLLGRKKKNFGGLKEGESRRPPNPGPQ 
FFFFFLFFFFFS G * R WGLTML PRLVLNS 
WAQVI LLP WPPKS LGLEV 


8042 


21943 


A 


8102 


1 


199 


PTRPFVFLMEMEFYHIGQTALELLTSGN ' 
LPASAS QSAG I TG VS DR*LA* YLFFFHE 
LFCYFALLLI 


8043 


21944 


A 


8103 


412 


46 


KGGSPPPPPPPKIGRKGPPNPGDPFFFF 
SPFFFRGF*GSPFPPK*KVLLVGFLKPK 
PPFPLGWGNPPPPPVGGFSPPQSLQFFF 
FKKVFGWVFFFPLFFFFFFFSPFFLFFS 
ETESHHIAQA 


8044 


21945 




8104 


192 


2 


EETELSFFIIKYYLPRNQKVCKRL*RRL 
G PVAHACTPS PLGGQREW I TRSG VQDQP 
GQDGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF*FQYILILFDNFFFFSLSFPFSQD 
FFFLSSFFFFFFFFFFIINYFYFF 


8046 


21947 


A 


8106 


373 


281 


SFKTAPPFFFFFFW*RLSLTMLPRMVSN 
SWAQWLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


8047 


21948 


A 


8107 


3 


114 


FLIFVHIGPNYVAQAGLELLSSDDPPAL 
ASQCAGITGMNHHARPYYLIj*TLNIPLP 
YDPEIFLLGIYPR*MKTYVHTKACT*FF 
ITPLFVIAKTWKQPK* PSVLGLQA 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEIIFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFFFFFFFFF 
L 


8049 


21950 


A 


8109 


2 


211 


SVTQAGVQWHVSLLQPRLPGLEQSSCLG 
L**CWDCGHEPWHLALVTTSKHTESCSL 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWCFITSFFGFYFL*RQGLTKSPRLVP 
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USSN 
09/515,1 
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nucleotide 
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correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last nmino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, ' 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NSWV* VI LLPWP PKVLGLQA 


8051 . 


21952 


A 


8111 


2 


225 


QKSAKNFYHLS I YLS I YLS I Y PS I YHLS 
I YLS 1 1 YLSSTYE * V 1 1 LEQKWNS I KKG 
CL*CTIQTNCL*QLKE*VIRLIPKL*YK 
IKLPTCWSVHLiNWFTLCNISWYLKYIAM 
**RKLVCFA*FWYNSNKLLITIKRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRESPPDKVNPPPPDIFGGGKTKKK 
TLANN*FPPKEKEFFVFFFKKEDFF*NI 
VGNFKTPSKI*AVCF*NPVFKKKKGAPK 
KKSSFKNF 


8053 


21954 


A 


8113 


369 


112 


IFFSTIYYFXHPFFFPPPLFFTPPPLFF 
FFLKKKKFFFFFFFXXXFFFFFXXXXFF 
FFFFFFFFFFFFFFFFFFFFFLAF*HVY 
SF 


8054 


21955 


A 


8114 


418 


248 


KYYLPTPGY*NIiILiliNGPPFFFFFFF*D 
RVSPCCPRLP * TPRLTQSS CLSLPRSWD 
HRCMPLRLVTF * FTFKLSHQS LRTLLTY 
TLAGTTLG 


8055 


21956 


A 


8115 


229 


78 


SFKGAAPFFFFF* *RQNLTMS PKLVSNS 
RPQAVLP P WPPKVLGLHRGGWI T 


8056 


21957 


A 


8116 


356 


66 


PLFPFFYIKNFKKKFFFFFQYFFFFPPF 
PKHYSHKDIFFFFI IKFFPSSSFSFPPF 
FALSSFFFFFFFFFFFFFFFFHA* FPVF 
FLYKI PMLYPQL 


8057 


21958 


A 


8117 


212 


81 


ERX.SHCSLN£,PDSSNPPASAS*VAGTSG 
THQLVMWHAGS PRF 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPLWFYFWGP*KKY^ FPHPRGY 
EFFFF+GGPPHFFFFFFFFFFFFFFFFF 
FFFYAEGIGVSPCYIMLGYNFSSFPCGT 
ISPRVRPRV 


8059 


21960 


A 


SI 19 


38 


215 


FFKSAFHHTRXIFNFFFFLEKTGSHYVA 
QAGVELLG* SSPTTSAFEIGGITGLFLS 
FKN 


8060 


21961 


A 


8120 


266 


2 


PQIFLFIFETRSHSITQVGVQWCDHSSL 
QPQNPGLKQSSLSS *DYRHTLTAPG* FC 
FL*RCGLAMFPMLVLSSWPQAVLKIHPC 
CGMC 


8061 


21962 


A 


8121 


3 


332 


DAWADAWGHVPPCPANFVFLV* TGFLHI 
GQAGLELPTIiSDPPASASQSVFIYFLET 
E SHS PAWETERDS VS QKKKKRGGRFKG S 
NFTSAGLQRFI F FMGP PKLI SRAGV 


8062 


21963 


A 


8122 


378 


66 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLETVFLHAGQAGLELLGSSD 
LPVSASQSVGITGMSHRVQPKLRIFVNI 
SFPFEIFCEI MLLVPLAQP 


8063 


21964 


A 


8123 


2 


316 


I S MLIaALLLiR I ITFRLPQLNGYVEE FTP 
YECGFNPISPGRAPLFIEHFLVDITFLL 
FDVQ I ALLL PLP * ALQTGNLPL I VM PSL 
LL I I ILALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


ITSILLIIITF*LPQ] J NGYI*KSTPYEC 
GFDPISPARVPFSIKFFLVAITFLLFDL 
EIALLLPLP*ALQTTNLPLIGMSSLLLI 
IILALSIiAYE*LQKGLD*AE. 


8065 


21966 


A 


8125 


2 


228 


IIITF*LPQLNGYIEKSTPYECGFDPIS 
PARVPFSIKFFLVAITFLLFDLEIALLL 
PLP*ALQTTNLPLIW1SSL 


8066 


21967 


A 


8126 


451 


200 


KA* WNRFAIIjNVTKYTCESQEEVKI PTF 
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peptide 
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Predict- 
ed end 
nucle- 
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correspon 

Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Clutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K— Lysine, E— Leucine, A'I =: IVletliionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, JR— Arginine, S == Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVWKKLI PAHTDDFVHFKTSVQEVATD 
VAKIAS LLE LE VE S ED VTE * LQSQDKT 


8067 


21968 


A 


8127 


2 


396 


LIVIINTLLALLLIIITP+LPQLSGYIE 
KSTPYECGFDPISPARVPFSIKFFLVAI 
TFLLSDLEIALLLPLP*ALQTTNLPLIG 
MSSLLLI I ILALSLAYE*LQKGLD*TES 
GWEKLVKKEKADKLKGGRP 


8068 


21969 


A 


8128 


1 


350 


GTRGLQTLLPLIAF* *LLASLANLALPP 
TINLLGELSVLVTTFS*SNITLLLTGLN 
I LVTAL YS L Y I F TT TQ * GS LTHH I UN I K 
PSFTRENTLMFIHLSPILLLSLNPDIIT 
GFSS 














LTCFSEHRPPPRANFFFSRKTKAPFFLP 
GGFQ I PGPNPFAP P I FPNGGP I PGLDPQ 
PGPLNFF*KSFFPDPKFFFLFFPPPI *N 
SGVFLPRNPVFPKTPNFLFFFSPFFPGD 
GVLLCHPGRLPRA 














GIRHGFAMLPRLVLNSWPQAILLPWPPK 
LLG*QAGVELLASSNPPALASQTAGMTG 
VSTRPPVLLPTLGTHLCLDACLGSSL 


8071 


21972 


A 


8131 


396 


5 


KYRDGPPPPGKFFFFFF*DGFSIjCCPGW 
SQTSCPK*SSCLPEG*DYRHEPPHMTLS 
S S Y * CSQMQKCINKNCLGS CWAHACNXi 
SPLGGRRITWGKKFETSLDNIVKNRLKK 
KKKKIYQGGGDHPCIFTN 


8072 


21973 


A 


8132 


347 


34 


KAGPV I PRGPQE F FF FFFPQDG I LLCF P 
GWRAVAKS * VTATSNS WAQAITGMLPPC 
PANF* KIS*RQS ISMFPRLVPNSWPQVI 
LPPQPPK*LGLHVCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


D YRRTP PHIjANFFYFL * RQ I LTVLPMLV 
SNSWVQAILLLQDFFSVQLKMGFLSVP 


8074 


2 J 975 


A 


8134 


338 


47 


FLSTQKESSPLHTHTHTHTHTHSHLLTH 
HHGRGQERGKRGR * GG * WRTGTHS PHG P 
HCVRLRKSRPSGELGAAQGGCSLIQPKI 
EDPGQSGGPREVP 


8075 


21976 


A 


8135 


68 


467 


SLLNRI PFCLCLNSFDCVFLLGLLFEAG 
FCS VAQAGVWWYEHGSLQP * PRGLK* S F 
HLSLPKFWDYRNEPPHPPFSCFKALNFK 
DYCDSGTVPCPFMKELSMLFTHALTESF 
VLFIVMSKLITTKHLTIISNV 


8076 


21977 


A 








FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8077 


21978 


A 


8137 


58 


222 


I YG YEGRGS VAGS LSS L * SATTDSDLD Y 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 


21979 


A 


8138 


345 


196 


KGDQAQWLRPGMVAHI CNLSTLEGQGGR 
IT*AQKFETSLSNTLRPPISFK 


8079 


21980 


A 


8139 


500 


355 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 












231 


HASADAWGRNEWPCAVAHAHNPTTSGG* 
GGWIA*AQEFEFETSLCKIVKPHLSKKK 
KKKKNFSPGWGSPPCTPLF 


8081 


21982 


A 


8141 


416 


26 


ARIFPNNSFWPGQGGFPFPPRPLEGPGG 
VIP*CQGVQARPAPPGEPRLFPKPQKIV 
GGGGEPPLFPPPERAGPQKCRPFGEGGF 
N* PKPRP CP PGWGAQGGFVSKKKKKRNE 
DSKIPLIGKYPVGIRNMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGGSPPGPRYPSQQHLLKARTPTP 
FARSRGSFPGVRGSGWRGGLGG*SRERP | 
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peptide 
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Predict- 
ed end 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E— Glutamic Acid, F=Phenylalanine, 
C=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LPSLPPRALPRSGASSGPLDGGAQGPGR 
RE RW AFLGG I RADARGCRAVTS PEPYRL 
PGAGAARARR 


8083 


21984 


A 


8143 


166 


368 


EN I LWAAMWLDCHPSTLGG- 1 - GGRI T* G 
QEFKTNIiANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 


2 J 985 


A 


8144 


1700 


621 


DLRDHSVLQQGPACLTRCGEPILSHSEG 
SKIGSERESGEHSRHPEAPPPSVEVGSR 
RS * GQDS * G FG PGNGQDTRG PL SHAAPN 
WTPDPAAAVAAGGTSLPPRGGGGI *RGR 
SHTARSGARVGSANAHARGGSGFSRHSG 
SHGCLRNQSPRLGKAGALCGSAWVSGL+ 
GSGLTSPRGRRERLRMPGVAKGRPR*GN 
PRRGRGKQHG I PASGLPGRCWLGI KMST 
LQPRTRAPEELFVPQESSGSTPEKMLVS 
FHGSSLRNEATPRYSQQEEAGNGRWQQS 
LSLERWPPWTSHPLGTPPLMPVAVARCC 
ILPGLWPLLSPPSGSASPVSQGPGCLSL 
WPNAFKKDDFPVRQGDTSWRSVS 


S085 


21986 


A 


8145 


390 


38 


NTTTFNIYFFYFFFFSLFIYFFFFFFFF 
FFFFFFFFFIFILFIYSLLVFGVWVLGL 
GLALALLAKLFLVNSLCRRYRG * SLLYY 
AWL*FFIFPLRYYIYCARFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTS TGRNPGRPTRPNF* VF * RDGDLI 
MLPRLVSNSWPLVNLLPWPPKILGLQM 


8087 


2198S 


A 


8147 


417 


64 


VRPRVRPRVRPRIRKKKKNKKKKKKKKK 
KKKKKWGGGF*DFWGGEKGPPPLQKNDF 
LLFWGFFLKSFFFFCGGLFFWGGPFFFP 
PPPNFLGVGGKKKTFRLLCGPFSAFFGG 
EKN 


8088 


21989 


A 


8148 


351 


187 


RLGTFYLSNILRFYLS ITHGP I PYIGRL 
rSQNLPLLLLDLT*LEKLLPKKSQTYN 


8089 


21990 


A 


8149 


465 


60 


PPHFWGN PKKKNSQ FFW AKKKKK IFLGP 
* KMVAKKGWFWGRKKGFQRGGKKG* KG 
GQL F YPVLS QVLGGGQNHG P PGKAGFGE 
NFFFFGQGLGWAISRWLRTPLFLAGGGE 
DPKKNFKPPPKIKTLGAGVKKK 


8090 


21991 


A 


8150 


1 


175 


WDYRHVPPYLANVFFVIFRGDKVSLCCP 
V*GPTTELKRSSCLGFP*GWDYRREPPH 
LA 


S091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHAWPETFFR*RA 


8092 


21993 


A 


8152 


269 


159 


ARVEVSKKYIYIHTHTHTHTHTHIHTHT 
HTYTTKKLPS*LFQVCKMEKFGRLWQ 


8093 


21994 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFF*TLKI 
RFI WRKQKTHNPKDG ILHQD I SQ VISLQ 
ITNHFHRRYFCTNFTCIQERDINQSLFL 
F 


8094 


21995 


A 


8154 


252 


41 


LIIPVLICIVFIVLQFKIFSVISLNKGL 
FRNHFF IF* * R WGLTML PR L VSNS KAQ V 
I LPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 


HELSLI * FI STLAETNRTPFDLVDGESE 
L VSGFS IEYAAGPFALFFIGEYTNI I I I 
NTLTTTI FLGTTYDALSPELYTTYFVTK 
TLLLTCLVL* IRTAYPRFRYDQLIHLL* 
KNFLPLTLALLI*YVSIPITISSIPPQT 


8096 


21997 


A j 8156 


2 | 205 


LLHLM KHFNLE I QKAQ * TR INLKRS ISR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8097 


21998 


A 


8157 


2 


205 


LLHLMKHFNLEIQKAQ*TRINLKRSISR 
HVI INLSfCAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8098 


21999 


A 


8158 


2 


222 


VYTWGTGKARVFPGEF*LPVWIPTRHLK 
FYNEP IRDANESASAETENPQS SIIHPQ 
DEQNGD I RRTDKVTI HQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLELLTSGDPLASTS *SAGI 
TGMSHHAWPLNCSYGDSFSWLXCPELTV 
ACWWEEGWRKLLMENEQGREAHQKLESV 
INFYIKDSMTKRKYK 


8100 


22001 


A 


8160 


138 


437 


NEYDHFSIIKERRDFIVCVFFFFFSKKS 
LT VGTQGGPS YLE PPFLGNKE FS CLT FL 
GS WDNGGATTRLG * FFVFFRKKGVSP CG 
PGWS*TPDLRKTPPSA 


8101 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICMWHAY 
NPSYSGG*GGRIT*AQKFKTSLNNTVRP 
HLYKK 


8102 


22003 


A 


SI62 


391 


255 


RPAPADF*FFFFYRLGLAVLPRLVWNSW 
PQATLLPQPLRVLGL.QI 


8103 


22004 


A 


8163 


131 


3 


HKVSLCYPG* SALMQS * LTVASNSWAQA 
I LQP * P PE * LGLQA 


8104 


22005 


A 


8164 


97 


440 


NKGSLCPPGGRPGALSHPTARSRGS PGP 
TPQKTGNNGPG PPT P VNLE FFEKRG FTP 
VGQGGLKLRT * RS P P PS APQKAG ITGG S 
RRPGPIQKKFLKTLLGETKWMAKLIKTY 
LK 


8105 


22006 


A 


8165 


541 


295 


FSQQKKKI FISTNLSHQES KIYICKEME 
SHS VIRLLEGSGAI MNHCS LKL PS SS DP 
PASVS * SAGI IGVSQHLANYTFKKI F 


8106 


22007 


A 


8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
S SD P PPLAFQSVE ITGMSP SARPMFGVF 
HSLR 


8107 


22008 


A 


8167 


23 


405 


FRI CHLLRVHLLVY F I P SFTDNHLLSTS 
AMFLYSWDW I AQNVS GPFLG 1 1 FRTFF 
FFFLKGEFFFAPQVGGQGGNLG*LNPPP 
PGLKGFSGLTPPGGGNSGGGPPNKPNFW 
I FKKGGGSTLWPR I I 


8108 


22009 


A 


8168 


23 


388 


PYFSCALPRHHPPTTNLGPKLGMGAPWG 
QPSVAHHTLLFFFFFLKRDFCFVPRVEK 
LWLEESSLNPPLLGLKEFSCLTLWRTGN 
NGGPPPPPVIFCF*KKGGVYPWGPGGGE 
TPALKETPGL 


8109 


22010 


A 


8169 


612 


498 


NIHFSLS FSHTHRHMHTHTHTHAHPTY I 
RTP*DNVPQ 


8110 


22011 


A 


8170 


2 


288 


VNLTFFPQHFLGLSGMPRRYSDYPDAYT 
T*NILSSVGSFISLTAVILIIFMI*EAF 
ASKRKVLIVEEPSIKLE*LYIKKKKKKK 
KKKKKKKKKKN 


8111 


22012 


A 


8171 


381 


101 


QKSLQQPSNIATILEEVQVIFLGSPYHL 
QRWEMRLTKWLRGLGAVAHACNPSTLGG 
MGGWIA*AQELETSGDPPPSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* *TKKHFGLLKKKKKKKKK 
KKKKKKKKKKKGGGAFKKKKFFPRGGGK 
I FFFFGAPKNKFGGGVLKTGGGKKPGVT 
KK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 

acid 

residue of 
peptide 

sequence 


Amino 3cid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine V=Valine 
YV=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 


81 13 


220J4 


A 








EKVSVFFX1KMKCSGIII.LPCRLTLPGST " 
DSPASVCRMPAI AGPRRHA* LVLVETGF 
RCVGRAGLQPLTASDPPTSAS PGAAIAD 
GVSFTQCSMVPRIRVQ 


8114 


22015 










GLTVLPGLiVSNSWAQVILIjSWPPE*LRP 
QAYATGAGFLL 


8115 


22016 


A 


8175 


48 


269 


FYVILCLRDRVLLCCLGWS *TLGLKXSS 
YLSLLSSWDLQHVPPHSASCLTF*VKAC 
FFLLKYLKRLKI YAKIM 


8116 


22017 


A 


8176 


1 


338 


PTRP INTLLALLL I IITF* LPQLNGY I E 

TFLLFDLEIAIiLLPLP*ALQTTNI J PL,IA 
MSSLLLI IUjALGLAYE*LQKGID*AEF 


TTn 


22p]g 





"8177 


~426 


~730 


QSLDPPGNMVTLSLQN*KNPPP*GGAVF 
PPPGRLGGENPPGGSKEQKRRPAPQGG* 
PRQPPSPKKKNADSEFSIFNLQPALLCK 
QYCIVLHALPNKILF 


8118 


22019 


A 


8178 


396 


244 


KIHLLYICMCI YIYVISRHYILiR*HTHA 
HTYTHTHTHTHTHTPKNTKYTLW 


8119 


22020 


A 


8179 




119 


TRPGNWPGWAHFYDPSTLGA*GELIT* 
GQEFETSSSLY 


8120 


22021 


A 


8180 


2 


80 


FLPLTLALLI*HVSIPITISSIPPQT 


8121 


22022 


A 


8181 


438 


333 


S PLGRLRHYNCLNWGGKGCN * PKS CPC P 
PAWGTK 


8122 


22023 


A 


8182 


393 


19 


I FFFFFP PPS PAKFFFI FKKKRFP PLWG 
GVF*SPPPGFQKTPPPQKVGFSRVFPPP 
PPKKFFFGGGKYFFFKEPPPPPPPIL*K 
GGAKISPPPPPGKKKNPFFLKGGKKKKT 
PFIFFFPPICKIKN 














LTVASTS WA* RI LHCSLLGS WE YRHTPP 
CL*NRNPQLjTVKWRVKDHNNKKSHR 


8124 


22025 


A 


8184 


154 


361 


WHNHDPLQPQTHGLKRSSHLGLPSIWHY 
RHVPPRLANFLNLSFRQGLAMLPTPSVC 
HLLVFLS *HIYWTFGYLS*SVCSHLjLiNF 
L** WHNHDPLQPQTHGLKRSSHLGLPS I 
WHYRHVPPRLANFLNLSFRQGLAMLPTP 
SVCHLLVFLSETLGFK 


8125 


22026 


A 


8185 


228 


337 


RPGMVAHACIPGILGGRGGWIT*GEKFE 
TSLANMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKNEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEFETCLGSLARPPSL 
QKNFKN 


8127 


22028 


A 


8187 


10 


420 


TQEAEVAVN WDHATALQ PGRQSETPSPE 
KKKKKRGPPRWGENFFPTPGGF*KDGGN 
KKTGFPPFKRDGPPPPKNWTPWVFQKRG 

PPPPIFKKNNPPEKKPHRGFNFFAF 


8128 


22029 




8188 


398 


1 


lkkkrpqtpfspkiqf*kffppfpfkks 
PFPKNLLKKKRAPGGPIFFFFFYPGIjFK 
NPSPL*MGSPPPPLFNPKN*KKPPLLKK 
GPGF*WPFFQKKKPPFFFFFFWRQGFTV 
LPRHTPSCLVNFCIFCRDGI 


8129 


22030 


A 


8189 


30 


153 


AWYVRREDHVSLGG*GCGKPWLCHGTPA 
WVTEENPKGKIK 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKN*ERVSVCCPGWSAVAGSQ 
LTAAS TS QAQ VI LL P Q P P E * LGLQ A 


8131 


22032 


A 


8191 


3 


228 


TCVCLCVCVYICDVCVGCVCVYIDEYIQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IYRYIHTYIDIYTHI * IYIHTYIDTHMW 
VCVYI Y I YI YR CMG VRET 


8132 


22033 


A 


8192 


41 


416 


I CRVKE I ASRPLLE THSLS LVELiS * FCL 
TAQ * RND FLALKHLTAVL YNL I LHSSG F 
WSSLKSHFIFIAL* YYL* *FI *SN*RRD 
LAVLPRLVSNS WPQVI LPPQPPKVLE I R 
YEPPSLAblLFFN 


8133 


22034 


A 


8193 


2 


183 


KRLS I WQKKVEGEGFQGMDVGPVS VAHA 
CVSS SLEGQGGKIA* GQEFET I LSNIVR 
PHLY 


8134 


22035 


A 


8194 


2 


465 


AGVQWHDLGSLQSPPPGFK*FSSLSLPS 
SWDYRSAPPCPADFCIFRRDWAGWGVGC 
GFHRVARAGLELLTSGDPPASASPGAGI 
AGMSHRTQRGQDNFASWRRGGPQMSPSP 
PSCLFLQNTFSLKPSWVAPYPHASALGG 
GEYGGPPSPQPHLE 


8135 


22036 


A 


8195 


397 


149 


L I CLTDLiP I RFLS LliALVFTVFNTKVKV 
KDLIKLGWAHTCNLNTLGG*GGRIA*A 
HEFETSLGNIVRAVIPVCCFSVILSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKIPS* YRCEEIIQL.SLQSS 
CDHRHMPPHPTNPF*RQHLSMLRSL,VLN 
SRPQAVLPHGPPKVLGLQAPATAPGILS 
LF 


8137 


22038 


A 


8197 


382 


112 


KTDFGKKGSPSSPPPFFFFKKSFAPSPR 
GEKKAP1.KSQRGLCPPKPKDPPPPGSG* 
TQDKGPPPPGQPIFFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


361 


AS RIG F I LQEFL PVR VWTTNPS AGADG Y 
NFLEKQKGTEKTS CSVAQAGAQWCHRNL 
1i*P*TSELKQSSCLSIjIiSSWDYRMYFVL 
LGKQVTLLIGKRALSVYPRDWSIjC 


8139 


22040 


A 


8199 


1 


314 


intllallliiitf*lpqlsgyiekstp 

YECGFDPISPARVPFSIKFFLAAITFLL 
FDLKNALLLPLP * ALQTTNLPL I GMPSL 
LL I I I LAL S LAYE + LQ KGLY 


8140 


22041 


A 


8200 


3 


109 


FTS KHHFGFEAAA* YWHFVD W* LFLYV 
SIY**GS 


8141 


22042 


A 


8201 


3 


216 


D AW AAVGHHHTRL I F * I FVE I G S LCVSQ 
AGLEILDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRPIGKIK 


8142 


22043 


A 


8202 


1 


267 


RIHTGE KPYKCEECGTGFKGPSTLTTHK 
FFVYCREVAVLLKNCYSHLYPH* IIVNG 
SEADKREMCLLCIFFHPLAEEQENVENK 
RKYNP 


S143 


22044 


A 


8203 


422 


92 


PLVFALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPPPFFFFFFFFFF 
FFFFFFFL FLYYLF ILC 


8144 


22045 


A 


8204 


445 


260 


VCSPPLFIWVAPGFSPPPLFKNPPPNFF 
FGAPKKKNFFPPPPA*NFFFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPGWAHACNPSSLGG*GQWIA*AHEFK 
TSLSNAVRP I SAKKKKGGPF 


8146 


22047 


A 


8206 


407 


197 


QL I FKFL *EQASHYVAQAGLELLGSGNP 
PASTSQSVKVTGMSHGAWLLAPTLRSTS 
LNYNSLSVLLLPRP 


8147 


22048 


A 


8207 


1 


142 


S CS VAQARV* FRDLGSLQPPPPRFXPFS 
CTPAWETVRLiHLiRKKKKKN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, .M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8148 


22049 


A 


8208 


2 


370 


PEVFDYPN*FKNL*LHKTCTQ*L*AALF 
VI TKTWNQGCPLTGE W INKLE Y I HTMDY 
LAI K I NELANHKMM * RTL KC I LL S ER S * 
SRKATYYMISIIRHPRKGKTIEIVQNSS 
CQGLGSRQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVEACLSPGVGRERAKRER*KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILRVSGRKQKYGNV 


8150 


22051 


A 


8210 


386 


254 


LNPGGGVCSEPRLCPCTPAWVIR*DSVS 
EKG KKKRKG P ENS LTATNF * ALS CNLD Y 
LRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV*LIVYQRE*RRTGWFVAKYMLY 
TFS * PGSFPRLRKCVEHVFGYEPACS PL 
FPL I HNQLHKAKC 


8151 


22052 


A 


8211 


3 


131 


AHLFIAWFTEYFKPTVITYCSKKKIPFK 
MLLL I DS APS YPRALMEM YKGMNWFMH 
DNLTFI LWPMDQ* VI FTS * S YYLRN* FC 
TIAAIDSDSSDGSGQSNLKIF*KGFTTL 
DAIKNICDS*EL 


8152 


22053 


A 


8212 


86 


286 


WSIWECTVISVQLFHKPKTSFKNTSYWS 
GVMAYTNNLS TLGS QDPQ I T * GQE FETS 
LANMVKSCLY 


8153 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFKNLVPSPGYFLFFLNK 
GFRYFGQGGFKILNSNYPPPLASQKGGI 
SGISPWPRALVFF*WRNLGPN*RTIVSP 
NYGLK 


8154 


22055 


A 


8214 


3 


313 


QGLVLWHSHGLLQP*PPGLRRSSCFSLP 
CSWEHRCTPPHLANFLILCVCVCVCVET 
GSHFVAQAGLELLGSSDPPLSISQKSWD 
YKV* AI I PRAKQNYLML FY 


8155 


22056 


A 


8215 


165 


348 


RPCPLFPEDLLTFSREGPGTMAHACNLS 
TLGGQGGGIA* GQE FETTLGNI VRPHLY 
FYLKK 


8156 


22057 


A 


8216 


379 


218 


VFLVETGFLHVGQAGLKLPTSGDPPASA 
SVSAGlrGVSQKCRF*YNVFIFQKSSLL 
NALPICSLTMLLGNLFSMFMTPKFFTCS 


8157 


22058 


A 


8217 


32 


276 


RYLPIIAALFTIAKI*KQPKSSLLDKWI 
KKLWC I YTME YYS ALKVI LT WTTWMNL 
VGFIPGDINHIQKAKYCMISLICGI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWLCLPEWSTWQ 
SRLSATSAS*IQAILPPQPPEELGPLCP 


8159 


22060 


A 


8219 


530 


73 


TKPTVGSNSRSYINQTQYFPSTLTFGGL 
MRLT I YH FFFLLVY YNPRVMVTPAYM * L 
LILFYSSSIRDLLIAKQPCLPNEGFSPH 
AC* TNATLLF P YS I * YLGLPS FHLIFFV 
CFSFYFL*R*GLTMPPMLVLSSWPQVIL 
PPQPPKVLGLQA 


8160 


22061 


A 


8220 


350 


135 


FFFFSQTGVAQNRVQ*RDHNSLQP*PPG 
LRQSSCLTLPKCWDYRCKPLCLASWIYF 
KDKVYTS TPVHTQEY 


8161 


22062 


A 


8221 


185 


331 


PGMVAHAYNPNTLGG*GGKLT*AQEFET 
SLGNINSVPKKRRKRRKEMM 


8162 


22063 


A 


8222 


73 


232 


YCSFKFI TLTCLS WLGTVAHTCNPSTVE 
GGGGRIT*GWEFTTSLGNIVRSCLY 


8163 


22064 


A 


8223 


418 


211 


AMPS LLTYLLTF I FCRKKKI FFYWGE FT 
MLS *LVLNS WTQVILPPWPAKVLVLPRL 
ALFYSLLVLFPSF 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQPQPPGMSLY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CH*CLRFWSWRPAEKPGWCSSLRFMELE 
SWSGASDAP 


8165 


22066 


A 


8225 


244 


287 


TDSPTSAFRSAGITSISHHTLPKSTF* I 
FCRAGGLAMLPRLVLNSWPQVILPSQPP 
KALGSQA 


8166 


22067 


A 


8226 


53 


235 


NNKWQG WAHTCNPTTLGG* GEWIT* GQ 
EFETSPANMAKIPPIA/MFAFCFEETDNK 
SHTK 


8167 


22068 


A 


8227 


275 


25 


GVLMFNFKFCI *QNSPFMLPGWLNSWI 
PAILPPQRPKVX.GIiQAWTILYKWYYGKE 
MWHKE I CEGTHDVLSKLLTTRRFKRHL 


8168 


22069 


A 


8228 


386 


211 


QLKLPPIjKGNKLRPP* *KFFFFFFFQRQ 
GLTILPRLVSSSWHQMIFPPQPPEVLGL 
QA 


8169 


22070 


A 


8229 


1 


117 


VFILFYF*DGIAMI,PRIATNSWAQAIL1 J 
P SAP KVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP*PPGL)KQSFCFSLPSSWDE 
RCAPPHPINYFQICRDRVYKRWANMVIS 
PNPLVLISQSAGITSKIPCPGSEISTIF 
IFSKQLLLVRGVYPDMLVLQLGKTCI 


8171 


22072 


A 


8231 


3 


188 


FPSLRPA* IFFYIYFFFLKNKVFLCHPG 
WTQLLDPNWTQAAGLKQSSHLSLLSSLN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KLDYINCGTNLAQKCLFCLFGSLFFQR* 
GLTLLLRLFMNSWPQAILLP* PLTVLGL 
QATMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


PEKAGS CLELAFGLQMQHPPWASTLPAA 
LTD FGLATVPNHVS Q FL KSHTHTHTHTH 
THTHTCTRT1iIjV1jFIj*EAIjTTTPLLRSI 
CKHRESACSLSLTWPLCC 


8174 


22075 


A 


8234 


390 


46 


RVFKPG*NPPLVKAPAPPLFFGGV*GRS 
PKKKNPQGPGGGGPPPGFHPFFIiTPFFP 
IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKKKKFFQS ISTRS V 
IHK 


8175 


22076 


A 


8235 


387 


216 


QKLGVGL,VPPGRVIFFFFFFFCC*GGGIi 
TMLPRL.VSNSWPLAILLPQPPKVLGLQV 


8176 


22077 


A 


8236 


1 


201 


RFFFTIPPQCFPQFILSF*RRGLAVLPR 
LVLNPWPQAILPPQPPELLGLQAHPTTL. 
GPVFYS YC VRA 


8177 


22078 


A 


8237 


399 


229 


SVGGGGIiTMLPSWSQTPGLK*SSHLSIiI 
KCWDYRPEPPHLAHFSLLNLQIiYCHTKli 
S 


8178 


22079 


A 


8238 


192 


335 


SHTLPHSSNFSYFW*RQGFAMLPRLVLN 
SWARAVCLPQSPKVLGLHA 


8179 


22080 


A 


8239 


442 


254 


GRDRVLLFCPGWSQTLGLKRSSCLGLPK 
HWDYRHEPPHPAENIKFCVWPTLYFYWP 
ALVQINL?*QLQGTECVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK*SS 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 
PPTLICNIHK 


8181 


22082 


A 


8241 


388 


260 


NAIPFYITKCCNPSTLGG*GG* IT*GQE 
FENCLANMVKPCLY 


8182 


22083 


A 


8242 


1 


145 


GFTILVRL.VLNS* PLDPPALASQGAGIT 
G VSHCAQL I YVFQRS LQLLW 


8183 


22084 


A 


8243 


1 


173 


GLTVSQAGVQWCDHSSL*SQTLGLKQSS 
SLSLPS SWDYKCKPQHQADRFFLMEFFQ 
M 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen>lalanine, 

K=L.ysine, U=Leucine, M=Methionine, 
N=Asparagine, P = Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


81S4 


22085 


A 


8244 


202 


327 


TLFITFIFF*R*GLAVLPRLVLNSWPQV 
ILLLWPPRVLGLHV . . ... 


8185 


22086 


A 


8245 


42 


289 


KEKRS I ILFI FFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*LALQAHTTIPRGGGS 
IILICIRDVKRERKQYAKSKNPEGET 


8186 












LSLLSS *DYRHVPPCPANFCI S VETRFC 
HVGQ AG LELLAS NN P PAP ASQ S TG I TGV 
S PCTRPRVYVFLLW I LANGPS * RAETLV 
ISTLATNQEDADSSLAPGLGSGHCF 


8187 


22088 


A 


8247 






MCSPVFSPKQKNPLFLNFFKKEFLFFFP 
GGRQGGHFGFLAPPLPGLNKFLVLPFPG 
NGDTGKGPPARGNFFFFFFFG I FSRDFV 
SPC*PGWSLIPDPR 


8188 


22089 


A 


8248 


392 


237 


HYGQASLAPLTSGDLPASAPQSAEVTGV " 
SHRAWPTVLI * S FS * KFNWV 


8189 


22090 


A 


8249 


153 


386 


FFPPGWPKPSLLEPPPPWLKGFPPPTLL 
GGGD*KRPPPPPGNFCFFKKNGVS PL WE 
GWF*TSNLGNPLPSPPKRVGIT 


8190 


22091 


A 


8250 


387 


166 


IETILDAIKNIHDSWEEVKISTLTRVWK " 

N*IPTPMDNFEGFKTSLEEVTADVMGIA 

RELEVEPEHVIELLHDIS 


8191 


22092 


A 


S251 


405 


266 


LFAIVKR*NHPKCLLIYARINKMWHISI 
GWNIISLISKGMKFSYMFLTWMNLEVIM 
LSEISHSQKVT*YMVPLA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8192 


22093 


A 


8252 


379 




PGANYACNPSTLGGRSGWIA*AQELETS 
LSNMTKPQPCSTPQCFLVNQLLLQNLYP 
YFP 


8193 


22094 


A 


8253 


78 


311 


LTLYFHTRKKKGLK* IFYFI *SILSQKD 
LKVGTGPGAGAHTYNPRTWGGWKGLQAG 
XRIT*AQEFETSLDNTVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHI ISLQRKMNYLIS 
FLYF* *SHSLTILPRLVSNSWPQAILLP 
LPPKVPGLQVSATLPGL 


8195 


22096 


A 




2 




MGYNLVCHLLRPLEKRSSIRVGVT*FSG 
CPPSPLPLARKGYSLTPCASQVRQCLAL 
LWLTLGGLHPLS CPHCPTSPSVMNPVPQ 
LEMQKSPVFCVSHAGSCRLELFLFGHLG 
TQSSLDS FFKKVI 


8196 


22097 


A 


8256 


70 


356 


LIIAYKLTYDLSWKMFYVYLRKIYTLiH 
WVECS I YTCWV * FVYSM VS WNR IDS PGK 
KKTN I YDQL I FNKG AKSTKLRKNS LFNK 
WGQDKISTYKK 


8197 


22098 


A 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 
PGPGFSNGGNPRPLKKSPKFS PGWGRGP 
LFP PS RGV * AGNFLGPGRWRG P WAKFVP 
LPSCLGHKARFFFPKKKRRKIK 


8198 


22099 


A 


8258 






YCEGDLGSYRPGVSKCMEFSPRRDIRIM 
P WPGAVCYAYNLSTLGGRDGWIA+ A* EF 
GTSLAY I VKPHLYTHTHTHTHTHTM Y 


8199 


22100 


A 


8259 


2 


189 


IQCVCVTKSHFCHPGWSAVAQSWLTAAS 
TS*A*VDFSHLSLLNSWD*RRVTPRLVN 
LILFPS • 


8200 


22101 


A 


8260 


453 


128 


FFFFFFLIETVSHDVP*AGLELLSSSHP 
LAS AFQSAE ITGVSHHAQPSS VS WL I HS 

S . 


8201 


22102 


A 


8261 


420 


79 


KNPTPFFFFFFLIETVSHDVP*AGLELL " 
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SEQ ID 
NO: of 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibfe 
nucleotide insertion 














SSSHPLASAFQSAE ITGVSHHAQPSSVS 
WLIHSS 


8202 


22103 


A 


8262 


1 


262 


NSRTERLGSWQSHFCHKEMEAHFWQSG 
VQWCDLGSLRPLPPGFK* FSCPILLSGW 
DYSQCRPCRTCSLIFMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


S ATKNYL.LSRQS F I GK *NHGLGAVAHA Y 
NSS TS RG LGGQ IA * A* E FETS * GNMMR P 
HCYRKYKNLARHSTYS PR 


8204 


22105 


A 


8264 


145 


275 


KPRDYSFFF*WKRDLAMLPRLASNSWTH 
AILPPWP PKVLRLQG 


8205 


22106 


A 


8265 


73 


351 


SLCYRS FARCDRPKKGQH FS KS YLQDAQ 
WARTLLSTIAQNNSAEHHLKTTDW*GAV 
AHACNPS TPQGRGGRI A * VQE I ETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 


232 


LVETG FCH VGQAGL KLLTSGD P P ASAS Q 
SAGITGVSHCAWI J *T*YS*DFQ 


8207 


22108 


A 


8267 


397 


130 


I GQAGLKLLTSGDLPTSASQTAGI TGMS 
HRAWPTFSLIAR*NASFQVGHFSPKYQV 
GTIiAKSLKCHQVRSDTLEFILCCRIIYE 
TLGKF 


820S 


22109 


A 


8268 


309 


97 


DTWYRTRRKDHCTITETRIQSVRPLSPR 
* AI TS RVTCTYTS R * PEANEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 




224 


DSSGIPGSPTRPPTRPKKQKLPCDPTVS 
LLGIC I YDQRQ * SVFQRD I CHSIIMFITA 
LiFT I VN I WKQT * H PLMDE 


8210 


22111 


A 


8270 


374 


2 


TSWESARKPPPIHSHTLKGFFFFLFMFF 
FF*NKICKDGSLTMIiPRWLNSWAEGIL 
LPRSPKWGIQELATQRRAS ILLCNPTY 
ACGIFQHAAA7NVKLLMKQCPFCPAAPTK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTS AS QVAGTTGTHHHI VLF F FFG I FE K 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGPPPPARGMGSGVFYPPPQKPFFV 
LIGGQILFKGRTIjDFKKRAPPFPPFSLF 
LPKHTPS FWIKWDP 


8212 


22113 


A 


8272 


80 


401 


awgmrfclllyscqylisdiahfskaff 
fwkrepppapqvkgqgpnls * wnpmlqg 

♦RGFSGLTPKGGGNYGPPPPPPLIFFFL. 
RKNGFFPGWPGWSKTPDLRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKRFWGSSKGFPQREKKLGSPTFPKKK 
KDFIYSYLSKSLKKE* INRPSAAAHTCN 
PSTLGGQGGRIT*GQRFKISLSNIGRPH 
LYEKI KNVKKLKI KKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRPGAMAHTCNPS TLGG * GGW I TR F 
LiANMVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVSSRPGTVAHS CKPS PLDGQGEWI A* 
VRKFKTILGNMMKPHLLFYKKLTFKNVK 
C I FKKYCATESRYVTKEKQKDRKKGTKE 


8216 


22117 


A 


8276 


1 


114 


FTMLARM VW I S * PRDP PASASQS AGTTG 
VSHHARPGL.S 


8217 


22118 


A 


8277 


2 


87 


TMLLRIjFIjNS*AQAVILPWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


SEGLALYQLEAFTVCDLRERGFCRGRSL 
WPLITWAWRGGVFLLVQFQEVCCRIALG 
SLSPDPVIiliPESGGPTSASQSAGI IGVR 
DRAQPNCVFEIGSYSVTQAGVQ*HNHGS 
L*P*P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Pred icted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, , 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R = Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNEPPG*PPPKK*RKPGKFILAPS 
PPFFGGGEFYKLKRGVLKQKPGPPGVPK 
ILKTPPRLGKNKPISPPPGQKKQNFF 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP *ARGLRTPWPTR 
ENPVFF*NPKN*PGGGGPF*IPPPWPG* 
ALKFPLPPRGGVPLTKNIPPPPRFSHKR 
RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK*VKNFVHAKICT*MFVAALFVITSS 
WKQPRYYSSIEKWINKLWHISAMEYYSS 
IjKR* ELSSHEKT* KNFKCGSLSEKSQSE 
KDTWCLIPVI - • 


8222 


22123 


A 


8282 


318 


40 


APRLSLVVFMVVFIKRLDSSPGSRITWT 
NCFENHLCVCVCDRFLLCQAREQWRVLG 
SPQSPSPKFKRFSCLSLPSSWDSRCKTP 
*PANFCGLC 


8223 


22124 


A 


8283 


30 


343 


CYQKAS HAT EKS FMR * RVKPCS KLCCC Y 
KILPQPPQPSAVITLISQQNKKKKKKKK 
KKKKKKKKKKKKKDGGGALKKKKI FS PR 
GGRIFFFFFWAKKNNLGGGL 


8224 


22125 


A 


8284 


163 


392 


FWL YNFS VI YSIS DPCWYKEQLCKS VF 
SFLSS*SLLDEPNPNSPANSQAAQLYQE 
NKREYEKRVSAIVEQSWNDS 














KKKLILAKPGWNLDSLKRAPPFFFFFF 
RHGVPLCCPGWSKTPEPK* SSCLSLWSS 
WDYRREPLCLAYS PVLNITHLSSPNVKL 

IM 


8226 


22127 


A 


8286 


1 


322 


TNTLLALLLF 1 1 TF * LPQLNGYVEKS TP 
YE CG LDP I SPARVPFS I KFFLGAITFLL 
FDLEIALLLPLP*ALQTTNLPLIGMSAL 
LLI I ILALRLAYE * LLKGLD* AE 


8227 


22128 


A 


8287 


395 


181 


HGSLHPGTPGLQQFSRLS LPS S WGYRHV 
PPCIiANF* IFCRGGVSLCCLGSVEGLEC 
QARLPVLVRAELGLD 


8228 


22129 


A 


S288 


3 


246 


SSTLSFPSG*DYRHVPPCPANFFFFWIL 
SKGAKITGGTPGMGLINLANGRPP. . 


8229 


22130 


A 


8289 


138 


421 


LLSSVFFFFERDFCFVPQAGGQWGNLG* 

PLIFCFFLKKKRVSPRGRGRANFLDLGT 
PPPGPPKERE . . . 


8230 


22131 


A 


8290 


141 


375 


RTFFFFFKTEGYFVAQAKGQGGDHGSLQ 
PQSPGLKGSSLLRRWDYGSMPPCPSIFF 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFSPPPVFKNRPPNLIFGPPKKK 
KNFPPPGGKNWFF*KGPPPFFFFFFFFF 
FFFFFYYTL 


8232 


22133 


A 


8292 


86 


433 


P PXPNKL I EXGGAF * KRHFFPPPRGRVR 
FFFFGPQKKTPAPGQNPGGGGKPWGSPL 
FSPGEKNXPPTHPXGGPVLGAPPPPPGI 
VLI + KKTGLPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


L VKFE YTVXj P KNR YRLG WAHAYNS S S L 
GGQGGRI T* A* EFKTSLGNI LRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAPPHPANFYSL * RRSLAMLPRLVS 
NSWTQAILLP*PLKVLALQV 


8235 


22136 


A 


8295 


18 


205 


IPTRPPTRPIPGRLNHFTFTATRPGVYY. . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


1 SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GQCSEI*GANHSFMPIVLELIPLKIFEI 
GPVFTL 


8236 


22137 


A 


8296 


1 


199 


YLYIYLSILLSIYLSIYLSIYLASYLAT 
*LAI *KEIYYEEYVLVMTEAEIFHILPS 
PSWRPRKVMV 


8237 


22138 


A 


S297 


1 


318 


PTRPPTRLLSRPIKIQIALHKFSLCPEY 
LQ * ADYLF* SLW I RPDAWPHACNLS IL 
GG*GGRIA*AQEFEISLGNIVRPCLYCI 
IKNKIKKKKPLQI IAFFLCFKD 


8238 


22139 


A 


829S 


416 


304 


GFLHVGQAGLKLLISGHLPA*ASQNVGI 
TGMSHHARL 


8239 


22140 


A 


8299 


352 


31 


EMI LKAAREKK+ I PYNEPLI CLTVDFLV 
ETLQARREWYDIFKVLKKKNKTKNPFYT 
KIEYLVKLSFKYEKVTKTLLDP*Q*LRD 
FINTRHANRNTSNRRKRPI IKHK 


8240 


22141 


A 


8300 


382 


266 


RWDFVMLPKLVSNS*AQAILHVGLPKCW 
DYRREPPHLA 


8241 


22142 


A 


8301 


2 


273 


PLTRGPSSASDALSSGGPYHPSECCFTY 
TTYKIPRQRIMDYYETNSQCSKPGIV*V 
VHTHHTGGRGSQQGLLEGSREWWWNGDP 
QRTSQV 


8242 


22143 


A 


8302 


77 


430 


GMGLSMHRTHFQYLAHIHPCVHVHTHTH 
THTHTHTHTHTLCM*HTEIHKSVRYGTE 
HAQDPFSIFSPYTPMCACTHTHTHTHTH 
THTHTHSVYVTHKVCEETCLSFFYPESG 
TISHAQEILKRCLENKTHLGVWCFTSLL 
SVANIVFFSYKRQVAAARGGLFFSLSQA 
E 


8243 


22144 


A 


8303 


137 


396 


S S VKTMNYLKP WHMVADRC * LFQCL PS 
KKKKKKKKKKKKKKGGPLKKKKI LTRGG 
GGIFFFFGAPKKISPGRV*KKKKKKKKK 
KKKKGGPLKKKKNFNPGGGGNFFFFWGP 
KKN I AG ARLKKPGGG KNPRS P 


8244 


22145 


A 


8304 


329 


144 


NGPPFFFFFF*DGVSLCCPGWTAVARSR 
LTFAVFFTLNCSWGIHSELQHWFQDYL 
KLKTF 


8245 


22146 


A 


8305 


404 


157 


RTYNPSTLRGRSGQIT*GQDFDTSLGDR 
VGPHLRKRKRNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHI FQHIATN 


8246 


22147 


A 


8306 


394 


294 


SLCFLTPPRDFNLGAL I KNFYS PTPGWV 
SCVLSKGPPFFFFFFFKDGVLLSCPH*S 
QNPFVFALEWSGTPELK*SSHPGLPKAW 
DYTYE PPHQAQVELLRY*RS FPVLLLDG 
ICIFHSFFSPRGSLIP 


8247 


22148 


A 


8307 


280 


2 


WDSHVTFFTSVI TVMLFS I QYVKQLEN 
IKSWPGIVARACNPSALGGRDG*AQEFD 
S S PGNTAKPCLPQKNKTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 


MS YQRKS VNKYKSNNACEEVTTWARHSG 
SCLRLYNPSTLVGSGGKIP*TQEFETSL 
SNIARPYLY 


8249 


22150 


A 


8309 


3 


118 


IFWLVQMGFHHVGQASLEL*PQVIHPP 
QPPKVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRVDPRVRVRSTHRNLCLPGSSHASTSA 
S*VAGTTSACHHARLIFSFSFFFLGFSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKDI I PPLVSN 
RGDQPEPFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8251 


22152 


A 


8311 


253 


446 


SKTFQARGGVTEMGFRHVGQTGLEILTS 
GDPPASACQAAGITGVSHKARLRI *YFI 
RIFILMTC 


8252 


22153 


A 


8312 


430 


88 


GVGR I Q I S WSP WLPKPKNS PGS P PQ * VG 
APGGPPPPGPYFFFFFQKKGFPLFSGRV 
*GSGSKNFFVFPPPKNWGTSPPPPPEGR 
GFFFFFFFLSRDGASPCCPPRPPKVLGL 
QV 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHPIDFFLFFVETGSCYVAQGAC 
E LLG S GNL PGS AS * NAR I TDVNHHAQ P * 
KSSSGPGTQQC 


8254 


22155 


A 


8314 


4 


421 


GGRIARTQEAKVAVSQQCTTALQPGRHS 
KTPSQKTKKKKSPGGPPGPPPGRFFFFF 
KT * KKG P FLKKP PQRGGD PRG K KKRG GG 
LWKKPGGXLGGPTKKKTRGGGANPPPLW 
GKGGGPKGAPRAFPFTKKKPPPSGRNQ 


8255 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPIjGGQRGRIP*GC;EF 
ETSIiANMVKPYFFLNSKKKTKNHPTKKN 
SQAW 


8256 


22157 


A 


8316 


422 


153 


AATIiYTHKTPHCIYFFLDRNIiLCCPGWS 
QTPGLK*AFCLSLPSSWDYRHKPPRPVT 
SPIFRIKSYDVIPPPYLE*NPMMLYLPS 
YCHHI 


8257 


22158 


A 


8317 


1 


284 


ERQDWESRLEAMECAFHLEKSVNQSLLE 
LHQLAMEKGDPQLCDFLESHFLNQQVKA 
I KKLG DY Ii S NLCKT * APEAG LAE Y L>FD K 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFFWPGWAHA*NCSTLGGQGRRIA* 
AQKFESKTLGNRVSHCLYKKFKN 


8259 


22160 


A 


8319 


2 


250 


KYEQSLQEVWDCVKRPNLRI IGVPEEEE 
KSKSLENI FGGI IEENFPGLARDLiDIQ I 
REAQRTPEKSIAKRSSSRHIVIRLSK*E 
VWDCVKRPNLRI IGVPEEEEKS KSLENI 
FGGIIEENFPGLARDLDIQIREAQRTPE 
KSIAKRSSSRHIVIRLSK 


8260 


22161 


A 


8320 


371 


196 


NNFSILFFFFETRSH*VAQAGLVIjIjGSS 
NPPCSASRVAGVTGVHQRYQQDDFKNFR 
HN 


8261 


22162 


A 


8321 


3 


235 


FISVS FKYS KH I PNNMHLKFLVL FLNFF 
FRDGFWLRSG*SVVAILRCGHSSLQPLT 
GLKQSSHLSFQGSWDYRRAPH 


8262 


22163 


A 


8322 


1 


395 


LSVPLLNTSLIiLiASGVSIT*AHHRLIES 
NRNQI IQALLITIIiLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHGLHGII 
GSTFLTICFIRQLIFHFTSKHHFGFEAA 
A* YWHFVDED*LFLYVS I Y 


8263 


22164 


A 


8323 


322 


83 


MHQSFLVAKVKKGKHGRWFYSCLMI KSA 
PVXiSVTHNTLLCWPGAWHACNPSTLGG 
QGR WMA * AQ E FET S LDNVGKPH P 


8264 


22165 


A 


8324 


12 


249 


GFFLPPPPGRGEKRGPPPPPLFFWFFFK 
KGEFPLGGGNFLTLEPPRLAPPKGGK*G 
GD PPP PGFFFFLKKKPKP LLKG 


8265 


22166 


A 


8325 


139 


335 


EN I LW AAT WLDCH P S TLGG * GGR I T* G 
QEFKTNLANIERPHLFFFFFLKRNFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 


1 


246 . 


FRRGLAVAPRVLIiCYPGWRAMVQS * LTA 
ASNSWAQAIHLSLPKCWNYKHTPPYPTC 
+NSACWDESSFPLLVSAMEVCGQPHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
iocation 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8267 


22168 


A 


8327 


422 


107 


IYYVRQQRKKRGLFFPFFRFPLPPGGGQ 
WGHFWFLQPPPRGV*RFFFLNLPRNRPW 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
*ILAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 




161 


FRG I RWVCHVLFF YLS FFLRQGLAVLPR 
LI LNS WAQATLLPQLPE * LGLQTS S 


8269 


22170 


A - 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NPFFFKKPKNNPGGGGGPLI PPSSGG * A 
KGMGLSPRPRLPLS*NWPPPPPPGGQNK 
LPFP KKKKKERKKEKFNFEHRG 


8270 


22171 


A 


8330 


46 


422 


STPFPLASQSAGITGVSHRARRSSSFQC 
TQIPLFPETLPNLGFGSRHGRGFSSQVL 
PAQSSSFDLDLISVLGY*MLSHISLGPN 
RALGFIPPHLSKWENGSTLRSIVLSQIE 
GLSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 


112 


FS HLGS S PRQG FTMLARL WNS * P CD P P 
TS ASQS AG I TGVSHHAQPHAVFY IRL PD 
SQVLCHQMPYWPTAHYPHRW I FSLVLQK 
NLSVPPPLWPIQSQPKASILLN 


8272 


22173 


A 


8332 


291 


19 


KTPFPPKI*KFPPNFLKFTPGCGGPPPP 
FLKKFVSPGNPPPKKIFFFFFPPPPPFF 
FFFF FLRRGSL I S IAQDG I QWHYHGS LQ 
PQLPGIjR 


8273 


22174 


A 


8333 


307 


123 


KYFRGGGGGGR* FPPLGGVKQKKRGNPG 
GGGSKKPKLGPSPPTRGKKKNFFSPKKK 
KKKKK 


8274 


22175 


A 


8334 


3 


169 


HASADMCHQAQLLLiLLLVETGLCHDGRA 
GFGS*SQEIHPPRPPKVLGLQHFGLIY 


8275 


22176 


A 


8335 


1 


208 


LCVCVCVCVCVCVCVLFC*WRKGLALSP 
RGECSGTNMAHCSLNFSGSSNPSVSPSH 
VAETTGVRP I IjG 


8276 


22177 


A 


8336 


1 


352 


ILFYFIFYYFSFCFEMGSRSVSQAGVQC 
PDLS SLQPLPPRFKRFFCLTFPS SRDYR 
CVPPHloAKFCI IRRR*AAIGQAGIEL.LT 
LSDP PAS AS * S ARNTGVSHCSGPSF INF 
LNWQI 


8277 


22178 


A 


8337 


420 


272 


KAPCQFSGEKTVFPKHCKNS WVPFRKKQ 
QQKKNLCPCLVLYTS INSQ * TMGLN I KA 
RTVN IS* KY KED P * TGKK* I NKS CP W TG 
KKKKFLARTQKTIAIWDTLINWTSPKLK 
MS TKQRHHLKMNR * PKWLLFFVS 


8278 


22179 


A 


833 S 


427 


202 


LI IYFCHQSVQKKGVLPLLQEREGWAEA 
SLKISNI SLSHTHTHTHACAHTHTHTHT 
HTSS *NG IKTWITIKSFA 


8279 


22180 


A 


8339 


2 


164 


TPGHGISLCRPGWSAVAPSRLTATSASW 
VQTILLPWPPE*LGLQESRLLRDHLC 


8280 


22181 


A 


8340 


422 


1 


PPGPPGGKTKRAPPPSFFKGPNPLLKGK 
GPGAKPPPKKGVFFFPPKKVKGPQKNFG 
GTQI SQPNKRGA* KFTPFFL *KPPQKKK 
GPPPGEKIFKGPKF*KFPSPGFFFFFLK 
QGLTLS PRLERSGANTAHCSLKLLSSSD 


8281 


22182 


A 


8341 


1 


456 


RTRGAVSQGHAIALQPGQQSETPSQKKK 
KKNPAPGGKGKIGGTAKKKGRNFLDANI 
LGPNSRPGGAFFGG*TAPPGKKRGSPGP 
IFGGGKGGPKKKI LSF*KAGGGAKFFPS 
P VGTPFFRGF+ KGKI LLKKKKFS KPGGG 
NPENLS PTNFFF 


8282 


22183 


A 


8342 


407 


2 


RQFILGAPGVFPPPGF*KPPPKIIFWGP 
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SEQ ID 
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Predicted 
beginning 
nucleotide 
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Predict- 
ed end 


Amino acid sequence (A=Alnninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=JVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKNFFPPPRGEKFFSFKGAPPLFFFFF 
FFFFFFSPFFFGALFFFFFTILFKTKFG 
VFS CKGI PPKFFKRSRP PNGAGRLPLLG 
VGPLPGVPQGGEAERKISQGSFQ 


8283 


22184 


A 


8343 


441 




PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKKKNSPPPPGKKIFFF*RAPPPFFFF 
FFFFFFFPPFFFGALFFFFFTILFKTKF 
WISWKGIQPKLFKRSRPPNGGGRLPLL. 
GGGPPPGVPRGGEAERKFSRGSFQRTGG 
SAPEFRA 


8284 


22185 


A 


8344 


466 


323 


FGVAGTQNPGRNPRVPSFRGGPPPGVPQ 
GRNPPPPWFFPLNPPKHFLTP*VFPFIiE 
P P I E WVGPG P P * PS YGFSS P VARK 


8285 


22186 


A 


8345 


657 


347 


LFFWFFLRQSFTLVAQAGVQ* CDLGSLQ 
PPPPGFK*FSRLNLiPSSLDDRCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDPPA 
SASQSAGI TGVS I SAQPAQ 


8286 


22187 


A 


8346 






DCS RGP P PHLAN FGLLE KTGSNH VDQAG 
LKPLSLRDPPASAS*SPGITGGSHRAQF 
♦FIFKRGMAPKGF 


8287 


22188 


A 


8347 


9 


255 


IPGPTHPSDFFLiYEIDLPFKN*KGFPPP 
PRGGGKNPPFPPPPFFFS* PPWKKKITF 
QTPL TG VFGAPKNFNFKGGFKK IG F 


8288 


22189 


A 


8348 


79 


312 


NAAGFEVGISKA*ALETLQIITREGHSN 
KPRYAVTFESHEMGTVLHLVEQEHTVGY 
ITASCSAIJCC* IESGGFLNDYI 


8289 


22190 


A 


8349 


1 


369 


FFFFLKRDRVSPLSSRVKYKWHDYSSLQ 
LQTPGLQ*SSCLRFLSSQDYRHIPPCPT 
NF* KCFVEMGSCCVTQGWSQTPGAQAVL 
PPWASQRHWGLQGVRPLCSLLLSNCPHS 
RLVWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


I,PILSPVWWHMPWPGTREAEA*EQ*SL 
EPGRRRLQ 


8291, 


22192 


A 


8351 


3 


253 


YM CS KFFFLE CS GAI SAHCHLELPGS S D 
PPPPT*NF*VIGTTGTCHHEWLLFELFG 
KMKPHYIAQVGFKLLERNFLKTKKVFP 


8292 


22193 


A 


8352 


3 


1431 


SPRGGIPSLKQVYSSLTTNSRPSQHLF* 
EEALQRAREEEEKRKEITSHFQSTLTDI 
QGQIEQQSERNMRLCQENTELAEKLKS I 
IDQYELREEHLDKI FKHRELQQKLVDAK 
LEQAQEMMKEAEERHKREKEYLLNQAAE 
WKLQAKVLKEQETVLQAQLTLYSGRFEE 
FQSTLTKSNEVFATFKQEMDKTTKPCMKK 
LEKDTATWKARFENTCNKALLDM I EEKAL 
RAKE YE CFVMKI GRXiENLCRAiiQEERNE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQE I DAEE VNSVQTAVKNLATAFM I 
IHHPESTPHQSKETQPEIGSSQESADAA 
LKEPEQPPLIPSRDSESPLPPLTPQAEA 
EGGS DAE P PSKAS NS PAGLGAETQCEGL 
PVGAQADQASWKPEAEASGQAPQAPTEA 
SLQKMEADVPAPACAAEEHVAAMVPACE 
PSRQPPRAAAEELPVGASAGPQPRNVA 


8293 


22194 


A 


8353 


376 


65 


I HCGDHVQE VPKL KVCALHMS S WACS Q I ' 
PKAGDKILTFDQLTLDTLKGCGTILLSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RDQKFKHIRG*WASQGYKN 


8294 


22195 


A 


8354 


459 


85 


LEEVESL.ERVLEKKMIESEELKSKQTRF 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gtutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LEEIKNQDKLNKSLKEEAMLQKQSCEEL 
KS DLNTKNELL KQ KTI ELTRA CQ KQ YEL 
E*ELAFYKIDAKFEPLNYYPSEYA£IDK 
APDESPYIGKSRY 


8295 


22196 


A 


8355 


231 


158 


P FF I FYL YQSND I ARGLERGLE PEKV I G 
ATDSCGDLiMFLMKW* ESVAPMTFSGSRP 
LSKPRAISLLWYK 


8296 


22197 


A 


8356 


2 


164 


I HRGVENQMAGEQGWDKDVS VYWVGAVA 
HACNPNTLGGQGRWIT*GQEFETNLP 


8297 


22198 


A 


8357 


23 


291 


CTHTRAQAHAHTHHLFYYFLYI FNFFEE 
TGSH* I VQACLELPASSDPPASFSQSAG 
ITGMSHHAWPTSCILKVKMALCSINCSA 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFS CLSLLS S WDYRHVP PRLANFC 
IFSRDGFHHVGQAGLQLLTSNDPPT*AS 
QSAGITGVSPRAQPCPAF 


8299 


22200 


A 


8359 


75 


266 


FVKTRNL.FLTVLEAGKSN I KARAG* VSG 
EEHGLCLQDGALLPLEGRNAVSSHPRGQ 
NMMRAKC 


8300 


22201 


A 


8360 


1 


295 


NTWVFCFVLLFRDRVSLCCSVRSTVASS 
*LIVNMNLSFPSS*AYRHVSSHLDFFFF 
FKIQRILILPRLISNFWPQEILLWLPQG 
LGLQPRPQVLVGNS 


8301 


2.2202 


A 


8361 


376 


293 


MGFHHVSQAGLKLLTSGDLPASASQSAR 
I TGAGHHAQPD F * HSGDS WQAP PEPRDQ 
G 


8302 


22203 


A 


8362 


375 


142 


PQEEFLRFATDVGEYRAVTELERPVSES 
WNTQKDLL*QKRGRVSTYCRHTYGVGES . 
FSVQRRGEHVGGRGLGPCELE I 


8303 


22204 


A 


8363 


36 


363 


QLSFNPIilGKKVKLRTVFIGKQFLiFLLG 
GGVREAGAHSGAQATVRWHNLSSHAKRP 
S CLSLPS CDHRHLLPHPANFCI FWTD* I 
PPCRSGWVQTPELKQFTHLGIiPKR 


8304 


22205 


A 


8364 


455 


235 


GAQARAPPRGIIFFFFW*RKGFCMLPRP 
VS KF WG * WPLP W P P KRVGLD S WAPP P R 
DFIFFSFFVKKRVCILW 


8305 


22206 


A 


8365 


432 


175 


STDQVSVAMGPS* PGAGYNLLVCRWLRP 
LKKCS IRMGVS* FSRYCLSWLPLDKKGN 
SLTPCASRVRRCTVMLRLMLRGLHPLSD 
KP 


8306 


22207 


A 


8366 


383 


83 


RRLIVFICYFSKK*GLPLLPRIiVSNFWP 
QVILVPWPPKVLGL * ALKKLWLDMVTHA 
YNPSTLGGQGTRIT 


8307 


22208 


A 


8367 


27 


304 


FFGFILTLKKSSN* CTLRFCFLS IGKVR 
IHTLGNILSRRGYGERCSLPHCLWESIC 
L P VQSN* A1.HKKVQMHI PFDQA I P SRG I 
LPTDTGHK 


8308 


22209 


A 


8368 


165 


204 


QNI PIAVSFFFFFLKRVS FCPPG* RAGG 
QS LF I GALTFQV * G I LI PYPPGNWELRG 
PPPNLTNFF*FLNLRFLLGPINLHYVHD 
KTSPLLYLFFFFF 


8309 


22210 


A 


8369 


400 


201 


KERPLPFRLIGSPFQGTWPRFFSKGEVK 
NSNFSKY*PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


A 


8370 


21 


408 


L I P LST YVC VRVCE C VC VCVC VNI * G YA 
CNTLLIACNILQDVPFTSPTKDKIFSGD 
FLTLVGKK 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
■^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8311 


22212 


A 


8371 


1 


94 


NTLGHEN*LNSGFGGCSEPRSCHCTPTW 
ATE 


8312 


22213 


A 


8372 


546 


972 


PIHSLYGR*PSFCLGLALFKTLQVMHFL 
KDS VRNLKSSNLKVGLN FRSPQI QAT I S 
. PPLQPKTKTYEEDPKSKPKKEEKNMEVN 
MKEWILRYAEQQNEEEKNENSKSLEEEE 
KFDPNERYLHLAAKLLDAKEQAATFKLE 
KK 


8313 


22214 


A 


8373 


502 


178 


QHFWRLRQKDSFALGG*KQPGQKGENPF 
FQKKAKNILG* WDQPWPTTQKGGGGES 
L* PGRTKS PPCTSGWGKKQNPVLKKKKN 
PQKTNCIPLCIYHIFLINRKTNTV 


8314 


22215 


A 


8374 


362 


112 


YRMSVNPQVRAKRVSQRTNTPPNFFF*D 
RVLLCR P G * ST WQS QLT VAS I S W AKA I 
LLPPQPRQAKYFIjRLVKCSSEKGERIE 


8315 


22216 


A 


8375 


411 


21 


vlgiyptlvspspglkmtrvklpsgskk 
vi s s snrp vfg waagg * fd kp i lkagr 
asskykakrncwprvrgvamnpvehpfg 
ggilqrigkpsairi yapagrkvgliav 
rrsgrlrgantvqeken 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK* *LRSCRPVDGSAGQSPGP 
LRTE S L.ETPGCS PQGWQELE P PPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH*QRSGESRPPAVPRPPCPLPWKKGVL 
E 


8317 


22218 


A 


8377 


25 


306 


EQQPPPVAVHQSGPPGAPPEASPSHPGS 
SEKPQPVPPPRAPLVPQPKP*LSPPHHP 
AAGIjIjAVHAIiAGGPGTVQRGLPGRAARA 
VSVRVPAARC 


8318 


22219 


A 


8378 


376 


149 


PAHCSLLiIPGFRGSSCIjSLPCNWYPGRA 
PPRVGNFFFFF*EMESCYVAMLPSLVLT 
SWALAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


SVPPPVGTYEGDAAPTGGGG2 J PAI J GR*G 
AAGGHVQQGHPGPHRGHAAPKPSAQGGP 
GHSRECLEPSGGAAGGEREPPRAGGGQS 
WMRAPSMT 


8320 


22221 


A 


8380 


2 


807 


RPQCCVRAPRAGAVSGVPKLWSVP*SGR 
SAPHPTRS * VRSWSRWLRTQS WQTSRTR 
RRSESKVRVSYQAWTASPPPAASASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
SQTFVRNSSTLSCLCLTRKWIAMMPQV1, 
PCTPVRPS CSAVSTI TRSFLL * QAIiASR 
VT*IAPPRGSTTRTPSPIRL*NLP*LSR 
GPGK*KSGSWSSSSSA*TRWRTSAPLI 
TSSSLLSRSSCKAQTG*ASCRPVRVPIP 
AGSSLGASAPGSRCHW 


8321 


22222 


A 


8381 


104 


327 


CIAFPEMLGIKYEVGKCPSGLKSLKILK 
NTHTHTHTHTHTHTHTHTHTHALYLIRE 
GRKK* CVCMRKTDSTCVS 


8322 


22223 


A 


8382 


400 


270 


KPKLE VRASGQVQWLS PI I PALEEAKAG 
G*LEPRSLRHFMVCK 


8323 


22224 


A 


8383 


178 


588 


ADIjLSGAPGPPGGSQDNANSSEHSRACA 
PAAPGP * WLQPWLCP * WTQSPLEARGS * 
NPGVSHSSRIPGIPGAAGTSRPSGALPP 
QPAGHGGS SGELHVARGCACGRPLWSHR 
PGCGQLLRVTAADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQELAELTGVTVESPEEPQRLLiVS*GG 
TV P CLAC S LHRT L * TVGHGS AS S FLAQG 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycinc, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T = Xhreonine, V = Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DMEPLGDRGLQGTEQLH'GGDRASLQDI 
CFLLPVLSLL.CPRSLDL.HRHRVGVTGFT 
ED 








8385 


434 


176 


FADDSRFDFLLLV*CAMIiMLIREQLiLiEG 
HFTVNMRLMQDY P I TDVCQ I LQKAKELQ 
DSK* PGGKRPMFGREASRPCAVAPGTHR 


8326 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR* PGPCGEGGHI I 
REEQACGWPRGHSGPWRHPQAGPLQLGQ 
WRGKGAGQGARRSPQGSLESRPPTLSAA 
P PAIH I CS VGG VRLHCY 


8327 


22228 


A 


8387 


331 


91 


EMRSHSVTQAGVQWRDYGSU5TRPPGLS 
FNVNWDWVGL*GDPSSPPMLLPSS * ARP 
GGGWGPGGEGHVS KEAAGRLQGPP 














IHVRLiWSQIjFRRIjRWEDIjLSTGGGSCSE 
P * S CHCTVTWAFFLES CLRCAHWYVTL 
LTALLGSLP CANPRQTAVS LPQAQLV 


8329 


22230 


A 


8389 


388 


176 


YWDSQ* CMPSFAVPVESACICAIGHNFS 
KSVNSVIAVCVDGSSHKYVFTHDGRCHR 
EAFDVYRDI CDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR*PPPRPDNF 
LSLYSVLHET 


8331 


22232 


A 


8391 


559 


142 


LRTLP VTLGKSLALS VPS F P YVGGNW I T 
NREPSAKGRGSGPNRGPGHAASCPARTH 
SLRHSPAGGVMLTQPRPILiILiPETQHPS 
ESAVARTDISKARRWLHSRSPWPH*DET 
SAGAERPGIRCYIGHPAT 


8332 


22233 


A 


8392 


104 


400 


YYE I FKKEMYS YYHILHFFFFFSFFFFE 
TKPNS VNGPEGKGPILG * LKPWLLG * KH 
FCLTLPSGGNYRHVLPPPVNGFFFFFFF 


8333 


22234 


A 


8393 


40S 


169 


DQVWIKDWNIGSLiRPR*KGPQ'ri ILTTP 
TAVKIEGIPAWIQHSQVKPAAPETWEVR 
PSLDNPCKVTLKKMTSPAPVTLRS 


8334 


22235 


A 


8394 


i 


269 


YCCPLFSSKALTQENSPYSSFRLVNPPG 
LSLHPEGEGGKWINERGRELGPSAGPLL 
LFLHFAEAGRRQPPDWADSEADIiQQVRH 


8335 


22236 


A 


8395 


2 


204 


KDCKVNKEVERVLREFHQAGKP IGCDAS 
TSLPAPAPWWSWNAALCCPGLSCVKCWQ 
GESRTCVGEHQ 


8336 


22237 


A 


8396 


3 


285 


KVTVKYDRKELRKRLNLEEWILEQLTRL 
YDCQEEEIPELEIDVDELLDMESDDARA 
ARVKELLVDCYKPTEAFISGLLiDKIRGM 
QKLSTPQKK 


8337 


22238 


A 


8397 


1 


192 


ETGFHHAGQDGLELLTSGDCPS LASQSA 
RITGVSHCTRAQLLTEQEYFKNQNHLPS 
APFYKRAF 


8338 


22239 


A 


8398 


8 


241 


LNPFLYAFLDPRFRQACTSMLCCGQSRS 
SGTSHS IRGEKS AS YS SGHS QG PGPNMG 
KGGEQMLiEKSI PYSQETLWD 


8339 


22240 


A 


8399 


2 


240 


SCLNPFLYAFLEPRFRQACTSMLCCGQS 
RSAGTLHSSRGEKSASYSSGHSQGPGPN 
MGKGGEQMHEKS I PYSQETLWD 


8340 


22241 


A 


8400 


10 


272 


CNWQNAYLTWRCAQDCEDYFAERLYRS 
MKGAGTDEETLIRI IVTRAEVDLLGIKP 
KFHQKYQKS LS DMVRS DTCGDFRKLLVA 
LiLH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucte- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1= Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Trypropha/i, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


834] 


22242 


A 


8401 


47 


248 


trgi fffflntrshsvaqakglghdhss 
IjKPQ p prlkqs fnls llstwnyggllph 
salwkaeaaks 


8342 


22243 


A 


8403 


497 


303 


TVEASVS YLL YVAM VMQLP WG KS PAS GA 
ESHRQGCGGSWPGGGMEEGRGAERGGGC 
LLPQAliLY 


8343 


22244 


A 


8404 


3 


235 


LTLIiAHCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMIPGYPM 
MRSPARLMMVPSQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKIGAKKQKMSTLEKSKLDWESFKEEEG 
IGEELAIHNRGKEGWIKGAQPLGQ 


8345 


22246 


A 


8406 


1739 


804 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
I LS IiN PQ ED VE FQKE VAQVRKR I TQR KK 
QEQLTPGWYVRHLPNLLDETQIFSYFS 
QFGTVTRFRLS RS KRTGNS KG YAFVE FE 
SEDVAKIVAETMNNYLFGERLLECHFMP 
PEKVHKELFKDWNIPFKQPSYPSVKRYN 
RNRTLTQKLRMEERFKKKERLLRKKLAK 
KGIDYDFPSLILQKTESISKTNRQTSTK 
GQVLRKKKKKVSGTLDTPEKTVDSQGPT 
PVCTPTFLiERRKSQVAELNDDDKDDE I V 
FKQPISCVKEEIQETQTPTHSRKKRRRS 
SNQ . 


8346 


22247 


A 


8407 


2 


241 


CLRFPSSPAMGLLRSGTKLIFRRRPKQK 
EAGLSQSHDDIiSNATATPSVRMKAGSFS 
RRL I KR F YFKS KP KANGNP S P QL 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNSWAQAFLLLWPPKVL 
RLQA 


8348 


22249 


A 


8409 


1 


246 


RHENREELQVIADLCIKYDTLCISDEEF 
MNGVEYIWKGPCFCLTTFTLYLKGDIFP 
RPPSGCLNLWIVLNPMYPVIHLITSC 


8349 


22250 


A 


8410 


1 


102 


AGS YTiiH 1 1 KGDDGTRGVTGR FTFTLHR 
IQTSEE 


8350 


22251 


A 


8411 


877 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 
GP KKFS CLNLPS SRD YRS P P P PLANFCT 
FS KNGVS P CWPGRS STPDRR 


8351 


22252 


A 


8412 


1 


130 


RMUSIDKTLRTDIGGNFPKNGWAAIATHS 
FEFAQFDNFLEEATR 


8352 


22253 


A 


8413 


97 


401 


CNPENGLTASALGRRMCIATCKAPRTLP 
DSGDTASCRFPAVPRPPHSRRSSGSGHL 
PGRPRCPAIiPGLEWSNPPGPPTSGYLFP 
TFSTPAAHSHQKTLLGI 


8353 


22254 


A 


8414 


348 


516 


VSAYGFITEGHERFSDHYYDTSWKRLIF ' 
Y I NHDFKLEREVWKRLiHDEG 1 1 RLYQRL 


8354 


22255 


A 


8415 


714 


410 


LGVFRSALHGSLWLLLRSFPQKSPNPLA 
L L LFLQ CWTAYQ CLL I ADQHCR TRK Y FL 
CLASGI PCVSHVWVHDSCHANQLQNYRN 
YLLPAGYSLEEQRILDW 


8355 


22256 


A 


8416 


2 


102 


GHDPQDRLLiAQDSEHNHSDRNWQRNRPW 
PKDSY 


8356 


22257 


A 


8417 


1 


309 


FYFENALSKSNKP IHTI ILNPHVHLVGD 
DAACIAYIRLTQYMDGSGMPKTMQSEET 
RVWHRRDGKWQNVHFHRSGSPTVPIKPP 
C I PNGKENFSGGTS LWQNI 


8357 


22258 


A 


8418 


3 


242 


ARALTNAASHVDDMPKALSALNDLHAHK 
IjRIjDPVNIjKLLSHC1.LVTLVDF.I.PDEFT 
PVEHACLDNVLAS VS TVIiTFKYR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


I Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Slop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8358 


22259 


A 


8419 


711 


296 


NVANSDGLIASLWKEYGKADARWVYFDP 
TIVSVEILTVALDGSLALFLIYAIVKEK 
YYRHFLQI TLCV CEL YGCWMTFLPEWLT 
RSFNIiNTSNWLYCWLYLFFFNGVWVLI P 
GLLLWQS WLELKKMHQKETS S VKKFQ 


8359 


22260 


A 


8420 


3 


97 


FWSSLCNGLIAAQLLF'YWNAKPPHKQK 
KAQ 


S3 60 


22261 


A 


8421 


1 


279 


G CG I ADLAMSAI FWFQSLLTD ILLLICT 
CAYI PSLAPNLLDRNKTGL.LG I FWKCAT 
IVERKSPYVAVCCILMAFSILLIQKLVK 
MPQCICHNI 


8361 


22262 


A 


8422 


1 


228 


RHEVFIELNHIKKCNTVRGVFVLEEFVP 
E I KE WSHKYKT PMAHE I C YS VL CLF S Y 
VAAVHSSEEDLRTPPRPVSS 


8362 


22263 


A 


8423 


3 


182 


SEDTGEEQWTAEFINRGEYEIDIAGYR 
FQAKAKLYPVASLFTQKRRKDDMELSDL 
HGK 


8363 


22264 


A 


8424 


1 


70 


VQFVFDAVTDVI I KNNLKDCGLF 


8364 


22265 


A 


8425 




151 


ETTASSCTPASLESRRCCAPCRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERX.VPGVPGAEAQQPAGDGV 
RAGPLQARPPAPVGVSQGRCQAAGAAAG 
PPRPDG 


8366 


22267 


A 


8427 


520 


423 


GKYQLQSQENFEAFMKAIGECWTERQSC 
GSHQ 


8367 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTbYLETPKPS ISSS 
NLNPREAMETVILTCDPETPDTSYQWWM 
NGQSLPMTHRFQLSETNRTLFLFGVTKY 
TAGPYECE I RNSGSASRSDPVTLNLLHG 
PDLPRIHPSYTNYRSGDNLYLSCFANSN 
PPAQYSWTINGKFQQSGQNLFIPQITTK 
HSGLYVCSVRNSATGEESSTSLTVKVSA 
STRIGLLPLLNPT 


S368 


22269 


A 


8429 


3 


390 


ILGCNILRVEYSLVICVSVPGSKKVILD 
LPbVIGSRSGLSSRTSSMASRTSSEMSW 
VDLNIPDTPEAPPCYMDVIPEDHRLESP 
TTPLLDDMDGSQDS P I FMYAPEFKFMPP 
PT YTE VD P C I LNNNVQ 


8369 


22270 


A 


8430 


3 


208 


NGTHVIILLCLKTCGTWNVANDMIVAS 
NLiGTGVPNQTPVSSGDLLIRSNGLLIPG 
TCEITRLYTISE 


8370 


22271 


A 


8431 


3 


563 


liPTSRVDPRVRtiDRMKKDQEEEEDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FSYPELPDSCQPYGSCFYSLEEEHVGFS 
LDVDEIEKYQEGEEDQKPPCPRLNEVLM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYRSAFYS FEEQDVSLALDVDNRFFT 
IiTVIRHHIAFQMGVI FPH 


8371 


22272 


A 


8432 


2 


119 


DGDN I LVTTF I YI KSVTE LNGD I I TNAS 
WRCSERVAPSA 


8372 


22273 


A 


8433 


192 


1285 


AGVLSIIEETDSEGLQTKWENQTYDER 
LEINDSEEVASIYTPTPRHQGLPRSAHL 
PNKAMADNSSDECEEENNKEKKKTSQLT 
PQRGFSENEDDDDDDDDSSETDSDSDDD 
DEEHGAPLEGAYDPAD YEHLPVS AE I KE 
LFQVNQLRCVPTDVLDHKLKPFI PDFI P 
AVGDIDAFLJCVPRPDGKPDNLGLLVLDE 
P S TKQSD P TVIjSLWLTENS KQHN I TQHM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

tocation 
correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H— Histidine, I=Isoleucine 
K=Lysine, L=Leucine, MKMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














KVKSLEDAEKNPKAIDTWIESISELHRS 
KPPATVHYTRPMPDIDTLMQEWSPEFEE 
LLX3KVS LiPTAE I DCS LAEY I DMI CA I LD 
IPVYKSRIQSLHLLFSLYSEFKNSQHFK 
AIAEGKKAFTPSSNSTSQAGDMETLTFS 


8373 


22274 


A 


8434 






TIjQADHFNTRIjS CGDAAQTliWARTGY LG 
FVRRTELTAATGERHDALYWGALDETL 
ELRGLRYHPIDIETSVSRIHRSIAEW 


8374 


2.2275 


A 


8435 


1 


225 


QTFNLEGSQIYEDSIVLQSVFKSARQKI 
AIEEESEDESNEEEDEDDEEYHEWKRYD 
RLGENMCLNC I WNGGGY IV 


8375 


22276 


A 


8436 






CGCGIAGLAMSAI FNFQSLLTVILL.LIC 
TCAYIRSLAPSLLDRNKTGLLGIFWKCA 
RIGERKSPYVAVCCIVMAFSILFIQ 


8376 


22277 


A 


8437 


1 


317 


G PKPL VRTS REPGKDVTTSG YSS VS TAC 
PTSSVDGGLGALPQPTSVLSLDSDSHTQ 
PCHHOARKSCliQWRPPSPPESTVSQQQV 
KRINLC I HSEEEDMNLGLVRL 


S377 


22278 


A 


8438 


80 


213 


PHLSFNAG ITT I KVN IRNANS LGGGFHC 
WTCDARRRGTLQSYLD 


8378 


22279 


A 


8439 


3 


767 


HEDNIKQLKEMKFTYLINYIQDEINTIF 
NDY I PYVFKXLKENLCLiNLHKFNEFIQN 
ELQEASQELQQIHQYIMALREEYFDPS I 
VGWTVKYYELEEKIVSblKNLLVALKDF 

E YLS I LTDPDG KGNE KI AELS ATAQE I I 
KSQAIATKKIISDYHQQFRYKLQDFSDQ 
LiSDY YEKF I AES KRL I DLS IQN YHTFL I 
YITELLKKLQSTTVMNPYMKIiAPGELTI 
IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQAGV 
QWHCLGS PQPS S PR FKRFS CLS TLGGRG 
GWIMMSGVQDQPGQHGETPFLMQAGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKIVFGMNGRVWVKGKTVQQTLIL 
ANVLEACELMTIiDQRILMFILAES 


8381 


22282 


A 


8442 


459 


3 


CGGLH P VRAS WLLCL PKQAKAMAGAP P P 

GAGYNL LVRRFLSLS E KRS I RVRVTRFS 
RCHLSPIiSLiTRKGNSLTPCTSRVRQCLA 
LLiRLAHGALHPLSCAHCLALPSEMTPVP 
QMENAEITRLL 


8382 


22283 


A 


8443 


2 




RQGNMTAALQATLLNPPINTKSQAVNDR 
ARGIVLWVLISFKANDIEKAVQSLDKNG 
VDLLMNYMYKGFES PSDNSNAMLLQWHE 
KALAAEGVTSIVRVSTALIPA 


8383 


22284 


A 


8444 


2 


109 


VTPFPMSCDLQGDCACRNPQAQEHSRKD 
LRGYSHG 


8384 


22285 . 


A 


8445 


1 


69 


LVbNSWPQVIRRPWPPKWGLQV 


8385 


22286 


A 


8446 


1 


163 


PSEKHNIWVGVTQFSRCRLSPLSLTRKG 
NSLIPCASRVRQCLTLLRLVHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAGITGVSH 
RARPSS I SF I LELRGS VRKKF 


8387 


22288 


A 


8448 


914 


729 


GLTMDTQKDVQPPKQQPMIYICGECHTE 
NEIKSRDPIRCRECGYRIMYKKRTKRLV 
VFDAR 


S388 


22289 


A 


8449 


134 


392 


TCPPPPEPPSPLTCAVLVPQTRRWRGbG 
S L FRRACCVALPLQLLLLLFIiLLLFLLP 
IREEDRSCTLANNFARSFTIjMLRYNGPP ■ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny}»lanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PT 


8389 


22290 


A 


8450 


37 


248 


AKITPLLQPGRQEQDS I SNKKI LRCCLT 
MLPRLASDSSGFKRSSHFSLPTSWDYRH 
SPPPLASVPFLLMC 


8390 


22291 


A 


8451 


367 


165 


NFFVFLVETGFCHAGQARLQLUASCDLP 
VLALQSAGITGVSHCARPISTIiKDTIKT 
SHFVPLKISGK 


8391 


22292 


A 


8452 


2 


282 


GKRMAGGPEMMQLRLDGNRLY I TTSMFS 
AWDKHLYPDLIREGSAMLHVDVDTALGG 
LKLNP Y FLVD FWKE PLGPSLAHSIiRYPG 
GDCSSDIWI 


8392 


22293 


A 


8453 


4 


163 


HHDFHAQSLLANIERIAILVFAACEAAA 
VLSLLiVSISNTYGLiDYLHNIjKLiLQC 


8393 


22294 


A 


8454 


374 


197 


TPE PKQS SHLGLS KCWDYRHEP PHQARN 
STFDSNHSSLAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22295 


A 


8455 


343 


161 


MGMGPSVPGTGYNLLVCRLLRPLEKCSI 
RVGVSQFSKYHLSRLPLPRKGNSPTPCA 
FWVR 


8395 


22296 


A 


8456 


1 


193 


MGFLHVGQAGLKLPTSGDPPTSASQSAG 
ITGMSHRVQPYYVLS ICQTLHSVCICLL 
IILLTTVL 


8396 


22297 


A 


8457 


342 


263 


WLRP WPALWEAEAGGSLELRS SRPA 


8397 


22298 


A 


8458 


101 


303 


KAAKYbFQRLHFCFLNLKKDNMLLFNHR 
MENFKELFGPGTVAHTCNPSTLGGRGGW 
ITRSADRDHPG 


8398 


22299 




8459 


J 


140 


YLKHLCMYVCKYLRQGLLMLPRLVLNSW 
AQRILLPWLPKVLGLQV 


8399 


22300 




8460 


327 


248 


FPRLVLNSWPKAILMPWLCKVliGLQA 


8400 


22301 


A 


8461 


327 


79 


SIPFILAFFIYLSFFFKETGSQSVTHAG 
VPG 1 1 1 AQCGLQLLGS GDP PDSASLWG 
TTGTYPTPILSVFPNMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQVGLRWPQVICLPWPLKVLGL 
QA 


8402 


22303 


A 


8463 


375 


217 


QAGLKLLTSDDLPASASQSAG I TGVSHH " 
AQPRHHFFKTRFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKC F VQ AFL Y KE CSGGQARW LT P V I P E 
LWEAEVGRWIT 


8404 


22305 


A 


8465 


367 


248 


FQMRIHKWlilDLHSPSEIKQITSISIEP 
GVEVEVTIADI 


8405 


22306 


A 


8466 


341 


153 


NMILGWAQWVTPVIPAIGEAHWGGLLYP 
RSSKPAWATQQNLLFTKKKKNIYIYIYI 
YIYERY 


8406 


22307 


A 


8467 


368 


101 


VEMGFRHVGQAGLELLTSSDPPASVSQS 
AGS S TSLPTADLLTPGWVG VYGLVDKNP 
VTL I AMQRLPA VLE KKS VAR KNS ENK I V 
LKRGQ 


8407 


22308 


A 


8468 


362 


248 


HVGQAGLALLASGDPPTSASQSARIAGV 
SHRARPEAPL 


8408 


22309 




8469 


384 


196 


S ARLGLPKFWD YRRE PPCPADDML I LMV 
GILETFSLIFTATMESRNYCVYFKMRNK 
HIISKTS 


8409 


22310 


A 


8470 


386 


300 


LAML P RL VPNS WS QWLSPLP PNMLGLQ 
V 


8410 


22311 


A 


8471 


380 


178 


PKKNKFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFMYIYIF 
A I VYQTLACNNKFLQMER VTRKLRQC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


.Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


84J1 


22312 


A 


8472 


3 


154 


FLVDMGIHHVSQAGL.KI.IiTSGDLF.ASAS 
QSPIFFFLFFFFEKQILAQAGG 


8412 


22313 


A 


8473 


3 


162 


GLYHVGQAGLKPLTSGDPLASASQSVRI 
TGVSHRTRPKI I FKL I FFLFFLVA 


8413 


22314 


A 


8474 


266 


107 


LCLFLIEMRFPYWQAGVKLLSSSHLPS 
SASQSDRITGVSHCFWPEVFKRGIF 


8414 


22315 


A 


8475 


361 


169 


RLPGPSNPPISASLAAGITGARHHAQLI 
FVFLVEMGLCQAGKHNFEKYS RVL I YS P 
IVSFTHFI 


8415 


22316 


A 


8476 


I 


154 


KKL I PTLMDHGAGFKSS VKE ITTAWEM 
AREVELEVEPEDGNELLQSRGKT 


8416 


22317 


A 


8477 


3 


193 


SHYQSLISSNHGHKHCGRPQGPLPRKTI 
DLCSLDYQLTFPPLLTHDPVKSPSVRNT 
QELSLIP 


8417 


22318 


A 


8478 


408 


103 


TGSRLTLSSRVECSGVITAHCNLELPGS 
SDP LAS ASQ VAETTGMHHHVWLI FLGIF 
LVKMASHYVAQDDLKLLGSTDPPVSASQ 
SVGITGISHHAWATICYK 


8418 


22319 


A 


8479 


59 


166 


NVNTCLCLHTHTHTHTHTHTHTHNYFKE 
TPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FLLLLLESGFCHVAQAGLKLLSSCDLPA 
LASQSAEISGVSHCAQPLA 


8420 


22321 


A 


8481 


379 


184 


CFSLPKCWDFRGVPISPGDFWNFIIKLG 
VCPFCQGGFELLAPNDVAASGFQNAGIS 
GINFWAWAL 


8421 


22322 


A 


8482 


367 


170 


CGLSSGLGGRSGRESERKKERERERDRE 
KKKEGEGERKRERERESVSSYKDTNSMG 
SGPHPYSLI 


8422 


22323 


A 


8483 


324 


136 


QRSSCLSLPSSWDYRRMPPHLANFCIFC 
RDGVL P CCPS WSQMTTS FLS LNFL ICLV 
GIILGRG 


8423 


22324 


A 


8484 


1 


57 


GERRYGTCIYQGRLWAFCC 


8424 


22325 


A 


8485 


35 


339 


GWLLRFFWWQWKWGNNLNVSLIDGWLSY 
DTLGYNG ILN CHMY I LTEGDSQKKKKKK 
KKKKKKKK3CKKIDPAGNSKI YNRKI FKT 
PRGFFKWALGPLLILFF 


8425 


22326 


A 


8486 


330 


66 


RPGVSRFNPWPLIFFLRRVFLCRPGWNA 
WQSLLTKISASQVQAIFLGGGGCSEPR 
SCHCTPAWATTAKLCFKKKNGKNFTDIM 
FSK 


8426 


22327 


A 


8487 


3 


209 


CLSLLRSWDAQAPPPHPANFFLMFNFCR 
VGVLLCCPGLASSDPPPLVSQSARITGM 
SHCTWTIFLSFK 


8427 


22328 


A 


8488 


3 


216 


GQS I TVS CTATNSD VGSSNL VS WYQS SN 
LVSWYQQHPGQAPKLIIYEDNKRLSSEK 
KKKKKKKKKKKKKK 


8428 


22329 


A 


8489 


326 


62 


PAYFS IGSFFLFLFFFKTECHSVTRLEC 
SGAISAHCKLRFPGSSNSPASGGGGRSE 
PKSHHFTLAWTTRAKLHLNKNKIKTVKE 
FLSS 


8429 


22330 


A 


8490 


2 


156 


SSAGIGRTGCFIATRIGCQQLKARGEVD 
ILGIVCQLRLDRWVCGCKQGQQR 


8430 


22331 


A 


8491 


399 


254 


KNNKRNKTINNKITNKLIIKLKNISFLY 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


FRRVGQAGLELLTSGDLPAS AS QGAG I T 
GVSHRAQPTFLKSC 


8432 


22333 


A 


8493 


3 


94 


HLPAEFTPAVHASLDKFLASVSTVLTSK 
YR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8433 


22334 


A 


8494 


327 


70 


KSGFSNVGQAGLKILTSSDLPVSASQSA 
GITGMSHCTWPLYPYFGLGSRSFHSIEE 
HCWYQIiAPFPSCRNPFVIVPFFVHRFYIj 


8434 


22335 


A 


8495 


97 


219 


PGTVAHACNPGVLGGQGGWMTSGQEFQT 
SLASAAKPRSTRP 


8435 


22336 


A 


8496 


3 


191 


GIRRGEILEVIEFTSNEEMLCRDPKGKY 
GYVPRTALLPLETEVYDDVDFCDPLENQ 
PLPLGR 


8436 


22337 


A 


8497 


144 


304 


SYFYEEILFSTVIKKNAIVS FTATWVEL 
EAIILSELTQEQKTKYHMFSLVSGS 


8437 


22338 


A 


8498 


2 


90 


LTILVLiAMWKVGFFKRNRPPLEEDDEEG 
E 


8438 


22339 


A - 


8499 


1 


156 


PSSQDYMHAPPHLANFLNKFFVKWCLAM 
LPRLMSNWPQTILLSPPKALGLHV 


8439 


22340 


A 


8500 


67 


319 


KLMS FGPTLLSARCVSTGQI GMNELNLN 
I DAMSVAATINNE IMSFARTWMKLKAIT 
LIILTLEHKTKYCMCSLISGSLRMTTQG 


8440 


22341 




8501 


3 


299 


ETGSHCVDQGGVQWH I S AHCS LDLLGSS 
DPPASASRWGITSAHHPL.LWSISSLPP 
LLLHPHNLLNLLLLLLPSLSIVLNLHIL 
PPLIiPPLRLIjCSPC 










56 




NNSAKKKKKKKJCKKKKKKKKKKKKKKGG 
AAF 










486 


348 


TGFHRVAQAGLELLS S GNP P ALASQNGG 
I TG VSHHAQPAE YN FKK 


8443 


22344 


A 


8504 


158 


403 


RIFFFFETVGFVAQDGVQGRGAILQPRL 
FELKRFSCLSLPSSWDYKRAQPNPNDFF 
KRQMAFDPKIQRISLGSKKKKNHPIB 


8444 


22345 


A 


8505 


3 


78 


DWHIAYVLLYGPRRVEIMEEESEQ 


8445 


22346 


A 


8506 


164 


291 


TRFCCCWLFWWFCVLCVCVAVFPPWCI 
L FF I WL FFFFF F F 


8446 


22347 




8507 


372 


69 


CAPLPPLKFFFPPRGFIFRGGGSPIFPP 
PKKGVLPKNPPKGFFPPPPFLKPAPGGA 
FNYPPFGGKIFFSPPRGKWAPPGGFLKG 
APPFFFFFFFFFFLFF 


8447 


22348 


A 


8508 


282 


1 


SFQHVAQAGLKLLGSRDPYTLAAQSAGI 
TGVSHHARPVLYFSSVLFSMEFIiSTMSA 
VILPCICSFCVSLAVPSSCILQQPLPPT 
CTYIKTETCI 


8448 


22349 




8509 


400 


45 


TPPVSSLFFLPLKE I I FFCPPFRQSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTL 
SKEKNK 


8449 


22350 


A 


8510 


241 


419 


TPLGGWEKKKKKKKKKKKKKKKKKKKKK 
KKKKNKKKKKKGGGFLKKPLGGANFIGA 


8450 


22351 


A 


8511 


317 


28 


QSLSPSPQTVYFIYLI IYAVFFTTLTII 
LIFSPS FRMELKFL LRCYFFAP FYAFL F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGRAGV 


8451 


22352 


A 


8512 


124 


292 


MCCYIMSLLKVICTIIKEFSTKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKI KK 
KGGVPL 


8452 


22353 


A 


8513 


1247 


910 


QAQLGDIGTSCYTKSGMILCRNDYIRLF 
GNSGAAG ACGQS I PAS E LVMRAQGNVYK 
LKCFTCSTCRNRLVPGDRFHYINGSLFC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
ocl 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


1 Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysir»e, L^Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EHDRPTALINGHLNSLQSNPLL.PDQKVC 


8453 


22354 


A 


8514 


99 


262 


MNKRFLLNKKKKKKKKKKKKKKKKKKGG 
PFKKNPGGAQIFPGGEKKI FFLKGGL 


8454 


22355 


A 


8515 


85 


2 


GWGWAQWLTP VI PVX.WEAEAGGSRGQE 


8455 


22356 


A 


8516 


244 


423 


VFAVFNFLGWVMDMLINLAWI ILQCAC 
ISNHHVVHLEYIQFLFANYTS I KLGKKP 
SKSV 


8456 


22357 


A 


8517 


3 


114 


KINGEDKQKILDKCNEIINWLDKNQVCV 
FFFFFFPI 


8457 


22358 


A 


8518 


413 


91 


GGVRGKISFRGGGFLKKKKKKI.L1KKKKK 
WGGGPKEKKKRGGFFFFFLLNFFFWGGV 
KKKKKKKKKKKKKKKKKKKKKKKKKRRK 
GRKEGREGRRKEGEGRQEINMTA 


8458 


22359 


A 


8519 


1 


163 


SRLPGRSRRRSRGRRRRRRRRRRRRRRR 
SRVtilFPWDQNLDIiNIVYIIEDVDAA 


8459 


22360 


A 


8520 


191 


21 


RFTTHVDAWATVTVFLIESVFLHVGQAC 
LELLPSDDLPASASENARVTGVSHRAWL 


8460 


22361 


A 


8521 


2 


144 


AAATSQAWWCMPWPATWEAEVGASLKP 
GRLQLPLTMIALVHL.SYAI 


8461 


22362 


A 


8522 


90 


8 


KPTPFLPTLIARTTbLLPISPLILIIL 


8462 


22363 


A 


8523 


305 


146 


VFRIHSCGIRGSVDVKITDGLLVIRRIE 
NVPPGPNNKNKNPYAI FQSSSIESQ 


8463 


22364 


A 


8524 


176 


2 


LAWPFLFLPKCWDYRHEPLQPASLTTFV 
REWDKIGWAWWLTPVIPALWEAEVGRSQ 
GQ 


8464 


22365 


A 


8525 


2 


200 


CLIPS S WD YRHMPPRPANFCIFFFFLGD 
HVSQVGLKLLTSKDPLAWTSQRAGITGF 
SHHTKPQGFF 


8465 


22366 


A 


8526 


2 


217 


SFSLLFPPSFSL.LL.PPLASLLLPPLPSF 
SLLFPPSFSLLSPPSFSLLLPPSFSILL 
HPTFSHLYPPSPSFI 


8466 


22367 


A 


8527 


2 


186 


I RGLRP LE KRG I R VG VSR F S R YHLS RL S 
FARKGNSPTPCASRVRRCPSLLHGLHPL 
SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CLGFLSGWDCRC I P P CLANL KKL F VEMG 
SLFWPQWIIjPPWPLKVLGLQV 


8468 


22369 


A 


8529 


2 


388 


RTAVHTF I LVLQVCVCRWHLS LAALTPA 
HLVCSSHR XLS YLS IRGSWDHRCMTTTP 
GL VI S I FFVEM KYR YAVHAGH ILLGSSY 
PASAYQSAEITGVSHWAWPTDSLWQKPH 
MVI WC I S CFLNVLIGTM 


8469 


22370 


A 


8530 


293 


64 


YRIS IKVSKWFKGWVSSFWI FFFFVFFF 
SFFFVFFCFFMFFLKQYLKFIINIIIIL 
TQALYICTTSKICYSFLYFR 


8470 


22371 


A 


8531 


2 


250 


S CLGLPS S WDYRGTLPCLANFCI FNRDG 
VSLCCQGWSQTPGSQVIDEAVYSIYAYY 
ILALLIVYVAQWEVWLKHFCCILKYT 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELLS S GDLPAS ASQCAG I T 
GVSHCTWPSILYRMNFKNLSAVFIAWVI 
EGNYTNKTCI 


8472 


22373 


A 


8533 


319 


200 


TFTFFLQRWGLTMLPRLVSNSWAQAICL 
LQPPKVLELWA 


8473 


22374 


A 


8534 


160 


297 


KTETIiALVAHACNPSTLGG IGRQVS SAH 
EFKTILGNMTKPILYTKL 


8474 


22375 


A 


8535 


333 


228 


PNRGHY I AI VKSHDF WLLFDDD I VE VS S 
FLISYF 


8475 


22376 


A | 8536 . 


3 


325 


PPSSLLPCSWILDCCASNERDSVGVGPS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPGAGYNLLVCCFLSLLEKRS IRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQVRRC 
LALLRLAHSALH PLS CS YCLALP 


S476 


22377 


A 


8537 


216 


5 


KLGFLVNLVEFQVKFFFLFFFLEIILIE 
MRSKYVAQAGLELLGSGDLPASASQSAE 
I TGMSHHAL PNY S F 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WAT CGRRPAS 


8478 


22379 


A 


8539 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WATCGRREAS 


8479 


22380 


A 


8540 


8 


319 


YIYMSMYMLIYVCIYVSIYICVCIYINS 
ISFNTYMI IYMLI YVYVYMCVYICVCVY 
IYTNSISFNTHITPIKHSDVHTVSTTRC 
NLGGRGFCHTWPLPHLAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACLPVCLPACLPSFLPSFL 
PS FLS FFLLLLLLLQKP VNS L I FSQSLS 
IS 


8481 


22382 


A 


8542 


72 


1 


KIGQARWLAPVI PALWEAEAGGSP 


8482 


22383 


A 


8543 


2 


83 


QAGVQWRDLGSLQALPPGFMPFSYLGG 


8483 


22384 


A 


8544 


293 


2 


KTVWHYLVNAPEVEIQAI YSQETCKSMY 
QETCTRIFIAILFI IAPNWKQRQDTMNP 
GNKENGKVATLDSDGFLVRFQSLMRSNC 
PLYPFNKFPILLK 


8484 


22385 


A 


8545 


314 


122 


LVETGFLHVGRAGLELPTWSDPPTSASQ 
GAG I AAVSHRGQP VDFKNN ISTQIQGRP 
IICNYKTF 


8485 


22386 


A 


8546 


316 


123 


LVKTGFLNVGRAGLE I PTWSDP PTWAS Q 
GAG I AWSHRGQ P VDFKNS I S TQIQGR P 
IICNYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERLANYTGGIYAEYQDTTYINHV 
VSVAGWG I SDGTEYWI VRNS WGEPWVRC 
FHFLKVI PRKKTS W YS SRLNG S VYV 


8487 


22388 


A 


8548 


289 


25 


QRREKFFKNGPPIFFRGPFGFPQIFVSF 
PFFFPKIGVFFVFFFRFSLVLKGDFFSN 
FLKINCFFFFFFFFFFFFFFFFFFFIKN 
RPCL 


8488 


22389 


A 


8549 


289 


2 


KKKIGETLCKLKLCKKFLDVTPKAQFIK 
QKKEKL INWQFIKI KNFRS P KALLRRGK 
GKPGTGRKRNANHISDKGPLCRIQKKNS 
RGRAQWLTLW 


8489 


22390 


A 


8550 


290 


187 


RQENPLNLGGGGCSEPRWCHCTAAWVTE 
QDSINK 


8490 


22391 


A 


8551 


293 


183 


RDEGRG I S YMLPRLVLNSWPQAI LLPQP 
PKVLELQA 


8491 


22392 


A 


8552 


364 


87 


MLPLDFFLGPSLDFCPPFCESGSGGIKV 
PESTTPRASAFLPPGAANLHHILQLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRPP 
KVLGLQGG 


8492 


22393 


A 


8553 


57 


225 


KWQKWTPWAGGVAHACNLSTLGGRGGKI 
ACAEEFKTILENIVRPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KS KRGYSEPLVFS I CSARAPRSHS VSQA 
TVYGHAHASTQPFPPRLNRSSYLSLLSS 
GDYRHDPP PLAY ICI S VIGNDVEHLFLC 
LLSTCISSVNRSHVYSTLVPIFGP 


8494 


22395 


A 


8555 


372 


148 


YICFLRSSVRQILFTPDCSDCSMWVPCN 
IRLPGSTHWCASASQGAGTTGARHHARL 
I FFVWFFCI FS I FSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQER LHT YQLNY YH FCKR WGLTTFPR 
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SEQID 
NO: of 
nucleotide 


1 SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


^Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

otide 

location 

cofrespon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenj (alanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Ti-yptophan, Y= Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, V=possible 
nucleotide insertion 














LVSSSWVQAILPTWPHKVLRLQA 


8496 


22397 


A 


8557 


669 


407 


AGVKWRNLSTLQPPPRRFKRFSHLSLPS 
SWDYRCPLPCPANYCIFLVEKEFRHVGK 
AGLELLTSGDPPAWVSLIAGXTGVSHRA 
HPD 


8497 


22398 


A 


8558 


195 


352 


LGLQAGTTASGYSTVILGGMLKPRSWAW 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 


A 


8559 


345 


219 


MLARMIS ISLPCDPPASASQSAGITGMS 
HHGWLNLFKSIHL 


8499 


22400 


A 


8560 


137 


2 


VXYCLPTRSL I KTVRPLS PSQATASP VT 
CTYTPRWPEVTEESOKN 


8500 


22401 


A 


8561 


324 


193 


RSSGNLPASAAQSARITGVSHRTRPKCI 
I S KE YVAENNHL KS TS 


8501 


22402 


A 


8562 


326 


121 


HNHTHTHTKNHKKHQQPQHTHTYHTQTD 
IYVLQAASQKSLILISSLAV 


8502 


22403 


A 


8563 


3 


85 


HEKTHTHTHGHTHTHRHTHTHTLCARM 


8503 


22404 


A 


8564 


2 


153 


ARDRHIPPCLTNFSVFWRDGGLAMLPRL 
VSKLWAQAVHPPWLLKVLRLQA 


8504 


22405 


A 


8565 


2 


177 


ARVGFHYVGQAGLGRLTSGDPPASASQS 
AG I TGMSHRARPMS SLTLH I LQDTWRAY 
AG 


S505 


22406 


A 


8566 


353 


212 


AGFRHVGRAGLELVTQDGPRTSACQSSG 
ITGMSHCAHPEKSTAFILF 


8506 


22407 


A 


8567 


76 


191 


ELIFCRDGGLTVFPKLISNTWPYAVLPL 
WPPKVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFLHVGQAGLELPTSGDPPALASQG 
AQMTGVS HCASQKKE FLEW 


850S 


22409 


A 


8569 


250 


95 


SASRAPLPLPPSHIiPLRAAGLSPLCPPR 
LVS S YRPHVILL PL P PKVLGLQ V 


8509 


22410 


A 


8570 


3 


226 


HEHTHTHTHTHTHTHTHGHTHTHTHTLV 
LVYSLCEHIWRFIVISRARVCCIYVWF 
FECGYADRRVADKWLWIH 


8510 


22411 


A 


8571 


"j 


216 


GTRGLGVGVSHLVSKNTHTHTHTHTHTH 
THTHTATASQLSVFKIWAFQRSL.DKGLT 
YISNSYCSSKFYLYNH 


8511 


22412 


A 


8572 


356 


268 


LTMLARLVLNFRPQVIPQPWPPKMLGLQ 
A 


8512 


22413 


A 


8573 


12 


175 


RWGFHPVGQAGLELLTSSDLPTLASQKA 
EITGMSHCARPSFFSFFLSFFFWKHL 


8513 


22414 


A 


8574 


1 


163 


GTRTG FHHVGQADLELLTSGDPPVSASQ 
SVGITGVSHRAWPVGLCIIALISNEY 


8514 


22415 


A 


8575 


344 


176 


EFLVETGFRHIGQAGFELLTLGDQSTSA 
SQNTEITGVSHPAPGLAIIFKSGNQAFF 


8515 


22416 


A 


8576 


2 


179 


ARGILCHSGTFKLDSRPSMVAHTCNPST 
LGSQGGRIACGQEIDTSLLKVARPRLYG 
QIF 


8516 


22417 


A 


8577 


1 


100 


GTRGFTMLPRWSDSWAQVILLPWPPKV 
LELQM 


8517 


22418 


A 


8578 




114 


HERHE ILS I YLT I YLS I YLSIYLS I YL I 
YLPTITRI 


8518 


22419 


A 


8579 


1 , 


264 


GTRPCRIELYRWESLAKAQETSGEEIS 
KFYLPNCNKNGFYHSRQCETSMDGEAGL 
CWRVYPWNGKRIPGSPEITGDPNCHIYF 
NVQN 


8519 


22420 


A 


8580 


1 


245 


GTSGTSG I LGFYLEREDRLLQI RTDTIT 
LSHGYSTFSLNRCDSRHHMNRMFQMLYQ 
SPFLGYIQLSHEVLAFGISYIYNTE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lystne, L=Leucine, M=IYIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


S520 


22421 


A 


8581 


1531 


1457 


QARNEriARRLWDVSCDI..IiGLPID 


8521 


22422 


A 


8582 


191 


2 


TITSYFSPTPAPGSHYATFCFYKFDYSR 
YFIKVELWEAETGGSRGQEIETILtANTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAG I Y PKE Y KS F YYKDTCMRMFI AGL F 
TIPKTWTQSKCPSWIDWIKKMYRQGPIL 
SPMLECSGEVSGFK 


8523 


22424 




8585 


2 


192 


ARGNS PASS S RVAG I TGAHHHARL IFVF 
FLFVRFFFSFLKTGFKLVAHSNFQPLTL 
FAPLHFK 


8524 


22425 


A 


8586 


277 


66 


PLDFRLGHKRRLPFPKKKKKESKFVCVC 
VCVCVCVCFKRQCLVTLPRLVSNSWAQA 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGLPKCWDYRREPPCLAEWLKKYVW 
VTYFS FRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYSILNRDKIKPKLDRMVYKAR 
PRVMCLPWPPKVLGL 


8527 


22428 


A 


8589 


167 


349 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 
VFLLVGNLFLFFGGGIFGLSRVMCFKEF 


8528 


22429 


A 


8590 


375 


240 


LLSRFSWFCFVFRQGLTMLPRLVLNCWA 
RG I IiL P WS P KVLG LQA 


8529 


22430 


A 


8591 


347 


215 


ETG FPH VGQAGL KLLASSDP PAS AS OS A 
QITGMSRRAQPSAKRF 


8530 


22431 


A 


8592 


340 


114 


HRTAHCS LYLSGS S DPATS S S QVTGTTG 
KDQHTWLLFKYFCRARVS KINKYKS INK 
FYFLNTQTTPSEYGKASPG 




22432 








253 


LRLCCGLSCSTAKKKKKKKKKKPKKKKI 
KKKNLHPKRGGFEEILFFFPKGGKNLF 


8532 


22433 


A 


8594 


3 


135 


GFCHVGQVGLELLTSGDPPALASQNAEI 
TGVSHRAWPPVLFFF 


8533 


22434 




8595 


2 


274 


PRVRT I S LLG I LVYRSHL ISSLLiCLEGI 
I LS L F I I ATL I TLNTHSLLAN I VP IA I L 
VFAACEAAVGLALLiVSISNTYGLDYVHN 
LNLLQC 


8534 


22435 


A 


8596 


3 


316 


SIPYSWGEKEGIPCMAPPQIQGTSRLND 
FTAIiSLHLNTHTHTHTHTHTHEIQHLPP 
QG I TALILNSLLRHYCPS LAFPS LS P KQ 


8535 


22436 


A 


8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 
KKILGV 


8536 


22437 


A 


8598 


275 


369 


GVQVLKLLTSGDLPALASQSAE ITGVSH 
RAR 


8537 


22438 




8599 


98 


344 


VHYEEFMCVCVCVCVCVCVCWPRVRGVD 
MNPVEHPFGGGTHQHIGKPCT I RRNASA 
GRKVR.LMAALWTVRIGGTKTVTI)RKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSLAKFVFLVKMGFLHIGQAGLDL 
STSGDPPALASHVFKR 


8539 


22440 


A 


8601 


2 


214 


HVDWMGTFAI LS ELHHDMLHVDPENLR 
LLDSVLWCELANHFGTDFTPPVQAAYQK 
WADVANALAHMYH 


8540 


22441 


A 


8602 


89 


181 


GLRHENRLNSGGGGCSEPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEDGGHGRKAPGTS PWTLG 
ELPKSARGHTGLFVSYRGCRTQFRFKKV 
LFGGDRNRGLAMLPRLVLNSWPQEILLP 
QPSKVQGLQK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
cth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H— Histidine, I— IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \ = possible 
nucleotide insertion 


8542 


22443 


A 


8604 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAFQNAG 
ITGGSHCTRPFSLFHSYLVLFYGCSSSQ 
VSLSILARM 


8543 


22444 


A 


8605 


546 


279 


RPGPTVSPRLECICMIMAHCSLDFPGSG 
VS P TS AS RVARTTS THHHAWLi I FYS S VE 
TGSHHVAQAGLELLGSSNPPISASQSAG 
ITAL . . 


8544 


22445 


A 


8606 


3 


249 


NRFNLGSGGCSEPRSHHCTPSWATEQET 
CITYliEYCFPPVSHTNLVYFKPLNVTIR 
SDYCILGLHASSIGGSDSLIIDCWD 


8545 


22446 


A 


8607 


159 


324 


VIQYRTNILHLHLCEVLVFWFLPYFFLK 
KTYTGIGQMRWLTPVIPALWEAEAGGS 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVIYFLNIS I YPS 


8547 


22448 


A 


8609 


278 


202 


QQPSTLRQDPPPAKRLHLSEGSDGH 


8548 


22449 


A 


86J0 


236 


66 


AAS TFFIiQKI KRHLiLSG T VAHACNPS Th 
GGRG IRI SRGRE FKTSLGNMVRPRVYKK 


8549 


22450 


A 


861 1 


29 


177 


GKQVMAIjHCFI FFHFFFRRSLAMLPRIjV 
SNSWVQEILPPWPLKVLELQA 


8550 


22451 


A 


8612 


1 


215 


HTSRTL.FVHLRLKNFNKYLIEWGLSMLP 
RLVLNSWAQAVIjPPWPSKVLGITGMSHC 
TWLYLHLSTDLLKLC 


8551 


22452 


A 


8613 


3 


91 


LRHGNCLDPGEGGCSEPRSCHCTPAWVT 
E 


8552 


22453 


A 


8614 


2 


167 


FTISLLGILVDRSLTHHINNIKPSFTRE 
NTLMFIHLSPILSTRKQLAIQHLSRLE 


8553 


22454 


A 


8615 


2 


281 


FCILVETGFHCVAHAGLQLLSSGNPPAS 
ASQSVGITGMSHCTSPNLSLLTQSKSFC 
LLIQNHNPYAFINSQLTFSILFLLSYFI 
FYDSLFLLF 


8554 


22455 


A 


8616 


282 


182 


GQAWWLTPVIPALWEAEVGGLLEPRSSR 
PAWST 


8555 


22456 


A 


8617 


379 


207 


SNOTTKKYINFFFVEMRSHYVAQAGLKF 
LGSSNPPTSASQSASITGISHYVWPISP 
S 


8556 


22457 


A 


8618 


705 


483 


DKSFALVARAGVQWHHLGSLQPLPPGFK 
QFSYLSIiPSSWDYRHAPPRPANFCILSR 
DGVSPCWPGWSRTPDLR 


8557 


22458 


A 


8619 


359 


220 


RQSLTMLiPRLVSNSWAQAILPPWLSRLL 
GLQAJLAP I PGKS YNSKTP 


8558 


22459 


A 


8620 


3 


186 


HEVSWVKRKQDEWIEFDDDTVSIVAPED 
ILRLSAGGDCHIAYVLLYGPRRVQILEE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTSFCRDGGLTVLAQLGSSFWPQRILPP 
WPPKWELQA 


8560 


22461 


A 


8622 


369 


188 


MYS LEMG FI GNGHAGLELLASGDLS ALA 
SESAGITCMSLRYRTRFSFKRATCGLDL 
SLQS 


8561 


22462 


A 


8623 


202 


22 


YILGFYFMPVICYQERKKDKKNWEVTTI 
MSFVGICMELGAI ILSKLMQEQKTKYKI 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRLCHCTPAWATE 


8563 


22464 


A 


8625 


2 


135 


TRDLCSLVYLLTFP PLLSHDPAKS PSVR 
NTQELS I KKKKKGRPF 


8564 


22465 


A 


8626 


3 


232 


HASAFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLKKKKKKKKKKKKKRGRGLKKKKK 
KHNKKFKKGQKKYNFGRGAF . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H^Histidine, I-Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8565 


22466 


A 


8627 


97 


373 


QHGFLKKKKKKKKKKKKKGGPIiL.KK.PIjG 
GPNPTGDGKKKPF P KKGG IKKPPGEFLK 
KNLFLGGKKNGKNPPKKLNPLGEKKNFK 
GKKGKNPP 


8566 


22467 


A 


8628 


2 


188 


RYGCPPPYHTFEEPVYIKSRQKRKEWNP 
PKLVSSQPHGLHDFFKKKKKKGAAVLKD 
PSGGPS 


8567 


22468 


A 


8629 


287 


68 


GEEGRTQKGERNGGGGSERKEGEEHEAR 
RTGRGEGGGGEAKGR PRS ARRRGRGS RG 
SKSQTPSRHLPAHI itn 


8568 


22469 


A 


8630 


364 


136 


VSLLLGLEYSGTITAHYSLNFLLFIVET 
GSHYVAQTGLKLLGSSNPPTSDSQTFEI 
TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFPPRFFPFFSPLSPLKFFFSPKGFNFF 
REFFPIFSPPKKRVL.SKNS PGGFYKPPL 
RGKI FTFPPPVKFGPPRGLFKGPPPFFF 
FFLFFFFFLVI 


8570 


22471 


A 


8632 


2 


266 


AFTISLLGILVYRSHLISSLLCLEGIIL 
SLVI I ATVITLNTHSLLANIVPIAILVF 
AACEAAVGLALLGS I SNTYGLDYGHNLN 
LLQC 


8571 


22472 


A 


8633 


1 


186 


LTHH INNIKPS FTRENTLMF IHLS PILL 
LKKKKKKKKKKKKKKKKKKKKGGALKKK 
PGGGQN 


8572 


22473 


A 


8634 




80 


VLEIAVALIQAYVFTLLVSLYLRDNT 


8573 


22474 


A 


8635 


713 


333 
_ 


EGPPPPRSKKKGTQGGGKDPPS PPEPKT 
PGNPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRSSSSNTNAPGEKIFFSKNPGR 
KIFPPRAILVFFSPFPLKNFFFSLRLLI 
FLGGCAPFFPPPK 


8574 


22475 


A 


8636 


389 




FIFSFLFLIIFFFLNINFFFYIFSSLSF 
FFFLNFNLLLKHTPPIILFFFSLSPIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 
LLRARGCKQYKNQKL I WYLTF 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDAIPGRLYQTTFTATRPGVY '" 

YGQWYEICGANHSFMPTVLELLALTIFE 

IGPEFTL 


8576 


22477 


A 


8638 


1 


87 


PTRPLE I AVAL I QAYVFTLLVS VYLHDN 
T 


8577 


22478 


A 


8639 


2 


132 


INLASTLI IFTILILLTILEIAVALIQA 
YGVTLLVSLYLHDNT 


8578 


22479 


A 


8640 


2 


113 


LFFIAEYTNIIIINTLTTTIFLGTTYDA 
LSPELYTTY 


8579 


22480 


A 


8641 


375 


22 


FIFISPLAYLFFLPFPFFFIPPLCFFFF 
I PLFFLFLSFLFSL I FFLFSL I LIFPPS 
LFFFFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFCWLPHRVMXAKVGISWSKKI 


8580 


22 81 


A 


86 2 


180 


34 


WFISDSLWLGMVAHTCNPSTLVGQGGWI 
TCGRSRPSVPKRTPSQTCAPA 


8581 


22482 


A 


8643 


338 


133 


NLPLGLFYSLQKKKKKPFLRPGTLGAPK 
GNFQGAPP L FGKFAF F FFFF F FP FFFFF 
FFFYGFFFFLFK 


8582 


22483 


A 


8644 


156 


287 


LTYPI KPPLLCSQKKKKKKKKKKKKKKK 
KKKKKGGAFKKNPWGAQI 


8583 


22484 


A 


8645 


1 | 325 


LHHCHTHNFVjSTKWCFIRKKKKKGKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
' sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanint 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, ' 
K=Lysine, L=Leucine, M=!VIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKRBKKKKKKKKKKKGGGPLKKPPGGPK 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGEKMGENPPKKIKPLGEKKIF 


8584 


22485 


A 


8646 


370 


3 


PVRASRLLCLPKQAWAMAGAPPPASLPP 
CSL I S DCCASNQRD S VG AGP S E PG AG YN 
LVMHCFLSPSEKHSIWVGVTRFSRCCPS 
PLSLTRKGNSLTPCASQVRQCLALLRLV 
QGAGTHRTRG 


8585 


22486 


A 


8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 














FFFFFFFFFFFFFRMGCDRWHGELYNKI 
PIWHKLCGDL 


8586 


22487 


A 


8648 


1 


108 


PTRPRTRGVASVLYFTTILILIPTISLI 
ENKILKWA 


8587 


22488 


A 


8649 


3 


96 


EICGANHSFMPIVLELIPLKIFEIGPVF 
TL 


8588 


22489 


A 


8650 


457 


297 


SPSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLFYFALF 
FIYLIIKDNYMFQC 


8589 


22490 


A 


8651 


84 


352 


YE KTDVKI I S I LFNTD Y MLE YNVLH I LG 
QIKMLLISKKKKKKKKKKKKKKKKKINK 
FFWGGLILTLLGIVLYSPLEALKISPSF 
GFLEK 


8590 


22491 


A 


8652 


2 


111 


NNI KPS FTRENTLMF I HLS P I LLLSLNP 
DIITGFSS 


8591 


22492 


A 


8653 


1 


305 


RQQQQQQQLRNLRDFLLVYNRMTELCFQ 
RCVP S LHHRALDAEEEACLHS CAGKL I H 
SNHRLMAAYVQLMPALVQRRIADYEAAS 
AVPGVAAEQPGVSPSGS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYLFPPKKKRLIILCLFFRNPGSS 
NLQKITKEPIIDYFDVQD 


8593 


22494 


A 


8655 


2S1 


117 


CI ITLS FINTLQAYI I WLWFSEVCLYV 
SLSIYLSIYLSIYVFEYRYIKISFHL 


8594 


22495 


A 


8656 


3 


177 


DLHAHKLGVDPGNLNLLSHCLLETLAGH 
LPGEFTPAGHAFLDKFLDFVSTVLTYKY 
R 


8595 


22496 


A 


8657 


3 


270 


RRRGRAHCSLDLLGSRNPPASASRVAVT 
TGTRHLAWIilWGGFLFFVLRDVLVRFH 
ATDKDI PKTGQFTKERGLMDL.QFHMAGK 
ASQS 


. S596 


22497 


A 


8658 


107 


11 


Q FEHTK P TP FL PTL I ALTTLLL P I S P F I 
LIIL 


8597 


22498 


A 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFGIDRDAIA 
QAVRGL.ITKA 


8598 


22499 


A 


8660 


1 


395 


LVTALYSLYIFTTTQWGSLTHHISNLEK 
KKKGKGKKKKRGGALFKGI FGGSHFFGV 
WELLFFFLKGGIKKTILGFLGKTLFFGG 
GLLGAPLPRKIKGLREKKNFKGELGVKN 
RVFFFFGNFSSLGVYLKKY 


8599 


22500 


A . 


8661 


1 


255 


RTRGRTRGRTRGLTRGKKKKKKIMKKKK 
KKKKKKKKGGAFLKNPWGGPI FWGLPKF 
YFLPNKGS FFNLIGDFLKRPFFWGGAYF 
G 


8600 


22501 


A 


8662 


136 


358 


FHI VKGVS I I KFKCVILKKKKKKKKKKK 
KKKKKKKKKKGKKLKKKKGGKKKKGGGK 
NKI FFWVGFFYKRVWGYF 


8601 


22502 


A 


8663 


198 


368 


LINLVQFTCILGTLLGILFIYLFLRRAI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TMLARLVLNS W PQAVLLPQ P PKVLGl.QA 


8602 


22503 




8664 


407 


82 


KFLPHLGFPPFKRFFIKKEPFQWFFFLF 
GFFFFRRSFFLLPRVQCNGAISGPCNLC 
LNVKVRLS FKKKKKKCWD YTFE PLCPAL 
HFSFINKTEDCLFVCFTVYWPHGF 


8603 


22504 


A 


8665 


166 


293 


KI IAKHNIARTNPYTFCIMKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 


A 


8666 


1 


164 


RAKAKRRNTTVS CRMRHLKI VYRR FRYR 
LYVG I I IGPCGLNETLVFTCKKITVP 


8605 


22506 


A 


8667 


295 


85 


YIYMSHICTLSLSLSIYKRVKCFCFFKT 
WSHS VAQ AGVQWRDLGS LQP PP PKW I LC 
ISVFFTVLFVMKKN 


8606 


22507 


A 


8668 


2 


131 


I N L P S TL I I FT I L I LLT I L E I A VAL I QA 
YVFTLLVSLYLHDNT 


8607 


22508 


A 


8669 


423 


107 


HCALPKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKK 


8608 


22509 


A 


8670 


432 


301 


SQILGRFSQKQHLSLGGGGCSHPRSCHC 
TPAWARGRLiRLKKF I K 


8609 


22510 


A 


8671 


1 


106 


LKLLTSSDLPASASQVAEITGMSHCARP 
LSRIYMC 


8610 


22511 


A 


8672 


383 


252 


CPANFVFSVEKGFLHVGQAGLELPTSGD 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 


560 


135 


SSWGMDSIPTSSNMEETQQKSNL.ELLHI 
S LLL I ES RLE PVR FLRSTFTNNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYSKFDTNSHNHDALLKNYGL 
LPCFRKDMDKVETFLRMVQFRSVEGSCG 
F 


8612 


22513 


A 


8674 


2 


326 


SLLMCVHRCECVCMRACLCAGVCMCIAS 
CLGLPMNWECYTWRVLVFHQFQDEELH 
DTVDLETIPLERQPRDVQHPVSTRILYL 
HVYFVAVTLTL I R I LQLWTEAFSP 


8613 


22514 


A 


8675 


171 


272 


TQCCLLKAEKKKPKKKPKKKKKKKKKKK 
KGGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLESHEDVGPLIKDSKQEKKKKKKKK 
KKKKKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKPPPKKKSPEGKKKILKGKGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


L FWNKEGNNW KKKKKKKKKKKKKKKKKK 
KKKKKKKWGGALLKI SGGGP I FPGGGKN 
FFFFFGGGF 


8616 


22517 


A 


8678 


1 


97 


IIGQVASVLYFTTILiILIPTISLIENKI 
I>KWA 


8617 


22518 


A 


8679 


1 


141 


NEHQPGQHGEMPSLLKIQKISWAWWSMP 
VIPATQEAEMEKESQISIM 


8618 


22519 


A 


8680 


499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAPPPPGGGFPPNPPRGFF 
YPPPLKGNFFFPPPAGGGPPRGFFWAPP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFS 


8619 


22520 


A 


8681 


60 


224 


KQTKYIKEITKHICTWQGRYHDHEGGFP 
RVKLIHCTPDMLTPVISPDVGNSTTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKKKKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCFFFWGPPGKNVPPPKKMFFF 
LIPPPKKKKPPPPKKIFFPPPGVF.FPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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C=Cysteine, D=Aspartic Acid, 
E=Gtutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, " 
K=Lysine, L=Leucine, M=Methionine, 
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nucleotide deletion, \=possible 
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FFF 


8622 


22523 


A 


8684 


1 


406 


LYYFLTKSTTTYIiAVPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGGPKKKTRKGGGGQKKKKGR 
GPEKNRGGGETTPGEKKKKKKKGGGGKK 
KKKKSGGGKKNGGKPGRGGGGGR 


8623 


22524 


A 


8685 


391 


265 


GRPFFFFLDGVSLLLPRLECSGVILAHC 
NLCLPGPSDSPSSA 


8624 


22525 


A 


8686 


2 


177 


DTALYSLY I FTTTQWGSLTHHINN I KPS 
FTRENTLMCIHLSPILLL.SLNPDI ITGF 
SS 


8625 


22526 


A 


8687 


479 


366 


GLPKCWDYRREPPHPAANFCIFSRDGVS " 
PCWPGWSCL 


8626 


22527 


A 


8688 


35 


161 


SNP P SSAFQVAGTTGVRHHDQL.TF FFFF 
FLGGTGQILCRGGD 


8627 


22528 


A 


8689 


1 


122 


THHINNIKPSFTRENTLMFIHLSPILLL 
SLNPDIITGFSS 


8628 


22529 


A 


8690 


2 


152 


NI ILAFTISLLGILVYRSHLISSLLCLE 
GIILSLFIIATLIYPTPSFSCY 


8629 


22530 


A 


8691 


541 


85 


IFLRQFLSFRLEYSGVIKAHCSLNLLGS 
SDPPASAFKVAGTTGIRHHAWLIFVFFF 
AETGFHYVAQGGLELLNLRWLPATAFQS 
AGI IGISHHDLLGIQITTWLLQKLSYLQ 
KKICKDQETGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


22531 


A 


8692 


122 


38 


GRVDHSFMPIVLELIPLKIFEIGPVFTL 


8631 


22532 


A 


8693 


276 


77 


GVSPCEPVFFFILGRCNPPAWGVPKGRD 
FFVliSFFFWFYILLFFFFFFFFYFFLF 
SFYCFNGSNK 


8632 


22533 


A 


8694 


3 


70 


FMPIVLELIPLKIFEIGPVFTL 


8633 


22534 


A 


8695 


230 


3 


GSGIPSHPGHPSLPYHLEREWLAKTGIR 
DTSNSVHLLELCLRSQRHGRARWLTPVI 
PAL WEAEAGGSR R QE I EPI 


8634 


22535 


A 


8696 


401 


240 


PPPPPPPCGSEPRPPHPSLSPPGSREMF 
VLALSQESQMTFFFFQLRKKLHLYL 


8635 


22536 


A 


8697 


103 


313 


EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKGGGGPKKKKFEAIFGKTP 
GGVSQGGGPPPLNFFFLNPGPPFF 


8636 


22537 


A 


8698 


354 


110 


KKGTLFEVSESFYTTEGVSAKINRIRSI 
VS VNKVARCIGE I FTDAVQS AFQKEVGG 
VSDSMVHKYEYAGCVDVKAVCSHGL 


8637 


22538 


A 


8699 




164 


FLVETGFLHDGQAGLKLLISGDPPASAS 
QSAEITGMSPHACNSSTLGGRGGRIT 


8638 


22539 


A 


8700 


191 


371 


PPDSRWAIQGGHLELTPTTRQDLAQAPR 
RKLALLEGSFQGQAQWLTPVI PALWEAE 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGL YHVGQ AG LK P LTSGD P LAS AS QS 
VRITGVSHRTRPKIIFKLIFFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MEKYNVHPHSGILRSHEKEQAAL.FTIAK 
RWRQPSYPSIDEWINIMWSSHTVEYYTA 
MKRNKQLYSQ 


8641 


22542 


A 


8703 


3 


126 


LQELRDPTLTFRLL.GSPRPVWETRPVD 
DPTAPSNLYIQE 


8642 


22543 


A 


8704 


438 


31 


SEPCAGYNLLVCRFIiSPLEKHNIRVGVT 
RFSRCHLS PLSLTRKGNSLTP CTSRVRQ 
CLALLWLAHSALHPLSCMHCLALPSEMN 
PVPQMEMQKSPIFCIAHAGSCTPELFLF 
GHLGSTPLTHLLSLRINVFLRTSE • 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8643 


22544 


A 


8705 


429 


319 


PGAI IAHRSLNLPGSSDPPTSASQVAGN 
TGVSYCARP 


8644 


22545 


A 


8706 


426 


245 


RAATPSP I KLTKKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAPLKKTLGGPQ INRGK 
KKIFFFF 


8645 


22546 


A 


8707 


1 


108 


KHPRGGSFGYCFALAWVAFPLALVSGI I 
YIHLRKRE 


8646 


22547 


A 


8708 


3 


146 


QIEGHTI S ALGDGAAWPVQGi. I RHFRP E 
liEERMQRFAQQHQARQAAS 


8647 


22548 


A 


8709 


2 


S3 


LVLEIAVALIQAYVFTLLVSLYIjHDNT 


8648 


22549 


A 


8710 


1 


81 


RPLSKTVRFNVLKVTKAAGTKKQFQKF 


8649 


22550 


A 


8711 


156 


4 


RPLSFHPGGKKRLFLKKKKKKLCAVAHA 
CNPS TLGGRGGR I MRSGDRDHPG 


8650 


22551 


A 


8712 


440 


233 


TPSYFVLLVERGFSMLPRLVSLTSGDPP 
AS AFRSAGITGMSHHAQPRTGRS KAYLP 
FLEEPDLKHIFPF 


8651 


22552 


A 


8713 


92 


302 


AMHPTMGPQDQGHTGGLYAHTPLHPHSH 
THANTLIHSH I YAHAHSHTI PATCPKG P 
TQPYLCLLSPHIEL 


8652 


22553 


A 


8714 


2 ■ 


166 


KKINSFFSQGLTQAKVHGGRPSSLGPQT 
PVLKRSSCLSFPSSWDYRHEPLYPAN 


8653 


22554 


A 


8715 


1 


129 


GFRHVGQADLELLTSGDLPASASQIVRI 
TGVSPCAQHEFFKSN 


8654 


22555 


A 


8716 


381 


105 


RRSLLHS VLNGAQAG VQWRDLGS LQPPP 
PSSLPWPPKVPRLQPIiPGRHPVWEVRSV 
SARPP I VWDVRS PSAWLPSL.ES ESRLCL 
AAIPSGK 


8655 


22556 


A 


8717 


385 


125 


FFFFLPPAFLLSGXFWFFFFYI FSFFFF 
FFFLGSPPPPIFFFFFFFLNKIFFFFFF 
FFFFFNNLFFFFFFFFFFFYFFFFFFFF 
FFF 


8656 


22557 


A 


8718 


409 


264 


KTDQRWS STSSS KI MS QSQVS KGVDFES 
S EDDDDDP FMNPS S LRRNRR 


8657 


22558 


A 


8719 


3 


269 


KKHVKRVliSHLKAS CPEELLHFSSWPQL 
LELQLMGGGSHTHMDQHTHTHMHPHIYT 
HAHTHTVVAKLNELPWVEMNLLSLGAFQ 
QRIiP 


8658 


22559 


A 


8720 


430 


141 


LFFFliSFFEDGVLLCHPGWSWAQSQLT 
ATSTS KKKKKKKAT P PGI PKFL I GKSGK 
PPRVXiLIGHWAPFKFFFFNPRLAFF 


8659 


22560 


A 


S721 


386 


290 


RWGLTMLPRLVLNYWLQAILLLWPPKVS 
GLQA 


8660 


22561 


A 


8722 


3 


157 


HEVFLVETVFLHVGQAGIiELPTSGDLPT 
S ALWEAEWGGLLEDRS S GPAWET 


8661 


22562 


A 


8723 


493 


186 


FS PQGGE KRAI LGLRNL.CP P G VKD FS AR 
PPKEVGtJEGRVPKTKLFFVFLKKKGFPL 
IGRGGFKSRPWESPPRPPQKVGVQRLNP 
PPGPFFFFDLIWPNRNGS 


8662 


22563 


A 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTIiPPF 
PSPLFFFSPCPS PGEGGSREPL.LS TPLA 
CICI 


8663 


22564 


A 


8725 


370 


67 


FAPPKNF YKGAP PIFFIFLFFILFIYFF 
FFFFIFFKFLVFLLETGFHHVSQEDLDL 
VIHLPQPPKWGLQAWATTPGQFFVFLI 
ETGLHPVNQDGHNLL.TL 


8664 


22565 


A 


8726 


966 


652 


LGSLKPPPTGFKRFSCLSLPSSWNYRHA" 
PPCPANFVFLEETGFLHVGQVSLELLTS 
GDPPTIASQSAGITGMSHRDWLGQHLIV 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucinc, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NHCYLFYQGHIKTTRSLLPPP 


8665 


22566 


A 


8727 


85 


425 


HMHSFAHEL.HTFLHTHTNSHTNSQNQSP 
TYTSKSHTHSYMRFTDSHDSQMQQTHRH 
SLLHMTQLYTCTHTHTHTRTVIPLHLHS 
VKGWLNTKVALGGRTSHGRESHIAGRLL 
A 


8666 


22567 


A 


8728 


393 


124 


LGSGEPQliFQSPPSGGPPGPKSHWNSWG 
NANPPVPPYIIESAPPPPHFPFGAQSKY 
SGGQAARLCPCPPFFLKKKKKKAPGSGT 
GTFSFQ 


8667 


22568 


A 


8729 


25 


217 


AVQPIRVQWRRSLQSPQEQIILAPSLAK " 

VDMEMTQLiTQENADFATRDRYHHSSIjVM 

REQLMPHY 


8668 


22569 


A 


8730 


1 


611 


PGIFYSALLSLDTSILNQLCFIMHRYRK 
NLTAAKKNE LVQKTKSEFN FS S KTYQE F 
NHYLTSMVGCLWTSKPFGKG IYIDPEIL 
EKTGVAEYKNSLNWHHPSFLSYAVSFL 
LQTSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSPSPEERTVWVSSIRGKKWSWYLD 
YLFSQGLQGLKLFIRSSVHHSS I PRAEG 
INCNNQY 


8669 


22570 


A 


8731 


J 20 


378 


VEKTQAS I LGCKCS TPRAALG FRERTLP 
LRRCPCCGPPEAQMELEKRALGSQAQWL 
MPVFPALWEAEVGESLEARSFETSLGNI 
A 


8670 


22571 


A 


8732 


2 


68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 


A 


8733 


382 


258 


NYHSSLHL.ETPGLKQSSCLSLPKSWDYR 
HESPCPALIFNSL, 


8672 


22573 


A 


8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 


8673 


22574 


A 


8735 


422 


2 


GRRPPGIjYFFFFPPGKKNFQGRGGFLFF 
FPPKGFFFFFFPIGFFFFSSELGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFFFF 
FFFFFFFFFFFFFFFSFFFFFFFFFFFF 
FFFFFFFNKQVFIERliLCARHCFRPPAS 


8674 


22575 


A 


8736 


7 


75 


IAVALI QAYVFTLLVSIjYIjHDNT 


8675 


22576 


A 


8737 


2 


176 


NFGLLAETGFLYVGHAGLEVRSSGDLPA 
SASQRAGI TGVSTAPGFNRYFYKQTI YK 
YG 


8676 


22577 


A 


8738 


408 


111 


GSSPPPRAGGENFLKKNAGGKNFPGGEG 
GGGFYPLYPKKFFFSPKGFIFWGGGGGK 
WPPPKKGGFSKKPQKVFFTPPQKKKKFF 
WPPGGNWGPPKNF 


8677 


22578 


A 


8739 


5 


249 


YEGL.KKLHMQAS P FQRGH PVNHKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
EGALGGKFLAQTS FKGEKQKVFI ICY 


8678 


22579 


A 


8740 


1 


226 


F FL RQS LS VTQAG VQ WCDLS S LQ LLAFQ 
GS SNS PTS ASQVAG I TGVHHHIQL. I F I S 
LVETGFHHVAQAGLELNLQ 


8679 


22580 


A 


8741 


4 


96 


DGLIMLPRIATSNSWPQGILPPWPSKMLG ' ' 
LQV 


8680 


22581 


A 


8742 


19 


112 


FGRILLVREKKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRW F p cccccccc cc cc 
CCFSLPFIFEPSRS 


8682 


22583 


A 


8744 


3 


419 


LTLRRFQLNLTALTKKKKKKKKNLKKKK 
KKKKKKKKKKKKGGPLFKNSLGGPIFPG 
GGKFIFFFFWGGFLKPPGDFLKKTFFLG 
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W=Tryptophan, Y=Tyrosine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














GEIFGPPPPKNLTPLGKKKI FKGVGGKN 
PPLFLRRKKFFSGGFFKKIFSPGLGF 


8683 


22584 


A 


8745 


444 


1 


KKPSKEILYPENSRFFSPLSPLKFFFSP 
KGFNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKIFFFPPPVNLGPPRDLFKG 
APPFFFFFFFFFLWEMFYWLSLSSGSQR 
LPQQVPTVEPSELGGMGAYFVSKRSTYL, 
QRGWRPER 


8684 


22585 


A 


8746 


104 


222 


NKTFCLLKKKKKKKKKKKKKKKKKKKNK 
KKKKKNWGGAL 


8685 


22586 


A 


8747 


403 


201 


MEKYWWQPHSGILYSHEKEQAALFPIA 
KRWRQPSCLSIDEWRNIMWSSIiTVEYYT 
AMKRNKQL.YSQ 


8686 


22587 


A 


8748 




90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 
GP 


8687 


22588 


A 


8749 


444 


85 


SPPPPGLFFFFFFPKKKTSPPPTKKGFF 
SPPPPQKFFFFLKPFFFLGGLGPNFPPP 
KKNFFSKNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFFFFL 


8688 


22589 


A 


8750 


2 


309 


EFFPPYWEFLKINACMFSPEKKKKKKKK 
KKKKKKKKKRGPPLKKTPGGPQIPRGGK 
KKIPPLKGGQKKPPRGFLEKNPLFGGGP 
FGPPPPPKINPPEKKKNF 


8689 


22590 


A 


8751 


3 


76 


TASVSEGGGLQGITMKDSDEEEEG 


8690 


22591 


A 


8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPL 


8691 


22592 


A 


8753 




124 


GHLIiMHLIGSATLiAISTINLPSTLIIFT 
ILIKKKKKGRPF 


8692 


22593 


A 


8754 


80 


201 


F I NANS KKKKKKKKKKKKKKKKKKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


! 


IYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNPPRKRGFKDAPPPPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAPPAPLNPGAIi 
FFFFLRQS LSLS LRLE CSGT I LAHAS 


8694 


22595 


A 


8756 


3 


136 


FTATR PGVY YG Q CS E I CGANHS FMP I VL 
ELIPLKIFEIGPVFTL 


8695 


22596 


A • 


8757 


80 

_ 


326 


KKFSLGSQGGRAGVFPRPPLGKLQTTAL 
NSGAKGT PPFL F PE PRKRGG P P PAPGWE 
GIFCPHFFPMGQKGEGGLWHTGEGPH 


8696 


22597 


A 


8758 




164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTFP PLLS YD PAKS PS LRNTQE 


8697 


22598 


A 


8759 


431 


79. 


FFFFFIFIiKKVSTQSPPPQFYFFFFLKT 
FFLiFQPLIjKEGGSWGGRGDKPPPIiAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPRCVXiTRL 


8698 


22599 


A 


8760 


60 


194 


ARPTCPATAVTQKKKKKKKKKKKKKKKK 
KKKKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


CASMTFPKKKKKKKKKKKKKKKKKKKKK 
KKKK 


8700 


22601 


A 


8762 


399 


164 


HI TVKSLLVPMDDP PKKKKKKKKKKKKK 
KKKKGGGFKKNLWGGQKLTGEKKXI FFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TPPRKKFFFLKIKPLFFIAFFFFWSLFG 
FFLIFSLFFFFFFFFFFFFFFFFFFFFF 
FFIVFTITliMHFKIYIjH 
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peptide 

sequence 


Predict- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8702 


22603 


A 


8764 


2 


93 


FAMLPGLVSNS W PEA I CLPW P PKLLGLQ 
MM 


8703 


22604 


A 


8765 


398 


63 


PPPPVFFFFSGGFFFWGGGGQKSPPRER 
CFFFKMHPGFFYIPPFVGKKVFFVLEGV 
VCPLGFFLSGGAFFFFFFFFFFFFFFFF 
FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


22605 


A 


8766 


3 


83 


ILE I AGALIQAYVFTLLVSLYLHDNT 


8705 


22606 


A 


8767 


450 


306 


QKKKKKRRNKPEGLGTVAHACNPSTLGG 
RGRWITQPGQQEGNFISKY 


8706 


22607 


A 


8768 


1 


111 


WSLALVAQAGAPGFKQFSCLGLPKSWDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FLAEMGFCRVGQAG FKLLNSGD P PAS AS 
QSAGITGVIAPVL 


8708 


22609 


A 


8770 


3 


293 


SLGSKPLGLLSLSPVKWFRIFGKERNKV 
WGKDGGTDRNQSSSAFKHLQRGDSDPKQ 
NKI KACS SKFYLRRCVKRS FL.L.I KKKKK 
KKKKKKJCKKKKKKKK 


8709 


22610 


A 


8771 


2 


281 


CVCVCVCARVY I Y I VYVY I C I YTYMCVY 
MYI CIYIHVYIHVYTCVYVYIRIHMYI C 
ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


87i0 


22611 


A 


8772 


295 


161 


THTHTHTHTHTHTHTHTKTHTTCSLPPS 
LAQLLGNCCI KAAI S I LY I L 


8711 


22612 


A 


8773 


1 


318 


FFFFVRWSFTLVAQARVQWRDLGSLQPL 
PPGFKQFSCLSLLSSWDFRHTPPCLANF 
VFLVETGFLEVGQAGLELPTSGDLPALA 
SRGAGITGVSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMPIVLELIPIiKIFEIGPVFTL 


8713 


22614 


A 


8775 


369 


191 


TYKINKIGWAWWCAPLVPAAWKAEVEGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 


A 


8776 


2 


124 


GFRHVGRAGLEFLTSGDPLASAFQSAG I 
TGMNHHVQPIVE 


8715 


22616 


A 


8777 


] 


162 


liKYYTADENGKTSRLLPQRPSDECGAGV 
FMASHFDRHYCGKCCLTYCFIKPEDQ 


8716 


22617 


A 


8778 


346 


138 


YNSPPYKEKTIPLQARVNFGPPRDSLKR 
PPFFFFFFKRQSLTMLPRLVIiNSWAHGI 
LLPWPPKVQVLQA 


8717 


22618 


A 


8779 


2 


117 


LNLDTTAVQVRNYPRIRESYKVSFLSAli 
EEYTKKLNTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVSNSWAQVI LPSW P PKVLG 
LLA 


8719 


22620 


A 


8781 


403 


276 


IISTVFQRGSCPIPSRKEVCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGL ELLT S SDL PAS ATQS AG I TGMS Q R 
ARPH FQCKNVP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRSGWRRSQPISWGLTLVPRLIS 
NSWAQVILLTWPPKVLGIQA 


8722 


22623 


A 


8784 


2 


124 


AKLEKKKKKTGRAKRRMQYNRRFVNWP 
TFGKKKGPNANS 


8723 


22624 


A 


8785 


3 


355 


VGPS E PGAG YNLLVCRLLR PLE KHG I R V 
GVTGFSRCHLSWLPLARKGNSPTPCASW 
VRHHLALLRLTVCGLHPLSCTHCLTSPS 
EMNPVPQLEMQKS PVFCVAHAGSCRLEL 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


P VKLGAG P POAP F W LRKWAPL I KPFFKG 
PIGSLQIPLFFFLRQSLTLPPRLECSAT 
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N=Asparagine, P=Proline, 
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X=Unknown, *=Stop cod on, /=possible 
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WLTES STSWVHAI LLPQPPE 


8725 


22626 


A 


8787 


336 


10 


CLS LPS S WDHRRL P PH PAS F C I S 


8726 


22627 


A 


8788 


1 


147 


I FYLLETGFKY VGQAGLELLTSGDS PAS 
ASQSAGI TGVNHHALPE I HVF 


8727 


22628 


A 


8789 


3 


130 


DLEEGIQTLMGRLEDGSRRTGQILKLDH 
SSEFSKTRELYPVF 


8728 


22629 


A 


8790 


206 


3 


VPPSGP I KKGDKKKNLYLI FL I KNVPNK 
KLRKGVSPLAIPFFFFFFVFLLETGFHH 
VGQAGLELLTS 


8729 


22630 


A 


8791 


124 


390 


RAGAQSNLMEPPPSGVKAIFFPNFPKKW 
GNGTPPPAPLI FGGLKKKGVSPCGPGGS 
EPPAPGGPSPLAPPKGGNNGRDRLPPPP 
DKFLN 


8730 


22631 


A 


8793 


2 


301 


HEERERERERERERERERERERERERER 
EH ERERDAL.FA F FP PRG ARP P PS I ERG V 
FVGAGSVREIYPTLSIFFFSPPAHRGRE 
LRSRCVSHSLHLISRA 


S731 


22632 


A 


8794 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGAPPP 
PLF FLGRGGALRAHAI S P P P F LGAP P Y I 
YIFFHMRARPPHRYLCAQRETRPAL 


8732 


22633 


A 


8795 


I 


321 


ARGERERERERERERERERERERERVSR 
RIPRGDLRDQFPHPLARSRDSFCGSADH 
FGRGVFNKVSWTDPPTHRVTSSLGGGV 
ERDLLTLSGGGTYAPWKNMCATGEDQ 


8733 


22634 


A 


8796 


2 


140 


HEERERERERERERERERERERERERER 
ERERERERERERESLPPPAGCAKGAGWG 
E 


8734 


22635 


A 


8797 


2 


223 


SARERERERERERERERERERERERVSR 
RNLSRGGGGVPPPLQNVRIHSGGPARGT 
L.HHI KKKTSLTDVGLiAQ 


8735 


22636 


A 


8798 


1 


176 


ARGEREREREREREREREREREREKISF 
LGGGRGTQFAIERGVIQFCGRRYTTGRV 
RN 


8736 


22637 


A 


8799 


335 


27 


PPTKFFFFFFLVEMGFCCIAQGGLELLS 
SSSLPASAYQS ART IGVSHHAWSVF 


8737 


22638 


A 


8800 


342 


191 


STSIiSLPKCWDYRREPPCPANLSYFFKD 
PFSKCNPRLiRYWGLGLQHRNFG 


8738 


22639 


A 


8801 


104 


332 


AVPLTMVKIQPLWKRVWRFHNKSKLELP 
CDPAISLLSMYPKEMKSLCQRDVCTPRL 
STGPLTIAGMWNPPKRSSMD 


8739 


22640 


A 


8802 


563 


258 


FFLNLETRSWYVAQARVQWLFTGTVTVH 
YSL.DLLASSHPP I SASGVAG I TDACCAQ 
LTLPFLKIjGLLGLGGGGVGYPSYFGLEG 
FHNDCSRVRVLCMWSYQ 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLSPSEKRSIRVGVTRFSRC 
RPSPLSLTRKGNSLTPCASQVRQCIiALL 
RLAHGACTH 


8741 


22642 


A 


8804 


266 


108 


KLWSKCVALPPSFSSSCSSHIGCACFP 
FTFHYDCKFPQASQAMLPEQAVEP 


8742 


22643 




8805 


1 


140 


KWWFKRPGVYYGQCSEICGANHSFMPI 
VLEL I PLKI FE I GP VFTL 


8743 


22644 


A 


8806 




90 


SSAAAEENNDEKKEEAEKTEDDMEFGLF 
D 


8744 


22645 


A 


8807 


229 


34 


KKKIFFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFFLDLFIYLI ILSYTK 
LISIRAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGHAQWLTPGMPALWEAEAGGSQDQ 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
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EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


134 


I YM I FKNKFFNRDRGLPMLHRLVLNW AQ 
VILLSWPPKVLGL 


8747 


22648 


A 


8810 


391 


193 


VLPKI PGIQFPPPCRFKKRPRPRFKKPP 
PKRKKISFSNPPKIWPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


22649 


A 


8811 


304 


3 


TTPPTNIFFCFLFFIFIFLWYFIFIFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
QCF I GHDDQKMM VHTHTQTSSHLLQGGS 
NPQGKGQS PGWVLWCA 


8749 


22650 


A" 


8812 


395 


268 


FLVETGFHHVGQAGLiELPASSDL.PALTS 
QSAGITGVSHHGWS 


8750 


22651 


A 


8813 


782 


1174 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
LFVLRQGLTLSSRLECSGMIAACCSLNV 
PGS TDS PTS AS R WR TTG VRHH TQL I F V 
CFVEMWFHYVAQAGL.EPLGS S S P PALS S 
QGAG I TDVSHHTPLEbCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGLKNIYFLKNKDKGLTML.PSLVLK 
S WARVI LL.P W P P KVLGLQT 


8752 


22653 


A 


8815 


381 


58 


SCLLFPPFWGARGGGPPRAGGSNPPGPP 
GLTPFPPKKPKNYWARGGGP FI P PPREG 
WAGGFFLPRRGRVPLAPGSNNL.ELGPLP 
S P PGYQKKTPFS kkkkkekkkrkr 


S753 


22654 


A 


8817 


441 


158 


FFFFLRTDGYLTMLPRLVSNSWPQAILL 
PQPPKMLGL 


8754 


22655 


A 


8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


400 


171 


N FF FP PG VKFLGG GGPQFPPPQ KRG F FQ 
KTPGGFFKPPPKKKKKFFPPPPKIGPPQ 
GIFKKAPPFFFFFFFFFFFF 


8756 


22657 


A 


8820 


2 


268 


IN I I LAFT I S LLG I LVYRSHLMS SLLCL 
EGI ILSLFI I ATL I TLNTHSLLANIVPI 
AILVFAACEKKKKGGAVLKDPWGGQSLR 
VLiAR 


8757 


22658 


A 


8821 


— 


258 


LCLPNQAWAMAGSPPPASLLPCSblSDC 
CASNQRDSVGVGPSEPGVGYSLWRRFL 
S PSEKRS IRVGVTRFSSWVRWLRTVI PA 
T 


8758 


22659 


A 


8822 


250 


456 


GGGDKFGLIETFPPGL.KPFFL.LNr J LSGW 
ELGPLAPPPFKFCFFKGRGFPFLPRFVF 
VANLLLTCCKKRD 


8759 . 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCLCTPAWATER 
ACLKKQNKTNQ 


8760 


22661 


A 


8824 


J 


116 


IiTPLPPSAPPSVDDNLKTPPEWVCSLPF 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FL.VETGFHHFGRAGLKLLNSGGALTSAF 
QRAGITGVITGVSHCFEVNRVWTGTQYF 
LCYILKS 


8762 


22663 


A 


8826 


393 


75 


PRFFPPPPPKIFFFPPPPFFFLGGFPPI 
PPPPKNFFFPKPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFSI 


8763 


22664 • 


A 


8827 


81 


380 


KI ITKHNIARTNPYTFCIMNYLKKKKKK 
KKKKKKKKKKKKKKKKKKKKKGGGPFLK 
NPGGGQNNPGGKKNFFPFFGGGKKNPPG 
FFKKKPFFGGGKFG AP P PPKN 


8764 


22665 


A 


S828 


354 


13 


TKPKTPPIiTKGPRRGFIiAPRVPGGGGPI 
TPPPQNFLGEKGTLARKKVFPPGRPAPK 
RPPRKKNGFEMRPLKKNGPGPWKFLGFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D— Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IANPPGFGPKKJCKPENQTHKKKCQRRKP " 
RF 


8765 


22666 


A 


8829 


1 


215 


KHSTTHSHCPRTI KLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGPPFKKTLGGPPF 
SPGGKKKFFFFLGGL 


8766 


22667 


A 


8830 


1 


127 


1 1, FFWQRRGLALL.HRLVSNS WAQVI LP P 
RPPRVLGLQVIYFK 


8767 


22668 


A 


8831 


264 


55 


FFFXFFXFFFFFFFFFLFVFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFF WCS CS 
YVLIFIQPFIKSFH 


8768 


22669 


A 


8832 


399 


85 


PPQKNFFFPQAPKISGGGGPQIAPPQKK 
GS FQKTPGGFFI P PQKKKKNYFPPPGKM 
GPPQGFFKRPPPLFFFFFFFFFFFFFFH 
NRHLFKVKHLQNS FHLVQLRL 


8769 


22670 




8833 


47 


288 


NCLYRTKKKKKKKKKKPPYKKKTLRGPN 
FPPAGPPAPLPLSGGEKKPSRGLLRRPP 
TLGGAAKGPPPPPKLTPLRKKKIF 


8770 


22671 


A 


8834 


1 


146 


INIILAFTISLLGILVYRSHLISSLLCL 
EGI ILSLFI IATLI TPSTLC 


8771 


22672 


A 


8835 


2 


79 


EDPKTSPKPKI IQTRRPGliPPSVSN 


8772 


22673 


A 


8836 


I 


156 


PTRPTriTPILLTLFLITQLKILNTNYH 
LPPSPKPIKKKKKKKKKKKKGGGP 


8773 


22674 


A 


8837 


152 


292 


AKTFNFYKVEFINVFFNGI.CLDI I KNTL 
PKKKKKKKKKKKKKKKKKKK 


8774 


22675 


A 


8838 


1 


122 


VASNS IHPKAKENTAPHTHTHTHTHTUT 
HTHTHTHLFMA IFK 


8775 


22676 


A 


8839 


2 


106 


HVGQAGLELLTSGDLPTSASQSVGITGV 
SHRAWP 


8776 


22677 


A 


8840 


364 


224 


APLCGRQICDCDCTYPSPHTYIHTHTHT 
HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NSLSVEFLILFPPYNYSPKQCICSHQNC 
LNFVJVI KCVNTPHPNFLNLRMPLLAKRG 
AHACDPNTFGGGGRWITRSVDRDHPGQH 
NETP 


8778 


22679 


A 


8842 


343 


52 


KTGFRHVGQPGELIASSDLPASASQSAG 
I TG VS HRVR PGLH I LDNS S FLDTS FAD I 
FSVLYIiKAG IAS LLHI Y I HFLLPLRDLL 
LVSSLTVSFPTAV 


8779 


22680 


A 


8843 


256 


35 


LFIFSLSLQYIFCVCGCVCVFMSLCFLV 
CAWGHVQLHVCS CVRMCVCENDVCVCGW 
VQMCFCIKNFQKEVYQI 


8780 


22681 


A 


8844 


333 


11 


PPIKKKAPPPKGRAFFFFFLKKKKGGPP 
PPQKKTGGGGPKKRGGVKKPPPGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFNFISTKNLF 


8781 


22682 


A 


8845 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGASASTPITSVNCSQGAI YPEVGT 
TGSTI IAAASSGGEFDSVLQNDICKCFL 
TQQFHF 


8782 


22683 


A 


8846 


1 


397 


ECAHHTQLIFLFL.IESSLHHVGQAGLKL 
PISSDPPTSASQSAGTTAVSHHA 


8783 


22684 


A 


8847 


1 


179 


ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFP IGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 


233 


HEERERERDRERQRERERERERDRERER 
ERETFFSSSGGGPKETTLFFETGAPKEG 
YQNMHARLIPSAVPFKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HEERERERERERERERERERERERERER 
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sequence 


Amino acid sequence (A- Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Gtutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possibJe 
nucleotide deletion, \— possible 
nucleotide insertion 














ERERERERERERERERDFVFYRWGEGHI 
DRQ YCS PLCPG WCHCPM 


8786 


22687 


A 


8850 


2 


79 


PEERERERERERERERERERERERERER 
ERERE 


8787 


,22688 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERARL.FARGEK 
KKHTGGGPPLFYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRERERERERERERERERERERERESP 
A 


8789 


22690 


A 


8853 


5 


227 


EERERERERERERERERERERERERERE 
RERDFFFFFLEKKYLFSPIGELKGRGGG 
SLTPLLTGKWNPCPPPPH 


8790 


22691 


A 


8854 


1 


189 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RHSRLFPRESVCVARAGGASSPPRG 


8791 


22692 


A 


8855 


2 


271 


HEERERERERERERERERERERERERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
S LLYVTHNLCVSHS T PRGER YVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQQQLRNLRDFLLVYNRMTELCF 
QRCVPSLHHRALDAEEEACLHSCAGKLI 
HSNHRIiMAAYVQLMPAIjVQRR I ANYEAA 
SAVPRVAAEQPGVS PSGS 


8793 


22694 


A 


8857 


44 


203 


GYSTCVGMHAHTHIHTDTHTHAHTYIHQ 
CLIQIGLCTFATCTSQVNKSSMAEL 


8794 


22695 


A 


8858 


329 


182 


IIFVCLVERGLHYVGQAGPELKASDDLP 
ALASQSAGTSGVSHCVSSKMS 


8795 


22696 


A 


8859 


1 


175 . 


GLTLPPRLGCHATITAHYSLNLPGSSNP 
PTS AS QVAGTTHS RP CPE I TQTALQRG P 
HD 


8796 


22697 


. A 


8860 


3 


350 


HEAIGLRSNI KNGLDHFLPLGTPTPLI P 
I LAM I ET I L.LL VQP I ALAVR LTANT TAG 
HLLMRLNGS ATLAI STI SL P STL I I FT I 
LILLT I LEIAGAL I QAYVFT LLVG LYLR 
DNA 


S797 


22698 


A 


8861 


139 


3 


TFKEAD I KGGGPPHLAS FC I FGRDGVS P 
CWPGWSRTPDLKRSTRA 


8798 


22699 


A 


8862 


3 


134 


SPTPPPSS KPSSI PRKSSVDLNQVSMLS 
PAALS PASS SQRHES 


8799 


22700 


A 


8863 


346 


219 


RLVPPCLDNFLNFCRYSCLTMLPGLVSN 
SWAQVIFLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFFFERGIDHVGQAGIQLLTLGDPPT 
PASQRAEITGVSHHTWPNFIF 


8801 


22702 


A 


8865 


267 


132 


KKKFWPFFFFFLRDGVSLLLRLVSNCWA 
QEILLPWPPKLLGLQV 


8802 


22703 


A 


8866 


343 


215 


LAQGLKTPIPKRVPPHTQKKKFCSERPT 
RIQTDRSCWVSPLS 


8803 


22704 


A 


8867 






GTSKKIDKLDYIELKCFCTAKETISRVK 
RPVEWEKI FANS S PE KGLIS RINKE AKK 
IiNSSAIAAHNCNPSTLGGQVRRIT 


8804 


22705 


A 


8868 


1 


219 


GTRR P T W AT WQN PAS T KKKKNS F F FET E 
SRTVPQAGGQGGNLGSLQSLPPGLKRFS 
CLSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGG I LLLLDWS LAYE S KHLLEGAKSE 
SAEELKKRAQELEGKLNFLTKIHEMLQP 
GQDQ 


8806 | 22707 


A 


S870 


1 . 


255 . . 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
AIjGGMDAEIj i DEEALiTS LELELGLHRVR 
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peptide 
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Predict- 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycinc, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EL1PELFL1GQSEFDCFSDLGSAPPAGSVS 
C 


8807 


22708 


A 


8871 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHEPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


8808 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKKKNI I WGG 


8809 


22710 


A 


8873 


398 


100 


QKERPPGFPPFKPGPPQKRGGKGAPPKD 
GEHYFFWVLEKSTGFPPKTKRVYYPSPV 
YPHPPAPQKEGFPGGSPPPRPIASPFGG 
KKKEKKSYIEKENLF 


8810 


22711 


A 


8874 


3 


214 


DAWVLGILVYRSHLISSLLCLEGIILSL 
FI IATL I TLNTHSLLEKKKKKKKKKKKK 
KGGAFKKNPGGAQ I 


8811 


22712 


A 


8875 


2 


254 


PRVRPEGRNRTPQSWFQANPMACMTFSK 
KKKKKKKKKKKKKKKKKKKRGGGFIKKM 
GGGKNFREKEKKNFLLIRGKIKKTGGDF 


8812 


22713 


A 


8876 


2 


135 


TINLPSTLIIFTILILLTILEIAVALIQ 
AYVFTLLVSLYLHDNT 


8813 


22714 


A 


8877 


2 


296 


1ARGAEVL.GYGSHSRGRVPGALVGQGAG 
RLFTEHPGSSPATLAIYPTPEGTSVAVS 
ISAPPKARSRPYPPSRSCHNLSLAGSSV 
LVPPGPVHRWVGRP 


8814 


22715 


A 


8878 


164 


29 


RKIGPARWLTPVIPALWETEAGGSRGQE 
I E I I LANT VKPGACE VL 


8815 


22716 


A 


8879 


381 


219 


GEKMTEEEVHMLVAGHEDSNGCINYEGK 
R 


8816 


22717 


A 


8880 


2 


156 


ANGNSFATRLSNIFVIGNGNKPWISLPR 
GKGIRLTVAEERDKRLATKQSSG 


8817 


22718 


A 


8881 


3 


402 


SGFNIEYAAGPFALFFIAEYTNI HINT 
LTTT I FLGTTYDAliS PEL YTT YFKKKKK 
KGGRPFFFFFFLKKKKIPPPKKKNFGKK 
GETLKGRGGAPI FQPKKILSPPKKKKKK 
RGGAFKKS LGGAKFNGGGRN 


8818 


22719 


A 


8882 


1. 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTP PGETLE 
RYSTPPGETLERYSTPPGEALERYSIPT 
GGPNPTGTFKTYPSKKKKGGGRFKEPLG 
G PRFTGVGRVKVFSLWGGVLNAR 


8819 


22720 


A 


8883 


484 


307 


KKILNRRVR WLTPVI PTLWETEAGSSRG 
QEMET I LGNTVKP PASASQSAE I KGMR I 
IIF 


8820 


22721 


A 


8884 


1 


131 


GSLTHHINNIKPSFTRENTLMVIHLSPI 
VliLSLNPDIITGFSS 


8821 


22722 


A 


8885 


406 


106 


KGGAGPKIAPPKKKAFSPIPPPFFWPPP 
VFLKGPPPPS PFNFFPP P VFFFRGPLFF 
FFFFFFFCETWLPFRPSNLVFTLPHVF 
AWLIPIDQYLFISHSC 


8822 


22723 


A 


8886 


425 


325 


SSTHTHTHTHTHTHTERSHLLAFEPSAG 
ELWMAM 


8823 


22724 


A 


8887 


394 


278 


VCVCVCMCVYICIYPCWLPCWHMCI.ELY 
KVFKG IMKG V 


8824 


22725 


A 


8888 


2 


86 


YSRTRAAALIQAYVFTLLVSLYLHDSAY 


8825 


22726 


A 


8889 


400 


62 


FLYLKSFPPPFERGVFSPLPPQKFFFSP 
KALNFLGGGGPKFPP P KKRFFFKYP PRV 
FLYPPHKKKNYIFPPPDDFGPPRVFFKA 
PPPHFFFFFFFFFFFFFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SLEPGTSGLKGFSCLTLPSSEDYRHEPP 
CPAN 
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1 SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8827 


22728 


A 


8891 


32 


142 


VRPTRPGQVASVLYFTTILILIPTISLI 
ENKILKWA 


8828 


22729 


A 


8892 


3 


382 


TPHNPLSRPSYEPKPASTPSRAPKKKKK 
KNPKKKKKKKKK.KKKKKKKRGGALLKKT 
LGGPKLPGGKKKNFFFFRGGKKKPPGDF 
t.KKTLFIiGGGNLAPPPPKKKNPWGKKKI 
FWGEGGKKNFFFFG 


8829 


22730 


A 


8893 


368 


199 


KKKRI S PPSRKRAPQGDFLRPTPP I KNL 
YYYLSFFFLIIiYIFFFFFFFFFFFIFFF 


8830 


22731 


A 


8894 


3 


136 


TINLPSTLI I FTILILQTI LE I AGALIQ 
AYVFTLLVSLYLHDNT 


8831 


22732 


A 


8895 


1 


144 


IATINLPSTLIIFTILILLTILEIAGAL 
IQAYG FTLLVS LYLHD Y P YN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKKKKKKWGGGF 
LKNSGGAPIFGGGGKFFFFFLGGGFKTP 
RGFFGKNPFFWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLTPVI PXLWDAEAGGSFEVRSS 


8834 


22735 


A 


8898 


215 


359 


TELVPNAVLFCPLPCSSGHHLLCAIHAK 
RVTIMPKDIQLARRIRGERA 


8835 


22736 


A 


8899 


414 


73 


PRLLCFHKKAWAMAGALPPAIjLiPPYSLiI 
SDCCASNQRDSVGVGPSEPGVGYNLWR 
RFLSLSEKRS IRVGVTRFSRCCPSPLSL. 
TRKGNSLTPCTSRLRQCLPLLWLAHGAR 
TH 


8836 


22 737 


A 


8900 


374 


88 


VSPPS PLKI FFS PKAFNFWGG VGPHFPP 
PQKRVLPQKPPGGFFSPPLKKKKFSFPP 
PVKLGPPKGS FKGPPPFFFFFFFFFFGL 
DFMYTGSSNVW 


8837 . 


22738 


A 


8901 


433 


56 


PEGPPPPGGFYPGKRGFFPTPLWFGGP 
KRDPPPRGKKNPSWRSGGKQTPRFPRDP' 
LLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTQTGPPFFFKKX.GDPWGPKKKPPPG 
GAPRLGPPKKKFL 


8838 


22739 


A 


8902 


3 


73 


IAVALIQAYVFTLLVSLYLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNLS YI IGKDTWVEHWPEEDECQDEENQ 
KQCQDLGAFTESMWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFAAPKKSGIIjFFSKTWMEVEVLF 
LSQLMLGQKTQYCMFSLINGRLIIKAPG 
SLKGDHHPPLGLLEG 


8841 


22742 


A 


8905 


142 


5 


KIFLQKKKKKKGLGTVAHACNPSTLGGQ 
GGWIMRSRDQDHPGQQW 


8842 


22743 


A 


8906 


367 


244 


HVPQARAKFFC I FSRDGVS PYWPGWSRT 
PDLMICLLFIVEL 


8843 


22744 


A 


8907 


146 


248 


S Q I LG RLRQ EN CLNLGNG G CS E PRW CHF 
LiAWAKE 


8844 


22745 


A 


8908 


2 


197 


AAASTHHARLFLCVCLFAYLLVEMEFHH 
VAQAGTELLDS SNP P PLAS QNAGI IGMS 
HHTQPQYIC 


8845 


22746 


A 


8909 


1 


127 


KKTRLERAQWIjTPVI PAFWNAEAGTLLE 
ARSLRRLPWPPNVL 


8846 


22747 


A 


8910 


368 


280 


L.TMLLRLVLNSWAQAILLTWPPICVLGLQ ' 
A 


8847 


22748 


A 


8911 


355 


211 


QQQNLHLFWPGTVAHTRNSSTLGGQGMR 
I TGAYE FETTLGNTARPHL Y 


8848 


22749 


A 


8912 


325 


190 


KKKKKKGGGGPQKKKKNPP P P PTS KKKK 
KKKKKKKKKKKKKKKKKKKKK 


"8849 


22750 


A 


8913 


1 


411 


VNVPAPRGGAYRGRQASFSSGGLHPVPA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














YRPLCLPNQTTNSAMAGAPPLASLPPCS 
LI SDCCASNKRDSMGVGPS ETGTGYNLL 
VCHFLS PLE KCS IRVG VTR FS RCRLS PL 
SLTKKGNSLTPCTSRVRRCLALLRL 


8850 


22751 


A 


8914 


258 


9 


GAIPCLHCETNPSHISSTQELQTPEPQL 
PGVPLELPP PGS C FKCRKSGHWAKE C PQ 
PGIPPKPCPICAGPHWKLDCPTGPRL 


8851 


22752 


A 


8915 


326 


42 


EKIHRKYFCYSFFVFLVEMGFYFVGQAS 
LKLLASSDLPASPSQSARVKAMSHCVRP 
VLVI FKNKILQNTENSTKIYVYPPPNNQ 
ILTFCHFCFK 


8852 


22753 


A 


8917 


3 


106 


GLELPTSGDLPASASQNAGITGISHHAR 
PIMTYS 


8853 


22754 


A 


8918 


320 


171 


AASTTDGSYKCLCLPGYVPSDKFNYCTP 
LNTALNLEKCPFGLPHLSGSS 


8854 


22755 


A 


8919 


1 


201 


VKPSDRYHLMPIFTAAYPHQNSTYNVPV 
STRMVMAEEFKQGKRLSLCSDLYRKDLR 
TIVDPWSCAT 


8855 


22756 


A 


8920 


3 


182 


LLRPLEKRGIRVGVSRFSRYHLSRLPFA 
RKGNSPTPCASRVRRCPSLLHGLHPLSD 
KPQ 


8856 


22757 


A 


8921 


142 


1 


SCHPGWSAWRPQLLRRLRYENCLSLGG 
SGCSELGSCHCTPAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLLSRLSLNSWVHVILT 
PWPPKELGLQA 


8858 


22759 


A 


8923 


357 


121 


LGAFSDGLAHLDNLKGTFATLSELHCDK 
LHVDLENFRLLGNVLVCVLAHHFGKE FT 
PPVQAAYQKWAGVANAIiAHKYH 


8859 


22760 


A 


8924 


343 


133 


LFFNFNFLETRLREWLIFVFSVETGFPI 
KRAGLKLQASSDPPARGSQSAEISGMSH 
YAWPCKSNRAFKCS 


8860 


22761 


A 


8925 


321 


44 


PDSGGS PAS ASQVAGTTGACHDAWLM F I 
FNMDININIPFSAKRDKLSMDKINEI IF 
SLL I KR I SS FQI M Y FLCS SR I ESRKRWL 
KQTGRKKE 


8861 


22762 


A 


8926 


335 


106 


HFVFL IETG FHR VGHAGLELL TS TDP PA 
LAYKSSGITGVSYRSFTKLHLCACVCLC 
IYTYLGLVLSFTPGSWEKKT 


8862 


22763 


A 


8927 


2 


160 


LSRVPPRSANLSISVFAEKRCHFTMLPR 
PISNSWAQGILLPSQAPKVLGLQA 


8863 


22764 


A 


8928 


3 


140 


RV I TEEE KNFKAFASLRMARANARL FG I 
RAKRAKEAAEQDVEKKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRS CHCTPAWATE 


8865 


22766 


A 


8930 


3 


104 


KNMAC YCRI PACI AG ERR YGTCI YQGRL 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMQQELEKNITRELKEAAAELE 
SGS IASPLGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


22768 


A 


8932 


345 


134 


GGGSSPWFPPFW R PR RADS LR VG VLNP P 
GPGGGAP FFLKN PL I GWGGGARLWFQL I 
LLRVRPENSFDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFFFFFNSGSHYVAQAGLEL 
LDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVLLHCSSPVSNT 


8869 


22770 


A 


8934 


84 


22 


TPAWVWWLTPVI PALWE AKAG 


8870 


22771 


A 


8935 


277 


111 


ER I INHAAGSHGVSG I FMKYDLS S LMVT 
VIEEHMPFWQFFVRLCGIVGGIFSSTG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN ' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clu famine R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8871 


22772 


A 


8936 


2 


106 


DS AI PRRLECLKEDVQRQQERE KELQHR 
YAEFVF 


8872 


22773 


A 


8937 


1 


193 


LMWDFSPSGLDGAFQRGYYWYYNKYINV 
KKGSISGFTMVLAGYMLFIYCLSYKELK 
HERLCKYH 


8873 


22774 


A 


8938 


240 


69 


DSGNQLKRSFSGRARWLTP VI PALWEAE 
VGESLEVRNIIYILQNMSNIGNIHLIIK 


8874 


22775 


A 


8939 


329 


177 


RWNLAMLPRLVLNS WAQAI CPSWPLKML 
GLEAEPSCPARTNFKVTSPRLH 


8875 


22776 


A 


8940 


358 


120 


HPSNFFVFLVF.MGFHHVGQPGLELFPAR 
YVPTLAFRSGGITGISHCAWPKEWALPR 
KAS PGLSGRRGS PHCATCSS KAW 


8876 


22777 


A 


8941 


333 


104 


PHVYSNGHICLS ILTEDWS PALS VQS VC 
LSI I SMLSSCKE KRRPPDNS FYVRTCNK 
NPKKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 




241 


APLVAFTQVNLEDKGGLSKLVEAIRINF 
NDRYDEICHHWGGNVLGPKSGTRIAKLK 
KAKARELATNW I KCTLLS FLY I KR 


8878 


22779 


A 


8943 


1 


240 


APLVAFTQVNLEDKGGLSKLVEAIRINY 
NDRYDE I CHHWGGNVLGPKS VTRI AKLK 
KAKAKELATNW I KCTLLS FLY I KR 


8879 


22780 


A 


8944 


306 


150 


AASTLCPRLECSGAIMAHCSLNL1L1GSGD 
PPASASQVGGARGSLEPGRWRLQ 


8880 


22781 


A 


8945 . 


284 


2 


GKSRRSPLDLCTSGFTGSTHFTLIICWV 
PRYVATCPPIGLNFVFIKTWWSHYIVQA 
DLKLMVSSNPPVSASQSAGNTGMSHHTW 
LENRLLTTKK 


8881 


22782 


A 


8946 




226 


WSFALFAQAGLEFLTSGDPPASASLSAG 
I TGM S HHAQPAVS LNS I FS AS TFCATS S 
YCAKMELKELKFMLIVDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLSIPFLVLSKAVFLFLLGM 
FTAALLI IAQRWEQPKCASTDEWINKMW 
HMHTMEHYSTCCGVHAAVNISIQISEWN 
CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGQPGLRLLTSGDLPASASQSAGVAG 
VSHRAQPTPALNCVLTCWVLQHYIKHNH 
GKRSEYFTIFNDM 


8884 


22785 


A 


8949 


69 


298 


SRGHQRVADAALLSFPDVI CLGLMLWGS 
SSIVCILHRHKRRVQHIHRTSVSPTSSP 
ESGATKTILLEQERTRSPRS 


8885 


22786 


A 


8951 


2 


201 


LYDLLNMRSISLPLPSLVCIHSHTHEHT 
YTHIHTHAHTYTHTHSHWVLSLKQKKMV 
KHRPRLSVWI 


8886 


22787 


A 


8952 


1 


285 


GASPPASLPPCSLISDCCASNQRDSVGV 
G PS EPGAG YNL WRRVLS PS E KRS IR VG 
VTRFSRCRPSPLSLTRKGNSLTPCASRV 
RQCLTLLWLVH 


8887 


22788 


A 


8953 


341 


256 


LGLGGGGCSEPRWHHRTPAWVTERDFIS 
L 


8888 


22789 


A 


8954 


227 


43 


ILFFI GKHTHTHTHTHTHTHTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAANPRGLV 
RAGLG 


8889 


22790 


A 


8955 


310 


79 


RPRRPGLMERVNVFIFSILRESFYKIRI 
TNQAQCCMPIVPVTQETEAGRSLEPKSL 
RLAWAI QQNS I S KKNNNNNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAMLPRMVLHWVQAVLPQ 
PPKVLGLQG 


8891 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPIIPALWESVAGGL 
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SEQ JD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=fsoleucine, 
K=Lysine, E=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LEARSLRPAWFFFNT 


8892 


22793 




8958 


354 


216 


AGPRVFPGGEGGFRPPPPGGSPPQGPPK 
GGESGGKPPPPAHWVFF 


8893 


22794 


A 


8959 


364 


223 


LQAARSKKKKKKKKKKKKKKKKKRGGGK 


8894 


22795 


A 


8960 


259 


11 


FEALKDPSNNLQSLFSFIVTVLTVAHVY 
EKLSTLTSLAARRGLAMLPRLASNSRTQ 
TILIiPWPS KVLRL QARATE PGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AASTKNFERMEWGLTTLPMLVLNSWPQV 
ILLPWPPESMRL 


8896 


22797 


A 


8962 


260 


3 


KGGFFPIPFIGIKPGFFFSPALIFPKVF 
I FSQNFFVFLKKI FFFFFVETVSSHWK 
AGLELLASSNPPAFTSQSAGMTGMSHHA 
W 


8897 


22798 


A 


8963 


425 


248 


ADEWIHKMRH IHM I EYYLAI KRNEMLIH 
ATAGMHIiENILSEKDHMYBS I YMKCLAW 
ANL 


8898 


22799 


A 


8964 


1 


118 


GKLMELHGEGSSSGKATGDETGAKVERA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


]30 


GRVGPRMALNSWAQVIHLPWPPKVIjGIjQ 
VAMKRKIMILSTSY 


8900 


22801 


A 


8966 


119 


364 


LGWYLIEKIQICCSPCLGADPFQQMISP 
L.RDGSLTVLPKLVLNSWPQVFLPPQTRK 
VLGLQAEVTAPGHI I FMI IMINTFK 


8901 


22802 


A 


8967 


238 


333 


TRCLRLLPRLVLNS WTQALLLPS PPKVL 
GLQV 


8902 


22803 


A 


8968 


297 


169 


NPPPPHIYKFFFIFNIFFFFFFFFFFFF 
FFFFFFFFMSKILIF 


8903 


22804 


A 


8969 


2 


212 


RVRPRDRPLVRVRVGRNKDGAETTPSPG 
LLPAHLTFPLDYHLNQPFIFGLRDTDTG 
ALLFIGKNMDPRGP 


8904 


22805 


A 


8970 


242 


357 


TLDI IKS FKKKKKKKKKKKKKKKKKKKK 
KKKKKK KKKKKKKKKKKKKKKK 1 1 


8905 


22806 


A 


8971 


191 


265 


ILMPXKKKKKKKKKKKKKKKKKXKR 


8906 


22807 


A 


8972 


398 


97 


MFLYNRILWFFSPHPPLRFFFSPRATIF 
LGGGAPHFPPPKKGFFSKKPPGVFLSPP 
QKEKIFSFPPPLFIiAPPKDFFKSPPPFF 
FFFFFFFFVSTGSSVS 


8907 


22808 


A 


8973 


370 


265 


YNSSLPSSWDYRKVPPCPDNFCI FSRGW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


GLKXLTSSDPPASASKTAGITGVSHHTQ 
TIFVFSSHKS 


8909 


22810 


A 


8975 


1 


110 


VAQAGFKLVSSANLPTLASQSARVTGVS 
HRARSMKQ 


8910 


22811 


A 


8976 


3 


152 


WNQPKYPSTGEWMEKMWYIYTKLVQPMA 
HGPHVAQDGFEGSPTQIHKLS 


8911 


22812 


A 


8977 


375 


193 


FQDDPLGSTSNISFFKRWSPAMVPRLIS 
NS WPQE I LLPQTPKVLG I TGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQ AS AMAGAP P PASLP P CS LS S D 
CCASNERESVGVGPSEPGAGYNLLVCYL 
LRLLEKRSIRVGVTRFSRCHLSPLSLTR 
KGNSLTP CAS RVRQCLALLWLTLS AL YP 
LS CTHCQTLPS EMNLVPQb 


8913 


22814 


A 


8979 


120 


19 


ARVQWHDHGSTATLNSWPQVILLPWPPK 
VLGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRDKGLTMLARFVRNSWPQAI PLPWPP 
KVLGLQALATAPCQMS FLRAPLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


IM 
eth 
od 


SEQ U> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V-possible 
nucleotide insertion 


8915 


22816 


A 


S9S1 


335 


221 


FFFLFLFFCMITSHCSVNVLGASDSPTS 
VSHIAGTIGT 


8916 


22817 


A 


8982 


360 


150 


LSHMPLLFVSLYPDHRAFMQVIFFFFET 
ESPS VTRLECSGAI LAHCNVCLPGS SDS 
PASASYAGYSFYL 


8917 


22818 


A 


8983 


346 


116 


GQGRGVETESCSVTQAGVQWCDHHSLLS 
L ELLG S S DPS TS AS P VTG TTGGHYHTS S 
HFLFSVSSSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRLVSNAWPQAIFLLWPPKVLELQ 
A 


8919 


22820 


A 


8985 


3 


159 


EEYNLLVCRLLSPLEKCSIGVRVTQFSR " 
CHVS PLS LTRKGNSLTPCAS WVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLS I CGTLHS VDRYUSTI KL 
TNISVIDSEKYPHMLSVKNCFIQGSWR 
YMQLPADEVHKQLLQDAARKEPLQQKQ 


8921 


22822 


A 


8987 


2 


115 


GAR I VGHLTHALKQGE YGKAS I CNGGGG 
ASAMLIQKL 


8922 


22823 


A 


8988 


3 


159 


GFYHIGQAGLKLLMSGDLPASASQGAGI 
TGLSHCAQAGIFFFLPYDIFTVF 


8923 


22824 


A 


8989 


1 


123 f SHYVAQAGPKLLGSSNPPASASRSAGIT 
GVSHHAWPPSLFL 


8924 


22825 


A 


8990 


462 


303 


LARTPSRPTRPPTRPPTRPPTRSPTRPP " 
FMLSVDCliLLY 


8925 


22826 


A 


8991 


382 


178 


PPFFFYFFFFSFFFFFFFFFQKGFKGFF 
PQKFFFFFFGGKPVKNFQKFFLRAKGKG 
GEKKPPQNKKVF 


8926 


22827 . 


A 


8992 


532 


432 


RDAGLTMbPRLVSNS WAQVLLLPGPPKV 
LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNIFLIFFNYFFFLIFFFIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFGGAWSLFRGGDSGEVKGTDSMS 
RQCFQGF 


8928 


22829 


A 


8994 


2 


722 


AVRLNISYPPQNLTMTVFQGDGTASTTL 
RNGSALSVLEGQSIjHLiVCAVDSNPPARL 
SWTWGSLTLSPSQSSNLGVLELPRVHVK 
DEGEFTCRAQNPLGSQHISLSLSLQNEY 
TGKMRP I S G VMLGAFGGAGATALVFLS F 
CI IFWVRSCRKKSARPAVGVGDTGMED 
ANAVRGS AS QG P L I E S P ADDS P PHHAP P 
ALATPS PE EGE I Q YAS LS FH KAR PQ YPQ 
EQEAIGYEYSEINIPK 


8929 


22S30 


A 


8995 


498 


323 


EPKAELNEGDIAWHIKREEGWFKGTLQ 
RNAKTGLLPGSFVENI 


8930 


22831 


A 


8996 


21 


511 


LI IDGVEAYAXiNASGWNI IVFDPKGWA 
LFRSFKAVKSKLDTRRGSNSELETAVKD 
LGKAVSYKGMYGDVAIWYSGQYVENGV 
KKNFLPDNTMVLGNTQARGLRTYGCIQD 
ADAQREGINASARYPKNWVTTGDPAREF 
TM I QSAPLMLLADPDE FVSVQLA 


8931 


22832 


A 


8997 


178 


306 


gainainwppfnnvgmgktlgfpnEmal 
kqipfspvgektkrr 


8932 


22833 


A 


8998 


351 


211 


SIQSLRMQPVPCYISGRHHHYHSHHHIH 
HHHYSSYFLKYPQEYLRPL 


8933 


22834 


A 


8999 


1 


381 


FRLFKFALMELiCS I EG I SDQPLFFVFGF 
FFFGKEAS FGPQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPKPKRGGESSP 
PEGPKKTGAPGPAKKGLFFLTKGKNPWP 
G PGKTRKKAVFPKN F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


IKSEWAAKKKKKKKKKKKKKKKKKKKKK 
KKI FFFGGGRFFFFWGGDYFWGGGGVKK 
KGGGKKNPGPQKKNGGGKK 


8935 


22836 


A 


9001 


2 


81 


E FTPAVHAS LD KFIiAS VSTVLTS KY R 


8936 


22837 


A 


9002 


3 


835 


DAVRANDDLKENIAIVERRNNLLQAELE 
ELRAVVEQTERSRKLAEQELIETSERVQ 
LLHSQNTSL INQKKKMDADLSQLQTE VE 
EAVQECRNAEEKAKKAITDAAMMAEELK 
KEQDTS AHLERMKKNMEQT I KDLQHRLD 
EAEQ I ALKGGKKQLQKLEAR VRELENEL 
EAEQKRNAKS VKGMRKSERRI KELTYQT 
EEDRKNLLRliQDbVDKLOLKVKAYKRQA 
E E A E EQ ANTNL S KFR KVQH E LiD EAE E RA 
DIAE SQVNKLRAKS RD I GTKGLNE E 


8937 


22838 


A 


9003 


429 


280 


RGFRGHKFLEPPGFPGFPPIMGPPSPPF 
GSPPQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LS AYQGTPLPAN I LDWQALNYE IRGYVI 
IKPLVWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCSPVY 
LLTFS PLLS YDP VTS PS PRNTQE 


8940 


22841 


A 


9006 


3 


370 


LLLTGLNRLTTDLYSLYI FTTTQWGSLT 
HHINNIRPSFTRENTLMFIHLCPILLLS 
LSPDIITGIFSYTRHYWSSTYLETNER 
GGSYKNRLGGPESKGIjRIKDDSIjCWGPIj 
HDTSAYYENK 


8941 


22842 


A 


9007 


363 


76 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKXKKNFFFPRELLWGPPRFFIKT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFVMIS 


8942 


22843 




9008 


1 


87 


KMLKKPKFELGKLMELHGEGSSSGKATG 
D 


8943 


22844 




9009 


420 


3 


RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNIFFPPRKGGRGS 
KVQTMGPPLFFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLRETSSNNV 
LiAYQ I ENS LLKLDLQRQFCL PL PGRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGLGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


IDPRCRNSARAPLSLAGPHPGMGDASNH 


8946 


22847 


A 


9012 


83 


607 


RRSGTPCGYNVTSQNGTIYSPGFPDEYP 
I LKD C I WLi I TVP P GHG VY I N F TL LQTEA 
VNDYIAVWDGPDQNS PQLGVFSGNTALE 
TAYSSTNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFEIA 
ITGEGKPTRMPQSRKKSCHIISVLiISAK 
LNRAFL 


8947 


22848 




9013 


483 


364 


PPPPPPFLNPAPGIFFPPPLGGIGARPP 
PPS FFFFFFFFF 


8948 


22849 




9014 


74 


180 


LLSTYVGRLSARPKLKAFLASPEYVNLP 
INGNGKQ 


8949 


22850 






488 


317 


HVGQAGLVLLTSGDPPTSASQSTGITGV 
SHRAQPLRSFSFVLSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNPSTLGGRGGWITRL 
GDGDHPSCRILAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FDSTADSDQVNP1 QGLASKWDYEKNEWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVI WNRERI SNS QNG IVKEI KGAD 
TFIFGHTPAVKPLKFAKTQMYIDTGAVFC 



1091 



NSDOCID: <WO 0164835A2_I_> 



WO 0 1/64835 



PCT/USOl/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ LD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
M=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GNLTL I Q VQGEGA 


8953 


22854 


A 


9019 


3 


209 


CPPLSPG CTNSARAE PSVHAL PDALNUL 
RTHEGSGDGPSSSVDWNRPEDVYPQGIY 
VISAPSIYAREA 


8954 


22855 


A 


9020 


504 


20 


NYAQMS YS TARAS ANES WAY FMG RRKFV 
ASRQASQMFLCWLEEAIVRRVVTLPSKA 
RFSFQEARSAWGNCDWIGSGRMAIDGLK 
E VQE AVML I EAGLS TYEKE CAKRGDD YQ 
EIFAQQVRETMERRAAGLKPPAWAAAAF 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 


3 


168 


LPRLEAAARIRHEERERERERERERERE 
RERERERERERERERERAPPPTLCET 


8956 


22857 


A 


9022 


2 


89 


LCSLVYLLTFPPLLSCDPAKSPSARNTQ 
E 


8957 


22858 


A 


9023 


508 


371 


FFCRDKVIiATLPRLLANAWPQVI FLPQP 
PKVXiRLQPCTTPSliKDL 


8958 


22859 


A 


9024 


2 


170 


RTHS AGRP YQCNQCEKAFRHS S S LTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


8959 


22860 


A 


9025 


3 


185 


PPAATPQGPSPINSPPTKKKAKNKNRTK 
KTKKKKKPPPSKKKGPQKTPKKTMGGPL 
KTTT 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGLSLLSFLPS 
LPSFLPSLLTYLLPSFLPSFLTSVINPL 
PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 


A 


9028 


36 


154 


FLVGQAGLKLLTSGDLPASGSQSARITA 
VSHRTWRTHYF 


8963 


22864 


A 


9029 


2043 


1237 


WLiRTRVAPALPERLGACTQLGPVLPCSQ 
PYWCRQCPEYRRQAAQPPHCPAPEGEP 
GAPQALGDAPSTSVSLTTAVQDYVCPLQ 
GSHALCTCCFQPMPDRRAEREQDPRVAP 
QQCAVCLQPFCHLYWGCTRTGCYGCLAP 
FCELNLGDKCLDGVLNNNSYESDILKNY 
liATRGLTWKNMLTESLMALQRGVFLLSD 
YRVTGDTVLCYCCGLRSFRELTYQYRQN 
I PASELPVAVTSRPDCYWGRNCRTQVKA 
HHAMKFNHICEQTRFKN 


8964 


22865 


A 


9030 


15 


389 


IVDHYPCGGIPTSGSVTQAGAQWHNLGP 
LQPR P LGLSGSSCLS I. ICS WGHRHAPPH 
IiARGH F SS E VEQLRSRLQHGAVF VKQR K 
GTPEDPRQGLRTLISSCLLHLVALiALWF 
SLPFHGPRIHMQ 


8965 


22866 


A 


9031 


3 


243 


RHSLAFNRFSCLSLPSNWDCRHPPSCPA 
KFCTFVEMEFHHVGQAGLELLTSGDLPT 
LASQSAGITGVSHHAWTRCCCCF 


8966 


22867 


A 


9032 


1 


245 


PTRPAGIAICICMCMKNHRATRVGILRT 
THINTVSSYPGPPPYGHDHEMEYCADLP 
PPYSPTPQGPAQRSPPPPYPGNARK 


8967 


22868 


A 


9033 


46 


124 


QE LMTHG AKS PDGTRVHNAFL FVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKKKKKRRPFKLKGPFKPP 
LKPQGGLFEAPPFGGPPPPGFFFAGGGA 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTLFISYITELI FVCFVEMMFHHVAQ 
AGLELLGSSNPPTSTFQSARITGISHHA 
CLNI | 


8970 


22871 


A 


9036 


297 


411 


NKTMSFAATWMQLEVILSKLMQEQKTKY | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: Of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
2 


Predicted 

beginning 

nucleotide 

location 

correspond 

amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, / = possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CMSSLISGS 










461 


167 


NKNYPLLGGPPFGSPSPENLNGGFPLTP 
GGGGPINPNSPPPPPPGGRRPPFLKKKK 
KKKKEKETNTYILQDTCDRCSAVLPIMQ 
VKI VM I CTj I MHRL F 


8972 


22873 


A 


9038 


466 


292 


NGCLLEPASWNSLANIFIFCREEGLTML 
PRPQVILPPLPPKVLRLLPPPLSS 




22874 




9039 


34 


124 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 
VK 










54 


239 


RRRRKLRRWGLAMLPRbVSNS WHQA I LW 
P PKVLR LQACATTPGLTRRWLHGQI WS P 
GAGSV 












223 


P WARVCVCVCVS VCV CACVRVCARAS E S 
QTCAAV P S LGPAGAGR P ALGC CAVCEDE 
VGSCPHARPL.PRAAVDKAAGRRGLCF 














SPGWAHSYNPSTLiDGRGSHITRVRKFM 
TCLATEGAV 


8977 


22878 


A 


9043 


144 


360 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
Y I FSQCTDKLTEKEFI EGTLAN KEILRIj 
IQFEPQKVKEKMKNA 












263 


RWNSDGKGSLRPAWATWPDPVSIKNTQN 
THTTHPTPHSPHTPHGQQPFLSPTPFPW 
PVSSPSPFPPFSHTCLiSCLLPPGA 


8979 


22880 


A 


9045 


461 


287 


PPSLANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAI LLiPWLPKVLGLQGE P PNLALK 
SL 


89S0 


22881 


A 


9046 


4S5 


6 


NNKNS WAW WRVPVL.PATREAGAGE SLES 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAILLPWL 
LKGLGLQARATLHMACSSCINSTQNILiQ 


8982 


22883 


A 


9048 


483 


338 


LREGERQREQERERERERERERDRERAH 
TTAFTQLLFQYVAI ILLPCY 


8983 


22884 


A 


9049 


680 


303 


NDL.LFGSLYVKQFAASGTGRVTLPEPSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
NYHGQDLLFKDATVRAVPVGEKTTYRGW 
LGLDYVAALEGMSSQQCSGAGRTGPPCR 


8984 


22885 


A 


9050 


26 


165 


AFGYHRVGQAGLELLTSGDPFASASQSA 
G I TGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 


83 


HKINLPSHKKKKKKKKKKKKKKKKKKKK 
K 














MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9053 


531 


404 


S VTQAAMQWQ KHN S LQGS TLE FKQ S S LL 
SLPTTWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KKKTNKIKMGGGPLPFKISPPFLKKTLS 
WKKGAPKTPPLEMAPFFFPQGGKKNS PF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQIFSLPPPKGVPQKNPPPLLRGGL 
FFTI 






ft 








I IoA VNGE T IjE G V TH E QAV AJ L KHQ RG T V 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE 
VLGLQV 


8991 


22892 


A 


9057 


1 


299 


PTRPINIILAFTISIiLGILVYRSHLISS 
LLCLEGI ILSLFI IATRI YPIAPPLPPL 
EPEKKKKKKKKKKKKKKKKKKHKKNTKK 
RK I KAKR KW1.GGGGL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ TD 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 


Predict- 
nucle- 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


K=L*ysine, L^Leucinc, M=iWethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8992 


22893 


A 


9058 


482 


346 


LTSSKKKKKKKKKKKKKKKKKKKKKK 


8993 


22894 


A 


9059 


2 




YFLFAYTILRSVPNKLGGVLALLLS ILK 
KKKRGG P FKRTLGG PKFNRGGQGN I FFL 
MGGGLKPNLGILGRNLSLGGGKNWPNLP 
PKIKGFRENKNF 














PGKIWPPQKIFKKPPPPFFFFFFFFFFF 
PKFFFFFFFFFFFFFFSIRNPPSSYTKW 
LRSLDVCHLLPPITPQVPGPTPVLLFST 
LPDCNHSLL 


8995 


22896 


A 


9061 


2 


274 


FT I S LLG I LV YRSHL I S S LLCLEG I I hS 
LFIIATLITLNTHSLLANIVPIAILVFA 
ACEAAVGLALLVKKKKKKGR PFKRILGG 
PQINPG 


8996 


22897 


A 


.9062 


42 


255 


QEFRTRRERERERERERERERESSGWR 
AKFRSNFSAGAIAHTIRVMLPPSGIYSN 
EKSINKCGFALLFF 


8997 












YALRNSARAGDPHGQLGRRDGAVGGRGW 
LMPRE ALHPW WANVQAWAGS PGLFLGAP 
WPQGGVWLPWLCVSMSASSSSINGLPPQ 
KKKKKKKKKKKKKKKKFWGGGPGPQSPF 
WGGLKKFGGRFLKRGGGEN 


8998 


22899 


A 


9064 


6 


306 


ECEHSTTYSHFAGGGGPVMVGGDADARS 
KALLGVCVGSGTEAYVLVLDPHYWGTPK 
SPSELQAAGWVGWQEVSAAFDPNSFYNL 
CLTSLSSQQQQRTLD 


8999 


22900 


A 


9065 


463 


126 


AHYQLVYTCKVCGTRSSKRISKLAYHQG 
WIVTCPGCQNHHI IADNLGWFSDLNGK 
RNIEE ILTARGEQVHRVAGEGALELVLE 
AAGAPTSTAAPEAGEDEGPPSPGKTEPS 


9000 


22901 


A 


9066 


23 


244 


S CHAG IRHEERERERERERERERERERE 
REREREI YFLALFSQKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNFGQAWWLTPVIPALWEAEAGGSR 
GQEIETILVNMVKPHLSVLLAKNSSSHC 
PMCWLAEIiQWYDLCS FQKI FFG 1 1 LI PG 














pdeffeltvddvrrrlaqlkserkrl.ee 
aplvtkafreaq i kekler ypkvalrvl 
fpdryvlqgffrpsetvgdlrdfvrshl 

GNPELSFYLFITPPKTVLDDHTQTLFQG 
EP LPGRS GCNLGAEE PAGVYLE PGLLEH 
AISPSAADVLVARYMSRAAGSPSPLPAP 
DPAPKSEPAAEEGALVPPEPIPGTAQPV 
KRSLG KVP KWLKL PAS KR 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERLYLTEDT 
LHSLSLEGARSGASVCVQSVYPTGASAP 
HS I INGHKRVLVL C VET EHT F P LAE KDR 
AFLGDTSVFVGAAPVGGANTGASAGGAP 
PSRGEPKEESGESHEDMGFGLFV 


9004 


22905 


A 


9071 


297 


126 


CSIYDKRIPTFGLGEGVSERQRLTLLLP 
CLPPALTETPTSSDWQSTEATPTLTNSS 


9005 


22906 


A 


9072 


117 


286 


WLRGSEKMFQVQEINEQYLQCILIKGI 
FLVKKKKKKKKKKKKKKKKKKPRPEYIK 


9006 


22907 


A 


9073 


3 


302 


PPMETLRQWTTIVHSQEEEEEHFEVIKD 
EVKWARKHGQPGTPVAIATQLPPRTSA 
AFPAQQQPLQPLAPTTVLSVSADWNALL 
QRPNKYIWGLQYLEM 
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1 SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1 M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V"aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9007 


22908 


A 


9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KILRQKKKKRRKAGSREKKKILRHLPEK 
NKRSKKHYELINALSRLQDTRLIYQKMH 
FYILAKNNWKIKLKIQYHLQALENKVPR 
HKWETYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMF FHDK I I V YLCSLVYLLTFP P L 
LSYDPAKS PSARNTQE 


9009 


22910 


A 


9076 


452 


310 


QFSRCHLSSLSLDRKGNSLTPCTSWVRR 
CLALLRLTLGALHLLSDNPQ 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 
IPFDPTLHF 


9011 


22912 


A 


9078 


37 


169 


GIPGFRHAGQRRLQLiLiITGDLPTSASHS 
AGITGTSDRARPPSTF 


9012 


22913 


A 


9079 


1 


408 


RKGRLSKEEIERMVQEAEKYKAEDEVQR 
ERVSAKNALESYAFNMKSAVEDEGLKGK 
ISEADKKKVLDKCQEVISWLDANTLiAEK 
DEFEHKRKELEQVCNPIISGLYQGAGGP 
GPGGFGAQGPKGGSGSGPTIEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRNPRLEHTVTITAYCSLELLGSSNLP 
ASASQVARTTGLQLLNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHYIAWAVLELLGSSNPPASA 
SQSAEFTDAHKPTDCRPLR 


9015 


22916 


A 


9082 


465 


326 


PCVAEFCTHTHTHTHTHTHTFKCVGSHL 
HTNND 


9016 


22917 


A 


9083 


33 


269 


PGNSRPISSVRGIHPPCLANFCFLNVFV 
EMGSHYWIoAGLILGSSDLSAIaAVQSGR 
VTGMSHCAWPEIFFLLENWIIL 


9017 


22918 


A 


9084 


47 


300 


DL I TSL FLLKQS S LATMN I OTP FKN I Y F 
Dlill KTLFLRFFI FCFI YFYFLFFSYGG 
FFSFFFFFLKGSSFL.PPRLNGGAKIWVN 


9018 


22919 


A 


9085 


437 


220 


NIAIKSLSLSLSLSLCVCMCVCVSACVC 
MCLSLSLFSPSSLSHSLFPFSTPNMDIS 
HLPILSTQENMLLLLQ 


9019 


22920 


A 


9086 


227 


23 


DCPKAN ILGLTP RLVSNSW PQVILTP W P 
P KGQS LLCRQGEGDRI WTWHP WWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQWPTPVIPALWEAEAGKSQGQEFE 
TS LiANTVKPRL Y 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPLPASGGSRCSL 
AWGSIPPISASIPTWPSVLCPCEIFPLL 
FLLYGDCNLGPTLI PDDLIMRSLP 


9022 


22923 


A 


9089 


373 


205 


MPPRLANFCYFSRDWVSPYWSGWSQTPC 
WDYRREPPFPPCVRTSYSVQYCACYAGL 


9023 


22924 


A 


9090 


283 


2 


REFSFSAWLAFKIFWKLFSPPFWRVRP 
EDWFI PGGCGPNE PNFP PCPPGVGKKGE 
PRFKKKKKKNFISSQAQWLMPVIPAFWE 
AKAGGSLEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACFYLLVYRRCLAMLPRLYLNSWAQVI 
LLPQPPASLELLQAHATTHPALCLSF 


9025 


22926 


A 


9092 


17 


109 


RQGLAMLPRLISNSWTQANLLPWPPKEIj 
GL 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFLKTGRKPFFFFLRQGLA 
MIiLRLVLNSWAQAILIiPWPPKVX,GLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLTPCLPEIWEAETGGSQG 
QEIETIIANMVKPCLY 


9028 . 1 22929 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HSLiLQSQTPGLKQSSCLSFPASAGFYKE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SPSFYKDKSCHHLGKHSGA 


9029 


22930 


A 


9096 


170 


9 


AASTGYCIGRFCTKLKIQSGGQVRWLSP 
WE KREAEVGGLLEARSSRP I WATW 


9030 


22931 


A 


9097 


2 


104 


RRPGGRAEPRARVRGDRDPPVLLRHYPH 
IFEECG 


9031 


22932 


A 


9098 


350 


142 


LCLS C P KGGGYRAMAPWRGKFCS FGKNG 
GFPMLGRWLKWPQMGPLPQPPKRVGFQ 
GLSPPPGPPPFLV 


9032 


22933 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELIASSNPP 
ALISQHAVITSGSHHTQPKLYF 


9033 


22934 


A 


9100 


30 


232 


GGIMLCHGRCACFPFAFCHDCKFPKVFQ 
PCFLYSLQNCSFLYKIPSLRSCRLQLAE 
LLS I I YLENPL 


9034 


22935 


A 


9101 


26 


171 


RNSRRliRHKNRLSLGGGGCSEPRSCQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCVCVCVCVYRVCVCVCVCVY 
IYMYMCIYIYIYIYICIYICREREKERE 
REMW FHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAG I ELLTSSDLPASASQSAR 
ITGVSHHAQPLFMI FCAKC 


9037 


22938 


A 


9J04 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPWGKRPHFHFSYETNK 
VERGKRLPLRSPSTEQGFDFRCVLFQT 


9038 


22939 


A 


9105 


468 


259 


YVFFCRDGGLTISSRLVLNSCPQQSPCL 
SLPKCWDYRHEPPCPSCCSNLL.LMFHFT 
NVFLIDSSCSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHHAWL INI PSVETGFHHVG 
PQVIQPPRPPKALGLQA 


9040 


22941 


A 


9107 


409 


306 


AGFELLASSDLPASASQSAEITGVSHHA 
RPSLIF 


9041 


22942 


A 


9108 


417 


269 


MWAQADMESSKEPGLFNWI INDSLVQ 
AYPELKEALYEEIKKAQRRGA 


9042 


22943 


A 


9109 


533 


399 


RGSLRHETRLNAEARRCSELNSRQCTPA 
WATERDAVSKKTKQKA 


9043 


22944 


A 


9110 


16 


239 


LVHFEVRWNSHEVSLTADSRKSHLIFPQ 
PNYADTLINQESYEKSEPLLITQDLLET 
KGE PRQLQVSFFP PKREE 


9044 


22945 


A 


9111 


68 


219 


EPPHLANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKL.PVLLPS 


9045 


22946 


A 


9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTECAPPGNST 


9046 


22947 


A 


9113 


21 


201 


TI VS WE FMNLGGG TCGEPR WCHCTP DW 
GTERDSVSGEKKEDKDVARRHWVGSFYN 
GPQ 


9047 


22948 


A 


9114 


2 


144 


TFTIDRCAKDLFVAKQVGTKLSVNEPLS 
FSVESI LKRPSSAI TRVSQ 


9048 


22949 


A 


9115 


156 


316 


MILNKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTLiGGAKITPGWKNYFFFFMG 
GV 


9049 


22950 


A 


9116 


1 


84 


TKPTPFLPTLITLTTLLiLPISPFILjIIL 


9050 


22951 


A 


9117 


1 


176 


HLNLGGGVCSEPRSCHCTPVWATEILSQ 
KKKKKNGFKFWE PRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 


1 


184 


AAAGYPGQQQNFHSVREMFESQRIGLNN 
S P VNGNS S CQMAF P S S Q S LYRTS G AFVY 
DCSKF 


9052 


22953 


A 


9119 


69 


374 


1 IFSKKKKKKKKKKKKKKKKGGPPKKKP 



1096 



JSDOCID: <WO 016483SA2_I_> 



WO. 01/64835 



PCT/USO.1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 

ingto first 
amino acid 

peptide 
sequence 


Predict- 

correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyJalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WGGP KI FVLRRVLS FFIRGE KKNTFGDF 
LKXTFFWGGEKLGKPPPKKKTPWGKKKI 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EFSPLLL.TTNQKRGFFPPSPPKIFFFPK 

FFSPPKEKKYNLPPPGKFCPPQGFFFPP 
PPLFFFSFFPFFFFRCQIK 


9054 


22955 


A 


9121 


2 


380 


WDYRCPPPHPTNFCIFSGDGVSPHWPGW 
SWPQVICLPQPPKVLLTPWVRDRTRDLE 
GGDLW YGACLGACLHLGPQAASFGHG I 1j 
GPASLLCCIPKHVHMDTLSPGVKLQSCD 
TGVALRVGEKREGS 


9055 


22956 


A 


9122 


2 


534 


DMIREQKIYHKYLAQRREEEKAQEKEFD 
RILEEDKAKKLAEKDKELRLEKEARRQL 
VDEVMCTRKLQVQEKLQREAKEQEERAM 
EQKHINESIiKELNCEEKENFARRQRIiAQ 
EYRKQLiQMQXAYQQQSQEAEKEEKRREF 
EAG VAANKMCLDKVQE VLS THQ VLAQN I 
HPMPQGMPQ 


9056 


22957 


A 


9123 


389 


46 


GPNFLPPPIFGFFSPFSPLKFFFSPKTL 
IFLGGFFPIFPPPKKSFFPKNSPGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 
FFFFFFFFFFFFNFYRLYFIWVNGLAKV 


9057 


22958 . 


A 


9124 


422 


59 


AQQKKKKKKKKKKKKKKKKKKKKKKKKK 

K 














QWCDGNLLQTLTLGLKRSPHFSLPSR 


9059 


22960 


A 


9126 


175 


89 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 


368 


250 


CFNYLSRDRGLPLL.PGL.VLKSWPQAILL 
PQPPKMVGVQG 














FFSPKRLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFSPPQKKKKKFFPPPVKMGPPKS 


9062 


22963 


A 


9129 


396 


13 


FPPPPPPIFFFSPRRFFFRGGGSVALSP 

PPFFCPPPQILFIiAPPPYFFFFFFFFLF 
VFFFLFFFFLLPLLFLGKTS FLLLLFLF 
I YLE TGS CS VAWARVQ 


9063 


22964 


A 


9130 


55 


180 


LLYNKLI FTLKKKKKKKKKKKKKKKKKK 
KKKKKEKYEKTKAY 


9064 


22965 


A 


9131 


2 


95 


SFVHMEHEKAVLLLKSFQNTVDLGIQRE 


9065 


22966 


A 


9132 


172 


404 


KKKKKKNKKKKKKKKKKKKKKKNWGGG F 
FKNSWGGPKKRGGGKI FFFSLLGGKKKT 
LGVFLEKPLFLGGRKMAPPPP 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTIIGQVAYELYFTTIIilLIPT 


9067 


22968 


A 


9134 


1 


120 


QPVSYPFTIIGQVASVLYFTTILILIPT 


9068 


22969 


A 


9135 


7 


269 


I ATGQ P RLQ KLTLD PT LLLKR F LLT KKK 














KKPPGGPKTYPVKKKI FFFFKGKI XKPP 
WGI 


9069 


22970 


A 


9136 


1 


64 


LVS I S NT YGLD YVHNliNLLQ C 


9070 


22971 


A 


9137 


400 


259 


SQPUIRLRHKNILNPGGGHYSELGSCHC 
TPAWDTEQDSIHSWLL.SSV 


9071 


22972 


A 


9138 


3 


310 


ECFPKSPNMNGHCNLLVQSTSPSISTMP ' 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted- 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPKYVALEKVFLLNVLWFFYLKYRDNRP 
GSVAHTCNPSTLGGRGGWITGAQEFETS 
LGNIARPSLI KRKKYKLF 


9072 


22973 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHS DT FTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 




PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


9074 


22975 


A 


9141 


3 


102 


P PAS AS QS AG VTGMSHHAQP I E PLRCS S 
HRTT 


9075 


22976 


A 


9142 


245 


51 


NREGDLLCCPRWSQTFGLKQSCLGFPKC 
WE YR RG PPHLA Y I MSMRKNI Ii I QM I FYY 
NNPCLLYS 


9076 


22977 


A 


9143 


2 


108 


GAKCVSDRI KHAFLiIKEQKI I VKVLKAQ 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFLHVGQAGLKLPTSGDLPASASQSAG 
ITGVSHRAWPVSGFFCSEAKECFWEEQL. 
INLDKNSQK 


9078 


22979 


A 


9145 






WFFLGFPLKGKLGPPKMFFRAPRVYSPF 
FPPFFFWPPPPRIIFPPGFFKFFVFLAG 
PIW 


9079 


22980 


A 


9146 


387 


220 


PLFFVGCFFFFFFFFYYFFFFFFFFFFF 
FFFFFFFFFFFLGGCERNFI IRKNINN 


9080 


22981 


A 








VINFKHILIKTVIQSLFFFERGSCLPPR 
LECNSMIMAHCSLNLLGSSHPPASASQV 
AGTTGAYQHTQL I FI F FAETGS CYVAQA 
GL.QLLGS SNPP P WPPEVLGLQV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCEPRWSHYTPAWATSAKLCLKKK 
KKKRFLEKSCLTPLLKA 


9082 


22983 


A 


9149 


413 


240 


FIFCRDGGLTMLPGLTSNTWSQVILPPQ 
PPKVLGLQAYHADELPHLATWGVLYGLT 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGP CLLNL I TQ FVSCHLLD I KLQMTL 
SEGYFPL.NIQESPFYRAPLDDPSVRHAR 
GEILPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LV AA I VXiGG TG WG F KN I RRGRS GGA 


9085 


22986 


A 


9152 


365 


181 


IVDRPDPCRDADAQREVFTLSARYPKIW 
VPPGDPAREFPFIQSAPVMLIiAAPDELV 
SAQLA 


9086 


22987 


A 


9153 


373 


251 


FKKIFFCRDRALNMSPRLVSNSRTQAIL 
PPQPLKVTjRLQA 


9087 


22988 


A 


9154 


204 


95 


DYHWSQPGMVAHACNPSTLRGRGRWITR 
SGDRDHPG 


9088 


22989 


A 


9155 


44 


280 


AELFGTLMREDGLSLGGGGCKEPKSCHC 
NPAWATEPDCLRKKRKESPYLLWTPPD 
SSYSPFFFLLLPAPDNNDYFLS 


9089 


22990 


A 


9156 


2 


117 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSISKK 


9090 


22991 


A 


9157 


3 


207 


HASAHASADAWDYRRVPPCPANFCILYV 
SHVGKAGLNLLTSSDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 


A 


9158 


117 


267 


NRVLLCCPDWPQTPGLKGSSLSLLSSWD 
YRH I P P RLAARS S YLGPG F YH 


9092 


22993 


A 


9159 


415 


167 


LFFFPIKNXFFFSPPLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFLELKQNF 


9093 


22994 


A 


9160 


217 


41 


SSQIHRDRNRMVRQGAVAHACNPSTFGG 
KGEQ I TRAQEFKTSLANMVKPRLYYKPC 
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NO: of 
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sequence 


SEQ ID 
NO: of 
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M 
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od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Asp»rtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine. 
T=Threonine, V=V fl Iine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LY 


9094 


22995 


A 


9161 


341 


164 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFYVCF 
VSVELKKWQNYSFDVRIILRVEP 


9095 


22996 


A 


9162 


190 


402 


IHTGKHPKCYRNSYSVLFLASKKKKKKK 
KKKKKKKGGGFKKKKNLNRGGGEI I FFF 
WGPKKI FGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LS VKRCAKNPKKGKVAPG I KE I RTRS HQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPLPRKTRDLCSLVYLI/TF 
PPLLS YDPAKS PSARNTQE 


9098 


22999 


A 


9165 


170 


402 


ISRCHFPSLILSYVKITKTIKAPKKSKK 
KKKKKKKKKKKKKKKKKWGGGPFKKKNF 
LGGGGGE IFF FLGAPKKKRGG 


9099 


23000 


A 


9166 


399 


113 


WRLRWEDCLSLGGRGCSEPGSCYCTPMW 
VTEQDQS SGYEGFQRPSSLS KART PSVA 
AERSMCQ PACVCNLS VEHLSLYS CLLAQ 
QGKALYKLLNK 


9100 


23001 


A 


9167 


3 


138 


SWITGVCHYAWLI FAFLVETGFHRVGQA 
DLELVTSGDATTLRSH 


9101 


23002 


A 


9168 


134 


301 


SfCLTQSHSVTQAGVQRLNVHSLRPPPPR 
FKRFSCLSLPSSWDYRNLETVLQEETH 


9102 


23003 


A 


9169 


201 


1 


GATILRFKTKKKNVFFKKKKKKKKRKKK 
GKGWSGTESRWNLSGEGSSGLRRRKLQA 
PAGPQEFEKLA 


9103 


23004 


A 


9170 


2 


191 


FVPQPLPLGFKRFSCLSLLSGWDYRCTP 
PHLANFCGFFFLKFFFFFFKKKNFFSPR 
GRGGALI 


9104 


23005 


A 


9171 


2 


191 


GGTESCFVAQAAVQGCDLGSLQSPPTGF 
S ETAS QTVDPDKVKD PGKLRTM FVAEL P 
KTIVTDS 


9105 


23006 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
I TGVSHRARLQTG I FRAV 


9106 


23007 


A 


9173 


361 


174 


VLGI I LSMSETE I FNTKCLDQSGTVAHT 
YNPSTLGGQGGRVIRAQEFETSLGNTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 


351 


LVETGFNHVGQAGLEL.LTSSDPPASASQ 
SAGITGVSHLiALVLISRKSC 


9108 


23009 


A 


9175 


1 


115 


FRKIWYIYTMEYCIAVKKKEMLSFTITW 
MNIjEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLAICLGWSAVAIHRRDHSA 
LQSQTPGLKRSSCLS LLSSWDYRRVPPC 
LANESHY 


9110 


23011 


A 


9177 


399 


237 


RNTKWL.FFLRRSLTLSTGLECSGAISAH 
CNLRLPGSSDSLANGIFKDLALLDM 


9111 


23012 


A 


9178 


427 


101 


GRTGRQGRGAPHISHDGRPGRDAPRFPD 
VMAAGKRRS SLPRWDGGRAETLLTFQTG 
QPGRGAPHIPDDGRPGGDAPHFPDGWA 
GQRLQSQHFGRPRQAAGRWRL 


9112 


23013 


A 


9179 


193 


41 


IVFLFIYLFCFFVGTGPCYVAQAGLKIjL 
VSSGPLASAPQHVG I TGMSHHT 


9113 


23014 


A 


9180 


292 


119 


KITKINKQ I LNLS YTLDQMDLTDI CRI F 
HPTATEYRFFSRAHGTFFRSGTKNSAWH 
I 


9114 


23015 


A 


9181 


3 


279 


ALSPPYSSECCKLGSFNRELSSPSPGGQ 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQITAVSASGVTHPSPPLLH 
VCCVCLI 



1099 



NSDOCID: <WO J)164835A2J_> 



WO 01/64835 



PCT/US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN- 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPEVTPLFCLNLPNN 
WGYRWPPPPPTNFFFLKGVSLGRPGSV 


9116 


23017 


A 


9183 


3 


207 


EFHHVGQDGLELLTSGDPPASASESAGI 
TGVSHRAQQACRFLHFFEGTCFLSQELS 
YLKLGRDTIIC 


9117 


23018 


A 


9184 




106 


YPHYVKSIVASTFI ISLFPTTIFMCLDQ 
EVIISN 


9118 


23019 


A 


9J85 


110 


248 


GVLVGS PLRTR FHSQIiQAEGAEEIGWT 
WMGDGVLPVCWLEVDV 


9119 


23020 


A 


9186 


925 


1132 


KCPSAWAAHPTSEIRISQGGSQAAWLV 
LFFLLKNPGDVI PALWEAEAGGSQGQQF 
ETSLANMAKPRLY 


9120 


23021 


A 


9187 


399 


292 


ASQVAGTTGAHHHARLIFFVFIjVETGFH 
HVSQDENF 


9121 


23022 


A 


9188 


7 


82 


PCDPPASVSQSAGITGVSHHAWQXF 


9122 


23023 


A 


9189 


1 


117 


LPISLPSFLPSFLPSLLPSFLPSSFLPF 
FFPSSVTPYLK 


9123 


23024 










NKAFRIKAFVSPSNKKFSSFFFFLNEMK 
SHYVDQAGLELLSSGNPS I IoASQNAKI T 
GVSHHVRP 


9124 


23025 


A 


9191 


2 


282 


LI F YF VDNMLVRAGLELLS S SDLPALAP 
KVLRFTGVSHHTRPELYFQRLLAPGTLL 
INSQAAGTNFPIGLSRIKPTQSGHGMLQ 
EVWGTSGLR 


9125 


23026 


A 


9192 


44 


136 


CLCVCVCVCVCVCVSCKQTLMCCFLENF 
DV 


9126 


23027 


A 


9193 


469 


347 


PCPANFVFLVETGFLHVGHAGLKLLISG 
DLPNSDQKFYVN 


9127 


23028 


A 


9194 


427 


238 


PPARPTAFPYIEFFVEMVFRYFGKAGLK 
VLISGDLCASASPGVGITGVSHHAQPK 


9128 


23029 


A 


9195 


34 


178 


SLVHWEFERGRAEWLTPVIPAIiWEAEA 
GGSRGQEMETILANTDFWTG 


9129 


23030 


A 


9196 


22 


338 


CGPYSMWWNSIKRERKEGKKRKKKRRRK 
EQKERERKERKERREGGREGKERGRREG 
KKRKKNKRKKVGL I GS PCI VFTWQAAYC 
EESGGVWPGGETEHTQEGLVQ 


9130 


23031 


A 


9197 


3 


91 


QGLiSLGGGGCSEPGSCHCTPAWATERDS 
I 


9131 


23032 


A 


9198 




203 


RTRGNKS I DDTSN FDDFPESDI LQ P VPN 
TTEPDYKSKDWVFLNYTYKRFEGLTQRG 
S I PTYMKAGKL 


9132 


23033 


A 


9199 


2 


144 


PTRTMPCCCCCCCCCCCCCRCLFMTGVL 
FTCCHNOFENHSGCLPRQI 


9133 


23034 


A 


9200 


1 


99 


YAAKE L VHDRYRPT I EMERRRGLW WLVP 
RLSLE 


9134 


23035 


A 


9201 


3 


149 


LNAVQRMEWQLKVQELDPAGHKSLCVNE 
VPSFYVPMVDVNDDFVLLPA 


9135 


23036 


A 


9202 


284 


21 


KGPPLFFFFF FLRQGLVLWHRLE YSGMT 
TAHCNLELLVSSDPPTLSSQNAETTGVS 
HCAQPASLILSKPYIGFLHQIQKSCLPV 
TFL, 


9136 


23037 


A 


9203 


1 


77 


LRIiVLNSWRQAILIiLWPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TPAWATRAETPSQKINELIKSMAVELTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWCKRHRPPCP 
YVFVLLAETGFHVGQAGHKHTYCCVSLP 



1100 



JSDOCID: <WO 0164S35A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tannine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T 


9139 


23040 


A 


9206 


33 


152 


QLiQGLESWGMERCAIjPGYPVVFTNLiCKY 
RSWIEETMRDK 


9140 


23041 


A 


9207 


70 


402 


HVDWEALGNDPRLVNLGICKLKASGIHE 
WVHTRTCAHTHMHTHTHTCTHCRSPRW 
VT.NLG CT FEL PG AL KNARA Y PH PNATKP 
MTSESVGIRPTRQVFKAPRAEQHCSTA 


9141 


23042 


A 


9208 


46 


262 


GG I P PAAVRGG RS WLRDVLS RLAAAGRR 
RRGTRRGRGRAGGAGGGRDRVGSRREPA 
AARTAAEGERAVRGKC 


9142 


23043 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPASLPPCSLISD 
CCASNQRD S VG VGP S EPGDS DMQTSLWA 
KVPVSKD 


9143 


23044 


A 


9210 


475 


110 


NLIAPHLLVSLEKHS IRMGVAQFSRCHIi 
SQLPLARKENSLTPCASGVRRCLALLRL 
MLGGLHPLSCTHCPTGPSEMNLVSQLEM 
QKS P VFCVAHAGS CRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGFFPPPPFFLRPPPPFFF 
LPPKKKKIFPPRGPKKIFFLKAPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


87 


FFFXKQXFFFPSLFFFSPPXXFSFYPF* 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 
FFFLFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFD 


9146 


23047 


A 


9213 


2 


276 


FIQLIiKRRLCELGYEVLPHPSYSHDLLP 
TNYHFFKHLDNFLQRKFFHNQQDAESAF 
QEFAESQSMEFYATGINTLISPWQKCVD 
CSVSYFD 


9147 


23048 


A 


9214 


402 


276 


LSCLSLPSSWDYRCPPPKPGYFFYFFLF 
LVKTGFPHLAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCS PFFFFFFFFFFFFFFFGREG 
FFLQKKKVTCNNPSSNKLLTLGLQGKKE 
IQPALMKKNSISQS 


9149 


23050 


A 


9216 


1 


142 


KDFRVQVWYNHSSLQPQPPGFKQSSRLS 
LPKYWDYRHEYLHCIQLAP 


9150 


23051 


A - 


9217 


402 


181 


FFFRXFFFFXXFFXXXXFFFFFXXXXXX 
XXXXXXXXXXFFFFXFFFFFFFFFFFXM 
FQQEK1.LFIFYVKIWTF 


9151 


23052 


A 


9218 


2 


87 


SGSLIFGLIYICMCVCVCVCVCVCVYPC 


9152 


23053 


A 


9219 


420 


271 


YYCI FRETGFHHVGQAGLELLTSGEPPA 
S ASQSAG I TATAPS PFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


SGYHS KTLFQKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGGAFKKK 
IFFRGGGGNFFFFIiGPQKNIGGGGLKKR 
GGGKKPPPPQIFFFVG 


9154 


23055 


A 


9221 


134 


236 


LHRWGPAMLPRIILNTWPQVILLPRPPK 
VL.ELQA 


9155 


23056 


A 


9222 


364 


280 


STHLSLPKYWDYRHEPPCLAKKDAQSFS 


9156 


23057 


A 


9223 


1 


147 


TLLPMLESSGTIMAHCSLiNYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASLPPCSLISDCCASNQRDSVG 
VGPSEPGAGYHLWRRLSQSEKRS IRVG 
VTRFSSLLLSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRPKTKNFYTGNTRPN 


9159 


23060 


A 


9226 


396 


210 


KCWD YRHEP P Y PALTNLLF I EMGSYGVA 
RAGLKIjLGSSNPPISASQSAGVPGIjSHH 



1.101 



JSDOCID: <WO 0164835A2 I > 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alahine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQPIF 


9160 


23061 


A 


9227 


2 


154 


FHHFGQAGLELLTSSDLPTSASQSAEIT 
GMSHHTQPRMVFFSNFSSTAQC 


9161 


23062 


A 


9228 


1 


238 


FFLLALG I F I LHAI YLS I YL.S IHPPSMH 
PSLSIYLSIYLSIYLSIYLSISLSLSLS 
IYLSIYLSIYLSIYLSDFLSSAR 


9162 


23063 


A 


9229 


3 


65 


RGDLITEPFLPKLLiTKVKGS 


9163 


23064 


A 


9230 


25 


188 


YFRTAWNQAWW YVP I VSATWEAEAGGLb 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9231 


44 


232 


YDLKFNNHWFYFLFETEFHSLSKVECN 
GAISAHHNLCLCDGARLHHRRGGKKTKK 
QKTLPS 


9165 


23066 


A 


9232 


3 


195 


FFVGLGS PYWQTGLELLCSGNPPASAS 
LNAR I TGMSHQVQPNTGRLS FYLGS S I Y 
GSIHAMI 


9166 


23067 


A 


9233 


380 


244 


SVPLAEMGFHHVGQASPELLPPVIHSSQ 
PPKAIGLQARATAPSUH 


9167 


23068 


A 


9234 


11 


101 


LLLINTLMFIHLSPILLLSLNPDIITGF 
EG 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPRNNAQVAGP 
LRNKIFTPVGSPKKIFPSWETPVF 


9169 


23070 


A 


9236 


404 


262 


QGIAMLPRLVSNSWTQAILLPWPPNQSA 
GGSGVSHHTQPIFISENIH 


9170 


23071 


A 


9237 


369 


152 


RAFFVSI WRS FHVSNS YMT VTVQMYHNL 
SIPLPDPHFFFRGGGLiAMLPRLiVSNPWP 
QAILPPQPPKVLGLQA 


9173 


23072 


A 


9238 


389 


190 


RPKKKKKFFPPPGGKGVFFKGAPPPYFli 
NFIFFFFFFFFFFFFFFFLIYTHNFLIL 
FLNCDKLYIT 


9172 


23073 


A 


9239 


330 


3 


GGMEPPFFPPTGAQWGYFGSLQNSPPEF 
KQFLCLRLPNVLEYKSIPPCPTNFCSFY 
KNGIFPCWPGWPFKFQKKKKISWVWWCA 
IiAEAGGSLEPRWRWRLQSAKMAPL 


9173 


23074 


A 


9240 


122 


248 


KKKKKRGGLTPVI PPLWEAKGGGSRGQK 
IQTILAGGGKAPLY 


9174 


. 23075 


A 


9241 


2 


286 


FFVQTEFHHVAQAALQLVSSRDPPTSAS 
QSVDITGVSHCAQPVIVSCDDQVTLVQR 
RQVTVTSLIAFCQVAYPKGAKSQWPQSF 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCSSSLAIREMLIKTTMRFFFFKLI 
TCY 


9176 


23077 


A 


9243 


377 


293 


PLSPRLVLSSWAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFSPPRVFKRGPRNFFFGPPKK 
KKNPPPPRGKIFFFLKGPPSFFFFFFFF 
YFFFFFFFFFFTHKYFLLGDLLLIYHLS 
KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


TIGRCHHTQLNLKFFVEMGSHYWQAGL 
KL.LDS TNPPSSAFQSAGISGVNPYAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


EPYHLH FLPPHHY YYCRWGL I VLPMLVS 
NSWAQVILLPWPPKVQGLQV 


9180 


23081 


A 


9247 


3 


320 


WQAGL KLL S S CDQ P AS AS Q S AVI TG VN 
HHARPL I FLYEMDYGSRTVL I IRKWDDN 
EQSNRTI FIiKKDELTGNRSACLKYTPMD 
I NVNS GLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGSYFSFTYEAQTESCSIARLECRGAIS 
AHCNLRLLDSSNS PASASRVAETTDHKT 



1102 



ISDOCID: <WO 016483SA2_L= 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

Otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVPERSCFTPKERNVIQKGQEESEEMSN 
HLFTWLPILH 


9182 


23083 


A 


9249 


13 


174 


LGLKVRHRLWL I FVLSVETEFHHLGQAG 
LELLTSGNPSAGITGVSHCARPTSHS 


9183 


23084 


A 


9250 


3 


141 


HVGQTVLRLLTSGDPPTSASESAGITGV 
SHRPRQIFISLFTICIR 


9184 


23085 


A 


9251 


56 


262 


LRIPLTFGGWKGEEEDRAVMHLRLCGD 
DSPMSESLHFIKLPRTVLVDEDARPEKK 
KKKKKKKKKKKKK 


9185 


23086 


A 


9252 


401 


275 


PADLFIFIWEDGGLTMSPRLVMNSWSQA 
TLQPWTPKVLGLQV 


9186 


23087 


A 


9253 


411 


258 


PKVFKYRCDPPRLAWFLFFVQSLLPRLG 
LNSWAQAFLPPWPPRVLGLQA 


9187 


23088 


A 


9255 


391 


167 


GLKI LTSGDLPASASQSAGNTGMSHCAR 
PLMS FKI RYFL FT I KMKLKLLWPVWSA 
GVHPPHFSCPIVITNKPRI 


9188 


23089 


A 


9256 


9 


255 


FFCRYGGLAVLPRLVSNS WTQAI LS PW P 
LKVLGLQACTHHFFLMVNSNAARI KTI E 
MLLS ED VNFF VALL CPS F T I SH FHL 


9189 


23090 


A 


9257 


487 


42 


RYFIFLFYFIFKTETGSYYVAQARLRIiL, 
GS S NP P ALAFQ S AR I TAVS HRAQLTC I K 
YTFLFTHIP 


9190 


23091 


A 


9258 


1 


243 


ILSSGGSFISLTAVILIIFMIGEAFASK 
RKVLIVEEPS INLEWLYGCPPPYHTFEE 
PQPKKSEIYSKCFFFSVFVHFGGWG 


9191 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSLCVLESSPPPFFFFSPP 
PKKGFFSPPFFFFSPRVFPPPVFFFPPP 
PFFFFPPLKKNFSPPPPGKIFFFFFRPP 
PFFFFFFFLFFFFFRPFFFFSFFFLRWE 
EVLYFIMCHHISDAYPPNVDMNKE 


9192 


23093 


A 


9260 


417 


219 


SVEMGFLHVGQASIKLPTSGDPSASGSQ 
SAGITGLSHCARPHSQLSIVCCYSVPVQ 
CLHSLAARY 


9193 


23094 


A 


9261 


408 


172 


CPNH FRKWGHRGCP P PRVF FFVGRDG VY 
VAQAGLELLDLSNS PASASQSMGTTGMS 
HLGQPLLFILNDVIAVIPLQGSF 


9194 


23095 


A 


9262 


401 


132 


HQDNKPAPPGFFFFSPPPKKGFFSKPFY 
FVSPRFFPFPPFLNPAPIjFYFWGPKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 


9195 


23096 


A 


9263 


393 


220 


LFFDNKI PFFPPGGS KGGDFGPHTNPPP 
GVKPIFRPHPPEEGGPKGPPPRPREFFL 
GF 


9196 


23097 


A 


9264 


206 


438 


FPILKTHFRLGTVAHTYNPTTLGGPGGR 
NTCSKFKTSLGTWGDPFFHKSENCLGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 


23098 


A 


9265 


333 


48 


ALLAGPTPFFFLGKKKTRFGNRALFFFF 
FFFLNE VLLCNP G W IMWA I HR CDHG TL 
QPPTPGLKRSSCLSLSSSWNYKLKPLHP 
AGVRFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TGFHH I GQAGLKLLiTLGDP PLSASHSAG 
I TGVSHHA WPMHLES YWR VQYPFMPQVL 
YFWAPSGAQHCAQ I I ENAI PTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 


226 


HGPE PCMDADAQREG INVSAR YPKNWES 
TVDPAREFTM I YSAPLMLLADPDE FAS V 
QLA 


9200 


23101 


A 


9268 


159 


236 


PRLVXiISWPQAILPPWPIKWGliOA 


9201 


23102 


A 


9269 




208 


IFNYCVFSSGVAIAYGIYKQDLPALKEK 



1103 



_0164835A2_I_> 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteinc, D=Aspartic Acid, 
E=Gtutamic Acid, ^Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PRGWFVDMGHSAYQVSACAFNRGQLKV 
SIYFFSRRLLWW 


9202 




A 








IFNFCVFSSWALAYGIYKQDLGALEEK 
PRNWFVDMGHSAYQVSVCAFNRGK1.KV 
SIYFFSRRLLWW 


9203 


23104 


A 


9271 


379 


263 


ESCSVGRLECSGAVPAHCNLCLCLPDSS 
GSPASSSRLC 














RKKERKKERKKERKKEDGATADGISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFIIPNPNFFFF 
FFFFFFNFFFFFKTKGCEFLYFSY 


9206 


23107 


A 


9274 


2 


227 


LVETGFRHIAQAGLQLLSSSYPPTSASQ 
SVGITGVSHQPGPSFHFERKSLGLKTLL 
LCVKVGVRACFAL F FQG S S 


9207 


23108 


A 


9275 


70 


304 


NLFLKASTYPSEHLYHLPPTPIVYIRKR 
LDAMAHACMPGT LGDRGG W I ART P DL KC 
SAYRGLSKCWDYRHEPPRPVLV 














TVFYHVGQAALELLTSGGPPALASESAG 
ITGVSHHAQPFLLILVISLFPCNQSFHI 
SPYRFSFTQLNHGFNTVLYPLVINISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PRLVLNSWTQTILLPWPPKVLGLDV 


9210 


23111 


A 


9278 




160 


ELLTSSDPPASASQSARITGMSHHTRLK 
YIVYRVSRKSKNIPLKFUGIQSFP 














GLTVFPRLVWNSWAQMIIjPPWPPKMLGL 
QV 














LGETGFHHVGRAGLELLNKVMHP D WAPR 
VLGLQA 














AHYKLiLLKIKLPGQAQWLTPVI PTLWEA 
EAGRS 














TGDPAREFTMIHTAPLMLLADPDEIESV 
QLAQS W P FGA IAS LGRS P CR KRN 


9215 


23116 


A 


9283 


404 


277 


IFFVETRSHHIAQAGLELLGSSDPPYFF 
QVFTEMSLSQRALF 


9216 




A 








H YVAQ AS LKLLYSGDP PAS AFQSASIAG 
VSHCTWPTGSLNKIQNNIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSN 
PPAWSQSAGITDVSHDFLISLYRIY 


9218 


23119 


A 


9287 


420 


311 


LNFCRDGLAMLPRLVLNSWAQAVLSPQP 
PKVMGLHA 














LMFKALLFSCHYFPAMVTFKFSLS ISGV 
VRTSGYIMLFSYLCLLKTM 


9220 


23121 


A 


9289 


270 


23 


TPEQCRG I KLFIYS L I LYVRDGVLLCCS 
GWPWAIHKCDHSTLQHQIPSLKQASCL 
S LLS S WE YR YAP LHLAS ANFL KVT W 


9221 


23122 


A 


9290 


273 


46 


HLLNM PLFTE I WTERLFLFY I LCFVFLV 
ETGFFYITQAGLKLLTSSDPPAYASQSA 
E I IGTS HCTRP I C YVNVTNI 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


NLFLVETRSHHVARASFKLLSSSNPPTS 
ASQSSEITGASHRTWPEVFF • 


9224 


23125 


A 


9293 


393 


202 


GRERERKREREREKERKKERKKERKPER 
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SEQ ID 
ISO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Clutamic Acid, F=Pbcnylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PSGSLTYCVREG ITAANLGKRRHKAS KT 
LISHKNKN 


9225 


23126 


A 


9294 


252 


193 


GFCCCCCCCYCCCSDKRHPG 


9226 


23127 


A 


9295 


409 


269 


SQYFGRLRRVDCLiSSGVQDQPGQQGETP 
SLLKYKI YIHISWAGWHV 


9227 


23128 


A 


9296 


2 


213 


FCCGIjLIHLIYCKTEYSGQSTVAIHRCN 
HSVLQPQTLGFKQFSHLSLLSSWDYRYS 
PLC PAY I FNLS I G Y 


9228 


23129 


A 


9297 


169 


408 


RFARHMNSPHEERLPPPAGIRVRPPPQP 
AASLGPPRMAPS I SRRFQMTPYYKHNDF 
AGQAQ WLMP WPALWEAEAGGS S E 


9229 


23130 


A 


9298 


199 


51 


LHPPPPSLSLSLSHTHTHTHSHTHTHTH 
TLWLGRPGPAQDWLVWGNLT 


9230 


23131 


A 


9299 


1 


176 


MGSHSIAQARMQWCNQSSLQPQPPGLKQ 
PSPFSLPSSWDYRHVPPCLASIFFLKLN 
YH 


9231 


23132 


A 


9300 


412 


209 


MGFHHVGQSGLELiLTSGDPPTSASQSAG 
ITGVSHRNWPHCGFNLVFLMPDDVKHLF 
MCILAICVSSL 


9232 


23133 


A 


9301 


195 


347 


KIKGFRPGATAKAGNPSALGHQGGRTAI 
AWEFTISMSNMGRPCLYRKYKN 


9233 


23134 


A 


9302 


386 


190 


GFLFFFPPPKKEVFPHPFILGFPGFFPP 
PPFFKRAPEFFFWGPKKKKNSPPPRALN 
FFFFKGAPP 


9234 


23135 


A 


9303 


27 


173 


S FQ L WGHLK VPR I R P KRADH LRS G VQDQ 
PDQHGKTLS LLKKYK ISRAW 


9235 


23136 


A 


9304 


374 


121 


QWCDHGSLQPQTPGLKQSSCFSLPKRWD 
YRREPPHPTSLSPYIVAWPCTLCPPPLS 
All WfCHL I HQ KCDG VMVLQ VPG RTHNVT L 
T 


9236 


23137 


A 


9305 


70 


196 


PQVIKPGQAGLELLTSGDLPASASKSAG 
ITGVSHHAQPISHF 


9237 


23138 


A 


9306 


378 


256 


QKHVSICCPGWSQTFGLQESSCLGLPKS 
WDYRRQPQPCLTL 


9238 


23139 


A 


9307 


206 


3 


PCHLGWSAWQSWLTAASNSWPHAILPP 
Q P PKVLGLQASLKFLKAGI TQVTFS DYN 
KILNKEIKRGR 


9239 


23140 


A 


9308 


338 


161 


HFFCKKIYIYIYVYIHTYIYIYTHIHIYT 
HTYTYIYRYIHIYIHTYIYMYVYIYIPS 
LPF 


9240 


23141 


A 


9309 


503 


288 


FFLNLDKSQEWWLMPVI PTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGIIGMRH 
HARP WLRENGR I QN 


9241 


23142 


A 


9310 


418 


309 


FFCKNGISICCLPKLVSNSWPQAILQPQ 
PPKVQGTL 


9242 


23143 


A 


9311 


442 


275 


VSPPPI,GGGFI,SPPFKPPPFFFNLKGGP 
PFPTIPFLIIKGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 


95 


HENRLSLGGGGCSELRLCHCTEAWATER 
DFI 


9244 


23145 


A 


9313 


318 


154 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
F I LF I YSLLYWC VGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHRIKQIWHICALEYYAAFKKKK 
VLP YAVI Q I KLEDTMLS E I NQS WGKYCM 
IPLIGGI 


9246 j 


23147 


A 


9315 


2 


97 


STPGLKRSSHISLPKYWDYRRGPPGPAS 
ILF \ 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
corrcspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanirie 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWNAEGI P I RTTLDNSTTVQ YACLLH 
HLTMKAKSTVRDIDPQNDLTFLRIRSKK 
HEIMVAPDKEYLLIVIQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLFFVDTKFHHLAQDGFQL.LSSSN 
SPTSASQSGGITGRSHQAWPYVHLLTLQ 
KLNY 


9249 


23150 


A 


9318 


425 


95 


CLLFCPKKKKKKKKKKKKKKKKKKKKKK 
KKKGGPIKKKKKQP 


9250 


23151 


A 


9319 


25 


214 


VRIDDRPE PCMDADAQREG I NSSARY P K 
NWVTTGDPARECGMIHSAPLMLUADPDE 
FVSVQLA 


9251 


23152 


A 


9320 


50 


231 


GHRPEPGRDADAQREGI IASARYPKNWV 
TTGD PARE CTR I QS APLMLLADPDE FVS 
VQLA 


9252 


23153 


A 


9321 


401 


226 


OWL FTNMI I GHCS LELLGPSNP P I SASR 
LAATTGTRHHAQPS KFRNLKGWLCAKRA 
RT 


9253 


23154 


A 


9322 


410 


159 


LGSLQLPPPGSKRFSRLSLPSSWDYRGC 
SELRLRHCTPAWVTEQDSVPPKKKKQIF 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


9254 


23155 


A 


9323 


310 


25 


KWGGLFFFSPPKKRGFFKQVFFCVPPVF 
FFPPFFKTGPGDIFLGPLKKKIIFPPPG 
GKKFFFKRGPPPFFFFFFFFFFFFFFFF 
FSEVECVCLCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLTSGNPSASASQSAR 
ITSISQRTRPH 


9256 ' 


23157 


A 


9325 




116 


LHSVLLPFFYFCVCVCVCVCVCVCVKHK 
YVRAGGAPGR 


9257 


23158 


A 


9326 




158 


F I FL FT VGG LTG I VLANS S LD I VTjHD T Y 
Y WAHFH YVL.S IGAVFA 1 1 GG F IH 


9258 


23159 


A 


9327 


3 


107 


RGCSEPRLRHCTPAWVTEGDCVSKKKKK 
VPEAWG 


9259 


23160 


A 


9328 


257 


2 


GFFSNISASVI1PPPISRKNLFFPFRVY 
FGPPPKTFJCGPPPFFFFFFLS PRLECSG 
VIMAQCSLTLPGSSDIiPLSASCVAMTVS 

r 


9260 


23161 


A 


9329 


66 


384 


GNLCGWMKSMAWDNPWRKADWGQIRES 
LECQPNTGMLKKKKKKKKKKKKKKKKKK 
KRGGPFLKISWGGPILGGRGKIFFFFFG 
GGILTPRGVFWKSPFFWGGKKK 


9261 


23162 


A 


9330 


1 


98 


IIGQVASVLYFTTILIIjIPTISLIENKI 
LKWA 


9262 


23163 


A 


9331 


33 


412 


NLFYVMAEILHTHIQLCCQVKLHLSVPR 
PSLPLLCCWNFSLVLHEHSQPASEKKKK 
KKKKNKKKKKKKKKKRGGVFKKKPGGAP 
FCGGGENFFFFFFGGGLKKPRGVFGKNT 
FFWGGEKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
LSL.QKPRLIjLFS PS WHLGVPLSVGVQL 
QDVPRGQ WKGS VFLRNPSRNNVP CS PK 
VDFTLSSERDFALLSLQVPLKDAKSCGL 
HQLLRGPEVQLVAHSPWLKDSLSRTTNI 
QGINiLFSSRRGHLFLQTDQPIYNPGQR 
VRYRVFALDQKMRPSTDTITVMVENSHG 
LRVRKKE VYMPS S I FQDDFVI PDISEPG 
TWKISARFSDGLESNSSTQFEVKKYVLP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NFEVKITPGKPYILTVPGHLDEMQLDIQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHI SLSKAEFQDAL 
EKLNMGITDLQGIiRLYVAAAI IEYPGGE 
MEEAELTSWYFVSSPFSLDLSKTKRHLV 
PGAP FLLQ ALVREMS GS PASG I PVKVS A 
TVSS PGS VPEVQDI QQNTDGSGQVS I P I 
I I PQTI SELQLS VS AGS PHPAI ARLTVA 
APPSGGPGFLSIERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
REPKRTLTSVSVFVDHHLAPS FYFVAFY 
YHGDHPVANSLRVDVQAGACEGKLELSV 
DGAKQYRNGESVKLHLETDSLAL.VALGA 
LDTALYAAGSKSHKPLNMGKVFEAMNSY 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTLSRKRLSCPKEKTTRKKRNVNFQKAI 
NEKLGQYASPTAKRCCQDGVTRL.PMMRS 
CEQRAAR VQQ LDCREPFLSCCQ FAE SLR 
KKSRDKGQAGLQRALEILQEEDIjIDEDD 
IPVRS FFPENWLWRVETVDRFQ I LTLWL 
PDSLTTWEIHGLSLSKTKGLCVATPVQL 
RVFREFHIjHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVSVHVS PVEGLCLAGGGGL 
AQQVLVPAGSARPVAFSWPTAAAAVSL 
KWARGSFEFPVGDAVSKVLQISKEGAI 
HREELVYELNPLDHRGRTLEIPGNSDPN 
MIPDGDFNSYVRVTASDPLDTLGSEGAIi 
SPGGVAS LLRLPRGCGEQTMI YLAPTlaA 
ASRYIiDKTEQWSTIjPPETKDHAVDLIQK 
GYMRIQQFRKADGSYAAWLSRDSSTWLT 
AFVLKVLSLAQEQVGGSPEKLQETSNWL 
LSQQQADGSFQDPCPVLDRSMQGGLVGN 
DETVALTAFVTI ALHHGLAVFQDEGAE P 
LKQRVEAS I SKANS FLGE KAS AGLLG AH 
AAAITAYALTLTKAPVDLLGVAHNNLMA 
MAQETGDNLYWGSVTGSQSNAVSPTPAP 
RNPSDPMPQAPALW I ETTAYALLHLLLH 
EGKAEMADQAAAWLTRQGS FQGGFRS TQ 
DTVIALDALSAYWIASHTTEERGLNVTL 
SSTGRNGFKSHALQLNNRQIRGLEEELQ 
FSLGSKINVKVGGNSKGTLKVLRTYNVL 
DMKNTTCQDLQ I E VT VKGHVE YTMEANE 
DYEDYEYDELPAKDDPDAPLQPVTPLQL 
FEGRRNRRRREAPKWEEQESRVHYTVC 
rWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTS LS DRYVS HFETEG PHVLLYFD 
SVPTSRECVGFEAVQEVPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCAEGKCPRQRRALERGLQDEDG 
YRMKFACYYPRVEYGFQVKVLREDSRAA 
FRLFETKI TQVLHFTKDVKAAANQMRNF 
LVRASCRLRLEPGKEYLIMGMGLDGATY 
DLEGHPQYLLDSNS W I EEMPS ERLCRST 
RQRAACAQLNDFLQEYGTQGCQV 


9265 


23166 


A 


9334 


46 


324 


SKYLPPVITFQHEIWVGTNIQTISDWVL 
I CCPGWS AVTQSRLAAASTSWAQAMSS S 
S CLGLS S S C YYRHTPP CLANFVAFVETK 
LYTMLPGLS . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G— Glycine, H— Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVDIAVSHRRKQAFNLCLLKCKPHNNCL 
NLEGGGS S ELKLCYCI PAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLSTDCTSKEI INRHPTEWEKIFANYAP 
NKGPTSTIHSSCLMFPYLF 


9268 


23169 


A 


9337 


454 


359 


SSWDYRHAPPCPANFVFLVETIFIQGEI 
GLL 


9269 


23170 


A 


9338 


540 


328 


I KKRGAPRRGVLFFFTRKKKFFS YYKKS 
TKERVSFLFVAPGCIQGGKKKKPPTKKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


2) 


209 


AEEFPDFKLYYKANQNTYRGQWNRTETS 
EITPDIYLGIQLVSNSWPQVILLPWPPK 
VLGLQA 


9271 


23172 


A 


9340 


345 


2 


KIFFFLIYILFFFNFFFFFFFFFFFFFF 
FFFFFLENRWFbMENFIKGRGHVKGPLG 
QARDHDISGRRWWRREQCRASQLPGVRR 
SLLQVQAGGPVH I R I SGLQDARQGPR VA 
GA 


9272 


23173 


A 


9341 


15 


252 


LERVI RQSAALTDS FFNF I LL I LFIYLF 
FETGSPCVSQAGLKLLGSRDPPVSASQS 
ARI IGVSHRAQPALTDFPS ISF 


9273 


23174 


A 


9342 


484 


276 


RHA P P CQ AN F FVQ TR FCHVAQS GDKL I R 
SRYLFASAFQSVRITCMSHCARPLYFLM 
FKRLQYliKSPTKS 


9274 


23175 


A 


9343 


464 


326 


LVETGPHHVGQAGPELLAPCDPPAPASQ 
S AE TTGMSHRAR P P ANFV 


9275 


23176 


A 


9344 


486 


280 


SFSHSHSCSLiPSSWDYRCTSSHLANFFF 
KRWGFTILPRLILNSWPQVILLPQSHRH 
EPLCLETLKTLNV 


9276 


23177 


A 


9345 


276 




DRDFTMLPRLiVLNSWPKTILiLPIjPPKIL 
RLQS 


9277 


23178 


A 


9346 


45 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
WATE 


9278 


23179 


A 


9347 


229 


73 


PPPHMIFFFFFFLFFFYFFFFFFFFFFF 
FFFFFFFFSRLQVSVYCSSVLNKNVD 


9279 


23180 


A 


9348 


391 


139 


I PSLKRSSHLSLAKCWDYRHKPPCQVTF 
S FLNVR I YATKFFL S TPLAAWS KYEtTYLV 
FLLS FS S KY YLTP I S FLFLTHNVS SSIF 


9280 


23181 


A 


9349 


53 


191 


IYVTYIWYSEVSHAYNSSTLGGQGGRIT 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVQGCSELSWCHCTPAWGTE 
QDHCVSYNNSNTHNNVNY 


9282 


23183 


A 


9351 


476 


82 


YLATYLPTYLSIYLPTYLPIYLPIYLYL 
GSHYIAQAGLKFLGWVGHNGSCL 


9283 


23184 


A 


9352 


286 


142 


ELQFKMRFGWRHSQTISWGLSLLPRLIS 
NS WAQV ILLTWPP KVLG I QA 


9284 


23185 


A 


9353 


3 


102 


FFETESHSVARLEYSGAISAHCTPVIKL 
YLYT 


9285 


23186 


A 


9354 


3 


270 


TAGAPPRPANFCIFSRDGVSPCWPGWSR 
TPGLRCDPPASASQLLGRLRHENCIiNLG 
GEGCTGAILAHCTSTIjRAQAILQPQPSC 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNSPASTSQVAGIK" 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VALIQAYVFTLLVSLYLHDNP 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHTYTHTHTHIPNKKIPILT 
ILKCIA 


9289 


23190 


A 


9358 


464 


181 


LGAGGMTGILFSLGASMVJjGGVAQMLAP" 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KARTR R I Q TTDNGKQN I"Y FSS LDNMVAQ 
GNVLP VLYGEMR VG S RWSQE I S TADEG 
DGGQVWIGR 


9290 


23191 


A 


9359 


11 


469 


IDLRCRFRHEERERERERERERERERER 
EREREREREREREREGRAPPPFljFFGGG 
GPPARAHALSPPHLGRPPLYTYFFSYVR 
APPTER S ARTQRE ARRALKS VC WRERE 
THSLPRARCVCSHPTGCGGAPGARVSLS 
QSAHCFIiCVCGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERERERERDTV 
CPLFCPTGGFILREARVKSGCLCF 


9292 


23193 


A 


9361 


15 


345 


PPNRPPAEFRTRHLISSLLCLEGIILSL 
F 1 1 ATL I TLNTHS LLANI VP I AI LVFAA 
CEAAVGLALLVS I SNTYGLDYVHNQKKK 
KKTWGGGRAPNS P YGGRNNNPRPAV 


9293 


23194 


A 


9362 


61 


298 


SCDSPLQECCIHTHTHTHTHTHTHTHTH 
IiFSDIKTHTWREWALSLFLLTEEEGGGT 
LFQGTLIFGHTLFWDKPLWQKEF 


9294 


23195 


A 


9363 


430 


2 


GGAGPPPPCWTQGPGVPPPTLSPRVNGP 
SLCSDFPSPKFYKEKRKKKGNKTPKRKG 
KTLIiPTETIVLHVSPGCESKPGAGSADR 
RAPG VRDGG S DAS S S GTRAS E LG DGP VR 
VNFVLGDPGEAGRGFSMLGHPPGTSGPR 
TS 


9295 


23196 


A 


9364 


432 


291 


GITGLRHHSQLIFLFLVETGFHHVGQAG 
LELLIHQHRPSKVLRLQV 


9296 


23197 


A 


9365 


343 


507 ■ 


GQGLTLVPQVGVQWGNLSSLQLLPPG1.K 
QSTCLSLPKCWDYKCGLPPPASFCFVL 


9297 


23198 


A 


9366 


417 


128 


MGWWPGRGAPPFIiDEMAAGQRRSSLSRV 
GSQAEGLLTSHTKGGPAETLLTSQTGWR 
LGRGCNLGSLEGQGRRLGGGGCNE PSSR 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWDYRRAPPRPANFLFLVEMGFHH 
VGKAVLELLTSGVELLTSGDLPASASQS 
AGITGVSHRAWQIRE 


9299 


23200 


A 


9368 


1 


229 


IVDL.TTWGIPTHTIHKYTHHIHKYIHTH 
AYTHHTHRHKHI YTY I HTDMVWL.SPHPN 
L.ILNFSSHNPHVSWEGPNGR 


9300 


23201 


A 


9369 


451 


88 


KQTNIYTCIYMYVYI YTCIYTYVYIYTR 
T YT CI YVC VY TC I YVC VYTC I YVC VY TC 
IYVCVYTCIYVCVYIRVYTCVCIYVYIR 
VCVYTCIYVYVYIRVYTYICIYTCISTY 
IQCIYTHIYKVTKCSG 


9301 


23202 


A 


9370 


1 


208 


ADET I EQRGG I PKFVHDNYVI RRTAVDR 
G I PLLTNFQETKLFAEAVQKSRKVDS KS 
LFHYRQYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


PAAQREG I TASARYS KKWVTPGDPAREF 
TMIQSAPLMLLADPDE FVSVQLA 


9303 


23204 


A 


9372 


1 


336 


KSLIKYKGYQVAPAELESILLQHPNIFD 
AGVAGLPDDDAGELPAAVWLEHGKTMT 
E KE I VD YVASQVTTAKKLRGG WFVDEV 
P KGLTGKLDARKIRE I L I KAKKGGKS KL 


9304 


23205 


A 


9373 


2 


187 


NILVTALYSLYIFTTTQWGSLTHHINNI 
KPS FTRENTLMFIHLSPILLLSLNPDII 
TGFSS 


9305 


23206 


A 


9374 


96 


240 


ACWLMPVI PALS EAE AGRLVQPRRLRAA 
WATWQDLISTKKKKKGGPF 


9306 


23207 


A 


9375 


3 


293 


STISAFTTSTTSATTTSTTSATTTSTIS 
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NSDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Plienylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, i^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














APTTSTTLSPTTSTTSTTITSTTSAPIS 
SNKLHKKKKKKKKQKKKKKRKNDDTKQD 
TITKIHSGGGSL 


9307 


23208 


A 


9376 


69 


160 


EHTHTHTHTHSHTHTHSHLPERFFCVLN 
SL 


9308 


23209 


A 


9377 


336 


199 


EGITASARYSKNWVTTGDPAREFPMIQS 
APLMLLADPDEFVSVQLA 


9309 


23210 


A 


9378 


321 


128 


EPGFTMWASLAPTSSL.QVIHRLGLPKCW 
DHRREPPHPAYHSLYESGPSVQNGEEKG 
INPIIKMG 


9310 


23211 


A 


9379 


440 


286 


S SDPS AS AS KSAGFTGMSHHTQLLTS YS 
EGECLSYLVFLVLV 


9311 


23212 


A 


9380 


392 


154 


KGSSPLSPPISI1AGGKPPPPRKFFL1NFF 
FFFNRDRVSPCLPRLVSNSWPQAILPPQ 
PHKALGLQAYATTASFIFLIVTV 


9312 


23213 


A 


9381 


403 


178 


GRRHEPCRDADAHLEGIIFSAFYPKNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 
VQLA 


9313 


23214 


A 


9382 


303 


206 


DPVSTKICSLQPPPPGFKRFSCLSLPSS 
WDHR 


9314 


23215 


A 


9383 


215 


116 


SNNFWCCCCCCCCCCLRWSLTLSPRLH 
SCGTM 


9315 


23216 


A 


9384 


2 


285 


ARHLLMQLSGGATL>AISTIDLPSTL>I I F 
TLLILLTILEIAVALIQAYDITLLGSLY 
LGDNT YKKKKRGAAFE ETKFT SAGMQS K 
NFFLGTLNWR 


9316 


23217 


A 


9385 


12 


123 


GYENFLLPTLHTQTHAHTHTHTHTHTHI 
HSKDPMEY 


9317 


23218 


A 


9386 


3 


230 


KCWGYRCEPLRLAPLRCFFTPGGISLTS 
GPALLSNASALAEVRGQQLiCPPTPFLPT 
LIALTTLLLPISPFILIIL 


9318 


23219 


A 


9387 


590 


396 


ARVQWPDLGSLiQPPPPGFKRFSCLiSLLN 
SWDYKCAPPCRADFCIFSRDGVLSCWPG 
WSQTPYLK 


9319 


23220 


A 


9388 


224 


79 


GLGPVGHACPTLWEGEAGGSRGQE I EAL 
PANTVKPRFLKRKKKGKKKG 


9320 


23221 


A 


9389 


249 


72 


LVYLITVKKILFPVHIRTFNKIDHMIGH 
KTCLNKLQMTE I LQ S VFSDYNR I KLiEAS 
NW 


9321 


23222 


A 


9390 


400 


254 


LRQPGSTNSPVSASRVAGITGTHHHTQI 
I FVFLVEMWHACNPS YSGG 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDSIL 


9323 


23224 


A 


9392 


266 


106 


L.TANFLFL.VETGFPHVTRVGLEI/LASSD 
LPTLASQRAGITGVNHLQNFWPGNF 


9324 


23225 


A 


9393 


430 


348 


FGKPQPPPKGKSPPPRGNFPPQKAPKG 


9325 


23226 


A 


9394 


4S1 


311 


LSLSEKRS IRVGVTRFSRCCPSPLSLTG 
KGNSLTPGVSQEIHSPPWFGIAYSVCTH 


9326 


23227 


A 


9395 


3 


136 


VFPRLVLNSWAQMILLCWPPKVLGLQAC 
TTRPSHFLPLLNS KYK 


9327 


23228 


A 


9396 


143 


347 


NFVFLVETRFHHVGQAGLTL.LTSSDPAS 
QSAGITRVRPEILTLKRYHFPSGILGTE 
GMVIEMAYLL.KV 


9328 


23229 


A 


9397 


2 


231 


liELLTSSDLPASASQSAGITRGSHHARP 
SFTFRCVIHLEFNFMRGAREERLHFYCR 
WVIRRLTTFGRGYPSPAASD 


9329 


23230 


A 


9398 


383 


200 


QHYL.NRRLSLAQWWMPWPATQEAEAGG 
LLEPRSSSLQCAMMLPVNSHCTLAWATH 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

Uig "to* first* 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsolcucinc, 
li = L*ysine, L—Leucine, jVf = Methion/ne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REPHL 




2323 1 




9399 


3 


416 


GFLH VGQAGLKLLTSGDLPTS AS QSAG I 
I GVS HRAGCL KSIYLTFP TRKFF VQM S I 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFFFFLETGSFPPGWRAWEKFGLIAPLiA 
SGVKGIFLiPLPRENLGWRPSPIPPA 














FGYPQVFPLiPPFLKPGPGIFFLiGPLKKN 
FPFPPPGLKIFSFKRAPPFFFFVRKGLI 
MLLRL VSNSWAQV I I PPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFL.FYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 


9333 


23234 


A 


9402 


413 


312 


PKLVLNSWLQALL.LPRPPKVL.GLQARAT * 
ITQPK 


9334 


23235 


A 


9403 


363 


3 


FKKKNFCLGGAGWFFFFYDPPPKKKI F 
YFPSKKIYFLSPASSSFFFFIiCFPFFIF 
FFFFFFFFFFLENRWFLMENFIKGRGHV 
KGPLGQARDHDISGRRWWRREQCRASQL 
PGVRPRV 


9335 


23236 


A 


9404 


401 


82 


SELPHDPINPTTGYIYPKKMKSVCFTDI 
CTPMFTTALFSIAKIWNQPKCL.SKRITI 
FMFLVTYSQLFCTHYPILPYPFFPAPLS 
S SHN YHYTLYLYEN I LLDST YE! 


9336 


23237 


A 


9405 


382 


219 


WARWHGFLiVPTSPGGEAGGSLiEPGKRRV 
QLTQISPIjPSSLGGKVGLPLKKXKNN 


9337 


23238 




9406 


3 


195 


YLYSLVETGFHHVGQAGLELLISRDPSA 
LASESAEITGMSHSIQPKSAFYIQIPET 
FWQPAFK 


9338 


23239 


A 


9407 


371 


1 


FRRLRLKNRLNLGNGGCSEPRLiCHCI PA 
WETRAKIiGRKKKEERKREKGTKMYNKMY 
KMYLELNLTCPEPAS PKQMTRQLE VLS S 
NPSPGTFQFICKKQFKFSCCFAXiRVSI Y 
LLMSVFEYCSC 


9339 


23240 


A 


9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 


— — 





— 





— 


— — 

31 


LESASFP PGKVAHTCNPNTLRGCAGWIA 
WAQKFKTSLGNMAKPCL1YKIK 


9341 


23242 


A 


9410 


367 


256 


HPGWEWWLVSVIPTLREAKVGRSLEPRS 
LRLA.WDTWG 


9342 


23243 


A 


9411 


2 


192 


VFL VEMG FLHVGQ AGL ELP I S GD P PA S Q 
S AG I TG VS HHAQALT FLN I P I N P SH P P K 
ACCSKQL 














KNCLNLGGGGCSELRLCHCTLTWATE 


9344 


23245 


A 


9413 


371 


301 


EDHPRMVILRLLRYIVRLVWRMH 


9345 


23246 


A 


9414 


486 


337 


AGLELLGSSNLPALTSHIAEITGMCHHT 
WLAYI FVCAFGWVLiLiIjDLCLAF 


9346 


23247 


A 


9415 


378 


254 


IVAHCSLDLLGSSDPIiTSASRVAGPTGM 
CHHVWLIVVFFVC 


9347 


23248 


A 


9416 


160 


309 


WRYSSNEFIiLFLLFYLFKSNFTVFCCC 
YCCCCCCWRQSLTLPPKLECGV 


9348 


23249 


A 


9417 


162 


339 


RPSSFCEFLHDSVPELALPMKARFEKSR 
RSQVSWL.MPVIPTFWEAKVGRYQARSLI 
PA 


9349 


23250 


A 


9418 


1 


147 


GGGGCSEPRSCHCTPAWATRVKLGLFKK 
KKKINFLFVPPFIFLKSKTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKG QAQWL I P 1 1 PAFWEAKERGLLE 
VRSLRPAWATEQDS TATKNKNKNF . 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, "^Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9351 


23252 


A 


9420 


133 


3 


ELMFSWAWWIMPVIPALWEAEAGGSRGQ 
EFKTSLPNMVKPRLN 


9352 


23253 


A 


9421 


3 


133 


LRGLYSGAGIIRFEAKKWNALAIjWAWDI 
AGDNCAICRNHIMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSELRSCHCTPAWATRAKHR 
LKNKSPINVY 


9354 


23255 


A 


9423 


I 


210 


RF FHLGLS KH WDYRCES PHAGTRNE VL I 
YAT I WMNSE N I VPNE I I QTQ RTN I VS F Y 
LYQVGRIGKFIDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNIKISWPWW 
CMPCNPSCSEGQGRRIALNPGRWRLQ 


9356 


23257 


A 


9425 


1 


209 


GFFLFQGGGHHAGQAGLELLTSGDLPAS 
ASQS AG I TGAG PNQG I FVALES PMES AA 
L.LHLHWLSSPCS 


9357 


23258 


A 


9426 


628 


433 


RQGVAHLPGMECSGTITTHCSLELLGSS 
DPPASASQSSRLAWSHCAQPSIiLFSVI. 
CLKPIFCCA 


9358 


23259 


A 


9427 


417 


125 


FLRQSLTSSPRLEYNGAIPAHCNLCLPG 
SSDSSIIASQSIRITGVNKCTWPDAAVL 
RQNFFFSEKPQFYSEGFQQTHI IKSNLL 
YXiKSTKCTCYPYL 


9359 


23260 


A 


9428 


366 


256 


SFFYRDRSLATLPRLGSNAWSQAIJLPPG 
PPKVLRLLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQAALKLLI S S DLPTS PSQSAGI 
TGVSHCARLTRLFSSHE 


9361 


23262 


A 


9430 


368 


3 


VRASQLliCLPTQASAMAGTPPPASLPPC 
SSISDCCASSEQGSMGVGPSEPGAGYNL 
WCS S LRTLEKCS I RVGVPQFSRCHLSQ 
LPLARKGNSPTPCASWVRRCLTLLQLML 
RGLYPLSCT 


9362 


23263 


A 


9431 


3 


131 


LTMLPRLVSNSWAQGSSHLSPPKCWDYR 
CEPLHSGLKDI FLE 


9363 


23264 


A 


9432 


384 


274 


LSFAMLPRLVLNSWAQESAHLGLPKCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPPLGYFFIVKMEFPPWQDGFKLLNSS 
DCSTSAS PSGRI SGVNHSTRPR I F 


9365 


23266 


A 


9434 


211 


473 


CVCTFFS FLSMHI FFLFIiVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFCIFSRTRVWSCWP 
GWS 


9366 


23267 


A 


9435 


391 


188 


LPGSSDSPASASRVAGTTGGCRHTVLLG 
FEFKDNI RWIKLVE KAHTQ YGN I YI CVI 
KYMPLFYPYHH 


9367 


23268 


A 


9436 


331 


115 


SRDRGFAMLPRLVIiNSWTQAICSLLDSQ " 

SAHLSLPKCWNYRREPLCPASGEILSFI 

KPSSSKSFHNSTKKRQ 


9368 


23269 


A 


9437 


3 


109 


IYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


ECNGAISGHCNLCLSGSSDSPAPAWATR 
VKLRLIKKKKKKKNQ I LFFYP PRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCPCI LRH PAFLFAKS GNPLE SSLLDHS 
SPTFKLTDRENLRHVKCTSHSQS LPTSG 
FQPGLPDSTLCAKYPTRS PKLS PAGWVG 
GYRAP VGRRG PGS RNS WRLHWT P I TS VG 
D 


9371 


23272 


A 


9440 


1 


170 


YEIWFLDVGQAGLELLTSGDLSASGSQS 
AGTTGMSHHARPLKLFTISVSSNKKVKC 


9372 


23273 


A" " 


9441 


133 


244 


LFGYQTRGIAMLLKLDSNSWPQAVLPPQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unkno>vn, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPKVLGLLA 


9373 


23274 


A 


9442 


365 


176 


RGSMDtWPSESCTGYNLLMCPLLRPLEK 
RS I RVGVTRFSRCHLSQLPLARKGNSLT 
PCASQVR 


9374 


23275 


A 


9443 


368 


231 


VE AG F CH I AQAG LT FLS S TD P S ALAS RS 
ARITGVSHHARPI FLKE 


9375 


23276 


A 


9444 


88 


194 


KMLEAKKKIEQQGGFTFENKGVLSAFNF 
GTVPSNN 


9376 


23277 


A 


9445 


3 


349 


IRSGSYACYFCYPRECYALFHCYSMIVS 
YGPYYFMNIDQRHESRLSTIYIFKFEIG 
SKHVAQAGLELLCSSSPPAFTSQSAGIT 
SVSHHTRPQKIFNVIYSCDLHFHPGIEG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HAECSSMMIAHCELELLGSTDPPASASQ 
SAETTGISYHAQPYSTF 


9378 


23279 


A 


9447 


2 


282 


ICPTRSRFPPRVLKNGRKKDRLECSGAI 
SANCSLCLPGSSDSPASASRVAGSTVAE 
EDS TRRAVRHQGS S RSAPSGRGS WAG KQ 
ERADRAQQS 


9379 


23280 


A 


944S 


307 


430 


GLAWWLTPGIPILWEAEVGGSLEVEFET 
SLASMEKPSLYYK 


9380 


23281 


A 


9449 


2 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSRLPCSVSSPSGRGPGGEDLSPLP 
PTMSVSPEADFPPLAAAAPGNIMTAIT 


9381 


23282 


A 


9450 


1 


166 


NTGGGGEEIRVFSSCFAEDRLQKGKDRP 
GAVAHACNRSTLGGRGGWITRSQIFFC 


9382 


23283 


A 


9451 


1 


123 


NTRGLVMLPRLVSNSWAQVILQPLPPKV 
LGLQAS VTVH INN 


9383 


23284 


A 


9452 


396 


291 


S LCI KRVCACACVCVCVCVSGAKLRKLV 
LDFMRS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTHIYILSQVLMEYEF 


9385 


23286 


A 


9454 


394 


235 


TRSCYI AQSWQWLFTGTLI IHYNLQLL 
SSRDT P VS ASQYLGLQAHTTTPDS C 


9386 


23287 


A 


9455 


1 


85 


TNPVATLLEWQDALCFIPVGSAAPQGSP 


9387 


23288 


A 


9456 


339 


165 


KTHWWG KKS FLKNPFFFFFFFETKSRS V 
TRLECSGVILAPKQKTKKQNDKTTINKT 
I 


9388 


23289 


A 


9457 


3 


209 


LLILFSIKMRNRPDSPALLSTQLPPIFI 
YIHTKIYLYI YTHIYIFIYIYLYIYLFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLELLTSGDPPISASQSARITGLSQR 
AQLSELTLTVFTTRHCTCLCAHFLYYTV 
FLEGRNDDEFISIPST 


9390 


23291 


A 


9459 


287 


3 


EAGLLWRKATHKYAHTHTHTHTHTHTHI 
RGLQNMINCKYITYQEIMTLAQAKNKGV 
IPDSFPFTSS PPLTYP PALSFHFQKHI P 
NKTTTYHGHC 


9391 


23292 


A 


9460 


3 


72 


KTELKNKFTNI LES S FLMNNKS 


9392 


23293 


A 


9461 


327 


92 


ISLFLVWGALFFPKKI I YPPKJCKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQENPKF 
AGGRKIFCPPPPVFFFYFFFFF 


9393 


23294 


A 


9462 


3 


173 


ACRGDSGG PLVTRDLKDT WYL IGIVSWG 
DNCGQ KD KPG VYTQVT Y YRN W IAS KTG I 


9394 


23295 


A 


9463 


258 


39 


LKVILPSDPSSIFVCLFLENESRSVTQA 
GVQWHDLGFKRFSCLSLLSSWDYRRVPP 
PPAKFCI FRKHGVSPC 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 

tastaminb 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon,/=possible 

nucleotide insertion 


9395 


23296 


A 


9464 


104 


336 


NHVSQCEEIFLFDEYYIFSRAVASMFWV 
DAELGSDIYLDGKLKNSFLLSFLGFFFF 
FFLGKKVFFFSQNGGLGRPGQ 


9396 


23297 


A 


9465 


410 


0 


LNLGGGGCSEPRLCPCTPAWATE 


9397 


23298 


A 


9466 


254 


61 


KKKKDAFFTI PLYPSSQNLFAFTQMDPD 
TNISQQLTWTVLPQGFRHSPPTFLDKPF 
KQTSINFA 


9398 


23299 


A 


9467 


403 


255 


PAPQLGGPPMMPMMAPPSSWMMPMGCAP 
AMRLPIGGDI IWL.CVPTQISP 


9399 


23300 


A 


9468 


1 


1428 


PARFKDLNIiAGTAEVGLAGYFMDHTVAF 
RDLP VRMVCS STCYRAETNTGQE PRGLY 
RVHHFTKVEMFGVTGPGLEQSSQLLEEF 
LSLQMEILTELGLHFRVLDMPTQELGLP 
AYRKFDIEAWMPGRGRFGEVTSASNCTD 
FQSRRLHIMFQTEAGELQFAHTVNATAC 
AVPRLLIALLESNQQKDGSVLVPPALQS 
YLGTDRITAPTHVPLQ Y I GPNQPRKPGL, 
PGQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLSCCCCGCSCWGRLTRCLAE 
WTEVLTVGQVQRGPCTALLHKELCGTE 
PHGCASTEEKGLLLGDFKKQEAGKMRSS 
QEVRDEEEEEVAERTHKSEVQEQAIRMQ 

LWQRHLENGGDL.QKRVAEKASDKETAQF 
QAEEKGVRVIjGGDRSLiWQGAERGGGERR 
EDKPHHHHHHHQPEAEPRQEKEEASERE 


9400 . 


23301 


A 


9469 


265 


170 


GTESHSVTQAGGQWLNLSSLQPLPPGFK 


~9401 


„ Q2 


— 


Q . 


~7RQ 


___ 


RDLELASFLSFFPSFIjPSFFPSFLPSFL 
PSFLSFSFLiISFFPLPSFFSFISFSLSL 
FFE 


9402 


23303 


A 


9471 


67 


210 


TVTTGQAL.THAYNPSTLGGRGGQITRGQ ' 
DFETTLVPGDLNFSRVLASS 


9403 


23304 


A 


9472 


2 


186 


TSLWRQAKKNVEKKTDKYTEVLKTHGLL 
VCTQKSCSFLKNKNKAWIiGKQANAPARK 
VEIRC 


9404 


23305 


A 








FLSSFliFFFFFFFFFFFFFFFFFTV 


9405 


23306 


A 


9474 


430 


120 


ASQKKKKKNKKKKKKKKKKKKKGGAL 


9406 


23307 


A 


9475 


1 


150 


P TR P WQAGF KVLS S GN P P ALVS Q C FR I 
TGVSHRARPHLLNLRAPNLPVL 














PPPLFFNFGPFFLVEARLHHVGQAGLKL 
PITS DQ PDSAS QS AG FTGVS TRAQPTL I 
TLY 














LPWSPKMLGLQARATVSGLNKRYS 


9409 


23310 


A 


9478 


43 S 


277 


KRAPFFFFCRDRGLTMLPRLISNSWAQV 
ILLPWSP KMLGLQARATVSGLNNRYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 
RKKYFFFKGPP 














VSCRDEGLTVLPRLVLNSRPQAI LLPQL 
PKXJliDHR CE P PHIANHG 


9412 


23313 


A 


9481 


2 


176 


HAPPHPAN FLF FEE KE S CYVTQVS I KLL 
DSSNPPAliTSQSAG I TGENQHGHPMNCL 
SG 


9413 


23314 


A 


9482 


2 


178 


HAPPHPANFIiFFVEKESCYVTQVSIKLl) 
DSSNPPAliTSQSAG I TGVSHHGQPMNCL 
SG 
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SEQ ID 
NO: of 
nucleotide 
sequen ce 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
Od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


GRVGEGPELVSLLVL.DQHPVGTSDHRDA 
TQLGFQPLMSDLGSTGSTEGA 


9415 


23316 


A 


9484 


420 


325 


FFVFLLETGFHHVSQDGIiDLVICPPRPP 
NTFV 


9416 


23317 


A 


9485 


400 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKKKKKEL 


9417 


23318 


A 


9486 


117 


280 


S STNQ ANN CFF FFKKRGLA IL PRG ALN P 
WAKG I LLPWPPKGLKLRGGAI GPGQK 


9418 


23319 


A 


9487 


76 


353 


VIjWNTLE FCLI YLDIiLFLLFFNKEAL I R 
KKKKKKKKKKKRGGNKKEKKKKTPWQEK 
AGLFLGDTI IYSRGAVLKNSGGETPRHT 
PSGGLWPR 


9419 


23320 


A 


9488 


3 


122 


HASEDNVFLHCSHSDLGYVLLAGTASLQ 
DFSFFQKSPPP 


9420 


23321 


A 


9489 


1 


155 


PTRPTVCSGAMLAHCNLCLPGSSESLlAS 
ASQVAGTTACTSTPDPFFFFFSL 


9421 


23322 


A 


9490 


3 


212 


ETGFHHVGLSGFKLLTSGDLPTSASQSA 
GITGMCHCVWPQFAYFCLLLVYL.DHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


ETLFRHVGQVS fe PLTSGDLP ASASQS A 
GITGVSAMPSWMIKFFIAFYSCINC 


9423 


23324 


A 


9492 


474 


296 


SQPAGTADCLLPLPPGSCFPTI pcpaal 
VPLPSFPSARLWPPQSLHQLLPLPGTCS 
SCT 


9424 


23325 


A 


9493 


7 


151 


LCAVEFVYMKP PAHEEAKAPS RGFWRY 
APWTASSSEKGRDDGMGPRA 


9425 


23326 


A 


9494 


52 


363 


TTAPS LS V FL FTFTG ELFVY W WVGVTLE 
TSFISFFLFFFSFLEKDFSFSPAGRPWP 
NFGLIQPLASGVGGISRFNPPSRWELRA 
LPPPPGGLGFFFWKKGGPPG 


9426 


23327 


A 


9495 


383 


118 


WKPRPWVPPGAGVQTPPGPQGETPLFL.K 
KKKKRGRGGPLFGPPSFGGGGGEIPLTP 
APKGQEIYFPPPPPPPGGKKKSLFQKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLWRDFGSLQPPPS 
QVQRDSPCPQASQVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFS I Y I YVHTHLHVYLYTH 
I YVYVH I C I YKEG Y I TGSVLA 


9429 


23330 


A 


9498 


144 


16 


KVKKKVSQCPG I VAHNCNPSTLGGRGGR 
IAWGQEFETLSLQKN 


9430 


23331 


A 


9499 


2 


155 


NLALLGSSDPPVSASPVSGIRGVHHHAQ 
LIFFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


F I YGRDS LTMLPRLVIoNS W PQAI L PPR P 
PKVT.,GFRCEPPHLASLjILFYRDIL.FTI,G 
LKSV 


9432 


23333 


A 


9501 


465 


153 


PHKVFLNKG PFET pqgelsrgewktr p p 

pprersrpfhrkggfgsppklrqgppfv 
ffswppwcpppkkgarpqqkgffyfppp 
ffnppfffffclffifeiv 


9433 


23334 


A 


9502 


429 


11 


CRTLLGADCVCVC VCVCVC VC VKAFPP I 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWPGGAQPGSCYWGIiGCLCL 
FACKLGARLTEQTLGI PASGPGTWATPL 
GSCAS VLRRLiPGGS ERDTDLKVS VLSG 


9434 


23335 


A 


9503 


436 


202 


LPGGYI FQGEGVEKTKFSPLLRENNFFS 
NFFPPKKNNGGGPRIPPPPPPPLFKKGV 
F FI PKFPAPPR P KKKFPKKKKK 


9435 


23336 


A 


9504 


3 


112 j 


S FDLL S S S DP P ALAS QS AG I TG VS HS HG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Eysine, E— Leucine, Wl— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vatine, 
W=Tryptophan, Y=Tyrosine, 
X=l)nkno«n, *=Stop codon, /=possible 

nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRFLSPSEKRSIQVGVTRSSR 
CRLSPLSIiTRKGNSIiTPFASRVRQCLTIj 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


441 


189 


CPANFFVFLREMGFHHVGQAGLLTSSDP 
PASASQS AG I TGLS HCARPT VGSDVLSL 
PSGQCTSALTGFMLGSTTVHSNAAARA 


9438 


23339 


A 


9507 


117 


278 


NKALLCHPGWNHI KKVKGRLGMETHTCN 
PTTLGGQGQEFENSLGNIGRPHLYK 


9439 


23340 


A 


9508 


3 


126 


FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAGITGVSHCI 


9440 


23341 


A 


9509 


421 


134 


FLVERRSHHAAQASLEILSSSDSPTSAS 
QCAG I TGMSHCARPKYTSFFRIjHRHMPM 
ATPTKLPPTLASAS PQVKVCLYPLLPST 
SKGEIFTLQLY 














PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRLLPQPPAFYPSACHLLPQPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKERKKEGRKERERERKKEKKRKK 
KKERRKEGRKEGRKEASKGQRVWIMVD 
T 














FHHVG WAGLGLLTSGDL P ASAS QSAR I T 
GMSHHARHRHGFDNLSLCALAPSPTPLW 
RRHRRRGHFTVPEK 


9444 


23345 


A 


9513 


390 


265 


VFLVEMG FHHVGQ AGLE LLTSGDLPPLE 
SQSAGITGVSQQFS 














I G FR H VGHAG LD F L P LG D L P S LAS QN AR 
ITGVSHRAWPAMAFVNCHGAGGSVAMRT 
TRGHSCRHLGIGGFWPASLLQPVLSAGS 
L 


9446 


23347 


A 


9515 


126 


11 


RNLDFFLSLGIiAMLSRLVNS WPQVI FQA 
WPPKALRFQV 


9447 


23348 


A 


9516 


2 




FLHVGQ AG FVL PTSDDFRPTHTPQSSGI 
TGVSHRTRPLRCYFLRLNNI PLYVYAYY 
ILFIHSVISGYVGGCTAHYCHPRKW 


9448 


23349 


A 


9517 


403 


57 


PEGALVPQKKKKKKKKKKKKKKKKKKKK 
NIKI 


9449 


23350 


A 


9518 


406 


140 


PHYNHTFVYNGVRLEDLQHMCLELTVWD 
REPIiASNDFLVGVRLGVGTGE I PSPILT 
CLKASCLFYSYYHSRPPTQSSSVMTWF 
FPFS 


9450 


23351 


A 


9519 


3 


75 


G I E WMVNC WRNVHR P PR QRD1T 


9451 


23352 


A 


9520 


183 


6 


LLTHLILTSALIGEIMVAPILQKNSGSE 
RECRL PRLVXiNS W PQGILLP WP P KVLG S 
QV 


9452 












LQCLQKSGKLEPGMTYTKLIDADVNVGN 
ITSVQFIWKKHLFEDSQNKLGAEMVINT 
SGKYGYKS TFCSQD IMGPN I LQNLKPC 


9453 


23354 


A 


9522 


3 


329 


PSLMFPWLKIiSFPGRPRGRQTYSRYQTL 
ELKKEFLFNPYWTRKRRMEVFHALGLTE 
RQGKIWFQNRRMKGKKENTRDNLPEAQN 
EEKVEEKGIEEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVNE I FMAI AKKLPKNEPQNPGA 
NSARGRGVDLTEPTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSLLCLEGI 
ILSLVIIATlilTLNTHSLIiANIGPIAIL 
VFAACEAAVGLALLVS I SNT YGLDYVHN 
LNLLQC • 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9456 


23357 


A 


9525 


1 


. 62 


GNDAWVCTNDMFRLALHNFG 


9457 


23358 


A 


9526 


1 


111 


GDI IHNENMRQGRDKYKTLRQIRQGNTK 
QRIDEFEAL 


9458 


23359 


A 


9527 


427 


340 


IPIGNIMSCDSHSHHELVMSAQLNAIFH 
D 


9459 


23360 


A 


9528 


439 


27 


PRRTDKWLTVEM I QDGHQVS LL.SGELTV 
DQRAS I I QRFRDGKEKVLIPTNVCARG I 
DVKQVTIVVTFDLPVKQGEEPDYETYLR 
RIGRTGRFGKKGLAFNMIEVDEL.PSPMK 
IQALFDGRNKQLNAEVPCI QAAMED 


9460 


23361 


A 


9529 


1 


236 


RE VL I TTNVCARG I DVKQVT I WN FDLP 
VKQGEEPDYETYIiHSIGRTGRFGKKGLA 
FNMIEVDELPSLMKIQDHFRKS 


9461 


23362 


A 


9530 


437 


240 


QQVSSARPEPEAGEVSPPVGAGVNSNSW 
AFKYGPGNPKQSGPEPKKQTQVSFLLRR 
KGEASQPRQ 


9462 


23363 


A 


9531 


413 


260 


LFDQWKVASPSGLRTSTVRYGDPEKLK 
LEE SLRQQAE KEG S ALS VR I SNV 


9463 


23364 


A 


9532 


431 


260 


ASSGQMFSVKQQAPPVRDLPPFGISLFG 
GQI YDNI S PRP PQG I VAP PGGRGN I PNL 

G 


9464 


23365 


A 


9533 


2 


134 


I HGVC VHMCVCACVCVYACVCVGGYDGR 
IFLALSSTVPSMVNIM 


9465 


23366 


A 


9534 


419 


53 


LLLLTNGDRQTQREKIEACACQSYFDAV 
WGGEQREEKPAPSIFYYCCNLLGVQPG 
DCVMVGDTLETD I QGGLNAGLKATVW IN 
KNGIVPLKSSPVPHYMVSSVLELPALLQ 
SIDCKVRMST 


9466 


23367 


A 


9535 


439 


109 


NCIVI IDSTPYRQWYESHYALPL.GRKKG 
AKVTPEEEEILYKNRSKKIQKKYDERKK 
NAKISSLVEEQFQQGKVLACIASRPGQC 
APADAYVIEGKELEFSLRKIKARKGK 


9467 


23368 


A 


9536 


1 


185 


NTTSSHHFTIPCL.VLYSCTSTHTHIHTH 
THTHTHTHVRVFSYIPSQDVKEGCIPAP 
SVFSW 


9468 


23369 


A 


9537 


168 


280 


NPNS IRLTSCCCCCCCCCWKSPYNVYNR 
LLiRTGFTFP 


9469 


23370 


A 


9538 


142 


324 


KIVKMGFPHAGQAGLELLTSGDTPGSTS 
QS AG I TGVSHRSRARS IFLYPYLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYLS PGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 


A 


9540 


383 


318 


MSTQNNIQSYP IGEDEESESD 


9472 


23373 


A 


9541 


372 


207 


YGLNMCRQCFRQYAKDIGFI KYGVCRRT 
PCCLGREGCGGYFFRERQTDRSSLEN 


9473 


23374 


A 


9542 


2 


360 


ARGDDDHRSSDRKDGVLEEQIERLQEKV 
ESAHSEQKNLFLVI FQRVIMILTEHLVR 
CETDGTSVLTPWYKNCIERLQQIFLQHH 
Q I IQQYMVTLENLLFTAELDPH I LAVFQ 
QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQFAMSHGSAQVKGHGKKVADALTN 
AVAh-V.DDMPNTLSALSDLHAHKLRVDPV 
NFKLISHCLLVTLAAHLPAEFTPAVHAS 
L.DEFIAS VS TVLTS KYR 


9475 


23376 


A 


9544 


380 


56 


RDLSMVRMKSMFAIGFCFTALMGMFNSI 
FDGRWAKLPFTPLSYIQGLSHRNLLGD 
DTTDCSF I FLYI LCTMS IRQNIQKILGL 
APSRAATKQAGGFLGPPPPSGKFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

locntion 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, ]Y1=Mcthioniiie, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Ar<»inine, S=Serine 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


9476 


23377 


A 


9545 


304 


92 


SKSRVTNNTLRSTRKDTMITLNTHSLLA 
NIVPIAI I VFTACEAAVGLALLVS ISNP 
YGLDYVHNIjNIjLQC 


9477 


23378 


A 


9546 


394 


108 


KEIDFSLVDVTADDAGNYSCMYYQTKSP 
FW AS EPS DQL E I L VTVP PGTTSSNYSLG 
NFVRLGLAAV I WI MGAFL VE AWYS RNV 
SPGESEAFKPE 














YHLHRNQ S F C P TVNLDKLRTL FS E QTR L. 
NAAKNKTGTAPI IDAARSVYYKVLGKGK 
LPKQPVIVKAKFFGSLIFCFRIWILID 


9479 


23380 


A 


9548 


326 


188 


NFLRMGTVAHASNPSILGGL.GGRIAGAQ 
EFETSLGNITRPCLYFKK 


9480 


23381 


A 


9549 


381 


152 


VTL I S LS LLLQ WIGVbLW I ARLNLNE 
VEKQWRLNQLNWAATILVFFTWINVFI 
TAFGAHKTGFLAARASRNPL 


9481 


23382 


A 


9550 


170 


65 


IHRSKNKWKFHLKDGIMKLNGRDYIFSK 
AIGDAEW 


9482 


23383 


A 


9551 


544 


361 


RRLRQENRLNPEGGGCSKPRSCHCTPAW 
ATEQDS SSKEKKE RKNI.KGNTVKS VPLP 
LTSHA 


9483 


23384 


A 


9552 


369 


181 


GVLGATSGDMQIHTFGSMIIGSITGMVS 
HQQKKSY 


9484 


23385 


A 


9553 


1 


516 


S LVHWE FGQAI ARRLKP FGVQRFL Y TG 
RQPRPEEAAE FQAEFVS TPELAAQSD F I 
WACSLiTPATEGLjCNKDFFQKMKETAVF 
INISRGDWNQDDLYQALASGKIAAAGL 
DVTSPEPLPTNHPLLTLKNCVILPHIGS 
ATHRTRNTMS L LAANNXiLAG LiRG E PMP S 
ELKL 


9485 


23386 


A 


9554 


386 


56 


IiFPACVIjRWQFASDGGDIGFGVFIjKTKM 
GEQQSAREMTEVLPSQRYNAHMVPEDGS 
LTCLQAGVYVLRFDNTYSRMHAKKL.SYT 
VEVXjLPDKASEETLQSLKAMRPSPTQ 


9486 


23387 


A 


9555 


375 


150 


I WGKTS FHVS I PL.S LFFPGDVL YDLL.Q 
HILKQSKPRILFSPFFHPGNSIHTQPEV 
I LHQTHE EGTGRGFS FLA 


9487 


23388 


A 


9556 


3 


354 


CLPTQASAMVDTPPPALLPPCSLISDCC 
ASNEQGSVGIGPSEPGAGYNFLVCRLL.R 
PLEKHS IRVGVTQFSRCCLSPPSIjTRKG 
NSLTPCTSRVRRCLALLQLTLSELPHCP 
VPSV 














PIKVEDPEPVKKPPKEQRS IKEM^FITC 
DEFNGVPSYMKSRLTYNQINDVIKEINK 
AVISKYKILHQPKKSMNSVTRNLYHRFI 
DEETKDTKGRYFIVEADIKEFTTLKADK 
KFHVLLNILRHCRRLSEVRGGGLTRYVI 
T 


9489 


23390 


A 


955S 


338 




NFKKKKKRRTKKKE I RTEEKGRGANKKT 
FRG PKKKETRG SNRS P EGKKNHGAKKEG 
ERRPRKKKKKKKKIARYDFTCL 


9490 


23391 


A 


9559 


1 


152 


NTCVC VC VYTG FCHVAQVAH ELSSSNLiP 
SSASQSAGITGISHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGESVPHPWSAVRTRPAKKPGSQCLL 
QSAFPLWCS SRRTTWAEERKLNTETFGV 
SGRFL.RGRSSRGGFRGGRGNGTPRRNPT 
SHRAGTGRV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
elh 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9492 


23393 


A 


9561 


709 


280 


NTSPALTAPCGRPRGIRPMQCWARVPAP 
SSAC PAWQLP S LCLRSTVPAPHPS PAAH 
QPCELSQHHRGGLEPALPGTRQRLCSRT 
WHRCVRFbPGAQEAGELGPAACATGRLP 
VSHSTDMEQAPGLTSLHQEPHSTARGGT 
ATH 


9493 


23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDRITSASQSA 
EITGVSHRAQPQMS 


9494 


23395 


A 


9563 


409 


160 


NPGVKRS P CL.SLSS SGGYW SMP PCLAY Y 
KKKKRRFLKDGGLTMLPRIiASNSCAQQF 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


9495 


23396 


A 


9564 


1 


241 


GICLGSNIGSGWMGTSSLCNPYSNTLNQ 
NLLSTTKPFPVPSVGTNCGIEPWDSGVT 
SGLGKMI1FL.YYYYLLL1FSQMTPQS 


9496 


23397 


A 


9565 


2 


215 


IHWMNGNMDGWMDGWMGSCVRAVTGPGT 
SSPHTSTCGHAVCVCVCVCVYTCAHVFL 
ALFGVEKNNYFLWKA 


9497 


23398 


A 


9566 


244 


117 


DHQHLNWS VF ISFS FKVI KARKNKKG I L 
NPDSSMETSPDFFF 


9498 


23399 


A 


9567 


414 


77 


FGGVPPGTLVPPTWLPPVRVPPPPAFCL 
GELSPPPFFWRKGLAPPLLNFFFSPGQR 
GFPSPGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPFFFFFFFKKKVF 


9499 


23400 


A 


9568 


418 


198 


PPVSLTEVIVSKSHKAELVFWNMPGTLE 
HPEGDNNYMEF I E VL.TEALERVLLVRGG 
GSEVITIYSYPTLNARA 


9500 


23401 


A 


9569 


549 


415 


RQENRLNPGGRGCS EPRSCCC I AAWATE 
GDCLKKINNNKPKNEQK 


9501 


23402 


A 


9570 


546 


1 


KLECRGAITAHCSLNLPGPGDPPTSATQ 
AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKI Y I Y I YMC VCVYVY I C VYI C I Y 
VCVYICIYVCICVYIC1YVYI CVYMYI Y 
MHEFITFFSI 


9503 


23404 


A 


9572 


374 


138 


LiAASAA I QELFKR I S EQFTPM FRRKAFL 
H W YTGEGMDEME FAE AESNMNGL VS E Y Q 
Q YQDATAEE EGE FEE E AEE EVA 


9504 


23405 


A 


9573 


370 


233 


GS P P VRS VHQSGVS LS GTQVDEGVRSGS 
KRMVAPPGGRYNITSLS 


9505 


23406 


A 


9574 


3 


399 


YSSPGPIALALRDPCAHGGPSPNSSLSP 
AQGHPARAQQGPQPQQGPPSHLGSDLRP 
HVAHTRGAIjHG PA PAPAS PAP GGSG SGV 
AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVCKAPGRAVF 


9506 


23407 


A 


9575 


314 


48 


SILPHSDPERCFSIQPEEGTIHTAAPLD 
REARAWHNIjTVLATEIiDSSAQASRVQVA 
IQTLDENDNAPQLAEPYDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


IFWEMEFHHFGQAGLRLLTSGDMPASA 
SQSAGVTDMSHCTRPDTPFNFKELPGAV 
VTFLLCGMQVGRDELEDDWMGAWS 


9508 


23409 


A 


9577 


508 


314 


PATWKGCLSNFSSYL.TKPADFIEHQVLS 
WE QVPDG FF I FNE RFKS FTVWLNNVAE 
FVCKYKLL 


9509 


23410 


A 


9578 


512 


1 


RMREAIFSPGRLPRGAAPLRGPAGTLVP 
WMPTPGECWLSLSVSAAGEKPYKCSVCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCSKEFNRPDKLKAHILSHSGMKLHK 
CALCSKSFSRRAHLAEHQRAHTGNYKFR 
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WO 01/64835 PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO. in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

Iocafion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


1 Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H— Ilistidine, I— Isoleucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CAGCAKGFSRHKYLKDHRCRLGPQKDKD 
PV 


9510 


23411 


A 


9579 


3 


487 


WDLIYVGRKRMQVEHPEKAVPRVRNLVE 
ADYSyWTLAYVISLQGARKIiLAAEPLSK 
MLPVDEFLPVMFDKHPVSEYKAHFSLRN 
LHAFS VE P LL I YPTH YTGDDG YVSDTET 
SWWNWEHVKTDWDRAKSQKMREQQALS 
REAKNSDVLQS PLDSAARDEL 


9511 


23412 


A 


9580 


1 


840 


SKYLYDEGEEEEVFNDEWGAAGGVAVPT 
LNALE RG FLS AMD WHL YTD PRE I FE VL S 
WLESCVAEQQGRWRGWYTYTDLCVLLEQ 
PTWQLALGSLCQRLVKIjSCIjLAVAYVS s 
VAIAVASVAVIHQSLGLSCTPTPGPPDL 
GXjTSRCLLEPCIPSVPQCLPSLANVSSC 

legsmglrslwgsliasltppplpppdp 
papptllhnchlcqklqrdsptchaclh 
pnrtvptalsspwyhtyglappwpwspv 
plslpqpqqcslfsvmeiariiksfvfpg 


9512 


23413 


A 


9581 


2 


158 


IHAGFISVSVLDNI I YICMCVCVCVCVC 
AHMCAC I S E H I THAP R3JPEVF YT P 


9513 


23414 


A 


9582 


3 


144 


YTAGVISVSVLDNI I YICMCVCVCVCVC 
AHMSACFSEHITHAPMLP 


9514 


23415 


A 


9583 


503 


3 


GKCVCLSAVWGSLLSAFRGSLSVCCLGQ 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCIiSAVQGNVSVCLSAVRGSLFVCLL 
SGAVCLS FCLLSGAVCHLCRAPTPCL.SR 
SLAHRRCSVNVCEMTQLISVSYFFCWGW 
WRHRNNIHIiQKCRLRILMAYTGQGPP 


9515 


23416 


.A 


9584 


2 


1358 


AGLDS LHKFQVKI EKEKVYVRAS KQALQ 
LQRRTKVMAKCISPSAGYSSSTNVLIVG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
LSKSLDTQPEQLALRPKEFFRAYGIEVL 
TEAQWTVDVRTKKWFKDGFKLEYSKL 
LLAPGSSPKTLSCKGKEVENVFTIRTPE 
DANRWRLARGRNVWVGAGFLGMEVAA 
YliTEKAHSVSWELEETPFRRFLGERVG 
RALiMKMFENNRVKFYMQTEVSELRGQEG 
KLKEWLKS S KWRADVC WG IGAVPAT 
GFLRQSG IGLDSRG F I PVNKMMQTNVPG 
VFAAGDAVTFPLAWRNNRKVNI PHWQMA 
HAQGRVAAQNMLAQEAEMSTVPYLWTAM 
FGKSLRYAGYGEGFDDVI IQGDLEELKF 
VAFYTKGDE VI AVASMNYDP I VS KVAEV 
LASGRAIRKREVELFVLHSKTGDMSWLT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


Q CRKGLGKL VS FGGLG I WS LIDVLLI GV 
GYVGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLTFAPKGWPHPPTSLHPGQVTDQTT 
WWLFQELPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTRRQIRLSSPERQRLSSL 
NLTPDPEMEPPPKPPRSCSALARHALES 
SFVGWGLPVQSPCALVAMEKEEKESPFS 
SEEEEEDVPLDSDVEQALQTFAKTSGTM 
NNYPTWRRTLLRRAKEEEMECRFCKAQTI 
QRRLNE I EAALRELE AEGVKLELALRRQ 
SSSPEQQKKIiWVGQLLQLVDKKNSLVAE 
EAELMITVQELNLEEKQWQL.DQELRGYM 
NREENLKTAADRQAEDQVLRKLVDLVNQ 
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JSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RDALIRFQEERRLSELALGTGAQG 


9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRIiNRKNAETRINAQLi 
PLTDKGRMARHVXiDNSGEWSVTKRQVIL 
LHTELERSLEYLPLRCGVLTGLAAIASL 
LYMLTHYLLPYA 


9519 


23420 


A 


9588 


2 


399 


FSCEFNMEANQCPLWEPSYPDLVINVG 
EVTLGEENRKKLQKIQRDQEKERVMRAA 
CALLNSGGGVI RMAKKVEHPVEMGLDLE 
QSLREIiIQSSDLQAFFETKQQGRCFYIF 
VKSWSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 


19 


205 


LY CLVEWHDL CSLQ PL.S PG FRQ FSCLNL 
LSSWDYRHMPPRPTIKLSSYSSIKLIYY 
LFMLVL, 


9521 


23422 


A 


9590 


500 


407 


KLTNEVRSVKRSQLFPESFMCKEKLGNT 
Nil 


9522 


23423 


A 


9591 


727 


236 


RRSRGL.EGREALAX.CPGDGSHLDCRRTD 
SSFSFMAFFFTFMAQLVISIIQAVGIPG 
WGVCGWI ATI S FFGTNIGSAWMLI PTV 
MFTVMAVFS F I ALSMVHKFYRGSGGS FS 
KAQEEWTTGAWKNPHVQQAAQNAAMGAA 
QGAMNQPQTQYSATPNYTYSNEM 


9523 


23424 


A 


9592 


381 


278 


HPSVYKVASGLKEGLSLFGILNRCHCKW 
GE KliLR 


9524 


23425 


A 


9593 


412 


156 


SQRCLSLGCHEHLANAYAI I ICDNGFSA 
LLGPPLPGWIYDIAQKYDFSFYICGLLY 
MIGILFLL.IQPCIRIIEQSRRKYMDGAH 
V 


9525 


23426 


A 


9594 


263 


50 


KHAAP PAS LS LS IijjLHHGQ KRACFP FAF 
CRDCQLLEGSPAMIiPVQPAKLIjVLIjEQV 
CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GTFSERGPPLEPRSQTVTVDFCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


TLLPGWITAQASEGEIGASLPSSIjPPIiIj 
YLPSWRCSPSGALTHPAACPTGQRSSSA 
LLPSLPQAPPGFCGMTGLRPLPSPSVGC 
CSAPSPQLLCPTQLPPAPESSEGGCSES 
RCVANVKYTRDLGDFLEKCTNGEASALE 
YP 


9528 


23429 


A 


9597 


864 


407 


PSRRLSPTGRTTHGSSAACAPPPPRLER 
RSRTSRPPWGPPHSASARTQRACTTWYS 
RSWACSCSWITSRPSWMTTKRTCMIFST 
I PTTPRTALWRGLSbWKDAWLLTRLKLA 
PSSPTSMPPTTTCTACSSTWMTCGSPAR 
WASTPMPRSSTT 


9529 


23430 


A 


9598 


421 


145 


LTQHNGDAAASLTVAEQYVSAFSKLiAKD 
PNTILLPSNPGDVTSMVAQAMGVYGALT 
KAPVPGTPDSLSSGSSRDVQGTDASLDE 
ELDRVKMS 


9530 


23431 


A 


9599 


1013 


604 


PGRPTRPDICLLLERMEEQVKNVMKTFR ' 
EELYNIEKAFEVERQELLASNKKKWEQA 
LQAHNAKELEYLNNRMKKVEDYEKQLNR 
QRI WDCEEYNM I KI KLEQDVQ I LEQQLQ 
QRKAIYQL.NQEKLEYNLAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAGLELMTS S DLPSLASQGAG I TG 
VSHCARTVYGLS IVFFSSHRGFASVRTP 
PSNENFPDRPISIAHP 


9532 


23433 


A 


9601 


367 




PPRYSPPLGGVRGGVPPGPGVLAPPFPK 
GKPPFFFKNPNLPRPGGGAPYS PFFGGV 
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JSOOCID: <WO 0164835A2J_> 
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PCT/US01/O4927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRGISFNPGGQGSPGQKKPPGPPRRGTK 
GFFSKKKKKESTKLPVTSHTTHCWFLPE 
ALDLELDPLY 


9533 


23434 


A 


9602 


1722 


394 


GWNGSWNDNLVDTSPLKRDPLQDICRRY 
MEDLKKI CF YRELNS KTTL KFVHTS FHG 
VGHDYVQLAFKVFGFKPPIPVPEQKDPD 
PDFSTVKCPNPEEGESVLELSLRLAEKE 
NARWLATDPDADRIiAAAELQENGCWKV 
FTGNELAAliFGWWMFDCWKKNKSRNADV 
KNVYMLATTVSSKILPCAIALKEGFHFEE 
TLPGFKWIGSRI IDLLENGKEVXiFAFEE 
SIGFLCGTSVLDKDGVSAAVWAEMASY 
LETMNITLKQQLVKVYEKYGYHISKTSY 
FLCYEPPTIKSI FERLRNFDS PKEYPKF 
CGTFAILHVRDVTTGYDSSQPNKKSVLP 
VSKNSQMITFTFQNGCVATLRTSGTEPK 
I KYYAEM CAS PDQS DTALLE EELjKKI. I D 
ALIEN FLQPS KNGTGSGRS CLG VP PNT V 
MTLCGAYGNRATRRNCHTLEPCG 


9534 


23435 


A 


9603 


14 


356 


DFVERTQYTHTQTHTHTKSHAVRLSPRP 
VQSRPEASQTGPTRTKPALGPAHPSPRG 
APCPDDPGSLRMLGYGSRASHNSRRGRW 
GHGRCKDQRAMGPHSHMKEAALPPGALS 


9535 


23436 


A 


9604 


477 


264 


LVETGFLCVGQAGLELLTSGDPPTSASQ 
S AG I TG VS HRT WAALS T FTVL CNYH Y L F 
LEVFHHPKiiKLSPH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNIaRRCWMVGMPFPNIRSAEL 
QEKMAYliDQTLPRAPGQAPPGKALVENL 
CMKAVNQS IGRAI RHQKDFAS I VLLDQR . 
Y AR P P VIjAKL P A W I RAR VE V KAT FG P A I 
AAVQKFHREKS AS S 


9537 


23438 


A 


9606 


417 


254 


MVSLTQELCPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFLLESGP I RWDWWSLGALMYDMLT 
GAVGAQLKAAC I IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPKNSLNNISGISNP 
PGTPRDDGELGGNFLHS FQNDNYS PS S L 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDP PAS AS QS AGI TGLSHHLPKC 
WDYRLE P PRPAHC FCFYS FTMRNRNLL S 
LVKYSR 


9541 


23442 


A 


96.10 


366 


75 


TPGLKRSSCLSLMSRWGYRIEKKRTIIR 
S WEAI KEQDGREVDWEYFYGLLFTSEN 
LNLVHIVCHKKTTHKLTCDSSSIYYPQT 
RLKRKQPVRKRQ 


9542 


23443 


A 


9611 


1 


801 


PGYCGSWVFTCGALRQLSGGRDLSRGAR ~ 

MGNS AXiRAHVETAQ KTG VFQ L KDRGLTE 

FPADLQKLTSNLRTIDLSNNKIESLPPL 

L I GKFTLLKSLS LNNWKLTVL PDE I CNL 

KKLETLSLNNNHLRELPSTFGQLSALKT 

LSLSGNQLGALPPQLCSLRHIiDVMDLSK 

NQIRSIPDSVGELQVIELNLNQNQISQI 

SVKISCCPRLKILRIjEENCLELSMLPQS 

I LSDS Q I CLLAVEGNLFE I KKLRELEGY 

DKYMERFTATKKKFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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SDOCID: <WO 0164835A2_I_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine t R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9544 


23445 


A 


9613 


410 


298 


PRFTPFSCLSLRSSWDYRCPPPRPANFL 
SLSSVIiHGT 


9545 


23446 


A 


9614 


1 


130 


NPGSHCVAQAGLQLLSSGNPSASASQSA 
RITGVSHCAWPNTTI 


9546 


23447 


A 


9615 


1 


146 


NTWRLHMVPRLVSNFWAPGILLSWPPKI 
LGLQ KRATTMPGPE I YTS I N 


9547 


23448 


A 


9616 


552 


26 


RLSIRNLL-THAFFAEDTGLRVELAEEDD 
CSNSSLAIRLWVEDPKKLKGKRKDNEAI 
EFSFNLETDTPEEVAYEMVKSGFFHESD 
CKAVAKS IRDRVTP I KKTREKKPAGCLK 
ERRDSQCKSMGNVFPQPQNTALPLAPAQ 
QTGAE CEATEVD QHVRQQL I QRE PQQH C 
SSVTGDL 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 
HLLERDPQKRFTCQQALRHIiW I SGDTAF 
DRDI LGSVS EQ I R KNFARTHW KRAFNAT 
S FLRHIRKLGO. I PEGEGAS EQGMARHSH 
SGLRAGQPPKW 


9549 


23450 


A 


9618 


190 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GLQV 


9550 


23451 


A 


9619 


392 


308 


S P IKP KI PLS APRKNTNSVKYRL.KFR FG 


9551 


23452 


A 


9620 


166 


23 


FCNPIPSSLPSSLPSFISPFLPCFLTPF 
LPCFLPPFLPFLSFMTLTF 


9552 


23453 


A 


9621 


49 


374 


DRRGIRIMAAALFVPLGFSLLGTHGPSG 
AAGTVFTTA* YLGS KI LLTCS LNDSATQ 
VTGHRWLKGRWLKEDS\IjPGRKPEFNV 
GPHPPNGDKSSFVFPPPSPCPRLTF 


9553 


23454 


A 


9622 


85 


472 


SHVFPPLRLTLTLHSMAVPETRPNHTIY 
INNLNDKI KKDELKKSLYAI FSQFGHNL 
D I LVS RILNMMG QAFVI FKEVS S ATNS L 
RSMQGFPFYDKPMRIQYAKTYSYIIAKN 
KGTFLS C / DRKL I K * TPQ 


9554 


23455 


A 


9623 


170 


2 


IFFFFCFERGSCSVA*AGVQWRSYSLL* 
S*TPG\SSNTASATSVAGTTGL\HHHAE 


9555 


23456 


A 


9624 


2 


469 


RRLCS DRGLQRS LGS MS AAVTAGKLARA 
PADPG KAG VPGVAAPGAPAAAP PAKE I P 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
IL.DADTKEEFAGKI VPKSLLLKPYHR* K 
MFMEI FIHRSLGHQHVL*FHGYF * EHDF 
MSEELEL.\ CRPRS FLF 


9556 


23457 


A 


9625 


661 


1393 


AS PPPS QTTP AAPGCS AGEVS LA \ GLCL 
QERRLVDVAGFS I FIPSRYLDHPQPSKA 
EQDASIPPGTHEALIiQTALSPPPPPTRP 
VS PPQKAKEAPNTQAQPISDDEAS / RWG 
RNPG*CS*SGSPHQWPLPHLP*GRL*A* 
PSPCCPG*VG/PVHSPQARRRPITTSIS 
QRSWKFWPKPTVFRETSGGPWAMPRPSM 
PSRASISLSPRTRSRKQPRPLTLGCCVW 
HWHTDGERATCGD VRRAHHS PR 


9557 


23458 


A 


9626 


1 


286 


PTMAKLIALTLiMGMEIiAIjFWNHQASYQT 
RLNALREVQPA*LPNCNLVKGF* TGSED 
F\EILPNGLTFISSGLENPWI *RLDP\N 
SPGKMLLMDLNE 


9558 


23459 


A 


9627 


173 


491 


EGPLPLESSSNWGQADLDKKSREL.LWKT 
VYLL*LNIQLPYNPVNPILGINPRETHT 
C\HQEMCTRI FIATLFTKAKA* K* PROP 
*AREW/IK*/IWNIHIIEHY 


9559 


23460 


A 


9628 


351 j 1 


VXiGDAIQSRRGSSRKAGSYIJLSRSP/CS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
seq tience 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
corresport 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspariic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EGGRGGCWAWNLGTMSSVASGHGAWVPR 
ELTS*AWPEPGPAGQREGRPPPTPCPPT 
CAHGPPAQP * PRGKAAYPG / CLGLQWPD 
*SAFNP 


9560 


23461 


A 


9629 


599 


11 


KMWR* *H*AKSQIRNAIPFTIAHK/RDL 
KSLKIQLTKEAKDIYNENYKIPL/NEIR 
DVTNKWKNIP\CS* IGYLYSTYKYKCIC 
LVNIYL 


9561 


23462 


A 


9630 


226 


1 


KCDNLGYSTFFFFFFPFFFFEIGSYFVA 
Q VGV * WHN YS NS/DASAFQVAGTTGVCH 
HAQLI FNLFVEVGSHYVAQA 


9562 


23463 


A 


9631 


241 


1 


VSGCPLPRG*RPPPQPQSPPPSTPGTSD 
HPCTPCR/PALRR*SP**LPCLDLPPAS 
GPPSTSGNS TGTMQP T * PKLNSSSS 


9563 


23464 


A 


9632 


201 


1027 


MTLTPAPGQGQQSSQCLWRRPL/ PLPPG 
PDSEEPGSGAAG/PGSSL+SPDLPPAAP 
PKP *TSS * EMNTVGSTQEGSGLAQRTTQ 
HCP VP PS T / RVD I LGTCLQP FPALCL.TS 
APWLVRMPPSQRSSREIP*RLEAF/SLR 
TPIPAHSAQPSHQEKEP/PSPR*DHGD* 
PP P * PPKPTHQTXiAGQAPRSGQGLARPT 
LPYPPSPLRVTPRICRVHV*GHLRRPRG 
PPPSSARRPTWDWRVGLPWEDLEWLCR 
GPAWLLARTSAPFSSLPTHACTGKKTTM 


9564 


23465 


A 


9633 


103 


379 


YHCITIIVLKCIFFFFFLKPCLILSPRL 
E ARG 1 1 LGH * NLCLT\S QbLRRL IRKSP 
LTLEGGVCSDPRSPHCSPAWPTKGNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


VPAFRSVTNISISVSFFFFETEPGFVTQ ' 

GTTGPRH/HC*ALIFLVKTGFPHVAKTW 
F 


9566 


23467 


A 


9635 


4 


423 


G AMRGDR CRGRGGR FGS RGG P RGR FRP F 
VPHI PLDFYLCEMAFPRVKPAPDETSFS 
EAJLLNRNQDLDPNSGEQASILSliVTKMT 
YE I DNITVAPGTLDVL I EERQQGGP YI K 
GTMTT - * RH / SVADLTVI LQILTTWESVA 
S 


9567 


23468 


A 


9637 


2 


492 


GSRAIGFGLAHELERHL.IKIWLASIVR* 
DIA*LMNWSG\SHKD1AGKYRPILEKAI 
QLSGSEQLQAFKAFEESMVNENVSLVIS 
RQMLTDFCTHLjPNLPDSTVKEIYHFTIjE 
KIQPIVIS FEEQVAS IRQHFAS I YEKEE 
D WRNAAQ VLVG I PL * TGQKQYNVD YKLE 
TYLKIARLYLEDDDPVPAY 


9568 


23469 


A 


9638 






VVQGLGLVMGSPSRRLQTKPVITCFKSV 
LLIYTFIFWITGVILLADGIWGKVSLEN 
YFSLLKDKATIVPFI \LI ASGSAI I LFG 
I LLCIFS V* HYPV* LLLVTFLI * LLVFL 
NKWWISPHFLILTFNYLLFDIIIDNSS 


9569 


23470 


A 


9639 


125 


451 


IYRKPLKSIRRVGGLHLNKIFFFFLKQI 
WGWARGLMPC*HFGRLKKGDCLTLGIGD 
/SRG/RYDHDPALQPRQQSKTLSLKINK 
IGWAQWIjMPGIPAPWEG*AGRS FGAGI 


9570 


23471 


A 


9640 


670 


927 


GHVLLPRRAGCAQPGTGPARAAPAAVQR '" 
LPGGGVQPSGPGGRAGQGQGLRGT * AGG 
NPGTGF/GGPHASALLPPSEAPGSLLPS 
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beginning 

nucleotide 
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ing to first 

amino acid 

peptide 
sequence 


Preclict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
!\"=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoph»n, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPR 


9571 


23472 


A 


9641 


115 


460 


ISWIFLII CVGFFGGGGGARVPLCHLS P 
RLECSSAIRARCGFDFPGSSDP*ASAS* 
VAGTT/G/SCHHVQLIF 


9572 


23473 


A 


9642 


1 


431 


AMRAEVIiVYCCRGALASGCVQAFRARY\ 
WVL 1 1 VDVPDNEQANAS 1 1 VKLTDS FTE 
QADQLTAEVGKIjLGE*KVDAILCVAGGW 
AWGNAKFKSLFKNCDLMRKQS I WTWTI F 
SHLATMHLKEGGLLTLAVAMAVLDGTPG 
MIGD 


9573 


23474 


A 


9643 


154 


408 


I IMNAQS VEEDS I L I I PTPDEE EKI LR V 
KL * EDSDGKYGS KI P WNHLPEPENFRLL 
FKHVGYLD* SGLY\E I LTKF* I LCVLCY 

Y 


9574 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQ IF* KVGLGNGARPGFPVLGKVKCKD 
NLKL.GG QGCS RVEMHPRTS AWAI ERDPV 
LKKKKRQR\EHQCSNEQKHTN 


9575 


23476 


A 


9646 


3 


399 


ARAM VLS PAD KTWGKAAWGKVGAHAGEY 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQVKGHGKKGAD S LTNAVAHVDDMPS AL 
VALSDLHAHKLEVGPVNCKLLRH*LLVT 
LGG\HFPAWLTPGGIAFLEKF 


9576 


23477 


A 


9647 


22 


407 


APS AWGMCHFTEEDKAT ITS LWGKV \NV 
EDAGGETLGRLLWYPWTQRFSDSFGNIi 
SCASAIMGIPKVKAHGKKALTSLGDAI * 
HLDDL*GTFAQLSELHCDKLHVDPENFK 
LLGNVLVTVLAIHFGKE 


9577 


23478 


A 


9648 


243 


2 


QIATTTLGPGSEGVGKRDLGSSSRHHSQ 
VES VKH\ PRDLAQDA* GHGSAPRPLCP P 
PAQSWHCLRGPCIiAPHHPTLCEPSR 


9578 


23479 


A 


9649 


283 


196 


SYFI*IITTTHSLI\ISIIPL,LFFNQIN 
NNLFSCSPTFSSDPLTTPLLILTT*LI,P 
LTIMASQRHLSSEPLSRKKLYLSILISL 
QISLI ITFTATELI IFYIFFETTLIPTL 
AIITR*GKPTPLIQ 


9579 


23480 


A 


9650 


10 


354 


QL,RVITRLTWNN*TSA*SRGVSYP*AH 
\SSDALHL*RELTEQRFIVSVNCAIAHC 
RHQSASEDHVKLRNEVTEFAQTWDADES 
AENCDKSLHTLFGDKICTVATLRETYGE 
MAD 


9580 


23481 


A 


9651 


2 


404 


AFGTTKWVTyiSVLFLCSSAYSRGVLRL 
DAHKSEAAHRFKDLGEENFKAIALIAFA 
QYLQQCPF\EDHVDLVNEVTEFAKTCVA 
DESAENWDKI \LHTLFGY* LCAVACFRE 
T \ YGE TAAC* AKHE PVEDE \ CFLR 


9581 


23482 


A 


9652 


83 


396 


NIPGVGGGGVPIPLVWKYNISHTQR*RE 
PYIQS LRSGNWS ERMSHMVAR * RQRHHP 
YYI \LLQAQLSMA\SDPANVMMDSVITA 
GVLVA/ SRRARRISKiiGHRI 


9582 


23483 


A 


9653 


67 


954 


REGNHNTERNCKRP PQDTG PTQ/RPGPP 
PPEVPWQDASSAPTPAAAP VGKLLPS * G 
PPGSAS CQGR *TLGS VHWL * GS PLPSPS 
AGGRALPYGPAQGQSQGHWENTGRGTPL 
PGSRET*GHELTTGVEHQRHKAPREPQA 
GQPRLCPWVQSLPPT\ARSPSWSQGERP 


9583 


23484 


A 


9654 




252 


RPPQLLASDSLPPSRPPSFPPSIiLLSLR 
PPLSRSLCIiSHYLCPPL,SLIQQLTSFPD 
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nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asp:irtic Acid, 
E=Clutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vatine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P\PPSLLSSLPPSLSSAPPF/YVHSVSP 
TISAPLYP * YNS*PHFPIP FPPRKVQHL 
ARPSPKTARPPWTIRRI 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I*LLMNFTILCIGY*LLQYFFFVWKSF 
YFNFIRIPFILFLYLICLLVF*LIFFDL 
SYF/MCLRLI IFNLFY^FLLYSICLWN 
MVLLLISKLF* F*FCGRFALIQ 


9585 


23486 


A 








FPETGSHCAVQGGVQWCGRGSLQS*TPG 
/SQS * VAG I TGMCHHAS * LEKFFF VEMG 


9586 


23487 


A 


9657 






KTSMMNSIVPHISIPTLNVNGLNVPLKR 
YR IAE * I KIHQPS I CCLQETHPTHKDSH 
KLKVKGWE* IFHANG\QGVAILISDK/T 
DFKATTVK/RDKEGHYI I I KGLVQQENV 

NANT 


9587 


23488 


A 


9658 


179 


373 


VHSWLLWGSQSFVLFRPLTD*MRLTH/ I 
MKGNLLESKSTSLSVNLIQKHPHRNIQN 
V/AQNIWVSWNS 


9588 


23489 


A 


9659 


103 


373 


LKHVLRLCLQSERKLHYLFNIDQWGKFY 
TWDDLYECISSKFTTHFFFSFL/FFSF* 

HAPTSAS 


9589 


23490 


A 








GFLDMT*T/AQVANEKIG* WDFIRNLKL 
TCIGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGKLPTNMIEPLIIKEMLINLAMRFH 
LS PVR MAI / 1 KI KKSNRCW * GCREKG I \ 
IYHYWWECKLVQPIj/WKSSLK/FIK*IjP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


I C I KRRK/ WSGT VAHTCNPS S LGGQGGR 
IA*VQEFQPGQQ 


9592 


23493 


A 


9663 






RHELMTPHAFGAMKRVTFISLLFLFNLiA 
YSRGV I SRDAHRSEDAHP I KDLV* ETFK 
ALVMIAFAQHLQQRPFEDHV*LLNE\VT 
EFAKTCVAEESAENCDKSLHTLFGDK* C 
TV 


9593 




A 








KKJ. PFGRPRRVDHLiRSGVRDQPGLHDET 
PS\LKIQKLAGQGGICL*SQLLRRLR*E 
NHLNPGSRGCSEPRSCHCTPAWATE*DS 
ISKKKNTKIS 


9594 


23495 


A 


9665 


1 


355 


ITDL YSMFHFG * VDLATI TS LWGKGNEQ 

SASAIMGNPIVKAHGKKVLTSLGDAIMH 
LDD / LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 


9595 


23496 


A 


9666 


364 


3 


APN I P PSVTRI PALIAFRSQCLCGSCPL 
PLATEPVGLLLSMSFQAAPCFTWWVTEA 
WLTGISWPPDFLISL.LCI/HINYNAVIF 
RKYRPRVGAVAHICNPSTLGG*GGWIA* 
AQEFETSLGN 


9596 


23497 


A 


9667 


1 


357 


PISNAMRHLGQEETATITSLWGKVNVED 
AG * ETLGRLL VI YP WTHRLFDS FGNLS S 
ASAIMGHP+V\KAHVKKALTSLGDAIKH 
LDDL KGTFAQL I ELHCDKLF VDPENFKL 
LGNVLAT 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

1 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9597 


23498 


A 


9668 


24 


369 


APRPDAMGHFTEEDKATITSLWGKGNVE 
DAGGETLARLLGVYPWTQRLFDS FGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAI *H 
LDDLKGTFAQ\LS elh* dklhvdpenfk 
LLGK 


9598 


23499 


A 


9669 


12 


365 


LLiLMDGERK* FLETDSAPCEDAMNTVEM 
TTKNIiE*SINIjVDKAVA/ sgferigfnf 
EKS STLGKMPSNS I ACCRE I FHERKS /S 
MWQTSFLSYFKKLRQA/PPAATTLNDHQ 
PSALRQDPSP 


9599 


23500 


A 


9670 


346 


2 


YPYDPWSLINNMFFLFKRFQNFFLKEGT 
LTFPFKGFRGDLSLFFWPPYKLFNLKSF 
L/ CKFLEI*RYFFG*IIFFFPFFFFFFli 
RQS LTLSPRLECNGT I SAHCNLCLPD * S 
NSP 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQWPLLVIPALWEAKEADCL 
SSGVQGQPGQHSKTPSLP* PSKSWDYRC 
TPPHLASFCNFFSFFFGRDGVLLCCPGW 
P*TPELKQSA/CLGLPKCWDY/RARATA 
PGLLFF\ FMLKYVLVKPAVNI ANAN* KE 
LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 ■ 


223 


EME/FSLLLPGLECNGTCLAH*NLRLLG 
SSDFPASASPVAGITGVRLHAQL.ELYFL 
NLLiGFFFI ISLWYLS 


9602 


23503 


A 


9673 


100 


338 


KSRPSAVANLTPVIPALWEAEAGGSPEL 
KQFSLSLLSSRDHRFATPC/RG*FFIFF 
VEMEFHCVAQCGFELL.DSSSPPTSA 


9603 


23504 


A 


9674 


161 


1 


S FLWKLCLRGAPS CMRCQLA\ LL* DVSQ 
LGY/ SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 


A 


9675 


304 


1 


VPPLASQLGDKRRI FFQKKKKKKKNKDK 
IHIIISIILKKFDKI*YSLIIK\TL*KL 
GME*TYLNI IKVI YDRPTAS I ILSGEKL 
KSFPLKSGR*QECPLL 


9605 


23506 


A 


9676 


111 


346 


SKEGSKEGSLEIWGFL.VLAFPFPFPFPF 
SFFFF\ *RQAGRPRLECRSMITAHCNLE 
LLGSSDP PAS AS \Q VAGTTG PTS P 


9606 


23507 


A 


9677 


3 


349 


ARAGRI IKDLWLGFFFFFFFFYPKKTPL 
FTRDKGGGKLQTAKKNNPHPVKREKKKP 
PPGGTRGGKNPPGNPFWGFGPQRFFFGG 
KTFGGGAHPPDPTLG/PSPQS*KRPFGK 
RGPPP 


9607 


23508 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 
LCEEGMSKAKAD*RLGLLH/QVSQLVNT 
KEKLFKEIKSAISVNTLMIRK/RNSLTA 
DMDKVWAWIEDLSSQNNPLSQSLIHSK 
A 


9608 


23509 


A 


9679 


187 


345 


PFILFFETESYSVA*AEMQWCDLiGSbQT 
PPPG\SSNPPVSALWEARAGGSRSQ 


9609 


23510 


A 


9680 


192 


468 


ERTSLFT IHLLKLLTITNSAAMSNHV* V 
• FV*T*IFLSJJGQMPKSVTAGSNGQSIjFR 

ffk\ncqtlfhggcamlh/ shtqlnaip 
lfrnlwqhls 


9610 


23511 


A 


9681 


141 


367 


GSWAAS PCSCTMDDFAKES FTWDYVLL 
ENCPNMGDYWAPQFMTDNYVRVTQLNW 
DGVGTQ* KDYI / SSERNL 


9611 


23512 


A 


9682 


193 


3 


IFGERR* ILKI YKLTGHVGLRL*SQILR 
RI /RliNLGGGGCSEPRSCHCTPWATER 
DSISEKKEK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NIjVHFEEHFPSIjFTPSPSLCHIF/CSRD 

RVSPY*PGWSRSPDL/RYLP/CLDLPKC ' 

WDYRRESPCPASSRFIjMNFIjYF*-NTTLW 

Q*KTI PLLF*SGDC/ S*FF* YNFLAHTS 

STMVNSGISRNSRPVFKLADKAPSHSPL 

SILAFGVEFF/NLGSLQPQPLKLK*ASH 

LNCLSSWDNRHAPPCPDNFFFNIL,*RWG 

FPILSRMVLNSW/PLCSLSTSASQSSG/ 

ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILIVKVFFPHTQKPNFVLRLSLQSIQQM 
L.LPERKWTHAQKTCVSSENTERCHEKQV 
ITSHWLGMVAHS CNPSTLGGGGGWIT * G 
QVFE T /S LANTVKP 


9614 


23515 


A 


9685 


66 


344 


IGEKLLIiTDKHRNWVLETE*APGKDSVN 
IDEMT/TR/DLEYYIN\TDKAVTGFERT 
DFNFKSSIRGQSL* YSII/CAKGSQQKQ 
QISLVPYFKKLPQ 














PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKLSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGRIT*GQE 
FKTIQ 














DRYYYKSHFKPGAEAHAC*LVLAS FFMC 
YRAIMINK*HLYHRKLi/VYQVIRH*KYR 
D / 1 * T HVTLTTTL * DRYYY KSHFKPGAE 

MA 


9617 


23518 


A 


9688 


318 


46 


KL YALNDMASNTCKE YDNLI QYYRYKI Y 
KRLiIQHD*VEFIPEME/NWFNI */rksi 
NVIHYIHSLWEEKNYMI ISLDVADVSDK 
IKLTFLIF 


9618 


23519 


A 


9689 


168 


3 


GNSNTM IFFTI L YL.FFLYFF / LFF I FDM 
KSHSVARLECSGVILAHCNV* LPGSSNS 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPRMALHEAQHTF 
VNFL KTLiWAG \ P VAHAYNPNTLRGRGGR 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAGVCWRSTPDPVCLGITSSGCRTAE 
I VACS FLWKLHSRGAPARCQPELSCMRC 

AELENGARRSTALFRASRQDSLSLLKIiR 
PQLPLLPGALSQIEGI 


9621 


23522 


A 


9692 


226 


3 


WFLLPQVKLAIMTPRLYFPNLPFFFFF* 
SSPTSASRLTGTI ILQHHAE 


9622 


23523 


A 


9693 


305 


1 


NHAITTVN* FGLIRHLVTKAAFNSGEVD 
IV/SIKDPLIALNYTVYMFHHDSTHGKF 


9623 


23524 


A 


9694 


320 


1 


PRDP PAS ASQSAG I TG I SHQAWPDM I LY " 
I ENPKDS S KNPLGLINKYSKVAGYKINT 
QKSAAFL*TNNYLKN*P /MRTIPFTIAT 
KKKYLETYLTMEVKDLYTENYKM 


9624 


23525 


A 


9695 


263 


1 


VRILLFSESLALSPRLECSGVISAHCTL 
CLPGSRGALSIiLSRVDY/Y*VFIiVKAGF 
RHV/GQAGLEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 


9696 


262 


359 


IRKIHKIWSGWAHTCNPSTLGGR*TKA 
HEGSLYCLRVGKI VSNKVGTRS * FFRTQ 
K*V/HLFFS* VYHINGSH*SRTSL* IRK 
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Predicted 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnrngine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IHKIWSGWAHTCNPSTLGGRVGWIT*G 
Q 


9626 


23527 


A 


9697 


166 


473 


TQDTSQPWKNIWISFGSLKVFENVDIHK 
LYLELDNRHRSWMTDILLFFYF*DGSFT 
VT\RLECNGAI LAHCNLHLPGS SNSPAS 
AS * VAEVGGTLEPRRLRLQ 


9627 


23528 


A 


9698 


456 


59 


LFLWKLHLRGAPGYVRCRLA\LLGGASQ 
LGYSGVRDPLEEAVCPFSDLKLHAGRTT 
TLFKAVRYGHLCLQKFLLPFVRLCPAPT 
GGVFTVRQASMSCDGLLIV*\VSERH*L 
PKPQQFWELVPKFVRPGLKSL 


9628 


23529 


A 


9699 


4 


375 


VFCSSVTIVSEIAFLMWLSAQLLLVYRN 
ASDFCMSSLYPETSLTKVTQGQAL/H*N 
G*LT**EQYWVKTDQKAVI/CQQSTGSS 
GQSSQAREIKGPGAVAHTCNPSTLGGRG 
RWIT*GQEFETSIiA 


9629 


23530 


A 


9700 


239 


361 


RNDTFFFFETESNSVA\RLQCTGVTLAH 
*NLCLPGSSNSPA 


9630 


23531 


A 


9701 


169 


1 


I F YWKAI FFETESHS VAYAGVQ * SNLGS 
LQLPSS\GSSDSPISASQEAGTTDAHH 


963! 


23532 


A 


9702 


37 


348 


HAKNFDVSLLPYTKTHEHFVKEDIWIAN 
KHVKRCSATLVTKEIPPQS/TLKWLKH* 
PPC*QGNGRTGSPIP/C/WWECKIVQLW 
KIV*PFPKKLSIYL/PY/DPPILLLGI 


9632 


23533 


A 


9703 


16 


339 


ARLNT F AMAAKC * S E RKS / RN / S LTLNH 
KLQM I KL I EEGM LKAVNGQ KLRL L YQ / T 
SQFANAKDELLMD I ISATPATV* MIG * Q 
NSLIADL>EKVRW*LDDQ/TSHNIHLSH 


9633 


23534 


A 


9704 


276 


2 


GISRGGLSKPPFFP+DFPLDYLGQSPFC 
KVKRNS/RFRTTKSPLDFPIPPLFFFFF 
EKVSQAI SAHCKLHLPG* CHS PAS ASRV 
AGTTGTRH 


9634 


23535 


A 


9705 


352 


2 


cnsrsdffqlmmncflrwavsprvecs/ 
gvitahcnlnlpdsvdpptsts *vagtt 
ghtpNhtwliiiiiivagiflflfflfs 
ffgrdgvspccpswsqtsefrqsaclsl 

PKCWDY 


9635 


23536 


A 


9706 


6 


351 


I WNS RPRRPLGGQGGR I QDW P KLHGKTR 
SL/LKKLQKLARRGRASLWSQVLKRLRL 
TWVDCLSLGG*GCSQP*SH/IHCSPAW 


9636 


23537 


A 


9707 


76 


359 


CFLFVCLDFVIFLAYFCN*YLFPFLISH 
TFFFFLKKGLIFAPGVNLRDQNKFYFTL 
FDVFFNI YYC*IFFIF/ CLVLLFFIFFF 
FFYCFVMRFIVL 


9637 


23538 


A 


9708 


342 


3 


GEELLLMDEQSKWFLE\TPGENATNIVD 
LTTKDLEYSISWDKAAAGFERIGSN*N 
STLELKTVN/SVPHAR* IFCKRIiQSVWQ 
TS LLP I LRN\ LP * S PHPLVSAML 1 1 QQS 
STLK 


9638 


23539 


A 


9709 


211 


350 


RQDIALLPRLE*SGTITTH/CDPQLSGS 
SEPPTSAS LVAGTTG ACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFSREHMLIRRSAIGGKKHIEKGTLI 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
VWLGTVADVYNP S TLGGQGGRI T * GQEF 
E 


9640 


23541 


A 


9711 


80 


324 


LITLTMASMCSNERKSHTSLTSNQKLEM 
SKPTEEGMLKAKTS * KLGLFHQT/ SQLV 
KTKKKFLEE I KSTTPLNAPLMRK* NS L 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alsinine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine,V=ValTne', ne ' 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 


9641 


23542 


A 


9712 


246 


3 


qlakhgllfflffemgsh+lrlecsl\t~ 

ALLSKISAHCNLNIjPGP\SDPPTSASQV 
AGTTGACIjLAQIjIFKNFWETGSHYVA 


9642 


23543 


A 


9713 


2 


359 


AAASTTIKEQDMLKFYFLFYFFETESHP 
VTQ VG VQ WCNL CS IjQP P P P \ G P S DS P A W 
VTEQDSVSKE*KRKKLIKINVYLQVTTH 
P*HSPQHIVRVLSVLN*LNEKINRRVTK 
SSSKEKDR 


9643 


23544 


A 


9714 


278 


53 


KLFRGLNNHRKFQTRCGGLFFFLFIYF* 
DGA/YALSLLECSGTIMAHHS LDILGSS 
HPPISASLAA\GWRTCSPSC 


9644 


23545 


A 


9715 


166 


402 


TLSLKKNENISPAWWHVPWSGVWKAEA 
R*SLVPRSLRL\HCTPAWA 


9645 












VDCI IMRRSIAKSPRRKCRGTISARCKL 
RLSG\SGHSPASAS *EAGVTGVGHDPR 


9646 


23547 


A 


9717 


1 


553 


TS I RLFFLLS I FFFSRARNR I LFKIYCN 
GVRAKGRGGGWGTCRGHRNTMTWPGPQL 
LLSWGRGMKRQDQGWSWGGRGEGDTGCI 
PPPPGSTSRPWTPRSPWPLRLHLFSASG 
PPGSSSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALLLPLLKPHRFPSLG 
Q\PDPPPSSPPSP 


9647 










364 — ■ 


PPNR*RNPLIELTNHSLI YLPTPS\G JS 
A* *NFGSLLGACLILQITTGLFIiAMHYS 
PIRLQLAFSS IAHITRDVNYG* I IRYLH 
ANGASIFFICLFIjHIGRGLYYGSFIjYSQ 
T*NIGI ILLLAT 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPALSH/'HAATLIAPLPTT 
PFVGTKVTIVGQAQW/HQAL'SQHFERP 
RRVDYLHLFSRNGQNPISTKNTRISWAW 
WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLFIiASLKAQIIDSKPFFSNTLENI *K 
YEVINFFLYI \ I KLHFMNLTSLCFYCHS 
VCTILL 


9650 












pippgnesspptaqedmqaankh/ irry 
stslairet/ql/ktttrepyapitmat 

TETSANTTCWREYGESGSYCWVGCKTLR 
PLCKTVWQFLKKPSM*LAYDPAITLF 


9651 


23552 


A 


9722 


1 


344 


PLPTQNWPQGQSFLHYPVLLPAVSVLWS 
S FFFFFWKQILLbP PRLEG/NGQNSG * W 
KFPIiPGPSDFSCPSFQTSGNYGPPQQAR 
AIFWKFKIKTGFHGVTREGLNFLTSGSA 
PLCS 


9652 












VTHA F 1 LFYFE TESHS V * WHDLG S LKS 
PPPG\SSNS PASAS Q VAGTTGALH 


9653 


23554 


A 


9724 


76 


337 


YLS I YLS I YLA I YVS I I YL P I YH P S I S I 
YVSSIYSINHLPLYYFSTHLSSTHSSI Y 
LLS ISILS INYLLLS I SLS I YL.S I * LS I 
YDSSIHLSIHLSH1jYIYIjI*YLSIYLS I 
YLA I YVS I \ T I YLS I I HLS LSMYHLFIL 
S IIYLCI I FLPI Y/LSTHSS IYLLS IS I 
L.S INY / HTS I YLS I YL FVY/ HL Y I YHP T 


9654 


23555 


A 


9725 


249 


436 


ELiAG* IQDHP/GQHHKTLSLQQIKNLAR 
RGGRHL*SELFRLRQEDHLTPGVRGCSE 
L*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN** FSNLIFFETESPSVT\RGI ISAH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


9727 


272 . . 


233 


RKNQRIYQIARKRLNEMARISPLRSMI I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LNVSGLNFPLKRCRLAEWTEKKK\DPII 
CCL/ QKTHFTG/ KDIYRLKI KGWKKIFH 
ANGSQ*Q 


9657 


23558 


A 


9728 


368 


380 


RKGQRISEIAIKRLNKMARISPLISIII 
LNVSGLNFPLKRYRRAEWTEKKK\DPI I 
CCL/QKTHFTG/ KDIYRLKI KGWKKIFH 
TNGSQ*RR 


965S 


23559 


A 


9729 


294 


2 


KGNLSPKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTL.KVSQNQKKPKFK*NLKGKRPL 
KLFPKKKKLNPSRSWFYEKKKK\lNKLD 
GTLVRQINKKRKNQ 


9659 


23560 


A 


9730 


7 


284 


SQGLGRLRRENYL* FETESHSVARTGVQ 
WRNLSSLQSPPPG\SSNSPASASQVPVI 
T 


9660 


23561 


A 


9731 


1 


322 


RSLKKKKRREKKGKNQPGQHGKTLFLPK 
TPKI * KI KKI YPG/RGGTHLNSQLLRNIi 
RKENHLNSGGKGCNDPKLGHCIPAQMTK 
TP FKKKGGAKQNGKKLLQL I YG I K 


9661 


23562 


A 


9732 


168 


2 


GFYRKSLSCLSVWI CFKDFFFFFEMVSR 
SVA*AGVQWHDLGSMRPPPPG\SGDCPA 


9662 


23563 


A 


9733 


43 


317 


LKLQPRDTHCIWLSTTSPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
AS ASCTGMHHHTW LMKC INS FDVPDM P L 
GLGLNSHI 


9663 


23564 


A 


9734 


8 


296 


NWS ACSLNGLVKGRLS C LTLNQ I L * M I 
TLTTERFLKAERGQKLGLLHQTA/QWN 
/AKE I FL.NEVSSATP VYEG 1 1 RKLJNS L I 
ATMEKACWWIEDQT 


9664 


23565 


A 


9735 


3 


300 


TRPSNSPAALSKCSSRRKT/R/TCLTLN 
QKLEM I KIiSKEG I LKAERGQKLGLLHQT 
A/QWN/AKEKFLKEVRSATPVYEGI IR 
KQNSL I ANME KA * WW I EDQ 


9665 


23566 


A 


9736 


352 


1 


PFFYLLNFPTFFKGPPLIHFFLESFLEN 
FGFQGAFFKLQNFFAFLFSLLNGPFLEN 
PFYFKF?INC\KFLGQNTFYCPYLGFF* 
RSFFFFFFFFETESRPVA\RLECSGAIS 
AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KIFFFFFFFFLRQSLiSVAQAGVQWC/N* 
LGS LQP P PAGLKR FL.V 


9667 


23568 


A 


9738 


167 


3 


ISRSVFCFFFETESCSVTQV*EQWRNIG 
SRRPQPP\GSSNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


INVKYLSRHFFKGNIQSPNKHRKAIREI 
QIS\+TDNNKCW*GCGEVQYFIHCCMCE 
MVHSLWKIVW*LLTGL/HIELSDDPAIL 
LGGIPPEEMKT 


9669 


23570 


A 


9740 


93 


371 


KLCNGYYQESKREPTERN* I FANHKTKG 
CISKTYKELQLNNKNS/NPTQKWAKDLD 
ISPQTMKQTAKNQVKRCSTSLVIREMQI 
YTKSIPLHIH 


9670 


23571 


A 


9741 


1 


288 


GTRDHLR S GVRDQ \ PGQHGETR SLLKIQ 
N*GCGHGGSCLSSQ\L,TWRI.RQVNCL:S 


9671 


23572 


A 


9742 


296 


563 


FIKHLNSYMLGQNLATKLSEICKLPFYI 
EK* IQAGYGGTCL* SQLLRRLRHDNCLN 
LGGGGCS E PKI \ HC CTPAWATDGGS VS K 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALVPHWS S YS P I CbDLNVLGI YLFSDR 
VSLLVPRLECSDAI MAQHS LDLPRLR * S 
S HLS LPRNWLM FVI LVETG FCHVA* DGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheriylalanine, 
G=Glycine, H==H!stidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P= Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KLLGSSNPPALA/FLKC/WRNMRKSQL* 
PLLLGLLSADHSD 














REGGLFISETLLFL.CPAPSCS/HFLTPA 
Q*SYF*KNVSWLGAVGDTCNPSTLGGRG 
GWI \ RGQE I KT I LANMV 














PMVNKHVKRCSASLVIRAEQIKTIMRSH 
CTPNS * M * KTDNTKS W/ *GCGATGTP IH 
C/WMKYKMVQPLWKKAQQFLi\KHIHL>PY 
DPINLLLGIS*EK*NHLC/YKKTYVRIF 
IAALF 


9675 


23576 


A 


9746 


257 


1 


VHLRLS FPSFLLP I L.SPSSQGDLVCCFL 
MVF*RDGVSLPIPAGVQWLDRSSL+P/P 
SPGSSS PPASAS * VAGTTGMHLHAEPYA 
SC 


9676 


23577 


A 


9747 


148 


3 


IKLCEQFHK*TIHVliNHK/I./WPGMVAH 
AYNPSTLGGRGEWIT*GQJEFDSC 


9677 


23578 


A 


9748 


769 


1 


YPQLFKIAKKKKKSNVPIKS IMVSQYGH 
*" ENKKWVXiNHKGATljQ/EGKGNFGDTFK 
VTLKDKIAVAVKTRQERLPQELKLKFLL 
EAKI LKQ YNHPSNVKSIGVHTQRRPI YV 
IKELVPGGDFLS FQRKKNELKL \ VKFS L 
DADSGWCISSKKCIHRDLAVRNCL/VGE 

PIKWTAPEALNYGRYHSESDARSFGILL 
WETFGLG\VCP*PGMTNQQPQEQVERRY 
WMSVSWQC 


9678 


23579 


A 


9749 


607 


986 


SFSSSSSPEFISYGCDUjPMRDPHVFIT 
HPAFNLPLFLALGMFFSLFSPSPIjFIYF 
L,PPPPRPPC*HSPP\PPPPLTNSI j FVFL 
PPPPGVPSSPPNTSPPHLPVPRPPPSPP 
TSTHLSSPPPPSPTH 


9679 


23580 


A 


9750 


313 


339 


AR+GLSCSG*S*TPGFMQSSCLNLPKCW 
DYRC/RATVPSLCFVF 


9680 


23581 


A 


9751 


213 


453 


NLAWVDEINLFFFPLFLIVTLASKFCS 
/HLLVGR* FIQSCHYFLFFIFDTESRSV 
T \ RAEMS ATVLAHYNL.CLPGS PDSSA 


9681 


23582 










PLSSPRLPFPLFDPSYIFLISISSIRFL 
S ppcY FLCVFLS I S WGFSLVSQDGSDLL 
S S * S AFLGLPKCWDYRL, * P PHPAWRELS 
FFFS/SFFYLR*GSLSPSLECSGMIIGH 
CSLDILVSSARA 


9682 












ARAG FSGGDWNVGMQ S TASGLS L FHC F I 
S*GELFIILYILLKCYITL*IN*LMFGS 
FYFRDLQHFRLP/TSIVGKKRPGTVAHA 
YNPNTLRGRGGWITQGQEFETTLANMVK 


9683 


23584 


A 


9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG\QHCET 
PSPLKIQKLAGRGWV/CAPIGPRI.L,GRP 
R* ENGIiNSGGEVCSEPRSRHCTPAWVRE 
RDCLKNQNKTKSGVLFHSPHQCFLISNT 
SRNLL 


9684 


23585 


A 


9755 


137 


368 


DGVYLWTHRPYCGLGSLNFGSVI IVLP* 
VKAYGWMVLTSLGDAIQPtADPECSF\G 
QL.RELRCDMLHVDPEDFRLLGK 


9685 


23586 


A 


9756 


172 


267 


sksgnkpndyhlmngsvrwcirtv*qi:l 
krltte lp ygpava flgvy arem kayvh 

MKTYSLQKMCT*IFIAALFGIVKKWK*A 
K*LSPDEWISK/M/WCIRTVEYYLTIRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


9686 


23587 


A 


9757 


56 


318 


QFWGFSFFFFFFCFIDTETCS VSQAGVQ 
WHNQSWLTA\P*SDPPAS\AS*VAETIG 
MCHHDQLIFFFFKQALCPRGWSAMAQSQ 
LTSSS 


9687 


23588 


A 


9758 


179 


1 


KDTDRVSRGNTQDTLKPPPPG\SSDSPT 
SAS * VARI I GVHHNTWL I FC I L.VEMRFH 
PRA 


9688 


23589 


A 


9759 


219 


208 


NFCNFQMPLYQKILFWVRVSLLSPRLEC 
SGTIITYCSL/RTPKLK*SS*LSLGVQG 
CSKL 


9689 


23590 


A 


9760 


133 


371 


AMAyQLYRNTTLGNSLQESLDELIQXLR 
NRVNFRGSLNTYRFCDNVRTFVLNDVEF 
REVTELI*VDKVKIVSL\DGKQTGF 


9690 


23591 


A 


9761 


813 


3 


CGEGWAAGQDQGSRMGSCIjPVPRAGRQL. 
GSLFPAFQVYCSSCGTQRSSQSQWHKTE 
RSLPGNNPAPQDS/ASAPKYQTPGPVLG 
VQSPLNQQSCHTALSPEASSSAFFHTTP 
ATLLHQARTLSLTQGSRPTTWPVLLKMP 
A*RPRESRH\GR*EWTKRSQARSQEAIN 
ARPRKALGLSDPAPSLHPGSLSDSPKFS 
LGAPKAPPLPASSPTAQETQQSHISQQP 
TATGFLiGARLCPSPQHICQRGGFPGVTP 
LLLLGEWGVGEHS FPDVLV 


9691 


23592 


A 


9762 


96 


377 


RYHTNMAAQI PESDQI KQTG FHH VGQAG 
LELLTS \ FKE FLGTYNKLTET * FLDCAK 
DFTTRERKS EETTLSEHCLP KYSTMT \ H 
RISLT\FTDNHI 


9692 


23593 


A 


9763 


164 


399 


TDEELLLRDEQ / RKWFLKI ES TAGADAV 
NI VEMTTGDLE * CINLVDKVAARLERTE 
TNFE/RCSTVRQ/ML.SKSIACCTE 


9693 


23594 


A 


9764 


225 


25 


KCKITMNST* IE/TVFKDYYEHLYVCKL 
KNLEKTDKVLT I HNFLRLNQ * E PEVLTR 
AKMYKI YNDIE 


9694 


23595 


A 


9765 


128 


218 


FFKD1.PD CF P KWS HCFTLLLAVYEG FTF 
ST*IiVIREMQIKTP\I*YHFSPNRMVIM 
KKSN I NKCWQGRGESKAL I HC * * ECKTV 
* PLWKTVWQ I LKKLI IG 


9695 


23596 


A 


9766 


194 


551 


KNFFFLEMEF/SVLLPRLECNGVISAHR 
RLRLPLS S YS PASS SQVAGDYRACTTTA 
G*ILYF**RQGFHHVGQAGLELPTSGDP 
PASASQSAGITGVSHCPQLKKS ILHETP 
KGLTGVTS 


9696 


23597 


A 


9767 


238 


3 


SFLWKLCPRGAPACLRCLSA\LLGGVS * 
SG YMGVRD PLE E AVCPFS ELE HHAERTT 
ALFRAVPQGCLSLQKLSAAFCSC 


9697 


23598 


A 


9768 


343 


1 


RGAHHSRGDCVRFTGFCAPIPVXiNHPKP 
LFPFFV*KSPIERSSLiLGPPLKRSKNRF 
PFLR/VPRFFKNHPEFFFKKIFL.FPPFF 
FFFEMESHSLA\RLESHGAILAHCNLSL 
PGSC 


9698 


23599 


A 


9769 


191 


2 


FRINH*TSH/LRGQKKKKKTNSKAIGRK 
QITKIRA*LNETQPPESIQRINET*S*F 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKPPNCFYILNFKKKKGQDVFNRT/WLG 
WAHACNASTLGSQGGWIT*AQECKTS 


9700 


23601 


A 


9771 


97 


912 


VILSTGCSSGPLPGWQSVLHHSLTRCPF 
SFFLSSPTPEME/FSLLLTRLECNGEIS 
AHCNLLLLGSSNSPASASGVAGNTGI /C 
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SEQ ID 
NO: of 
nucleotide 
sequence ' 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G— Glycine, H— Hisitdine, 1— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 

nucleotide insertion 














GLHAC * AS * FLY FHSVETG FHHVGSGLG 
LEL,LTSDD\RPASASQGAGITGVTPRV* 
PLNSFSRHGLSCFFKETRVLSCCPSWT* 
TPGLK*SSCLNS* E/C*GYRHTPAHPAQ 
DMVFIFITY/CPFFVEPLL/VYIPRFLF 
LRW S FTL I VQ AGVQ WRNLG SLQPPPG\H 
KQSETPSQI *INKY*LiLN*LITFFFSIK 


9701 












I ILYPPL.LIRTS I YLISN\RL \VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLILGSLI 
I I I ATTNLLGLLP YS FTPTTQLS INLAM 
AIPL*AGAV 


9702 


23603 


A 


9773 


38 


385 


YLI II.DSPEKGLICGCLCNCMHSSLLDH 
LNI»T*SS*MCLYANIHIYVAISCIYFI + 

f*si/yifii*yfhysiayvyiy/iyiy 
mythahihmci 


9703 


23604 


A 


9774 


13 


1350 


DRVSLLLLRLECNGSISAHRNLGKLGSS 
DYPASASQVAGTIGVHHHTRPTFALFLV 
ETGFHHVGQAGLELPTSGDPPPlAASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GIWGQRWWAFLIL\SLPWL*LWHPCLSS 
IMCLSLFFFLCQ/RCKPUSD 


9704 


23605 


A 


9775 


376 


1 


KYFELMYIPVIAICLFNSEVFLNK*QFF 
NLTSSSGIQNLNRLIISNEVESIIKSLP 
TVKNLGPDGIAATF\TYKEQbTLILKLF 
QKHEEARILSNLTSETSITLIliRKQQPK 
KVPNEHTVKILV 


9705 


23606 


A 


9776 


192 


2 


NVYQPPPVFLYPPGVAGGV*PF*SPPVF 
RVK/LGF*GSLFFFFFFFETKSRSVTRL 
ECSGVILA 


9706 


23607 


A 


9777 


77 


350 


WLRKFGSRNSGKLTLYLWVYYIiQTSFFF 

G*RDFPTLTS+VL\GPRVILEFGFLEKT 
GFSHVPQL 


9707 ' 


23608 


A 


9778 


133 


325 


PRWVRFPYLGDCTPRVSEMLGLQGWPS 
S S PFFVCFET* S CS VAQAGVQ / WWCDLG 
SLQPLPLG 


9708 


23609 


A 


9779 


145 


362 


RLAFGSCSTCFPLT*TVI J YYIiI*FNIFF 
AWTTERVDAAANSRSMN 


9709 


23610 


A 


9780 


260 


1 


QKQNQEKKRKKT FRNIKHQRDYNEQ* YG 
/NKLDNPEDMNKF*ETRYT* /PNLHQEE 

DGFV 


9710 


23611 


A 


9781 


230 


1 


SSDSKTGSSWLVACRFFFFLETGSGSV 
TQAEVQWHDHSSLQPPPPQPPK* /PGTT 
G A* S TLAS * NAE ITGVS HHAR 


9711 


23612 


A 


9782 


20 


322 


S QHFGNPKGNPKI FRNLE INF FRLRNAD 
HMI ISTDAQKVFDKIQHL.FI I * TS *QT * 
TERIFLNLI KAS * KKPTANI ILNG* RIjN 
I\FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKIICNSFQYISFFFFFFLETKFCFFP 
QVEVQGGNFG*INLPLPGLNHFS \ASAS 
REPEIPGPPHHPG* ILVF 


9713 


23614 


A 


97S5. 


54 


312 


KVDKSTKIGRNQSKMAKNSKNQNASSPP " 
KDHNSSPARKQNWTENELDELTEVGFRR 
WVITNSSKLKEHV* PN/ GKETKNLEKRL 

DQ ... 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

locntion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


PG AG WARQHHG V* DQ PGQHG \ KRPSLLK 
IEKIiAGHGGRCLLS PAIGGGLKQGESLE 
TGGRDCSEPRLCHC\MPAWGNRNKTLSQ 
/ QNKTR 


9715 


23616 


A 


9787 


794 


164 


FS FLFCFFFLRWS LTLSPSLECS CVTS P 
HCN1.QLPGLSHS PATVS + VAGT I GTCHY 
TKLS * FF\FDDSL*APFLGWDTETYYP 
DPPSSKGLLPHSGQHGQQHPPAISRAWL 
SC*VLPHLRS/STSLGQP/RIW*LSKVE 
V*RGFTFSPTQNTPTNNNCS IAPGRVD+ 
SLLID*LQWCDHSSLjQPQTPVLK\HPPT 
LASQSAG I TG I SHHTR PD 


9716 


23617 


A 


9788 


256 


380 


YI FRQAKTQGLP P CALFR EAVGENTP * V 
FAC/PKNVLFYVHP+LFG* I * IYQLKIT 
F * N YEGMVT / F VF R FL FETR SHLSPRLE 
CSGTITTHCSLELVGSTNSSISAS*VDG 
NTGVRHNASC 


9717 


23618 


A 








TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 
LNVRAKT I KLLE EN I DRNLCDLGLGNCF 
LDMTPNAQTTKGR\ IDKLDFIKI * NFWL 
DT 


9718 


23619 


A 


9790 


288 


2 


Q VWQE * AE TES AI H C W WER KA I Q LL * KM 
RWQFLKMTNV\ELPYGSAIPL.VGIHGRE 
LER*ST*NIMFTAPLFVIGK/T*NLFKW 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


PPGEGKRVPFTTEPSPRIPSKAKSVSQG 
DTCTPMFTAALFTIARMWNRSKCPSFEE 
* IKKMR C I TT/MECYSA 


9720 


23621 


A 


9792 


305 


346 


TYEYTDYGGLI L * * L> I TFYDHAVIj 1 1 FL. 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV+TILPGIILGLIALPSLRRLYITD 
DAPDASLTIKSIGHQWY*TYEYTDYGGL 
ILNS 


9721 


23622 


A 


9793 


2 


317 


SRDRPRVRDR*LFSTNHKDIGTRYLLFF 
A*AGVLGTALiSLLIRAELGQPGNLLGND 
HI YYVIVTAIAFVI I FFIDLPILI \SSF 
G\SVYVI)1iIIjGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


rigkikg/lcflfv*wflkfffkmefl,l 
prlecngki\hcnlllmgssnsptsasq 
vagitgmc* f\vfliemrffhvgqaglk 
lltlgd/pprppkvlgi 


9723 


23624 


A 


9795 


205 


1 


GLQIKCTMRYHFAPTKMAITYFFLKKGK 
G\NNKCW*GS/GTEIGILIHCWWECGMV 
QLLWKTADAWADAW 


9724 


23625 


A 


9796 


307 


348 


QSARL*EAVCPFSDLQLRTGRTTAIiFKA 
VRQGHLSLQRLLLSF\VCLYPAPRGGAY 
RGRQAS LS CGGLH P VRAS RLLCLP KQAW 
AMVGAPTPASLPPCSS I SHCCASNQRDS 
VGYDPSSP 


9725 


23626 


A 


9797 


193 


381 


ILLI*IHILLSMISSPFFFETGSHSS/V 
SRLECSGTASAHCSliDLPGSGGSPTSAF 
*VAGTTGA 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADS YQLS YLFWGTFS WLE EE 
GV*WCICGSLQPQPPRVK*/FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEELIKIFFFFFFFWGQIL 
ALMPKGGGQGGIIiTYPNPPLPG *NNFPG 



1135 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


1 M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LTPPRTGINGLGPPGRVKFGIFKKKGGF 
PPGARGV*NPGPRGTPPPGFPKGR\DKA 
PHPGPLi*TFEKVPRPPNFPQ 


9728 


23629 


A 


9801 


256 


347 


LYILLLNSLK*LVEKLR*NWLASLIIR 
YKVTTKRF+*FSKPAQSEELQLLTSAPG 
LNF*RYIISFFFWIQSFALVA*VGVQWC 
DQGS PQPIiPPGFKR FS / CRS WDYRHE P P 
R 


9729 


23630 


A 


9802 


109 


401 


HLRRPCPTPPFALRTSGDQPLSPEARPD 
SGLLHSPLTSPRGLNGQCPPRGSTLNFN 
P+ASIPASPFPSGQPQASQEPPPVASEH 
PPDT+GDPAQPPP\RPQDFWGPATVPRS 
QAGQWPSPLPSDFSKGAQWPVPPTR/PP 
PSTSTQSRWT. 


9730 


23631 


A 


9803 


371 


2 


PGFPLFSFPEGNGPSKRQTGD* IRCLF* 
DGKVWE CS PPKKKKKTKKRKKAVI F/CV 
PVQTKCI WEGGEETLVGDV* V* P* \GS 
FKKWAMFPEK/DCLCTLYEASFKTKES 
RRVDGFVCVRVGT 


9731 


23632 


A 


9804 


188 


1 


FLFFFFSET*SQSVTHAGVQGLEQSLPP 
G\SGNSRASASQVAGITSRRKHSWIIFV 
FFVETG 


9732 


23633 


A 


9805 


112 


383 


VFINIRVFRLFTFSFFFKRGFNLSPRVK 
CNG * TKGH C I PDLPG VR * S F P FS LLTNW 
DYRCAPPRPNKFFF/ SFETGSHSVTQA 


9733 


23634 


A 


9806 


599 


237 


FRDRSLTLLPRLHCHCVITAHCSLKIiLG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ASRNSLGSSNPT*A/FPKCW 
DWQV*ATAPSLAYWFSEAEKLILRLIiAL 
KVSDENICTNF 


9734 


23635 


A 


9807 


16S 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL+ PPG\STDPPTSASQEATGTHHRA 


9735 


23636 


A 


9808 


225 


3 


GPLGYLPPYFPQPLISRG*NPSDFLKIS 
RVFYGT/HYKFEFLFLFFFETESRSVA\ 
RLECSGVISAHCKLCLPGS 


9736 


23637 


A 


9809 


541 


3 


RQLTGINSGRRQFQPQMSNYVIPYMMDM 
I ILKTKKSNNKYWQGCEKTELL IHCLWE 
CKMVQLVWKTV*QFLNRSDIEFPHDPGI 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFE VAK * * KQ/PQTP I TR * INTMW 
RIYTMEYYSAIPRKTPLIHPIA* LDTES 
II*TEKIQSQETTYY 


9737 


23638 


A 


9810 


213 


1 


FLCVDFVPCNFAELKLALVGLFVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGLSDLPT 
S AF * VAGT TGVSHH 


9738 


23639 


A 


9811 


54 


388 


PARPLPRQ/WDERPNQQPTKKKKKRGRN 
LPTKKKTQNYSHQNPVAQRNPGGKTNQK 
KPKTNPKKKKGGAL* KKTQRGPKQTGGR 
K*KISPNKGGK*NPCGKLLEKNPFSGGE 
K 


9739 


23640 


A 


9812 


429 


24 


RSKGAEFPICPCRSSKFSVLQQRQVRIV 
FQHPELGESVAR/CRYSQLLGRLRQ*SH 
LiNWGGRGS SEPRS CHCI PAW ATRASS I F 
CNFQAS S VEVRRSARKKLFSD I LKRHNT 
INWRVSGLLLVDSYFGRLATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CHFPCFHTTL/CE*EPGEDVTGPIGTKF 
FPAPGRTDGHTHTHTHTHTHTHTHSDPH 
HLCVWNTL.IHFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9741 


23642 


A 


9814 


66 


401 


PENKKTVYRKP I ATI I FNCERLNALPLK 
LETRP*YLLY*LFNIMLEVLASWRQEK 
EINLS IFT*DIVIK/KKIPKNLEKKXiLK 
LLSPFSKITGYKISIQKLTSSFLYYGFT 
V 


9742 


23643 


A 


9815 


82 


1 


APPPARFFFFFFF*EKFSSFPPGEVQRG 
VFCPCNT/CFP*G*KIFFL\KPIjKKVGF 
WPPPPGPKNFFYF*KKGGFPFWARGFFI 
PTPFVPPLHPFKKWGVKRG*APPPARFF 
FFFFFL)*GRVSLCHPGWNA 


9743 


23644 


A 


9816 


256 




QHFP*LPIPHIPIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSKWFILLSFFFFF* 
DRVSLCHPGWNAVM*SQLTAASNTW\VK 
*SSHLSLLSS*DYRCVLS 


9744 


23645 


A 


9817 


53 


411 


TIYCSNYVYNFVSQHFRSLYS*KNCGLC 
L* FHL>FVLiQS * NFMQFHM * FCLK I * VDP 
IITFEV*FTYI*FF*TYSSVSFTDAYNF 
VKPHR/VYRGPGAVAHI CNPSTLGGQGG 
WIT*GQEFK 


9745 


23646 


A 


9818 


100 


417 


DEHSEKQDEPLQRPRSKLLPFPGTPPSS 

FDMKERLCLLPMIjECRGMISDHCSLI/R 
PG* DKPPASAYQVGETTGTYRRA 


9746 


23647 


A 


9819 


136 


2 


IYMGMQPGAVAHACNPSTLGGQGG/WWI 
T*GQEFKTRIiANMVKPC 


9747 


23648 


A 


9820 


257 


426 


I VPGLFLGTGS CS FTHAGVQ * SNCSSVQ 
S * TPGP \ SDP P AS ASRVAGTTGAHHKAW 
L 


9748 


23649 


A 


9821 


151 


416 


LHKLCLMVMESGTAKTPSLSPIFNSVGL 
ILLLLEYS IDS LC * SVLCCYKEIPE AG * 
FI KKRGL/ WLG WAHACHLSTLGGRGGW 
ITRSGN 


9749 


23650 


A 


9822 


50 


513 


RGDPRVRPRVRI IKLSEEGMWKAEIGRK 
LDLLHHT ISQWNS KEKFLKEMKSATSV 
NTRMI RE * NSL I ADMEKVLW* DQTSHN 
I FLVQNI I KSKGIGLFKLMRAKRDEES / 
AEGKLDAS KGRFVRLK/ E KRSHLHNMKV 
QDEAS VD I ETTAS YLKV 


9750 


23651 


A 


9823 


48 


406 


LIIWYTLTTRWIGHSAVSYPLLRPPYS 
LRHN I E I RP I NNPTMAS K CS S E RKS IiT S 
LTFHL KIKMIKLSE EGVS KAKTG* KLGL 
L/R/QTVS QWNAM KKYL KE I KS AT P VN 
T*MIRKQNNIi 


9751 


23652 


A 


9824 


143 


444 


WNKKNNKNELADMILWGCGDKTIFMRKL 
TRDETKYGIPQLALRGHSHFVRDGVI \N 
LDGQVDLS\G*WDGTLKLWDLTSGSTTR 
RFVNK/TKDVLSADFS 


9752 


23653 


A 


9825 


297 


2 


HRS PTKETKI IAS I KKKTMI TKDLNRQF 
SKQDKHIiTSEYMKRCPTS/IiREM*AKTI 
MR YHLTP I RRAP I KNKNK/NXS KCW * GC 
GETEILiVHCW*ECKMVP 


9753 


23654 


A 


9826 


175 


400 


GGKGGKNFSLKGGEKKTNLG I FGKKP I F 
GGGTNGANPPPKIKGSKEKKNF* VFFPP 
FPFKNFFFP *NL* FLGGGWPHLS PPQK+ 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLiGPPGF/ SFWGPPLFFFFFFFF*RDKF 
SFYHPGWKAVIAAHFSLELLGSSNPPTS 


9754 


23655 


A 


9827 


64 


622 


MFSFFIjFDLiIIjNHLiFFCFV*MFSFF1jFD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


1 SEQ ED 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IilLNHLFFCFVIRQQFSFLFF/ ISFFFS 
FFHCNVNTPELCDFSVCIHSDNSFRFLH 
LF/NF*HMTLCISELR*TLKQSCYSYTK 
IC**FYSVQLSFLTILY**CKS*QALSI 
ALQNEFLSV\QYHKNMNYITLTIEFSRN 
KFLS VTFLLFS I FSHNFG I S AS ISIFIL 
HVY I KNNP IGFFK 


9755 


23656 




9828 


298 


1 


MRNCLTDDERKWFPEMASTPGEDAVNTV 
EMTAKDLiE / Y * * Y INLVDKAAS DFEM I D 
SNFERCSTV/NKMLSNSTACYRENFHER 
*GQSVQQTFVRVGERV 


9756 


23657 


A 


9829 


361 


1 


FLTGTQWGK/DSPFNTWC*DH/WNIHRN 
LDLYL IPYI KINLKQLTG PNLRAKTI KL 
PEQNIGENLCDIi*L,SRERYSTKSITRIR 
KL/DTLGFIKI KNKCISKDTIRKR*ATD 
WEKI FANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHRFSFVLFGPFAKVAFPIGAKNFIS 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA* F*NFLF*GFQIFF*SLFFCP 
FGGK* IPGVFSFFFFFLVFFEIGSHSVP 
\RLKCSGTITAHCSIjDLLGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ * I KM YVMC I CI YNFRRVYLCI YTYL 
HTHTFTHTHKKTPTQ I PEKDS /QCSLSD 
LKGHSL 


9759 


23660 


A 


9832 


3 


386 


KLRLGQLNTSAMAAKC*SERKS/R/THL 
TLNQKLEMIKLSEEGMLKAVIGQKLGLL 
YQ/TS Q WNAKEELL KDIKSATPVTA * M 
IGKQNSLIVDLEICV*W*IEDQ/TSHNI 
SLSQSLIQN 


9760 


23661 




9833 




370 


RRCRWPPDPSRTVGRQIGKLVTHRPTVF 
QERGCPFPLTRQAGSHHGGGAFAQVISP 
TKS I S PCGRGGSRL* SQHFGRPMQVDKL 
R/ LGVQDQ / RWP I CGQ YGETPA\ LLKL P 
KISWAWWLAPVIPA 


9761 


23662 




9834 


125 


409 


GQENRETWKIVHLV*VLYT*HIKRLNCF 
SHFI*CYQPTASQAHVHHDSNDSSTHV\ 
N*NSRWPGTVAHACNPSILEG*GGWIT* 
GQEFETSLANM 














PKPQKMGTF*PPMEGS*MGPPHWEKMGP 
FLKKKEGHPFFFFFFEMESRSVT \ RLEC 
SGA/TISAHCNLCLPGSSNS 


9763 


23664 


A 


9836 


377 


515 


FILFLRQVLTLLPRLEYSGAIVAHCSLX 
AF*G*SDPPASASRV 


9764 


23665 


A 


9837 


274 


1 


AGEWHDQICVFKIPLMKNGLHGARVGQE 
S WKERNQQV* DRCQKQNLSDS * I QT*R/ 
M/WPGTVAHAYN/PSTLGGQGEWIT*GQ 
EFETSIiANMVK 


9765 


23666 


A 


9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFFFQKLPGV 
GGKTLPSPLSRGLGRENSFHPGGKGSNK 
QSSPPSFPG/WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPSE PPLKYSN/ WQATWG* 
RKLLNDTMRVQPTKSRMQKILQDKGPVN 
GIFTKTGRL 


9766 


23667 


A 


9839 


36 


434 


LPFPQCVTEFIIVLISWCYIREDACKNL 
KHVAITIINVCIYICVCVCPSTLYVYMY 
MLP /kls dtlltln* vtlvemls s fqcl 

MFWYHGHI *K*HEIiDWAHACNPSTMGG 
QGGW I T * CQE YKTS LANMVKPR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9767 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPGWFSNFIjGVRA 
PQPP * IFFPI* KKPS PLI FFFFFFEKES 
RS VS \ RLE YSGTI SAHCKLCVSGSQHS P 


9768 


23669 


A 


9841 


463 


86 


LYMHEFISGLFLLFCWSl/WSVLCQYKN 
C FDDKVYG LDYGNTS QVFT * NS Kh I KL Y 
TLTMYSFLCINQTTIKPSEMIQATKANI 
DK*NYMKLENFCTAKETVNKMKRQLKKW 
EXVFSIHIPQEVNI 


9769 


23670 


A 


9842 


37 


400 


VHSPPFKFWSLYVSVLKFPFCSLYFKFL 
ALYSI FLQMCQHLLV* AFFF* KRGLNS V 
T\RLECRGAISPNCNLCL/£GSSDNWAS 
LPRKAEAGNSFT/CR*-SQVAGNPIPSIS 
TRLNQRKTPFES 


9770 


23671 


A 


9843 


42 


407 


NCLSFLITFFFFFLiFENKVSFCPQG*G/ 
RGG P F W PHG TL P P RG * GNNP P * L PGRGE 
*RGAPPPPGYFWLFGKKRGFPL\GQGGS 
KPPALKEPPPLGPPKGG\NYKRNPPPPP 
GNFF * L P PYQW 


9771 


23672 


A 


9844 


113 


379 


LGPWATSXjRGRFLGYQLiQQSKKH*NYP 
TVR*GCGEMRTSVHCW*ECKTA*PL*KT 
VWQFLKKIiNTGLPFDAETPb/LAETPKE 
LKA 


9772 


23673 


A 


9845 


284 


528 


PL.PKTMEIMLDKKQIQTIFLFKFKMGHK 
I AETTRN I KNAFG P GTANE \ K CTAQ WW F 
KKFCKA*E\SLEDKEA\*GHPSEVDTN 


9773 


23674 


A 


9846 


888 


1295 


LEQGCNLFLHKHS*LGEVFFFVCFRRSF 
TL,VAQAGVKWRDLGSli/HKLPSSL.SI.PS 
SWDYRPPLPRLA/NFFVFLVEMGF\TVL 
ARMVS IS* PRDPPTSASQSAG 1 1 GVSHR 
AGPVAG I LMFCRWNI SNKPKGAVFKKK 


9774 


23675 


A 


9847 


88 


387 


AYRMKI IDRISLSLYTAALWVTLFLFLF 
LFFFFKTEFCFAPQAGGQ/WGQFKLMDP 
NPPPPG*K\DFLVSHPRDLGIKGAPPQC 
GQNFVFYKEKGGWLLTAT 


9775 


23676 


A 


9848 


298 


438 


KIPRGAPNFPGGGKIFSPPLGGPIKTPR 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFWPPGKKPNPPMGG/SLDPPPPL 
ILSRPDPP*KKKKKJCKKKKKGGGPLKNP 
PGGPKFPRGGKNFFPPFRGAYKNTPGGS 
*EKNPFLGGGKKRKQP 


9776 


23677 


A 


9849 


251 


498 


AIIKKMESKKFW*RHGEIGTLIHCYWEF 
KMV* P FWKTV*Q FLK/ D/LNMGL P FDS A 
FQLPD I CLGELKTYVHTENCTQMFMAAL 
F 


9777 


23678 


A 


9850 


478 


37 


E SRNKHSHLWS INFQKGLR* LNRGKQPL 
NK* CRHNS I STCKRMKLDPYLTPYI KI \ 
KSKWTKDLNVRAKTI KSLjEENIEVNLHD 
LG*GNDFLDMTPKAQTTK*NID*LDIIK 
I *NFCGSKDTINKVRRQSTE*EKIFANH 
VADKLK 


9778 


23679 


A 


9851 


2 


378 


RLEGLFLCALFCS I AI CMFFFFFFFF*K 
KRGPFGFQGGDPGGNFRQLDPLPPGI KQ 
F WGP \ P PRKAGKTGGG P P PGANLG F WGK 
KRVPHGGQKGFKP/RNPRGSPRPGPPKG 
GVAFVFRPKP PGLEQ 


9779 


23680 




9852 


229 


3 


FGP YKI FFKKKGAPCPLENPPVF\HQKL 
GLGFFFFSFFFF*DGVSLFLLRLKCSAA | 
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26 
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beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ILAHYKFLLPG * SDSAASAS 


9780 


23681 


A 


9853 


108 


420' 


GPRVCVRLSLRDDSDLLATEDSLAHTEG 
DPVTAS SHSQAPS iiSVHPGAALEGVCWG 
TGAIAGTPEPQKPPQLTPGGRPA\PIAP 
GLTWEPPSHPPPQ*RSPPPPP 


9781 


23682 


A 


9854 


2 


256 


FF I FPLRYY I YCAR FQFLSPI LYLGK* K 
MD* * RRVFQEKWEQAYFFVEV\ S PMCL I 
CNQTLS VSKE YN * R CHYETWHGENFD * F 
TEKMHDENLLHLTTRQP 


9782 


23683 


A 


9855 


323 


3 


IIQMANKHRKRH\QSQ* * VRVMQKKI IM 
IHRYIPQWLKTSKSDMIKCCSGYGTTEI 
FTHYL,WECKLLHLLWRMVWL\ ILLKFKT 
DTPHDPAI PLLGTYPAECMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQWGS KFTAA* TA* YQGS I PN 
AWSFFSPPPVLKTPPPPQKK/YPPPKKK 
KIPPPKKK 


9784 


23685 


A 


9857 


436 


518 


AANRLENI YRHL I YDKD / EHYRI VGKDD 
LFNKWCWVNWI /ASNKEKNLDSYLIPHT 
K/ LNSR* I IQLDVKD* I I KLLEDNLYLH 
DLGDRQKFLGR I / PVFT I KKKIGKFD PM 
LKLSTFVHQKTPC*RLNSA 


9785 


23686 


A 


9858 


153 


492 


RliAGSDPGVADVSVQLQRQEKSPCPSWK 
AVS QAE S S S S S AG VSLVL FRLLADWMRP 
IPIKEENKL\SQSTDLNFKFIQKHCHRN 
TQNSV* PGGVGPPGTCGPFRRQSSP 


9786 


23687 


A 


9859 


2 


419 


TTGKLQVSHKKSTYSHFSKKPTHTTNKD 
MKRYSPSL/AIREMQSKTTG\MRYHFTP 
KYG\NNKCWLGCGETETL I YGW * E CKMV 
QPLWKAVWHFLK* LNIES / P I Y * *NYS * 
YISYSWRKTCITIYDPVILLLGIYPREV 
KTH 


9787 


23688 


A 


9860 


88 


419 


TFFFFFGFWFLKTKPLFVPQLFEGPGPI 
LG*WNPPPRD*KNFS/GPNPPGGGE*RA 
QPPPPGYFLFFKKK\GVSPWGGGGSKPP 
TPGNPPPGPPKIRVIRGGPPPPGGRKHF 


9788 


23689 


A 


9861 


301 


401 


KRA*GGQEPIKVCPFFYNRKVGNGSPPK 
KFKKGRRAAFFGLRANKRGLF VKKGK K I 
WVGKVGEPIT/DPFKEFGGR\LPEKDGP 
LV . 


9789 


23690 


A 


9862 


220 


415 


KIMGGAQI FRGGGGFFFFLEGWEKNNFG 
VSFRRKFFFGGGVFLPPPP*HKKNYFSS 
QRQYISLGGGGRKTPPPKKNFLLKDTPK 
LFFSHPSKKKKNPPPPRKIWAPPMIF* I 
PPPPIFFFFFFFFFFFFFFFFFKN*P/S 
FFFNFKKPIFKTFLSPPFKVFPPPKKKK 
KKNP P I S YRRWPLAI 


9790 


23691 


A 


9863 


116 


366 


GQEFETSPAKFCFETESCSIA*AWQWH 
DLS S LQ PLPPG \ SGD S PAS AS RAAGE LL 
EPRIRRLQSVEITPLHSSLGNRVRLHL 


9791 


23692 


A 


9864 


619 


295 


FFFEMEFSLLLPRLECNGAISAHRNLRL 
PGS S DS PASAGG LL * S QVAG I TRLRHHD 
*IiILY/FL)VEMRFHRVG*AGLELLTSGD 
PPSSASQGAGITGMSHSAGHYGKI F 


9792 


23693 


A 


9865 


12 


432 


IADRRL>FFTNHKDIGTLYLLFGA*AGVL 
S TALSLL I RAELGQPGNLLGNDHI YNG I 
GTAHAFG IILVIVIPII IGGFGN* LVTL 
I I GAPDMAFPRINNI S FGLLPTS LLLLL 
TSAILEARAGTC* TVYPCLA\GGYSHLG 
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Predicted 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prdline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














A 


9793 


23694 


A 


9866 


30 


380 


LFSTNRRGIGPLYLLFGA* SGVLGPALT 
LLI * ASLGQPG YLLGDDHTYND I DTAHA 
FVI ILFI VI PI I IGGFGD*LVPLI IGAP 
DMAFPRIN/NISF*LLPASLLLLLASGI 
REGRSK 


9794 


23695 


A 


9867 


256 


395 


E VL.RKS EKS FETVCI INNKC / WPGMVAH 
ACNPMTLGGRGGRIT*GQEF 


9795 


23696 


A 


9868 


458 




PKKRFFSQKPPRGFYSAPliKGKNYI FPP 














RFFFFFLEKG * KF*NFFFT* NMVFFYIS 
/CSKKFPFQFVL.TPIPF+KVPKPVWFFW 
DFPIFFKKIFFFFFFCDRVSLCHPGWSA 
VARSRL.TASPRV 


9796 


23697 


A 


9869 


144 


425 


IPL.CSRIYSFGPC*FSLISSSSCLSPFL 
SIPLCVLS**T* * YLFFYISMREIYKTR 
TKGI YPGAQRWSLYTH\ FS I EVQST I KA 
ERFWPGAVARACNPSTLGGRGGQITWGQ 
EFE 


9797 


23698 


A 


9870 


138 


401 


DLRLKLiNliSKNI I RLQGTVE S SVLT I KE 
MQ I KAKVR YNLI PVKI KR /SGNDRWW * G 
CGERGTLIRC* *ECKLVQPLWKTVWSFIi 

KKKKK 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 
GGGP\WPPPPIP*FFKGPFKKKGPPWTN 
RGNP 


9799 


23700 


A 


9872 


57 


435 


FTQMRKNLKNNSGNMKK'GSITPLKDHS 
NF PAVDPNQNEM FE I PGKE FKK/ S DY * V 
TQGDTRERCKPT* VLKTI LDMDEKCS KE 
MDILMKNQSELLEKKDTFRELQNAMESF 
NNRLDQVEERITELE 


9800 


23701 


A 


9873 


228 


443 


FSSKFILISVFLFLFLETGSCSVAQAGV 
QL*DYSSLQPQSPG\SGNPPISASQIAT 
TAGACLHALL I FVFFV 


9801 


23702 


A 


9874 


349 


2 


KKPKKKNFLPPKNFGFFSPFSP*KFFFF 
LKGFNFFRGFFPNFPPPKKKFFSKNSQL 
VFFLPPLKKKI F/ CFPTPVKFGP/ SQRF 
FLKGPPLFFFFFFFFLDRVWLCYPGWSA 
VARSRIS 


9802 


23703 


A 


9875 


405 


2 


KKASRMENVRKKKKLEMIKLSEEGMLKA 
NIGQKLSLLAKQQWNAKENLLKEIRSA 
IPLNTQMIRMQNSLIADMENIL/WWI/ 
EDPTNYNIPLSQSLIQSNALT\ FNSMKT 
ERGE EAA* EKFDAIS G * FMTFKERS 


9803 


23704 


A 


9876 


355 


394 


THPY YSHQEYQS P * P / LTG ALS ALLKTS 
GLAM * GHFHS I TL.L I LGLLTNTLTI YQ * 
WRDVTR*SAYQGHHTPPVQKGLiRYGI IL 
FITSEVFFFAGLF*AFYHSSLAPTPQLG 
GHWPPTGITPLNPLEDPLLNTSVLLASG 
VSIT 


9804 


23705 


A 


9877 


47 


85 


TIYIHICVRIYACECVCVCAYIHICGVP 
AHLTEQNlAKPARTQSPPAPQKNPELPV 
DLVS ISLDREAQDVELiNHYRLiGNTCRTD 
VLPKVNTLCIiRL/NFFPCLPILEELQEL 
P * LEWI PLFPQ P I DYI YT YMRAYI CV 


9805 


23706 


A 


9878 


2 


389 


GRGQFPPEFFYFFWGGSLVPSTPNKKKP 
LGEVLGGGWSNRAPKGWGGGFPPPILAF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H = Histidine, I = Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














KPKGPFGPKFVLKFSNVPPLLFFPKMRG 
PLPKGGGFLPFFFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 








FLGPSKNFNPFPPQKWPFAFFPKSHRAR 
VNEIFS/RLSVINF*GLTKPQRLFFSLL 
NFSS\PIPPFQFPSRLIKEIRFSTFLTP 
ENSKIKKPLFFF*FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 


A 


9880 


123 


387 


SFLWKLRPSGAPTCMRCLLA\LLGGVSQ 
LG Y TG VR D PL E E A VC P FS ELQRHAG KTT 
ALFRAVRQGCFFFAYF* IELFPCV/CSG 
RVGG 


9808 


23709 


A 


9881 


341 


659 


SFLGRVQWLHACKSQUFGRLRQVDHLRS 
GV* DIiPGQHDETLS LLKMPK / QKLARHG 
GMCL*SQLLRRLRQENHLNLGGRGCNEL 
RSQHCTPAWATEQDSISKTKTKK 


9809 


23710 


A 


9882 


216 


1 


PKFFWVLRDSQIFKPGFGPQGFF/GGFG 
NPQGKKKPGFWGLG*KKKGFFFFFFFFF 
LRDG I S FCCPGWS IQW 


9810 


23711 


A 


9883 


2 


244 


GRVGAVGRREGENFIKCDELPPPPPEGK 
QQNLCKFLTGFSLPCQDKKP \ CFFV*KK 
KKKKKKKKKKKKKKKKKKEKGGGGA 




23712 


' a 








**KQ\TNFILSSILLYYI*PSGSYISNL 
*Y*ELFSQNLSYQHFLWI*KKWGLGAVA 
HTYNPS TLGGQGRW I AWS PG 


9812 


23713 


A 


9885 


242 


358 


KTVWLYMNI *MANNHME*CSLPLVIREM 
* IKTPVTVH\TVGMAAI*NKK*NITR/C 
W*ACRETAVI,VYCWGEFYTQPL*KTVWL 
YMNIE/IPCD PA I PLLGMYL KE * R PGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLGFPFWAKKGKKKRLWVEGPLGKPP 
GGFCQKLFFSPGGGPKGFSPSIIIIiPKG 
* PFVWGPFREVF * KKRFKLKKKFRPTFS 
/Y/LFFFFETESHSVA\RMECSGAILAH 
CNLCRLGSSD 


9814 


23715 


A 


9887 


236 


347 


IKGFFFF*TESHSVT\RLECTST/SAHC 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRS PGL \ SDPPTS AS * VAGNLKFLSGLC 
SSL*SLEIS*DTKKSQVQITDDLCYTW 
LCLFFEME S HSWQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDSPFFFF*TNFPFFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
ISGAPGAPPPPGPNFFFSFFSSPTFFSR 
DGVSPYEP 


9817 


23718 


A 


9890 


195 


3 


DGVSLL.LPRLECNGTILTHCCLSPV/SW 
DYRRLP*RL>/V* FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMIjALGTASAKALKYSLALSLPR 
LECSGTIISHSSLNPP/GLKWSSCLSST 
SSWDFRPAPPHLAKKTKQQ/HATFF*EG 
ESPYVAQAGFKLIASSSPPAFSLPKCWD 
YKL \ DRHAW 


9819 


23720 


A 


9892 


231 


1 


PKPFFPKFFSP*N*KPFFQISPAKIQKI 
RVYQS \HKKFFS FFFFFHGVSLLLPRLE 
CNGTISAQCNLCLSGSSDSP ■ 


9820 


23721 


A 


9893 - 


2. 


347 


APARQENWCVYVCVCMCVCIWLHRHC 
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Predict- 
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ding to 
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Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lystne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EVLCFYKMLGKKRS I YAY* LVY* Y I YIH 
MCVYIYI/YIYV/YMYIYTYVCIYTH 


9S21 


23722 


A 


9894 


28 


382 


TAHLEAGATGYRTFLLPAIGLLCLPWVL 
DDGAGQTAQ*SMLLDHAML>LAHRAHELF 
I / DTYYE FEET Y IP KDQKYS FLHDSQTS 
FCFSDSIATPSNMEETQQKSNLELLRIS 
liLLIESW 


9822 


23723 


A 


9895 


93 


386 


FWKIISHYYFKYFSCSILISPSDLPITC 
ICMFHLLKLCQSSEFLHLLLFVWLVFET 
QSHSIKQPECS**LSAA/CHL.PGSSDPP 


9823 


23724 


A 


9896 


174 


1 


TLDWGPRKKNPFFFFFETRSRSVTEAGV 
Q*HNHGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLECSGT I / SAH CNLCLLGS SNPLAS 
AS * IAGTTGTLTGDVDST 


9825 


23726 


A 


9898 


127 


393 


GHMGTLTGS PATLS I LiALFFWGGRISTQ 
LDGINKS\FSL,FLYF3 J RHCSLSL,RLE*S 
GAIITYCRLELLGSGDTPASASQETGTA 
GTCHT 


9826 


23727 


A 


9899 


3 


334 


KL.RLGQLNTS AMAAKC * SERKS/R/ THL 
TLNQKLEMIKLREEGMLKAVIGQKL.GLL 
YQ/TSQFVNAKEELLKDIKSATPVTA*M 
IGKQNSLNCDLEKSCWLIQDQTG/HNI 
FLiS 


9827 


23728 


A 


9900 


356 


1 


GRKPFSHLPKATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVXiLTPLAPLPLTARE 
SLCPCPPS * TPQPSVNP/H/GKLARRSP 
CWSGRQSIiP*AEIVPLHAPPALGDRDE 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


IPYAKEKKKQENVLRFIHVNLCISNWFW 
FETGSHFVTQG*VHWCNLSSLQP/?KTLQ 
GSGDPPTSASRAAGATGVQH 


9829 


23730 


A 


9902 


321 


50 


SLRTRV*RPPSAPG*NPFCLKPQKIFQG 
GG \ ES PLSQ I LKR VKQENS YNLGG KGFN 
* PKLPPCPLTWATKKTSPPKKKKKKNYR 
TWETSVS 


9830 


23731 


A 


9903 


317 


8 


NCYDPNSGIVIPLLGILPKSTK\T*VRT 
KTCT*MLIVALCIITKKWK*SKCLSTDK 
QVNKI *YIHIMEYYSPIKGRKY*HTLQH 
/WMNLENIS *KRLDIKKPHII 


9831 


23732 


A 


9904 


287 


3 


RGALNGRGGPRGVKSPGFAPFKIiQLKPP 
GTFLNPK\QFPFFPNPPF*KKGAFFHFF 
LPLF*SFRWTEFYSVT\KLECGGAISAH 
CWLCLSCSSDSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EIWAPPLKNC/ IPPEKFCYFGGGAPGAG 
LP PPPGVI FFSLCRWKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF* KKSL 
FGKKTHjWVGGPLLLiKNFF*EKFR 


9833 


23734 


A 


9906 


1 


296 


IWVGATECIFKNNPTIWGHQETHFTCKD 
TYRLKV/KGWKKIFHTFHTNGNQK*AGI 
AIVISDKTDFKSKTIKR*KGHY/IMTNG 
SIQQGDILDWYKSNCSF 


9834 


23735 


A 


9907 


474 


41 


FMEYX.TIjSFIHVLIjQ*FIHCFVULFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDLLI 
S SCI YS F I YALVKS LANSLTHS F I HT FL 
Q*FVHLFTPLLIC£FTHSSIH*PSHSLT 
FIQLHLFFHALVSGFIWSFIYLFCKINV 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLIDK 


9835 


23736 


A 


9908 


286 


84 


LKPPPPFFFFFQANFLCFIRD/GFHHVG 
QDGLPIjLTS * SASHGLPKCWDS RCEALR 
PASPLISGPHQTI 


9836 


23737 


A 


9909 


316 


63 


DQLLVRSIGFEALMSPIFFFQMEF/SL.S 
LPKLECNGAISAHRNFCLLGSSDPPASA 
AV* /PMANLLKLCASVSYCIELVLiNELH 
WIK 




23738 




9910 


370 


461 


F*F*FLFSETESHSVA\QLECSAMISGR 
CNLRLLGSGDSPATREAGAQESLEPRRQ 
SEPGLCHCTAAW 


9838 


23739 


A 


9911 


294 


28 


PGKNSTVQKEPGKWFCEMKFTPGENAVN 
IV*PTTKDLEY/WINI/DKTVAGCERTD 
SNFERSSTMGKML\SNSIAWYTEIFRGR 
KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL * TLSNND VI VGSLI VT / TCTTLVGV 
LIMGD/ RLCMCGEKEYLGNLGTFLS I L 


9840 


23741 




9913 


377 


28 


REMLTVKPLiAPSFFRWSRAYRAIFFITP 
TWVS KSGKVTGPLFFLHIFFPGP/CPGL 
PIFFFIKTKGTRFFFGWFFFFERVSLCC 
PG * ME CNGAI SAHHKLRL PGS RYS P AS A 
SEFHR 








9914 


384 


725 


HLTAIRMTVIKDTENSKCERGC\KTL»MH 
S CWEYERVQPLWKTRW* F / LQQVE/LPS 
P CDPS IP/ LiLRM YP K* LNTD / CKRMC / C 
I PTS IAALiFPVTK/SWKQ 


9842 


23743 


A 


9915 


187 


3 


QPHKHLGbDNI YI FFEIGFCS VAQTGVQ 
*RDHGSLQPRPP\GSRDPPTSGSRVAGR 
PRQENG 


9843 


23744 


A 
— 


9916 


15 


167 


DGVSLSLPRIjECNGAILAHCNLR/LPGL 
k*fsclrlpsswdyrlppphpahv 


"9844 


2374S 




~9917" 


~345 




YCS R TS L P Li / TL KKGS A * P G AVAHAYN P 
SILGGRGRWIT*GQEFK 


9845 


23746 


A 


9918 


460 


42 


HQR P KVDKTTKMG E KRSR KTGNS KNQS A 
S/APPKERSSSPAMEQSWTENDFDELRQ 
EGFR*SNYSELKEEVRTHGKEVKNLEKK 
LDEWLTR I / SNAEKS LRDLMELTPKA* Q 
LPPERTNLSS * FNQL.EERVS VMEDEINK 
MK 


9846 


23747 


A 


9919 


282 


21 


AAGYLQPSEFWLHSLGSRRGLPISVFPV 
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ed end 
nucle- 
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correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l_ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RGRQAWLSCDGYHPVRPSRLLCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQISKFPFFFFGKYQKNFFFF 
FFFFPESKSRSVT\RLECNGAISAHYNL 


9852 


23753 


A 


9925 


309 


1 


YTPTIMLLGIYSNELKTNVHMKT* I*MF 
TATLFLIAETWKQPRCSSLSEWI/QQLW 
YIQTI/EYYSWRRNIK/LMKTW/MKIjK 
YILQSEKTQSEKTTYI *FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWNFNFTQLILKCIFLRTKFFS*VSK*N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 
TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KPQCIjWRK*D*WHNTQKLDVEKGKPIiIH 
YWWERKLV* PLWKTVWRLMR/NLKI ELP 
YDPVIPLWGNYSKEMK*LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQETHFKCKYTYRL 
KVNR*R*VYHVNINQEKALVAIDGAYFR 
ARKITRDKERHY/ IMIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLEMIKLSGGGISKTDMG*NLGL 
LHQTVSQWNTKEM FLKE I KS ATPLNT* 
MIRHQNSLIAAIGKVLWWIDQ/TQHNT 
LLSQRLIWNKALT 


9857 


23758 


A 


9930 


88 


374 


ILKACGPCPGNPHPWGPL*GQGGLGLNP 
SKSPPHPHPRGDPTWGGTPIGDYGPKPP 
KLPHGAGLPKKPCSSGGQNN/PKISGP 


9858 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIWGGGQGSRLLPPPFWKP 
GGKDCFS PGVLDPPGEQRGAPCFP* KKI 
RWG*GAAPWPPPQEG* /VGRAPLTPKI 
PAPVSFGCSPALPGEGQSETPFFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 


359 


IFFFLFFF/ C*KGGPPPVPQAGGQGPKT 
PP WP P I WAKKTKPLS KKKKKKKKGAG P P 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
ENFKKKKIWAGGGGNPREPPPFGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC*SFALVAKGGVQWCDLGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


LAGQACCS * STS ANRCVNHA\ PGYANGR . 
CDTITCILVFIRNSWLITPIALITMFCY 
HVG * VRPQEQAPGNRVS LS DLLL\P P VT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHNAFLLFFFKRELAFVPQRGERENN 
FVSLKPPPPG*SPFSAPNPPKGGNKGPP 
PPPQLI FFYF* /GEKGFS PYKPEGPKPP 
TLGKPGLF*NTKKICPQPRGLKKGGQHI 
W 


9863 


23764 


A 


9936 


173 


363 


PKKRPI ISLKRAFNSNPGD*QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGNK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQKMRKNQSSKTGICKNQSACPPPKERS 
SSLAMEQSWMENVFDELREEGSR* *NYS 
ELKEEVRTHGKEVKNLEK/NliDEWLSRI 
S/N*EKSLKDLMELKPKAQELHGECPSL 
SSQFNQLARA 


9865 


23766 


A 


9938 


194 


370 


VSNFFFFFETEFCSFAQVGAQGQNFR*L 
KAPPP\NCCLFPT*ASRWGTTGTCHHS 
WL.I 


9866 


23767 A 


9939 


426 


1 


SQEFKVAVS YDLTMHSNL.SNSETLSQKE 
KTKQTKTKQKKTY* EGRKHI KRCPTPLI 
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amino acid 
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ding to 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenyialanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine V=Vnline 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \ = possible 
nucleotide insertion 














IREIQIQTTVRYHFTHIRI/ VLL>PQKQN 
KCWCRVEKLKPLVRC*WECETVQPLWIT 
VWW/FLKILNIKLPFDLATPWCIYQME 
FSC 


9867 


23768 


A 


9940 


255 


16 


RVIYNIYIMYILCIYTPSRLLVEIGFHH 
PPCVNIYIYLHIYLIYMSDGVKY 


9868 


23769 


A 


9941 


206 


355 


FLQGHTAIKNGRG/WPGAVAHTCNPSTL 
GG*GGRI I *GQEFKTSLTNMVEP 


9869 


23770 


A 


9942 


3 


402 


HEELNPRSATSSPALVLHVTIAHVTIAF 
AEGCFQHDFCFSQPENRHIPLVFTTASK 
VRN I FYLLCRGRG I LRFFLVCLKIDPAI 
PLMG I YPK\ KSLYHKDTYIHMF IAAEFT 
VAKIRYQSKCPSTDD* IKKICV 


9870 


23771 


A 


9943 


79 


328 


CIKILLKFLCLFCILPPCLNFFLRQSCS 
VARAGVQW \ SLQP * PPRXjKHTPTSAS * S 
AETRGTHHHAHL I F S YNFL * R * VSHVA 


9871 


23772 


A 


9944 


321 


414 


VPS WAG \ P VAHACK PS TLGGRGGR I T * G 


9872 


23773 


A 


9945 


142 


480 


QEAKSFDFSDSTTMEVMLDKKQIRVIFL 
FEFK\MGRKAAETTCNIENAFAPGAAND 
*QMRWWLEKFCKGEESLEDEECSGRPSE 
VDDDQLRAI IEADPLTTI QE VAKELS I D 


9873 


23774 


A 


9946 


191 


3 


FFLPRGGGDSVFKKPPKKKNGFPKIFPF " 

CPGFPLAT/INFPPQIFFFFFFF*DGVS 

LCHPGWNAV 


9874 


23775 


A 


9947 


451 


461 


K* INK* KVKGW/RKEHHANINQKKVDVT 
I FISDNMNFRKKNKITGDREGRYI I IKQ 
PIHQENIAILNVYVPNNRVAKYVK* KLI 
EliKKEIHKFTILVGDFNTILSTIDRTTR 


9875 


23776 


A 


9948 


186 


405 


NHTLAISLF*PYKP+AILKLGIYRKERK 
LVC*RDI CALMFITEIjVTAAKIjWNQPKC 
PSTDK/WKMWYICTMEYYS 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKNLL*LTN*VSKFSGY 
NINI*TSVAFF\YSNNKISETGIKIVPF 
T I LSNRMKYLGM/TL.TKNMKDLHTENYK 
MLjRKENEEYQMGNMGSWIIjKINIVKISI 
LHKV I YRVK FLSNFNAI LLKK 


9877 


23778 


A 


9950 


1 


396 


LAFDGRAERLKTGFHRVTQDGLELLTS* 
S VRLGLLKC WDYWRE P / P VPGLWD 


9878 


23779 


A 


9951 


307 


405 


G I RVRR * PQGLRPPNPDALVFKKMVWPP 
PTN\PRGWGGKGWEPPQGPQDPPPQAAP 
GFLPETWRGPWNSLPTRGAWPPNSNLPP 
W P P APSRHPAQRAPGPS TAAP P PGGGGE 
GWDPPGRV 












402 


YGRPKEGGSIjRFWLTTPGPKGETPFFL 
KTQKITGGGGGGP* FPLPRRVRQENSFY 
SRGGG FN *PKRAPCALTWGKKQKF FLICK 
KKNTPQKPKKLiPGCRIQY*TQK/SSSLF 
MGGTPGKRQ 


9880 


23781 


A 


9953 


249 


1070 


SS FISRFYFI CIYLFETRSHSVTKARVQ 
CCD YN* LQPQPPGAQEILPLQPP * SAG\ 
ITGVAPNTPQLIFYFFVE/SGGSTHVAP 
RLALNSWAQNIAL/RLPKVLYFTF 


9881 


23782 


A 


9954 


75 


396 


GFKGRKRGLPLPIiNQKLKTLNLSEKGLS 
KANTGQKLGLL/RPVSQWNPNGKFLRE. . 
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ing to first 

amino acid 
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peptide 

sequence 


Predict- 
ed end 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














IKSAF/PIEK*VI*KQNSLIPDMEEVLG 
V/WKTKKYSHNI PLGLSLTQKRVLTLF 


9882 


23783 


A 


9955 


181 


458 


RLWNDGLYNI FDLKLQ/S I HV* SLLLLF 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDSPPQAFQVAGTTGVCPNAWVPSAF 

L 


9883 


23784 


A 


9956 


67 


430 


LAPRLIYSPIKFQWSFMSVRQIQVQSLA 
IGNFKNPRLPILICHCYAFHRILLFQIVIV 
RNFILYS ISISANKQESKCILKYILLYI 
NLRNWLGTVAHAYN / PSTLGV * GGR IS* 
AHEFETSQGN 


9884 


23785 


A 


9957 


77 


422 


LPLALWASCRPQLCLDTDFIHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLYNL*PALV 
LDHTTSNWCFRSP/SFFFLLPLPLPPA 
PLP\FFFFFFFDRVLLCHPGWSAVARFQ 
LTAAS 


9885 


23786 


A 


9958 


3 


422 


LHSS LVTEQDS I S KKKKKKKI FLGGTKK 
GGPS LGG FFFG PPG FPK I VKKKRGAL I G 
FPFFCPKKKPKKPLFFPKPLGKPPRKKG 
F*IGEPWKT/EGPGFPKKLNFVFFSPWG 
GNFLFPPQKPGGKVGKILLQPKKKPPGK 
G 


9886 


23787 


A 


9959 


370 


2 


PFSSWVANFREGDGEKCSVLQGXRFCMF 
CRYINVTPLVHFPFPISKGTSFFPLYFF 
SPEVM/HFP*LPY*FGWALLSRF*/MSL 
L*LFFSKFPFFPSFFFF*RQFHSSPRLK 
CNGAI SAHCNLCLP 


9887 


23788 


A 


9960 


256 




AWEEETAPLHSHCLGNQARPWNHRHAP \ 
P * LI VF VFS VETGFHHVAQAVLELLASS 
DPPTRFNLLSSWDYRGLSPQPMSRKSAQ 
EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG*I*SPYDPVIPLLVIYPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 
PSIDE 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/WrFS*VGIEI TMWLKVPDQTK 
KISLRSAEAAIKYFLTQATASI I IRRAI 
LFNNRLS EQSS I TNTTNQYSSLI I IMAI 
AI KVGMAPFHF * VP EVTQGS PVTSGLL V 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKI IRQSNFKTGKRKTDSNDLNR 
HFKREDTQMANKYM * F I * KMQ I \ KTTMR 
YYFISKRTTISRKTSNIQC*DTWSPGTF 
INCLLECKMVRLLWKMTWQFLTEVHIHL 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISPIDV*FGCFIYIFDYP 
LCSCFSLHPLVYLQ*L/LFFFFEAESHC 
VARLECSGWSAHCNLRLPGSSDSPAS 


9892 


23793 


A 


9965 


3 


334 


RTLRHHIWLIDPSFE*HLSCFHLLAIVN 
SAAINIPC/HSYV*THF*YS*MYRGRTA 
GS YAT / SMLNLLRNCS TLLHRNCTF * P F 
YH/WE*VPISPQSCQKLDFFIFLIIAIS 
EG 


9893 


23794 


A 


9966 


179 


352 


NQE * TE I LNRPMSNQ IESVI KYLPTRKS 
HRPHRFTAELYQTY\ I LLKLFQKVEGEG 
LL 


9894 


23795 


A 


9967 


2 


350 


THPS INS FIHVFIHASMNSL IHLPTHSF 
SIYLFTKSFIHLFIHFFSFLFFLRPSFT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PVAIYSFIHPFTYSSIH*FIYPCVYSCI 
DEPTHS FTHAF I QHLFIH* I I/IFIYSF 
ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CHQLRQELAI FTSFVILQLFSGHLDVYM 
QAWAQRPDKYEYDNK*FIEIKKIIQFTIj 
ISKRMK/YVGINLTR* VKDLHNENYKTL 
MKEIEEDTSEWKDISCSWNGRINSVKT 


9896 


23797 




9969 


196 


2 


DRKVFFRPLFI I FGLKKKIFPLFWFFPS 
/IRGRG*IFFFFFFFFETESPSPRLECS 
ETISCHCTP 


9897 


23798 


A 


9970 


245 


1 


TQCDMCVFIYTHI YISHICYMVYIYIHT 
RTHTGILFTPKREKNSNTCTCATKDNP/ 
CMTLC * / HKRRQTHTHTHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


380 


NHRRQKKNGR/HNGNKYKTVTKMINSNP 
P ITRNNLNVNGLKTP I KRQKLSDS I KKQ 
YPSIYCL*KT/YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCG FL, KKLN I ELFNS I PGHS CKT * TQM 
FAAALiY 1 1 S KK YKQPS CPSNDE * MNE I W 
HILTIGYSAVKKISWG\RWMNLENIARL 
S GRQRGH I LQDS VDM KS PE * A I PQ 


9900 


23801 


A 


9973 


200 


3 


MASKCSSERKSHR/SITHNQKLEMIKLS 
EEGMSKAKIG*KLGFLCQ\VIHWTAKG 
KFLKEIKSATP 


9901 


23802 


A 


9974 


111 


517 


NLEQPIjIKRTIANKPIKKINTTILIITI 
NVNGLNTLTK/ RQRWSD * I FKNQDPTM I 
/C*KKN/HCKYKDTNTLKIKG*KNIYDA 
NMLTQ I KKKAG VAI L I / DKI D FRA/ TD I 
TRAKEGHFIM I KGS VHQEYVTI LNVSAH 
AS 


9902 


23803 


A 


9975 


342 


3 


FLATKKIFFFFPVFPPQGPPPVLVAPFF 
WAFGGGPPKWAPKKKFFGPRGPPFENFF 
F*GFFFFKTGGFFFLRFSKKVFFFFFFF 
SETES CS VA\RLECSGT I SAYCNLCLPG 
SR 


9903 


23804 


A 


9976 


13 


326 


ILDHSNRPSSSTL.KTPNAGKDVEQKQFL 
/LCCW * ECK I VNYLGR * L/ WQFLTKLNLi 
Li*PYDPEIIjLLGIYQKELKTCIHTVACT 
* M F I EALFV I AKT * KQ PR YS S VG 


9904 


23805 


A 


9977 


74 


396 


LAPRTQPLPCSPAPLRQCHTQSLLPPFL 
LRLIFYFFCFLKTRAHSVAQAGAQ*RDH 
GSLQT*TPTLACTPASA/VARTTGTHHH 
AWLIVFFL/IEKVSHSLVCDKT 


9905 


23806 




9978 


287 


2 


QPHNSSYTQIFTPKIIIKTPPPNLKQNN 
TTPIKKKTQ*FP/PSTNP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 
IHPSTHPSIHPPSR 


9906 


23807 


A 


9979 


97 


332 


GNNDLFLYFFFFLETGSHSVAQARVQWC 
DHSSL*PQPPGSSGPPISTC/RLGLQAC 
MCHHAWLiIFNFFCKGRLSLFGLLR 








9980 


168 


2 


STWLRWDYWKTFFKTSKS/WPGTVALTY 
NPYTLGGQGRHIT*GQEFKTSLANTVKS 


9908 


23809 


A 


9981 


267 


1 


LKRERR * RRAN/ S KTSRRKE I I KI RAE I 
TATENRKPIEK/SNKTNG* FFEKTDK/ I 
DNPLERLRKKERD* I TKVRNGRED I L * N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*CHIYIYFGHKHT 
HTYTHTHTHTNTHPTC*VSKILLKKKRR 
TCSVCA ■ 
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residue of 
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Predict- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G = Glycine, H = Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib)e 
nucleotide deletion, \=possible 
nucleotide insertion 


9910 


23811 


A 


9983 


229 


415 


SYVFCHIRFCPLDSLNKEIVA/SQVQWC 
VSWSTT+VAEAGGSLEPRSLRPDWASQ 
RDPISKN 


9911 


23812 


A 


9984 


2 


288 


RG YDS T S S AYT VGH FT * ED KAT I TRL S G 
KENVKHAGGETLRRLLDVYP * TQRI IDS 
VGNLNSA\SAMM\GPPRLPKVLGLQAWT 
TMPGP*JCYIHIFF 


9912 


23813 


A 


9985 


246 


3 


FLLDYQYCALCCHCFLFNFANCGHFN I L 
LGG VKHTKFP KYS + KFS VL / WLDAVAHS 
CNSSTLSGLGWITRGQEFETSLANMVK 


9913 


23814 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFLGTQ* FHF\LEGFK 
FWFWEPKAFFFFF/CCETISAHCNLCLP 
GSSDSAASASWIAGITGAHHYARLSRRL 
RQQN 






A 








NRPLNNLVMAS KCS SERKGYTS FVLSRK 
/LRKMIKLSEESMLTVTTG* KLGLLCQT 
/VQVNTKEKFLME I KSATPGNKGTVTKS 
NTLTADREKV* VIWIKNKTNHKIPLSQ 


9915 


23816 


A 


9988 


209 


2 


IKGSHFPVXil * YRKI I II FGGVKGKGGF 
YSY\LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 


9916 


23817 


A 


9989 


256 


357 


QHRWMNTEMLNYLC* IIYVHF/HTHTHT 
HTHTHTHTHTHTHT 


9917 


23818 


A 


9990 


263 


1 


IQIRVFENSHLWPSYVHFPHGHSYLFCS 
L**AFFRH\G*FVLSVHFLKIFLNLKKI 
FFETG \ SHS VAQACS GM I S AHCNLCLLG 
SSDR 


9918 


23819 


A 


9991 


3 


368 


SLDPRRSRLHLAKIVPPRSSLGDRARPC 
LLKKLELFFETGFCCVAQPGVRWCSHGP 
LQPRI PG\SSNPLTSAT* VAG\ I AGMCY 
HTQLQLAF 


9919 


23820 


A 


9992 


294 


391 


SGTVAHACNSSTLGGRGGWIT*GQ/ESQ 
TSLTNM 


9920 


23821 


A 


9993 


317 


3 


TDKEL.LLRDEQRKCFVEMEFS PGKHAMS 
NVDMTKDLEYS INLVDKAAVELERADSN 
FERS S TLGKMLSNS I S CYKE I FLERKS Q 
LM*QIPLLLM/FKKLPQPP*PSA 


9921 


23822 


A 


9994 


193 


395 


I FW ITCCSSLS VLAAS PCTFMLWRQLLS 
LNLMNQ * DHS LS QS L IQS KS ATLFNSVT . 
AERGDESAEEKLEASRSWFMRFKERSCL 
HNIKVQGEAARTDR/EAAASYPENLR/V 
KDEGGYTK*RIFWDEIAFCWKTMPSKT 
F I ARE KS I PGF KVS KDRMT ARCGR S CL 


9922 


23823 


A 


9995 


82 


420 


SFLWKLCPRGAAAYVRCVSA\LLEDVSQ 
LGYTGVRDPIEE AVS PFSELKRCRERTT 
AVFTAVRQGHLSLQK\ FLLPFVQL.CPAH 
RGGV*RQ*AXiLCCGGLSPVQAPLPLCXiP 
TQ 


9923 




A 


9996 


376 


1 


CGRTDAPASTSPYAMTVRFLRPPQPCVL 
DGLHNCPARAPQEQNSLGEVDKRGPREQ 
TR\PATAAPPRPLGSPSWICPGDW**CR 
NSL,QGFSIAASVARGRDWALAGLPATVS 
ARFQEQQKMNTV 


9924 


23825 


A 


9997 


495 


982 


VQKFLRPNLAPKKHQRKIiAPNSLQGRIoR 
S LPS PTVWCTMAP PTGVbSS LLLLVT I A 
VCLWRMHS WQKNH W * AS FKKTHD * TGLA 
EP\SCARKQCSEGRTYSNAVISPNIjETT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RIMRVSHTFPWDCTAACCDLSSCDLAW 
/WFEGRCYLVSCPHKENCEPKKMG 


9925 


23826 


A 


9998 


401 


341 


IASKCSTERV/SRSHTLKQKLERFKL*E 
KGM / S KNE I GQK * ALLY Q/ VS QWNAKE 
KFL KE I KVD \ LHAQM IRK* NS LTAD VKK 
VLWKIDQTL/PLGHSVIQSKALTLFSS 
MKAQ*GEEAA*EKS 


9926 


23827 


A 


9999 


30 


393 


DTTILKLRQLIIGAVATKHWSERKSLTS 
LNLNQKLGTIKLTEEGMLKAKKGQKRGL 
S CQTVGQVMN/SKEKFLKE I KHAI PGNT 
*MISK*NSL/IIADMEKVLVG/WSKVIF 
*DQTSHNIP*SQN 


9927 


23828 


A 


10000 


133 


1420 


EQRRLPVTPGVASLQTLCCFLPRRGCSH 
G * ESAATHPPGPELVLLLLQGH * AGYLG 
/ DFG / PRTATGS PG \ PAGVQ PGP CLVPQ 
LSRAAAAG*PDPT* *GAIFSYHCKALRA 
TAARHCPPGC*WPG\PAGSPPGTSRPLS 
HPSSPS PPWTGVSSFLLNIATLCP * ASDA 
QGHACYPGPVPALAGQGHDSAGGRLGDP 
GHHWQQAAL.HG / DQGP PGHQAHAGTQPS 
AQGLGC*ATHPGEGG*VGYQGL.PAG*GR 
SHPRFPIVTRPMGRSCPVTATVRPGPGSQ 
GQTPPGL V\ PRG EAGQNPGGAAPG LSCI 
QEVTGRWWW*GKKPLGHYPPCGAVRCPS 
S*PHPDTMFKGTGREGCLPPTSPVGVTG 
QMS \ EGSRP CEWTLTMS P WGE * S PRASC 
AMLHFCPGSRVGGYHGCPPLHHMGPQST 
AGQAGQPHTLDTKAS 


9928 


23829 


A 


10001 


1818 


6682 


I KF P E AE E E E I QE V LVLLFR VAE E KENP 
SRVGNSQREYENQWLYSICNQLRYRNN 
L/RHVKKDERGYYEELLNYSRDHLMLYP 
YRLLD I MVKGFS ITPFS YYTGI MENI MN 
SGHNFTAADCLRLLGIGRNQYIDLMIQC 
RS S KKF FRRKTAGDLLP I KP VE I A I EAW 
WWQAGYITEDDI K I CTWPEKCATDKTV 
DSGPQLSGSLDYNWHSLYKKGFIYLDV 
PMSDDS CI AVAPLEGFVMNRVIi/NFDYF 
ETLLYKIFVSVDEHTNVAEIANVLEIDL 
SLVKNAVSMYCRLGFAHKKGQVINLDQL 
HSSWKNVPSINRLKSTLDP*KMLIiSWGG 
GESRRPVQEASSATDTDTNSQEDPADTA 
SVRSLS LS AGHTKH IAFLFDSTLTAFLM 
MGNL S P VQ S TGEGE AQR Y FDHALTLRNT 
ILFLRHNKDLVAQTAQPDQPNYGFPLDL 
LRCESLLGLDPATCSRVLNKNYTLLVSM 
APLTNE IRPVSSCTPQHIGPAI PEVSSV 
WFKLYIYHVTGQGPPSLLLSKGTRLRKL 
PDIFQSY\DRLLITSSG\HDPGWPTSH 
VLTMLNDALTHSAVLIQGHGLHGIGETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNASSQLANRKLS 
DASDERGKPDLASGSDV\NGSTESFEMV 
IEEATIDSATKQTSGATTEADWVPLELC 
FGIPLFSSELNRKVYRKIATHGL\*EKR 
AFKNLLHSSRKLSLQVliNFVHSFQEGAS 
TLD I HTE PS FSSLLSQS S FADMGVPLPA 
KKFN/SLKIVSY 


9929 


23830 | A | 10002 


439 | 1135 


MAVDFYNFVTKLWTTGYLRISFLAYKF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, II=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Srop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FSFPFLDSLS/LLSPRLECSGVIPASLQ 
PLLPGFKRFL/CI>SLPSS*GYRRAPPPC 
PALLYF*VETGFHHVGQAGLELLTSGKL 
TRLGLPKCWDYR 


9930 


23831 


A 


10003 


216 


572 


LTLVS I FTAKQS VTLP 1 1 CMLEMRKWLL, 
NDTSAF / P 1 1 * KVLFPKI FFFFFF * KGV 
LFCPPGGREGAQFGLTKPSPSGL.KEIFL 
PNPPGGWKQRRVPPFAGYFCFFIKRRVS 
LLWRGLPN 


9931 


23832 


A 


10004 


3 


354 


YSNN P KDRKGE TGI NE KTNN KM VDLS S Y 
TSVITLNF*ML*/D/KIN*KFVKSP+KK 
PTIiCCLLETYFKYGIGGLKVKRWKKI * H 
ANTIGKKTLVAILITKQTSELS*YPNKE 
EHYILIKE 


9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRICIYIPGCIHPGEKSQKC 
NTCGKNSTQKPELVN/HR* LKGHRCMK* 
NEGGKTFTRVQSIj/ ACHKRVYTERKSYK 
C 


9933 


23834 


A 


10006 


134 


456 


IWNLPLGKPCAWKQVTSDLISPDAIFIN 
AAIHSLSFFFFFFETKFLFVPQVGGPWH 
NLN*LiKLiLPPKL\SDFLVA* VSKKVGIT 
GPPHHPGLVIWGFKKKGGSPMLPR 


9934 


23835 


A 


10007 


210 


451 


C YYTIQTTEQPANDSMLG I KPHVS ILTIj 
NVNNLNTPPKRLGVADWIK/TQDPAFCC 
LQETSLTCNRTHRFKVKG * /WKKIYH 


9935 


23836 


A 


10008 


169 


462 


YPCWVDMAKNTSYNKCWQGCGEIRILPH 
CRWEYKMASPLWK/ SQFLNK*NMELP*D 
LPIPLLDIDPKEWKTGVQTKTCR/RMFI 
AALFPTSRS 


9936 


23837 


A 


10009 


490 


724 


EMP P * FMEKNFLAPGG\ VTP PGVNTTRV 
NPKTAFDPKRSSSPFLEKSQKPGLIEGL 
TKLKTL I LNPLKKDE5GE I PVLF 


9937 


23838 


A 


10010 


306 


588 


KQF I RAF I EMS W WLT P VI P VLWDTKADG 
SGIRDQPSQHGETPSLLKIEJCLAGHGGA 
GL* /SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 


9938 


23839 


A 


10011 


473 


53 


ISLGQDSTDISTINIPRMAFKLSSKRKS 
HMS F S LNQK/ li/EMIRLS/E E AC * KPRS 
QKLGLLS QS * P S CE S KFLKETKS ATP VN 
TQMI*K*NSLITNK\LKEVLMVWIKDQI 
KHTISLRQSVI * CKALPLFNSMKAERGE 
ESAKC 


9939 


23840 


A 


10012 


184 


455 


FWLSLIKKPLSSVTQKKINRETN ILLTT 
KKKKKDCNFLEGGLIilGGSVCRLVYRHY 
ATLYS VFC VDSS KS ERGSVDPTQVFVET 
*DKCFDY 


9940 


23841 


A 


10013 


203 


453 


KEELYISRERKSSTSLLLNQSQK*LIKL 
SEQAMPNANIGDRLKARPLSPNSHLilKT 
KQKFLKEV / KCATP 


9941 


23842 


A 


10014 


237 


483 


TLIRYIILQGLFSYIPPFFFETESRSVA 
QAGVQW * DLGSLVPGSRHS PSSASQVAG 
TTGFHY/HAWLI FCLFCATGS 


9942 


23843 


A 


10015 


157 


875 


DDPVRGRGEESAMPSGGRRNRSVSSSWV 
GSMAGITTIEAVKRKIQVLQHQADDAEE 
* AEHLQ * EAEGKRWAWEQAEAEVAS VNG 
RIQLVEEELDCAQECLATALQKLEEAGK 
AADESERDTKVIE I WALKD/ E/E KMELQ 
EIQLKEAKHIADEADGKYEEVACKLVI I 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














EGDMGCTEERAELAESRC*EMDEQIRLM 
DQNLKCLSAAEEKYSQKEA* CEKERKIL 
TDNLNKSPMR 


9943 


23844 


A 


10016 


579 


30 


CG I STKKNKWM I KQNS SELSTCVHNQ I T 
LHK I AMATQWG \ KNS FFNK * C / W / EN * V 
ST/CKKIKLSPSVTCTI*KQYLKWIKYL 
N I K I * QTS * I S \NGGKYDI GLGTNFLLQ 
KTKNLMS FS * L* NQMHKEQI I EKWDY I K 
LQNFCTSKEKFSGVKMSPKKW/DEIFEN 
HIFDRS*FSQYINNV*NSTKSE 


9944 


23845 


A 


10017 


44 


228 


EF VCR VRGLWW \ CWLflS QLL WEAEAGG 
SLEPKS1>RLQ+AMIM\DCTPPW 


9945 


23846 


A 


10018 


471 


40 


PGEGRFQQTRFPPFLPPWGPKGGPFFKK " 
PKKKKRKEM IKLS * KPRQANQ * LGLLCQ 
/TSQWNAKEKFLKEIKNATPVNTQRVR 
KLNSL, I AD I ETVLWGTEDQTSHN I PFG 
QS P I Q INI LAVFHS MKAEQS KE ATE EKF 
EARS 


9946 


23847 


A 


10019 


217 


486 


KKIS* PVYLSLYLS I YLSVCLS VYYLS I 
YLSIYLSIYLCI YLSTIIYYYLSIYLPV 
YLS INLLS I/CLSPI YLLT YHIiSS YLLS 
YLPTYLP 


9947 


23848 


A 


10020 


327 


1245 


TENQQGVLALAALS KIRMMPTRLKWFLN 
ID YTKFCEAFHV FKKRKKKK I NKN I KQL 
KMPLTKRI YLAFTEYSTQ*QQNTH\FFIi 
SADETHGKIE/HVLGPKTRHNKFKR 


9948 


23849 


A 


10021 


329 


490 


GESPTDNHCRSPKSQVEKVIYTFFETES 
RSVTPAGV*WCNLGSLQPPPP\GTSD 


9949 


23850 


A 


10022 




394 


YRVLEVXGYMLNIRFVVLGCSSCGQAAQ 
FREG SRCTTACRH PAHTLHAAKMSRRKV 
ssakcr*retpkkrsarwsakpap\qse 
TKPKKAAGKDKSSTSS\FSSSSSIjKAEK 
GGKE KQAEVADQETKDLS AE 


9950 


23851 


A 


10023 


448 


477 


SQVFETSLAS*MSKNIL/WAGTGAHTCN 
PSTLRGQGGQIT*SQVFETSLAS 


9951 


23852 


A 


10024 


198 


455 


SLKTNRREIPEDFHGQPCGKLVHSEKPR 
VRMGQTESKYASYLSFIKVILRRGGVRA 
S /TRNL IMLFQTIEQ FCLWF P E *GTLDL 
KDW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSLHTCVPLCRTVFSA 
TSR PDSLLiPAIjQVS AKMS P VL* S SLLLP 
SLGQGFPMASIAPRL/PHRSLRSCWAA 
AHSRAELDRHEAFLQCSLGQVJPRNDFLF 
S FFRDRVLLCHPG W S TAA* S * LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCFIARVKSNGAISAPCNLC 
LSGSNYFPASTSIiVSGNTGACPLARVNF 
FY*NL\FLVKMGFHHIGQESI 


9954 


23855 


A 


10027 


313 


2 


QDWEIS KLYSYCHS * VRRL /iFLKIRLS 
LYYILQHSLNCNI IG*LQFESCYIKLAG 
DKIAFRFLTLKTNHQLGLVAHAYNSSTL 
GGRDGW I T* GQEFKPS LANM 


9955 


23856 


A 


10028 


49 


388 


TPKIRGVFPFKGWFLGVFGAFPFPQKDI 
FYFPRTPMGTRLVS PGKKKKKE V* RVLY 
GLKKAKKI FVLKVKFSHKRGFFGEI QFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQ I AS AQE FKTS WGNMAKliCLF I *RDR 
VSPCCPSWS*IPELKQSACLSLPKY*DY 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l_,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RC/RNTTPGL*HLLFRCLLI ILPSYLFF 
CKS FKLLYMD I 


9957 


23858 


A 


10030 


265 


2 


HDNMRGSDYFPASNGLAVPFKRQVFTIA 
SN FL E KS P PRLAFT * LQVD * E RK WAG \ P 
VAHT CN PS TLGG QGG W I TRG QE F KTS IiA 
NMPKP 


9958 


23859 


A 


10031 


235 


413 


KAGKLFFFTEITFKKGKKAGGGGSFLKP 
HLFGRPGGGNHRVR IL* KRGFYKMGRW 
LKSLPCDSPPLASQKGGVSRKSPPPRPF 
CPF * RLFP * KKKAFRLFI PKKV+ * ALV/ 
SLPFIGEGKTLFFFF*DRALLCRPGWNA 
VAPSGLKLSSCLSLLRS*DYKHVPPCLA 
MF*KFF/CRN 


9959 


23860 


A 


10032 


167 


424 


GREFDLRMGHMVKPPLSFFFFPFSETMS 
LSVTQAG VQW * D I TALS AS WTP VI SDS P 
VS AS RVAG \ IGTTG VRHHTQL I FL 


9960 


23861 


A 


10033 


252 


3 


VEGCLSSGVQDQPEQHSETPSLQKKIFF 
\*LAGHGSMCL*SQIjLiGRLRWEDHSSPG 
G*SCSEP*SCHCSPAWATE*DPISKKKK 


9961 


23862 


A 


10034 


175 


492 


CPTCPFVWIPLCSEQPVQLYMWHTHTH 
THTH THTHTH THA / HLL F S LS F S FLR QG 
LSLSPTLWRSGMI/ CGSTQSQSPRLKQF 
CSPRYSRG*GWRMA* /DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


LYLHLVQLSPLDFPTPLFRVECPVSLSL 
CLS FS * FP * VCFF FLVS LGLF * GRVLLC 
RPGWSAWQSQLTAD/SDFPVLKHSSCL 


9963 


23864 


A 


10036 


229 


463 


MILGISKISFSLAITLANFPPTLNLFIF 
LDTGS CS VAQADVQWHNQGS L * /SELLG 
SSDPPTSAS 


9964 


23865 


A 


10037 


164 


422 


GLSSQLTDGHLLALLRVGGSEVGSREES 
EREEAGREREGERRDSRGG*REREREES 
GAREEREKKEKRER/ERDREEK 


9965 


23866 


A 


10038 


1 


490 


PNQIQNAVLIMISFSICFVSVIFFKSEI 
YNGQPKFKSFFFF * DESCSVAQAGVQ/W 
CG/LQSQQPRPPG\SSNP\PTSASGVAG 
ST 


9966 


23867 


A 


10039 


255 


446 


FFLRQSCYVAQARVQ/ YAIHRCSHSTLQ 
LiPALASSNPSALIiAPRVAGTAGMQHHNQ 
L*YFFRIHS 


9967 


23868 


A 


10040 


157 


390 


DHTCPP P S TAD YRLS * FL YFLLLLFRRS 
LALPPMLECSGAISAHCTLCVQETRETD 
AAHFKAARTIRAPHPAQLTIAFQVQTHV 
YQDGH/DGSWWCDLRTHPPRS 








10041 




30 


TLKKRKE FSSVLCPRGYIVLLLLVHQFL 
FFQTGSCSVTQAGMQ/W/CDQSSLQP*T 
PG\SNNPPVSASQVAGPTGMHHGFLHTE 
YESRGPP 














NHRYISYLTRLVKIKNFANKSWLESLSP 
TLKHS */W*CK*V*S LWKTVWHYLVNL \ 
KDDLT FDPATPFLMIYLI EMCAQ VHPE I 
YARMF I AVLFE I FKHQ \NNPNKLWYI HT 
RESYKTIKIKE 


9970 


23871 


A 


10043 


457 


724 


H FGR P RQ VD CLS S G VQD QPGQHGE VQSV 
LKL\KKLAGHGGAHL*SQLLWRLRHENH 
LNLGGRGCSEPRWCHFTPAWTTEQDSV* 
MNNNNN 


9971 


23872 


A 


10044 


111 


429 


KPDEDTTGRENYRPVFFTNTDANLSKIK 
CMYPCISVH/HVNMYGKIPEMMFT*Y** 
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K=Lysine, L=JLeucinc, M=Methionine, 
N=Asparagine, P=Proline, 
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*LFLSFFFYJiMt;SGSVPRIjECSGVISAH 
CNLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


lgvlykknsvfnkcywdn*isiwkrm\g " 
ldpyltphtkinfkwikdlni itktli c 

CSWMKTKA 


9973 


23S74 


A 


10046 


203 


361 


tykllvifividhf/lfsflsfffrdrv 
sfllprlecsgaisthrnl*lpgssd 


9974 


23875 


A 


10047 


198 


358 


IQRKCFTFFFFFEV*SCFVAQAKVQYCG 
S PG \ SSNLPFS AS E VSKT I GWI LPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKIARKRLNKMTRISPLISIII 
LNVSGLNFPLKRYRLAEWTKKKK\DPII 
C CL / QKTHFAG / RD I YRLK I KG WKK I FH 
TNGSQ*QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKNFIjNP PFLVKKCENCP I N 
KVFP * KPPLLKKTLFFSKKKFCGV/HPT 
EKFFFFFFESLALSPRLQCNGTILAHCN 
LR 


9977 - 


23878 


A 


10051 


112 


359 


SKVSEEPTENEVEHNLKV*SFILPLKQY '" 
SM**QKKNIHIH\ISFYKK*ELTWPGMV 
AHACNPGRL.GTE AGRS PEGQE FETSLA 


9978 


23879 


A 


10052 


346 


2 


RVSSYTLITLNLNRLNVALKRYRRLNGF 
FFLKKP*PN\ICCIQKTNLIYKSTYSLK 
VKGWKNIH\HANGLKKQAEVAI \LFISD 
KKDFKSKIVKRDKEGHYVMTKGSIQQ*D 


9979 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLLFEATP FLVI CY 
SGHRKLRPVAPPIRGSKSQI IPMPPAVL 
PRWHLS CLSFR I PS S FF* LRLPRLCKSR 
/ SAWGPVSPHLPQFLFFQAISFFFFL*D 
GILLCHPGWSAVA 














FLSFFVDGKIiCLEAKIWVLAPAVFIVTR 
VLLLLSTL/RQVELRNIYLCSKVHIHFH 
L YFC I YLSAYI LQTMS L Y * KL * YHSNV 


9981 


23882 


A 


10055 


251 


51 


ISLVNWLFILENMKLDPYLSPYTKINIR 
/ WQ*DLNVKNGTTKVIiEEN * GT I YVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVIiFLETRSHLV\TT 
LECSGAIMAYCRLDLPG*RHPPTSAS*V 
DG 


9983 


23884 


A 


10057 


1 


586 


AAARPAQGKAR PGL.LLRRGR WE LRAGF 

ltakgafr wwls qkhvkms yp pqsrcg c 
ggmgaagppsltvhqeLcgays pdgtte 
alplaswprarps pakapaydtaklpal 
igcgsrrppgvnpgasslkpgacvsega 
gptgtlesagsrpptplpppv/ccpgpp 
aradh*fchvtpqcr*/ppprpsp 


9984 


23885 


A 


10058 


393 


43 


HARPGAECERPAEGEAARADAFSGRHAC 
PHSLYCRL * RGF FVHKLHLNKA / RL I KR 
RKADNI KCWQRCGAPGTLMHC * WDYETM 
GLiLiEQTVFS FE I RS HCAAQAG VQRNNHG 
SLQEF 


9985 


23886 


A 


10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
*ASPLGPSAILTLPFHSPPPA\PLFSPP 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


9986 


23887 


A 


10060 


293 


9 


DILiQFYTFSPIQ\CPVYILjHL* YISLQR 
L*FHQKYLIGIYFIETWSCSVARLECSG 
RITPRCNLNFPGPTYPPTSASGVAGTTG 
.VCDFLGLPWSR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

locution 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G~Glycine, H=Histidine, I=Isoleucine, 
J<=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possiblc 
nucleotide deletion, \=possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 


taycrtlplkksnptisgsffff.letgp 
rsvpg * ragg\dlmahcs fdlpgsrds p 
asprqwgatgpchhpwlife 


9988 


23889 


A 


10062 


191 


2281 


LQTRGKLFKYKEKGICTKLHRSEIIIGK 
Q WKE AVCP PTGQAQSQGGLD PKS REALR 
GGHGF VPGGG VCGHQEL* LHS DGLKAVH 
PAGSEPSDPGGVQAAACQDAATGGTPLQ 
PMPETARGC/TATASSREAGPGMDP/RE 
A/GPEAWQLGKARSSWGGSSTSGGPAWV 
FIRDTPSWWGIPGTVSPPAGPRCPPPTR 
PQWEQLGSGRPG * MQTTAEETNAAPRAG 
QERTPSRGPSADPGVSQHVLSHRRSCPK 
CERRSVPPNG\AGPTLWTELHPSDASVS 
ALTPL *GDVEVGPYG\CEGYI KPGRRTH 
QHNWLDliDLRPAAS RTCRKMLELPRV * F 
LLRLPELAETYSSTPPACAQRPRGTGGG 
/ PPQRCVLMSGSHQGGAS * KGRDAPSLG 
SGLERPREGGPGWLS/RQPSPHAVQPCK 
HMLCLGPAF+RGRGARGVPVPVRASHFP 
FHGAAWWPGDHMSQP*WQMLSRPARGHR 

AFLPG*GVS/EENGQIPNVSTQLYLQNL 
PLPREQN*RRFPRDLSSGGAGPGTG*CG 
GGGGHALSPTPSRRLRAHSKDNSWAQPL 
CCWM PAASGC \ PG I GCNGVPP VAAS + QA 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGR I PHLLAQAVP PDDRAFPNCQASG Q 
LLGVHSRPSLSAAGCRQPLAVSGIGCNG 
VPPVAAS 


9989 


23890 


A 


10063 


65 


453 


ISREFLiFIjETKSIjYIMIjKGSIQQSDIIF 
VNIYTLNTGAPRY/IK*ILLELKRQIES 
DTVIPEDDNTPLSALNRPSRQK/VDLIC 
T I EQMDL I D I YRTFH P TAS EYL FFSAHG 
S FSKKYYML.GHKI CLETFLN 


9990 


23891 


A 


10064 


100 


466 


LGILPLQSRPSGPGGQCTPTLPFWVSLP 
LPVRGCFLHLCWCLPYSGALGSPCHTGL 
VCWHIPCF*CCGVSP*R\SCI,EVHI J YTH 
THTHTHTHTHTRFVLM*MRTPLRKGKRQ 
VMRLCGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GLAGSPCISGSVSFCSGCIY\RGHGIMF 
I CNDCKVFRFCKS K*NCFKEHNP 


9992 


23893 


A 


10066 


292 


458 


KVSS I CLKMSFLFLFFFRTGSHCH * KVS 
S ICLKMS FLFLFFFRTGSH/ SS PRLECS 
GMI IAHCSLKLLDS SDLTASAS * VAG 














TGFNPXjHFIYYYYYYYY\FESKSCSHAQ 
AGVQCCDHGSLKPQP\PSSSHPPTSAP* 
VAGS RNAHQANSTAWPMVS 














G I LKNCHNLMI KLERMRSCVLMGEQRK / 
WIY*DESTPGEDTVNTVEITINDLEYFI 
NLVEKTVA/GVETVDSNFERGPTVGKM\ 
I VCYRE I FHEKKS PSMQQlA YF * KFPQP 
LQPSAATTLVS *QPS \ SKQDPLSARRLQ 
FTAGSD 


9995 


23896 


A 


10069 


263 


435 


HFSFLSFFF*KTESHSVT/RVEGRGGIS 
VDCKLCRPGSRDSFASASRVPGTKGISQ 
GRG 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSSPPLSALPIQPPPHLS 
TPPPLETP/PPVLH*PLKDLGGWGALLA 
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JSDOCtD: <WO C 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ED 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 
peptide 


Predict- 
ed end 

location 
correspon 

last°nniino 
acid 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Lcucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S = Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














Q*RLPG*HSPKKKKKKKKAPRPPYK*GP 
PKTPVTJPPDGPFLDPPLFGVJPPPP\VFIj 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTLNQKLEMIKLSEE/GLMSTAE 
IGQKL/GLLLPNSQVVNAKEKFLKEMKS 
TSPGNT * TI I KWPGAVADA 














AIEIQiiLiSRPLCLWPSGRFPDAQVANIR 
VKVAV*GNPLR*VGPAEPGEQRPSPWGF 
PSIS/WDISPTWSSSASPPGLSADCKFT 
CHYRCRALVCLDCCG 


9999 


23900 


A 


10073 


98 


440 


GQLNKLSGPYPENVGYTLPVLINPliAQP 
VTYSTIFAGTLITALSSH* FFT*VGLEI 
NMXAF I PVLTKK\ INPRSTEAAI KYF3JT 
QATAS I I LLIAI LFNNILSGQKKKKKGR 
PF 














PCLYTYHLHNSNST*SLRLVSIMrEIVT 
SNLHRTVPHSYTPPTQLS INLVMAIHLS 
AGAWI ISYPSMIKNAL.VHFLPQGSPTPL 
I PILVI IETI S LL I QXPRPLiAVRLTGN I 
TAGHLLMHLIGSATLAISTINLASTLI I 
FTILILLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCLQKMEYYSS*KGKKLLSKVT 
IWMDPEEVMLS/EIS/SVTEGQILFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/D*RFFLPPNSNGFLVKMPPSVK 
RLPFFFIFPFSFFFFETESRSVT\RLEC 
SGTGIiAHCNLRLPGSRDS 














KijTQLPFIQFFFETGSCSVPEAGVQWCD 
HSSLQPPPP\GSSGSSPLSL*VAGTIGM 
IilFKIFSRNPLNFFFFLRWSLALSPDWS 
AVARS WFT 














KSSSLFQNHLWANSSTYSHLIPTLGYIL 
NHGNQFDPDTL/RDKM*NFFCTIVCPHY 
ELPSLEQW/ VSLGS LNYDT I LPLDLFCK 
RQGKWSEIPYAQDFMTLYQNLTICQT /P 
QNPPPPPKESSKVELDIIDDPPH 


10005 


23906 


A 


10080 


281 




IFLGEWGPFFPPQKKSFFPKIPQWVFFT 
PPYRKKIFFFLPR*NWA/HPKIFFKRPP 
PFFFFFFFFFFFGFKENCFILLPPKEHI 


10006 


23907 


A 


10081 


207 


35 


QENRRC*RGRGES/GTLIHGWRECQTVQ 
PLWKTVWQLLKRLNTEFPYDLAI LLLGE 


10007 


23908 


A 


10082 


2 


408 


IAPLHSSLGDRARLRLKKKKKKGKIRPL 
IALFFKNQNPS FKKNSHSFYGALMEP \ P 
PPHIPPFFFWKVQPPLiFPFRGTLiEFPGP 
IGTCKLQTLGSPG*WDPPCPLPHQSPGR 
GGLIPGRKGEDPPSFRGLPPPHF 


10008 


23909 


A 


10083 


195 


2 


ECKQRPQIiEMVSFRRRSHEPFFFETGPR 
SVAQAGMQWH/DLSSLKPPAPG\SGDLP 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 


1 


RLKI KLLKI I ITTTI DSGICS FKS CKL* 
H/LKIQNMWU3KIAHAYNPSTSGGRGRQ 
IT*GQEFKTSLGNMVKP 


10010 


23911 


A 


10085 


364 


413 


KKKGGEKGPLLKKER* KTNGQFLVHTNF 
/RLPGLKYFF/CPPPPSKWGLRAPPPKP 
GDFFFFFFFFLVFLVETGFHHVGQAGLE 
LLTS * P I PLGFPKCWDYRRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


I Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phcnylalaninc, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1001 1 


23912 


A 


10086 


200 


3 


INFJLLIjFIFVIjLI fetgpcs iaqatvq 
WRDHGTLQPQPLGLKQAILP/ASAP*VP. 
RITDVHHHARLN 


10012 


23913 


A 


10087 


154 


329 


FSL,SDFKTYYKAAGHRGSCL*YQHFGKP 
\RRTDPLRPAVQDQPDQRGETPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFPPSCGAQ/KSEVKKSSGPHSSS 
S SG * RGE / CLPLLVSGGSWHFLAYGSLT 
P VSACVFT * PFSMS L.CVY IQ I S VS 


10014 


23915 


A 


10089 


305 


1 


KKKRAQSLPGQKGGRGFSQPVTAGDRSF 
RPKIR KEALGLGQLGLTDLCRTFP PTAT 
G*TFFSSAPFPG\YPPV*PTKKKNTNL)N 
*FTRI KI ISRIFSGHEK 


10015 


23916 


A 


10090 


329 


2 


RGFFQPLGFWDPGPISNLGGPPKNLLLK 
AEIMGYNFCVPPFPPGGLPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFFFF 
FFLSRVS LCHLGWS AVA* S WLS AAS 


10016 


23917 


A 


10091 


84 


408 


EKKGENKLFWGFLLIjGTLFIGGYRFKI I 
FCLVGYFIFIYLFGEME/FSLLLPGL*C 
NGTCSVHHNLPLPGASGSPASTSGEGGI 
TGMRRHPRLY* I YLGRWSFALVARAVMQ 
WHVLGSPQPSPAWCKRFSCLNLRRGWYY 
RHAPAPPSLLNIQKI I LGFWLAPVY 


10017 


23918 


A 


10092 


184 


359 


FQINS S VTRRSFVLAAQAGVQWRDLGS P 
LLPPP \GSSDSPASAS *VAGITGLiHVGIj 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLiCEK* * ITPIRMAI I * KKREGENS \ KC 
W*GCGEIGTLVYCL*EYKMVQPLWKIVW 
QFLKILGIEL 














ICMNEMILiYKQDFGRTIIIRKIVIYFWI 
FN*FGH*IRIY*NLLRYKTCML*RG**I 
NLPQRS FKNLRLKI * * W/WPGTVAHAYN 
PNTSGGRGGRIT*DREFETS 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACEPGGI* IRPVD*MVXiISWL* 
HGTITCKMVTLGETGQKPQGISLIIIFF 
FFG \TQSL FVPRNGVHCRGP I S AR * KLC 
LPGPPLSPT*AS*VAETTGACYCTWFEN 
NFFLRLPLHCY 


10021 


23922 


A 


10096 


189 


2 


PPFFFFSNL.FKKSFYEQGLALSPRLKCR 
GIIIA/HCS/L*TPG/YK*SSCLSLP\S 
SWDDRCAPPRV 


10022 


23923 


A 


10097 


260 


401 


DHLRSGVQD*LGQHGETSSLLKVQKliDG 
RGGLQLIP\ RRLMHENHLN 














WYMTRCGSLGPSSAAHESHHPPPQQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
SS*IPPSPSSAHESPGRVGRPEFTGGAH 
GYLHLSPP 














FFGGAPCPPPRRKGKTPPPPRKLFPPFF 
PKKPPLPKKNC*NKNPL,LGPKKKNKRPP 
PGRFFKKGFF*NPPKAPLTNFWEKKKKK 
\SPFFKGVFFPPGEPKKPPPFFFFFFFF 
FFFFPTRPPRVRPRVRPRV 


10025 


23926 


A 


10100 


253 


3 


NAINRNKMQ* TKTSEWKVYADHNATI KQ 
KQKRKCQVli / WRGCGETETLVAC W WE CQ 
MEQPAPVEW/W*HFLENIKIQLPYDPAI 
HE 


10026 


23927 


A 


10101 


121 


404 


VLASWSIFFFFFFWKGGLIFSPRLEGG 
G/LILG*LKPPL,SGLGPFSCLTLPEGWE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamlne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L,KPPPPDPGFFFFF*GKTGFPRGGQEGL 
NFLTWEPPPLAL 


10027 


23928 


A 


10102 


468 


1829 


I RDT YTQNPDKAKGYFSCI FFE I EETTM 
SRKQNQKDSSGFI FDLQSNTVIAQGGAF 
ENMKEKINAVRAIVPNKSNNEI ILVLQH 
FDNCVDKTVQAFMEGSASEVLKEWTVTG 
KKKNKKKKNKPKPAAEPSNGI PDSSKS V 
SIQEEQSAPSSEKGGMNGYHVNGAINDT 
ESVDSLSEGLETLSIDARELEDPESAMIj 
DMLDRTGSMLQNGVSDFETKSLTMHSIH 
NSQQPRNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS * VP I FEKSVKDLQRCTVS 
LARYRVWKEEMDAS IKKMKQAFAELET 
LINGSRKWALLAEMDKVKAEAMEFLLSR 
QKKAELLKKMTHVAVQMSEQQLVEIjRAD 
I KH FVS ERKYDEDLGRVAR FTCDVETLK 
KSIDSFGQVSHPKNSYSTRSRCSSVTSV 
SLSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


10028 


23929 


A 


10103 


254 


2 


KKKDFPLTPLGFGGLKIQLFFFLQKKKN 
FYFPP* IFPKIFFFFFFWFGTQSCPVT\ 
RLECSDRIPAHYNLHLQSSSHSRASVSH 


10029 


23930 


A 


10104 


256 


419 


KVIELHTGR/ CAVGQHVGRPRRVDCLSS 


10030 


23931 


A 


10105 


153 


439 


TTVTSLCIYYTIYYYFRLLSTHAYTHIY 
THIYV*KLTIKQPQAGPSEGF*KKEF* * 

QEFETSLANMVKL 


10031 


23932 


A 


10106 


424 


157 


SSHLGRPRWVDHLSPGVPDQPWQHGETP 
SL/LKLAGGACL*SQLIGRM/RIjSSGGR 

FRMFKN 


10032 


23933 


A 


10107 


213 


3 


NYKCPPTPNLNTLKALYFKKLiILiSPKA* 
AIKEKID* /IIKIKHFYASNIINRMKRQ 
LTEWKKI FANHVSDKG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPREKPFPVNLPGKTLF/Y " 
EV* FMNPLK*QKKKNFQGFFFFFLICGT 

RGGWIT*DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKIKKKNKNHMI I \S IDAE* AFDKIQHR 
FMIKTLNKLGPEGKHI/KIMKAIYGRGG 
QDGQLHTAKK/EPLPLRENKISGKP 


10035 








102 




QAGVQWCDHSSLQPRPSG\SGDSPTIAS 
WGGITGVHCHIWPTFYFLFLIRLKSLM 
L * * TS S EMALTKMGVHSHGL FN 


10036 


23937 


A 


10111 


382 


2 


VNNATMPSKCSTEKSHMSL.TSNQKLEMI ~ 
KIjSEKGMIjKAKIGIjKIjGLIjC/QVSQWN 
AKKKF * RE I * NPTPVNI R/DMQKVLV/ V 
KTEDQTS HNI PLNQ I h I QNKS LTL FNS I 
KAERSEEAA*EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTIYQVQCSIKEEYLQRRTLKYASLFQ 
KHICGSLAFLHIiL* PKLYHSRSNAESDW 
IIQLFSVQPNVKEISKHKIGW*RGHACN 
/RQHFERPRHKDHLRLGVRDHPCQHGET 
PSLLSLLKIHI 


10038 


23939 


A 


10113 


71 


399 | HVLGKEARSVGWEAVRGQISRVSIQMKW | 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=A!anine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnianine, 

K=Lysine, L=Leucinc, M=iVIethionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valfne, 
W=Tryptophan, Y= Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 














KGLiIKGEKLKMEKSQACICI YFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\ + SDS 
PA * APHEQNGL / TGMHHNPRVI FV 














SKKKSWGSHSGSKCHTSFLSYPTKEQKS 
YLSLFYKCNLQSLTFKELLDHFRPVCLP 
TIY*NKNWP/RDLPLKPSFKTLML*NQT 
L/ WPGWAHACNPS ILGGQGGWITRGQE 
LKTIKANTVK 














* KPKMAT/RNTDHDNFW * GHGETE ILNH 
YWWDYKIM*PLWKTI*QFPKMFNIKLPY 
VSYIQLLGIYPRNLKINIHTKT 


10041 


23942 


A 


10116 


44 


394 


PGFQRVSQDGI J DI J I J TS*SAHLGLPKCWD 
YRHEPLRPAG I YSYPAVLFVliSTYRASN 
IT*MLMTPKPTSG\YSFLLS FRFSSFSY 
LRD I S AYMSLSCLKLNVFKSKP 1 1 FLS P 
LWLLV 














MESCCFMGIEFVLKEEVIjEIGCRTM* iy 
LHY*SLHLRMVKMVTSLLCVFYHN*TIF 
KKSSVFLYAGNKQL*I* ILKNYI YVMI P 
FTITSRNMKRNNLTKDGKDMSTETYKTQ 
LKE I TD \ * NKWRDTTFMHQN 














KVPVWATCGVilMHRLtAGHGGWRLiWTRLP 
WRLRRDNRLSSGSRGA\AAEIIjPLCHCT 
PTWVTEHDS ITYKY* KKGAPCWRASS * C 
TLSGHAli FFFA 


10044 


23945 


A 


10119 


258 


401 


YNYPFLTCVMKEKIGAGMAAHACNPSTP 
GGRGG * \ TRGQHLETS PTNMV 


10045 


23946 


A 


10120 


310 


384 


GVFPFLTFLYGGIKGGPPPPRFFLF* FA 
FLFFFF/ETGSHSVAQANCSGSVSAHRS 
FHLPG PGD PPTS AS * VTGTTG I CHHS 


10046 


23947 


A 


10121 


245 


479 


TSSLYRKIQKRFLYMLALEVLKICNTYF 
MNTERFPLKLFGQFS/FRIISI* FFPFL 
FPKI YHYFLRDRV* LCHPGWSAV 


10047 


23948 


A 


10122 


268 


469 


NSVFWLGWAHA/ W * VNPSTLGGRGGW I 
TRGQELiETTPANILPP 


10048 


23949 


A 


10123 


221 


423 


GQAGAVAHACWPNALGRPRREDSLRPGV 
GDQPGQHSKTPSL/LKSDNF*KMGWLGP 
WAWWLTPIIPALW 


10049 


23950 


A 


10124 


73 


183 


AGGGWAliTNTVINPT\PPSTPFTPCLSY 
SCTISTASAFRFPG\STSTAVIjTHVRIIj 
KSTPDAPHPPLKI PRARRNI QRDHTLS S 
NLFYYNHQLNVTEELTYYDLT* inptat 
ihaihtmpllflhnlncisi 






— 








PNXjGPCPPPRGTKGGPFFPSFHPSI\IG 
PPNFFSF*RP*FF*MLKSLGVP/SPTGP 
IPKFPFFPGFKNYFSPPL\PKGPL.LKPP 
RALFFFFFFNRHRVIiL\CPGWSPTAGLK 


10Q5I 


23952 




10 p 6 


"85 


~442 


KKVTNTKC W* R VWRTGSL IHCS WG * KMM 
PHWRAVWQSFR/SLSMYLSYEPALPLPG 
SYPRALRTTCVCTHSRCFSLLELPQA/W 
NH 


10052 


23953 


A 


10127 


180 


484 


KS FFFFSQS AQP I KVI PAP *NFHFFGVK 
RSSCLS FLRKWGPRWVPPPPFFFFFFFL 
I FVEMRSHYAAQAGLKLLASS \VPPPSG 
IPRTLD*SYPD 


10053 


23954 


A 


10128 


376 


1 


WSPGAQLLFISCLYYNLWSLHTGRTPAK 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














PP*GWTiiQKTTPLSCPAP*LPPPLPSPL 
QQ I DLKEKL VFTAPACSL / PHS PNS S P W 
PQRRWPSVS/P/PVAHPAAPRLPPPPVC 
ARPLWLPEAPLLGIPPH 


10054 


23955 


A 


10129 


140 


454 


CLIRSS INSVHNNI YIHSTPDRHLGCFQ 
FLWTKNAAVSRIiAYVF'* * I * I PTYDHI 
HTHTP AG YI P RHG r AGS QLTNI FN FS S * 
LSSNSF\QVGCTSYFLFPSHI 


10055 


23956 


A 


10130 


340 


473 


AGARHSVLRPRPPIiAGSSCPSS/CGGHC 
VALG VS C PGGG CRAAC VPG CC C LLCG L P 
SDVCVC/SVCWGGSMGSRLGGWAAPGF* 
GNSREGPGNCSSGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGAPAQVPLAPPL 
S P PPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 


2 


DS LALL P RLE YNGVNVAHCKLKS S W I RR 
SSHLSLPKHLDYRSMSH\L,PGLEIF*YS 
YCK*CLLTSPYFEFLiMNSLEDISSCFIP 
IRVMPKCRMK 


10057 


23958 


A 


10132 


244 


3 


IISTDLISHSWTLYblSSLVYVFLKQHS 
LAVLPRLDCSGMIIAHC/KLQTPGLKR/ 
FL/ CLS LjPST*DYRHTPPNRANFYYYYF 


10058 


23959 


A 


10133 


272 


3 


VGKPRVFP YR I AF'NFGRVT S I LGPH I KK 
YHFQCRGGKHCFF*KAPPFFFF\FNETG 
FCSVPRLECSGTI IALCSLNLQGSCNSP 
TSVSQV 


10059 


23960 


A 


10134 


217 


3 


FPRFRPLVS PALTELGQF/ HFLiTRGL * K . 
KKPFPFIjFFFFSETESCSIA\RLWGSGM 
ISAHCNIjCIjPGSSRTRG 


10060 


23961 


A 


10135 


229 


3 


kndylkcwqgcgrtetqmlgcwdik/li 
qp/lwki iwqfllklnmhllcnptlpll 

GIYP*EMNVYVHTQTCICTDAW 


10061 


23962 


A 


10136 


84 


352 


RREFKTGLATWQNLVSTKKYKTTVRYHL 
TSTRMDI I KGKR * QVCW/RKC /GDKGTL 
LDC/WMRMLVQPIWKIV*VWKFCRK*KL 
NLFFPPPPQ 














KCVWPEERILREKSQHLGKIKQEGRLN 
AWLIQPGQHGETLSLQKI / QLAGHGGA 
PLRSQLHRRMRREDHLSPGVRSRSEP* * 


10063 


23964 


A 


10138 


2 


386 


HIQEKQLPNKDSTLNPYFLLlSIIiKRKD 
EKIQMPSKHMKRCSTSLVIREMQIKIRS 
HFPH*VGKILKSANIKRW*MCRKRG\TL 
ITCW*TYK*TYKLVKTF*RIILQHPLLK 
THQSYYPAIPHLKETQA 


10064 


23965 


A 


10139 


229 


1 


TRSYHESLTILHTNHHLNCSPFPQAITP 
*SRFPYSNRA\WLGSVAHTYKPSTLGGR 
GRRNA*GQEFKTSLGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHL FHVAL I * LGIiVW/ S F VF VFFETGS 
LPRLECSGVITAHCSLDLLGLSGLPTSA 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKKFFSPPPPKNKNPR 
GPPPPPIFFFFFCKKGVSPLFLPPGGGP 
/HPFLSPPPPGVKPENFFDPGGGGFP*T 
KIFSRPPP PGGKQNFFFLKKKKKKKEHQ 
IKR 


10067 


23968 


A 


10142 


208 


424 


FFSLFMQTFHIHIRFYFFLAGGGGAGVD 
RVS / LLS PRLECRDTI S VHCY I H3JPGS K 
QFSCLSIiLSN*NYRHAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


I'redict- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Protine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10068 


23969 


A 


10143 


1 


297 


PTRS PQVSNLNHKS FISFYLFIDLLSVC 
LSVS VYFETS LTLL PRLE CSG AI TTHL S 
LLSS*GHRHAPPCQV\FFVETRFHNVSQ 
SGIELLSSSHLPTPA 


10069 


23970 


A 


10144 


185 


488 


SGF PRGTEL IGAVCVCVCVCACVCVC I K 
LGYI *NHKRIYI*LtiYIYSTYYIFNLSL 
*LSVY\SYVNRNSM*DRSQSI*IFILPK 
LKTCLENTHTLKNTHEGG 


10070 


23971 


A 


10145 


283 




LGW PGRKADGSGKLTI DYGRLD KX.VS P \ 
I*SAISDMASTTKAVLiQAQRDQYSVXiDL 
AHAFFSIR\NQSQFSFIRDSPQYAFTLP 
SEEHLNSLDGQ 


10071 


23972 


A 


10146 


146 


380 


CFLWDNFFFFI FTEKTGSCS I AQAGAGT 
Q/WVGIISYCSLKLMGSRDPPTSLSCVA 
GTTGMPHHAQVIYFFIFL*RQGL 


10072 


23973 


A 


10147 


275 


3 


I KGML KAKTG * KLGIiLCQ/ TSQVENAKE 
EFLKEI KSATPVNT / RMI R KRNCLI AEM 
EKV* VTLIDQTGHN I P LSQS LPQS KCLT 
LFNSKKAER 


10073 


23974 


A 


10148 


281 




KIFLPQPPQEMVFQGFAPPGG*IFLVF* 
*KQPFPPLGRFVFK/LPGGIPFFFFFFF 
FFFFFETES HA I T \ RL ECSG TI S AHCHL 
RLSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


SLRVWPIFGLLQANKKKFFWILFLS/IR 
PL.FFFSRERGGTPLFPPFFLRNPGKKSF 
PPG*PPPPLKIQFPGKYFFTSRPAPFFP 
LFCQPPGKNLNYWGLGPGFPKI FPPPPL 
F FFF F * D R VS LCRS G W S KVARS * LTWG K 
AFFTWVS 


10075 


23976 


A 


10150 


100 


431 


GPRL TDHLRSG VPKHPGQHGE TS FLLI / 
HNYSGG*GRKIANRIiNPKGGGCSEPKSC 
HCTPSWGKKRNFVS KKKKKKRE ' 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG/ WGKRNPPP*PFG*GEKTGW 
GPHPGKIFLiKKKLLFFFFF*KGGLSFPP 
GGRERAHFWLKKTP PSRKKE ILPPPPPR 
RGGGGPPPPPFL 


10077 


23978 


A 


10152 


403 


2 


LAFPILQEVPQCFCNLLHSLRMYSSTGS 
KLFPLASPQPHSLLYHLTLARAPMILEE 
LQS PNTALLLKYL * E AALKKKSR * H I TS 
SL* KVDLS FFLETKPRSI A\RLKCSGTI 
SAHCSLRLLGPSNSCATASQVAG 


10078 


23979 


A 


10153 


382 


297 


GLiN+ESWSGEKGTL/LHCWWECKFVQPL 
WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


SSPKF*FFFLSPKKIPLPFF*PIFFFTK 
TP/HFFF/SFFFFVCVSGSHSVPRLSCS 
GAITAHCSSTS 


10080 


23981 


A 


10155 


153 


422 


RTS GKNGNRGQ 1 1 L VTMVTNWLFS F F F F 
PLKTGPHFVIQAGGQGRAHGSLQA* I FG 
\ SGDLLALVS LQAGTAGFHHHAQLS FFF 
KKTGLF 


10081 


23982 


A 


10156 


257 


3 


RRLQNS KDCCLFFPLEAPRGATARCQPE 
LSCMRCLSA\KLGGVSQSGYMGVRD?HK 
EAV* PLAEL.KR* AGRSS APFKAVRQGCL 
S 


100S2 


23983 


A 


10157 


37 


386 


DATRFRINSVGTAEGGLDRIEPNDLQ* Y 
INIiVGTAVRRPDRI ESNFESSTVGKKLS 
NNIVCYRSFFCERESQLTQQTSMLYYFK 
KLPQLPQPSAITTLH/SRQDPPSARRLQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


1 SEQID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=L/ysine, L#=L»eucine, IW— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGQ 


10083 


23984 


A 


10158 


407 


272 


HKGGKRERKRKKKEKKEGGREKGQERRK 
ERE/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKWHRLLLDPSVEDPWKKSLKGPLQP* 
/RD*IFFF*NSGIjIjCHPSWRAVAQSRFT 
GVSTSQAQAIL.PSFSLjPSSWDR 


10085 


23986 


A 


10160 


458 


129 


GFS VLNRLEDEDLRQENLLM PAV* DQPG 
QHSKTPI STKNNMNPAGHGGVYP* SQLL 
RTVRPEDHLY/SNKPCSYLCAPAWATET 
L«S LKNAFS CCSNHYGLS FQTS FHRI LMF 


10086 


23987 


A 


10161 


82 


291 


YSKTKSLFFAGHGGTCIYSQLFRRQVQV 
RQEDHANPGV*GCS /DTMIMPMNSHCTP 
AWAA+ * D PVS KKKKK 


10087 


23988 


A 


10162 


6 


409 


LRINILPEPISLSPLTAPAVFMPWTNYS 
SCDTFSLASMTLLPLHFLSTSLALPWSL 
FIAPSLFQISKHQLYLSPVLIjTLSSCYF 
LNP C FRNH I LMNLGRPRQAN YLR S GV * D 
QPGQRGETPSL/IiKNTKITWPWW 


100S8 


23989 


A 


10163 


249 


415 


APPECSLPLLVLLLLLLLLLFLRWSDHL 
GPGG*GCSEPKSYHCTSAWVTR\GDPVS 


10089 


23990 


A 


10164 


1 


405 


RSEWRIiRQPERQSETVKKKKKGFFFFSR 
GGGGGKIFGLMEPSPSGLGQFFLFTPLE 
NGGYRPPPLGGPFFFFFKKGGFPLCGPG 
GFEFPALGTPPLLFPKGLNFRGGPTPPA 
RFFFFWGPVFSVFSPKN*NPPVFRGKNG 
KKTRPPKKKKPGRG\WGPPRKFNPFGNKR 
GGVPRAGNSKPPGPQRGNPPFLKKKKKG 
PPKGGGR*PPFSRGVNKKNCPKPEGEGS 
INPKIFPPPPPREKKKKPFFFFLTVSLC 
RSGWRNLHSL 


10090 


23991 


A 


10165 


175 


404 


AI KMVW I QAVPSQCLHGVGFYLPRSVCV 
KTP*NKRHRQGWAHGCNPSTLGGRGGW 
I \ RGQE FETS LTNM VKP 


10091 


23992 


A 


10166 


317 


2 


PFKNS PLLLNEMTN IHFLEPTCNYVNVS 
PFLK*HKFLFF*EMESCSVT\RLECSVT 
I I AHCCLKLLGTSNPPTSAS WAPGMI IG 
MRH CPAP KVCSHAS AHAS AHAS 














KMGEKQGHFIKEEIRKANKHIERCSRLL 
AI EEM * I KATVTI / RKHQTKC* QG* RES 
GSLVHG* * KHKIVHPLMKILSVSYKTKH 
S I I I IGYSS VLLGTVLVHSGSDAW 














GEPFLFCFPIKKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF * KKKKKS / LGMVAHT 
YNPSTLGSRGGQIA*AQEFKTILGNMS 


10094 


23995 


A 


10169 


164 


1 


MNKMSQYTFPYKN*QWGG\SVAHPYNPS 
TLGGQAMQ I T * AKE FETS LSNMAKP C 


10095 


23996 


A 


10170 


214 


419 


LS * PPP P FPG FKKFNRVLRFYGKS KPGP 
/FMFVALFQKGGPFYQNIPPPVKLEPFP 
PTIIjRVSFFFL>*DGVWLiCHPGWNAVA* s 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFSITNNYCSHFSSLHLAIW/ 
LGTVAHACNPSTLGG * GARGQE FETSLT 


10097 


23998 


A 


10172 


139 


3 


PAH/CES*PPGLKRSSYVSLPSG*DYR/ 
HTPPH\QLIFVFWEMGFHQPA/SA/FP 
KC*DYRS\DHHKWPT*HLK/SFHFIKK* 
N*LKRYLACSRLWYLFCLIRCMV* *NLD 
DLS ILYLHLNKFQFLFI YLFFEMES 
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_0164S35A2_L> 



WO 0.1/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Pred icted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


1 Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=JLcucine, M=Methionine, 
. N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10098 


23999 


A 


10173 


253 


2 


FPLKICNTLPPQIFSTQRGL.LSLPFTIF 
LLSLILLRPFFFFF*TESHSVPECSGMI 
SAHCSLRLPGSSDSCASAS \QVAGKTCA 
R 


10099 


24000 


A 


10174 


300 


3 


GRAFFFKGLVKWVFLTPNP I PFLLKN FL " 
KRVFFLAPSEKFVFFKNWPP+LSFLIFG 
EKNFFFFFFFFSEMESCSVA\RLECNGM 
ISAHCNICLPGSSDSP 


10100 


24001 


A 


10175 


117 


381 


EILRYIVSIRIRRTNLKFQQEELRVNIK 
NKLSSLGNMVRPRL/ECSGVITAPTAVS 
APGFK*SSHPSLPSSWDHRRAPLHLGDF 
FLIETRSHHVAQAGQVIFNVYSKFLLLK 
FKISPS DP YRNN I SQDFLAI GPTNMVPL 
QPYI VLCACRTKNNKI IDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNLLLSLHCLS FVPTDLYQPERLS K? 
PRPAKKYKFLI L / WLGWAHACNPSTLG 
GRGGWIT*AHEFDINLTK 


10102 


24003 


A 


10177 


100 


339 


VP VKS LLLCVC VC VLLL *RISDHLPTLY 
ANLG*CFFSYLVISYS\NSTYCILQRHL 
LAIQKQP/CSFLRQ'/HLVGWLHKKKHTH 
THTHTHTHTHKSNTHTHTHKS KDLTGTQ 
NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KEKYI KSKLHFEARHNGSHL * SQHFGR P 
RLMDCL\VQDQPGKCGENPYLQKNTKIS 
QAWWLVPWPVYSHRAS KTLS 


10104 


24005 


A 


10179 


368 


2 


SGSSPSTSPQGRPDPLGLFSAKIVLGQL. 
ATIPLPLCFLLKIFHPLTLPPLLLLGGE 
FNLSPQQQDS I S LPW FLC YRYGPTYKAF 
FYV*ASVPEKI\FFFFFF*DTVSLCHPG 
WTTVAQSRLTA 


10105 


24006 


A 


10180 


398 


1 


QNI MLYTINTYNFICQLKKKTI QWGENF 
FFNKWC*ENWIFTCKRKNLHPYLIPYTK 
TNS KW I S ELNL \ KS LKVNRG GNLHD FQF 
GNGFLGDTKSKTK/ AKISKLEFIKIKNF 
CVLNDII KKVKRQLIE WKKYLQ 


10106 


24007 


A 


10181 


252 


419 


GARFKGSNFTSAGGQGI IFFMGPPKLIS 
RPVF*HRGEGKTPGVTQLNRLEEHPLFA 


10107 


2400S 


A 


10182 


101 


380 


FC W I YNS WL I G YL FFP FRTLN I LAM CGG 
SCL*SQYFGRQRQADHLSPGVRDQPGQY 
GETPS /LTKI S WAWWHVP WPATQEAEV 
GGSLEPREVK 


10108 


24009 


A 


10183 


2 


377 


PGQVDCFSSGVQDQPDQYGEIIiFLLKIjQ 
KLGGHGG / IR * EDHLSLGGQGCKEPRLY 
HCTPAWVTEQYLVSQKKKKKKKK 


10109 


24010 


A 


10184 


140 


3 


DGVXJjSLiPRLE CNGAISAHRN \ SPS CDS 
PASASQAGCCGTCP*SQL 


10110 


24011 


A 


10185 


364 


3 


SVCPPRXiGGVSQSGDMGVRDPFEEAV* P 
LAEIiERCAGRSTAFFRAIR*AHLSLLK\ 
FTRNHPFPHVPCPRVMSR+ FGLLPFFQR 
CPAQRGE I WRRSLAAVALRS CGGRLQVR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


S VRLNKNKENTQ I RNKKGDT INLTE I KR 
I IKEYSEQLFANKLH/SI.DEMNKFLER* 
NPLRPGAEAQRLVS *RFWRATAI DHMRP 


101 12 


24013 


A 


10187 


270 


2 


PWAQRGCVFKTNQKIFFGFFEIPHLD* P 
CVFQRPPS\F*VRLIFIGGGFKPFFFFF 
FEMKSCSVARLECSTISAHCMLRLPGSS 
DSPASA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
. beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


101 13 


24014 


A 


10188 


428 


28 


PIFDKG/E*GN* WGKKSIFNKWCWEKCI 
C/KKLRDSSLTSY/TRINLKWIKDLNVR 
TK/STKLLEENIDIV/IYDLGLGIGFLD 
MTLKARATK/AKI EKLDKI KIGNFYAS K 
DPKKMKR*FIEFIEQGKIFTNHVSDDTS 
I 


10114 


24015 


A 


10189 . 


85 


243 


DRVS/liLSPRLECSGMISAHCSFDLPG* 
SDPFTSVPKSSWNYRCPPARKLSFT 


10115 


24016 


A 


10190 


96 


440 


LQSNHINTLNVNRLSSSMKRFRLSDEI F 
\KKHGPALCCL*EMYYKDKGTCTLKIKG 
WKNIYHANANNNKT/GMVLLSGKIRLES 
N/TVIRVKEG* LVI /INGRLI /QEDLIIi 
LFILNVYAP 


10116 


24017 


A 


10191 


137 


423 


AKDLDRQFS KED IHMTNR YMKTC/SGRQ 
LIRETQIKTTMRYPLTPLDLRKSKDKKS 
* * * YEQKGTPAHC* *ECKLVQPLQKI I * 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK*VWFFCFL,FFF*DGVS /HWVAQ 
ECSGAI S AHCNLCLPGSSDS VRVG 


10118 


24019 


A 


10193 


460 


121 


SDKTKYW*RCRAIKTLTHF*LKSKLVPL 
L* KTI + CYXAKLKIHIVFDSAI PL*DTY 
FR/E* FCICAPKTYSNMVIEAL.FVPANQ 
TS KKKNWK* LS CPQ I VDNFLNIHLMKYH 
LE 


10119 


24020 


A 


10194 


393 


2 


ATMPS YQRLLLTAVHELES PE * MDQFLD 
TYNLPRFNQEEIENLNSPIMSNEIQSVI 
KCLPFGPNPGPNGFIIEFYHTYKEE/LP 
ILL * LFQKI EG \ I LPNS FHEVS ITLIRK 
TDKDAT * ETHKP I S LMNTDA 


10120 


24021 


A 


10195 


3 


353 


EAAPMPVKFFQLLEPSGPPGLSMSCLGD 
CNGYACFS VSLHQSANQS I DQ/ SIYQSA 
IHLLIY/VSIIY*SI\YLLL*SLI*PMY 
VP I Y PLS CLS I YLS FMPLS C L.S FNNLS I 
TRHGSMCL 


10121 


24022 


A 


10196 


118 


395 


DFNAGLLPSYQVLSLYFCF*NGV\CHCT 
P /RLGDRPRLR/ P P KKKKKKTP P PPLGG 
SPPGGEPYTPRPGVGPPPPPFEHPPREN 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FGYPRVFPLPALLKRGPGILFWGPNKKN 
YSSPARGSKICFFKRGPPFFFFFYFFF* 
QTESCFVAPVGVQWCDHGSLQPQPPG\S 
SGPPPSTS 


10123 


24024 


A 


10198 


259 


3 


RQGNH VTHVYS S RLQ L I L S P P AS LAVF I 
QKIYVNLCrYIHIVNI J YA*MCICVCICY 
TCIL\ I CI YGLVAVAHTYNPTTLGAKLG 
GS 


10124 


24025 


A 


10199 


1 


416 


IISAHCNLHFPSSWDYSRTPPYPANFCI 
FSRDRVSPCWSGV*HHLEQHSKMLSKK* 
NKIDMHGGMCL* S * LLK/RAE * DVLRQE 
NCLNPGGRGCGEPGSRHCTPAWATRAKT 
LSKASYASSSSSPKKKKKKSPS 


10125 


24026 


A 


10200 


369 


3 


TAPPFFFFCRDGLRTSLIIFPSRATPYS 
LVTSGSFLPAGAGGRGLSLE I RVLDLVQ 
DGCPCSQAWAAALGGPGWWAP FLCFFET 
ASCSVAQEGMQ*CNLGSLQPPSPG\SSD 
SPASWLWTSK 


10126 


24027 


A 


10201 


31.4 


2 


liHHMRPEV* FTPSKLGVTGYMVSNTSYP 
WRCRAHK*SQVLVSLRHKHLFPLEMESG 
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ing to first 
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peptide 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H— Histidine, 1— Isoleucine 
K=Lysine, L=Leucine, M=IMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 

nucleotide insertion 














S EVQL \ DTNRC I SANCNFCL PGS SNS PT 
SAS * EAGTTDSRQPDAWADAW 


10127 


24028 


A 


10202 


68 


413 


KIGI LLKTH* PS I C/ VS I TNF * YNDTGE 
LiKVKG*KNIYYANINQK/KAGMVMFVLiD 
ETDFRAKK/DYQGH* IMIEVSVHKE/E/ 
LVILNLYASNKR VTXYMK * KDTDIKGEI 
DKSTTIVGDL. 


10128 












PKFPPLNPAWGTTI KI FFKKKNAHNWFS 
PFIKEKALQKIHKKG/RLIAKEISFLGGF 
* TDYKTNPDSQS I RTLGEKKKKI RPGTV 
AYAYNPSTFGG* GRW I T*GQELKTSLAN 
MVGRV 


10129 


24030 


A 


10204 


2 


392 


FLEGVLLLLPRLECNGAIPAHHNLHLPS 
PRFKRLSCLSLPSSWDHRHAPTRPANFV 
FLVETGFHHVGQDGLDLLTL*SPCLGLP 
KCWD YRH \ DHHAQ P S F 


10130 


24031 


A 


10205 


264 


2 


PLSLSNLFPIPLNSILKIFFIL.K*L*IH 
GKLHKMYRKV /LV* TS PS PSQC * HLR * V 
* HN IQNKKLMGLGAVAHASNPSTLGGRG 
GWIT 


10131 


24032 


A 


10206 


281 


2 


ISPPLFFLFSPPKKIFFSQKHIFLGPPL 
FYPPPLFFSPPP\PHFLWGFRPVPNPMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPPPFLPPPPFFFSPPK 
KK/SYPF PLKKNSPNPPPS* K* AA ISLiP 
PPFFYFFFFFFFS FFLENTWKMADAW 


10132 


24033 


A 


10207 


340 


1 


S TKKLVQKLGEHTQC WQGCE ATGTL IHC 
RWECKLVQP LWK/SAWQILRQL* NFMHL 
PYNPTIQLQGSYSRE\QHTCPYKDLiYVA 
ALFTIAKNC*QAKFLSTGE*3NKLWCIH 
ITE 


10133 


24034 


A 


10208 


136 


3 


KNRGPPPRSGKFFPFLPLFFFFFF*DRV 
\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA*KFDNLDEIYQFPGRYN 
LPKLTQEKANNLKRPVSLRETKSTINNL 
PKRKVPDPDGFTGELYQTFKEEITP\FS 
TEAKGILPNSFCEVTI IVRLFLLRQ 


10135 




A 








EIVPLLRFPLVIFIASPTTFYTYGFVSG 
EKSQYKKREKLCDKTLEQSRSLLGMHST 
CGIALHPKREGQMCTLCACFY* FFLKES 
HSAARAGVQWHNLSSLQLPP \ PSSSNS 


10136 


24037 


A 


10211 


361 


462 


I QQ IGSNTGNI PAMEIiALKRNS KN INRL 

*IYQLKITF*NYEGMVP/FVFRFLFETR 
SLLSPRLECSGT ITTHCS LELVGS INSS 
I SARAE FLQRGANL 


10137 


24038 


A 


10212 


62 


469 


SLLWKLRSSGASTSLRCLSA\LLGDVSQ 
SGYTGVRDPLEEAVCLiLSELERGAGRTT 
AXiFRAVRQGRLSLiEKIj\LIjPFVQICPAP 
GGGIYR/GQ*AIjLSCGGLCPV*AS* * /M 
CLPC/EA*TRLLKPPPGS 


10138 


24039 


A 


10213 


197 


1 


WTQPNEWCVTNKDAS FQQRCTKK I KT/ 
WPGMVAYTCNPSTLGGRDGWIT+GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEGFHHVGQDGLNLLTS * STCL.GL 
PKCWDYRHEP/PAPNWWKKFLSSKAFKR 
♦L.GCY 


10140 


24041 | 


A 


10215 


442 


29 


PT I NPTMACKCLS KRKS HTS FTLNQ KLE 
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Predicted 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspoh 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TIKLSEEGMLKAEAG * KLDHVH/ QVS Q I 
VNAKEKFLKEIKSATSLNI *MIRKQKRF 
IVNME/KKVLMFWTEDQTSHNI PLSQSL 
I RKRP * LFNSLKAERSEEAPEKLEAS SG 


10141 


24042 


A 


10216 


195 


533 


LVWFWLRVTYGVAAKMSSGTTI IGRLSW 
G WRLHF \ KMAHL » GYW WEAS VPLHMKLS 
SCHGTG\SLRVKDPRERLRQKLQCFYDL 
VLE VT YWH FYC I L F I RS KS L.S P AH I Q S K 
RI 


10142 


24043 


A 


10217 


274 


494 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
CS VAI /SAHCSLNLLGSSDPPAS ASQLA 
GPPGLRR 


10143 


24044 


A 


10218 


142 


480 


VTIKI YLHPHDTTTPLRQGGSRLKE I LV 
SY*QQKVETRPSHVIFFTVISTQFKKTV 
TTRSGAVAHTCNPSTLGGREGQI /T/RG 
QEFETSMTNPVK 


10144 


24045 


A 


10219 


163 


464 


AVYFSFWRVCEHDIASLYFENSGKRDNK 
FVTLVLQAQGPAWRQGEVHKHLLPCVYH 
FAFWLTQQPQSFF*DG/QSHSLAR/LE 
CGGVISAHCNLYLPGSSNS 


10145 


24046 


A 


10220 


379 


469 


V*KVQM/WPGMVAHTCHPSTLGG*GGWI 
T*DQEFETSLADVAKPC 


10146 


24047 


A 


10221 


211 


473 


VTWLLLFFSIFRASSLSTDSVSYVIAFQ 
FYFYLFIYFYLFIFETESCSVA*AGAQW 
CDLCSPQPPPRGS\SEEPSTLQMTSGSS 
NL 


10147 


24048 


A 


10222 


142 


394 


GVCQSLLGSLCQLGGTGVRDQLEEAVCP 
LAELEPCAWRTLLLRI CCSLQSQQAGI F 
/RFC*SYIHICPFPQVLLSSEMGVLSIR 
P 


10148 


24049 


A 


10224 


77 


396 


KRKTKKSFWTPSGFGLRPPIPKKKKFFN 
SRGKQGGFGRGSGSRGGGGSTSGRGGGY 
MGKVFGMGQ*NFTGDQGLAKGGFVFVFL 
GRTTIGIKGALKRLFA/NNEHDLR 


10149 


24050 




10225 


155 


443 


GQAREGTLKSQFPFPSPPGWPIjILPGGV 
FS PKFF F F FFETRV/ CAVAQAGVQWCNI 
DS SNS CAS S S * I AR I TGAHHH \ TEL I FV 
F*ISPPSQ 


10150 


24051 


A 


10226 


290 


485 


KRGEGPVQKKKFFFFFFETESCTVAQAG 
VQWHNLSSL*PPPPG\SSDSPAEFPR 


10151 


24052 




10227 


228 


424 


KLYQNLHDTLFGFELSHYEEVH* KMLKG 
SSWEFVMHSS/VSNKCILICSAKKKKKK 
KKKKKKKKKKKKK 


10152 


24053 


A ' 


10228 


269 


454 


DLFAASQGHAQLLEELMATTLS LTHTHT 
HTHTHTHTPWSLF*SR/SALSLVC 


10153 


24054 


A 


10229 


255 


1 


FYIFSKFLYVWLNVSHLDSHICFCIQSV 
1 1 SCFGVFYYYYY\ FF* RQS CS WHTGV 
QWCDRS SMQPQLQGQND PPTSAS QVAG S 


10154 


24055 


A 


10230 


18 


472 


GPEEFGTRRERDRERERETRDSRERVER 
VRERER* /RQRERERERERERER 


10155 


24056 


A 


10231 


52 


361 


WRKNYYFQIKITVYLENTKDFIQKLHTK 
CW*G*KEIGRAIHCWWECKM/VIQPVWK 
R/VWQFLRKLNMLLPHNPAVELLDIYPN 
ELKTHVCTKALYEAGRGGSRL 


10156 


24057 


A 


10232 


3. 


402 


AP RL PLGGW I S FSS PMGAMGLNRATG IM 
GSFFFFFliKRKSPLGPQGEGRGQNLG*G | 
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amino acid 

residue of 
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Predict- 
ed end 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NPGPRG * TKFPG * TP * KAGNNGEGPPPR 
GNFGFLFCKKGVPQKGQGGSKP/RDPRGT 
PRLGPPKGLE*RGGPPPPAPQK 


10157 


24058 


A 


10233 


48 


470 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLRHLPEGA+APRSSARSP 
LL I PAASLRS PRRRTCRRRLRHFG 1 1 TR 
PFQPGVRLPDSGPGLA/RWACAAVPAAS 
RALPAliRR PL * RRPGKGAPS S KSHGHS R 
SP 


10158 


24059 


A 


10234 


169 


446 


CPPGPAPSHLLEDRDDPQAGRQSSFYAG 
WWDRWREKTliFLFLiRQSLTLSPTTPFFF 
\FEAESHSVTHSGMPGVISAH*NLRLPG 
SNNPPTSAS 


10159 


24060 


A 


10235 


264 


2 


KLKPIFKMVFbKIMFMSYIMEVFFFIET 
GSRSVTQAGVRWHDLDSLQP*PP\GSSD 
PPTSVSQVGKWHKTWEAELAVSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP*LV*GLTEI*KIRAL 
VFNKKFFFFFFFPETGSTGSCSVA\RIiE 
CSGTI lAHCNLQLLGSSNPPASASRVIN 
SIKKLFANFIVSANPRPT 


10161 


24062 


A 


10237 


20 


469 


QCGAIPRRGDHLRSGV*DQPGQHGETPS 
LLKIQKLSRHDGH/ CRRLSHKNGLNPGG 
RGCSELRSCHCTPAWATR\GNSVSKKKK 
S 


10162 


24063 


A 


10238 


313 


3 


TRPRPTQGPPHSGGRACLCIFSPHGPPQ 
TRLSTPVSPSVRPLGFQVLLSARHGFLL, 
CQHGMF\PSLFTMSNITLWPGAVAHTCN 
PNTLGGRGG W IT*GQEFQTS 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHS IAQDGVQWCNHSSL, 
QP * PP \G\SSDPPTSASRVAGTTGVCPC 
EDKLSHILRRCYAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCKIKIIYLIVLLEDYIN*FVKSSI,*IiK 
S*K/LIQQMWAIISLPECLNSWCLGKV 
G*KRCSQQGTVAHTCNPSTLGGRGGMIP 
R 


10165 


24066 


A . 


10241 


289 


178 


FNYSSIKNNTDGWVWWCLLPVI PALWEV 
EAIES F / CSRDE I PLS CPGWS PVPKLKR 
FYCLNLP * CWDYRQEAPPHPAI S 1 1 FDA 
TVIELFS 


10166 


24067 


A 


10242 


352 


294 


PPRTTKMW*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC/REDH*SMGGGGCSE 
PRP*PCTPAWVTERDHVSKIK*NKIK 


10167 


24068 


A 


10243 


293 


1 


VSLCRESSNSFIWQCPWSHILALCFQR 
EKKRLDbP/S*MVGK*EIPFI J VPLI J FLP 
HWPLLLMK/YM* I FFFGNRVSLCHPGWS 
AVAQSQLTAASVPRR 


10168 


24069 


A 


10244 


218 


3 


LERHIPVRPIIICDYKWVWRF/HFLRPL 
HN*FLFSFFSFFFFFFETGSYSVT\RLE 
CSGTISAHCNLCVHLLL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNNVKKEGKGQHFFFFLE 
T*SRSVAQAGVQWCDLGSLQPPPPG\SS 
DPPPS 


10170 


24071 


A 


10246 


2 


265 


AHKKKPAKHIAPSTFLYGETEARRSMPK 
AAQLKCGRAGI ILFVFKTESCSVAQDAV 
LWCNHGSLQP*PPG\SSDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 


EAGGREALDGCGLTGRESGVQRRRDSMG 
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ed end 
nucle- 
otide 
location 
correspoh 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=L.ysine, L=Leucine, M-Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CQGTAGRGTRAGKEGPWRPEHLSRRPPA 
SSVLLSWAIHLAWVEGS*TGSLCDLEAP 
GPT-LSPWCSWSPELLQLHV/HHHTHTHT 
HTKS/HPNTHTDTQSLCHQPSFSVSPRA 
RTVL 


10172 


24073 


A 


10248 


201 


16 


TPPGGVFFLGFFFLFFFFETESCCVAQA 
GVQWCDLGSPHP PASRRPGNRSS I /AAA 
*GYHFK 


10173 


24074 


A 


10249 


167 


2 


GFDPFEGMLSYLEKEET/ WPGTRAHTCN 
PSS LEG * GGW I T * GRE FKTS LTNMEKP 


10174 


24075 


A 


10250 


92 


389 


FQPGQHGEILSLYEIKKI* /ELAGCIDM 
CLW S Y / L LRRL KQEDCSST* VQGCS E P * 
CLS CI PS WVTEQD PDSGRQ I * KKKKISL 
GRRGGIFLFLKIWHVFFL 


10175 


24076 


A 


10251 


221 


415 


KKKKRGGRFKGSNFTS PGWRGN I FFMGP 
PKLNSRAGV*QRRD/WEKPGGPQFNCFG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNRCG I WLLGRCQRSPAVSNALWL.HR 
VAPEEEAGEAWAGTLSLQLSCKSKITPN 
KIVY* KKNKT/WLGTVAHVCNPSTLGGR 
GGWII*GREFETSL 


10177 


24078 


A 


10253 


249 


2 


YPTM IFKCSSKRKSPTS LTLNQKLEMMK 
LSEEGMSKAELS * KLGLLH / QVS QWNA 
KEKFLKEIKSATPANTGMIRKRNGLIA 


10178 


24079 


A 


10254 


132 


414 


NNTYFQGKIFILNSQVKGMLICCWWECK 
* VEPLWKAVWRFLK\ DLEQNSKE YKL * T 
WNQPRYPAWDWIKKMWYIYFMEYHA\A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 


3 


YSLRHNIEMRPVSNPTRASKCSHERRSV 
ISLTLNGKREIIQEASKEGTLKANKPKA 
KLFVP VRQ WNAKE KFLKEAKSAAPLN \ 
RMIRKQNSLTADTEQV*VIWIEY*TSHS 
IPLSQSL 


10180 


24081 


A 


10256 


283 


2 


SLQPWKLYVHVFLPKYATYPPFGTLPKT 
HFFHKILPSKAGSDLNSIYAMTYSHRHI 
TIFFETESHSVTRLSCSGAI\SAH*NLH 
LPGSSNS PATK 


10181 


24082 


A 


10257 


216 


386 


PSKNPPPLINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP\GSNNSCASAS/RIAG 

I 


10182 


24083 


A 


10258 


263 


I 


FSQWLI LPS PLS YS FLFLRQGFALLPRL 
ECSGEITA\PSSAS*VAEI TG ACHHAE F 
LNFFVEMEFHHWQAALKPLGSSEPPTP 
ATK 


10183 


24084 


A 


10259 


248 


47 


I PKSRVKRENRLNPGGKGCN * PRSPPCS 
PPWVAKQNSLSKK/NKVKIYQNLHTETQ 
TVHGD I RGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDMSSLKLEY* KKE/RIWPGAVAHTYS 
LSTLGGREWIT*GQEFEISLTNMVK 


10185 


24086 


A 


10261 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRNPPPPGLRDF\PASPPKEAGF 
RGG PH * P GLFLVF * GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALF I FPKHWKLCKCPHIG/E * I KLQCI Y 
TWRYYVAVQKNEENLYQLMYNDYQEILV 
SD 


10187 j 24088 


A 


10263 


282 


3 


AFYNRFFLKIFFKRGPRGKEKKGGKKEK " 
KKWGKKKKKKFFFFS PGQI FFLGGVFFF 



1 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLFFFEAESRSVT*AGVQWCDQ\SSSQP 
RPPRLKPFFQ 


10188 


24089 


A 


10264 


196 


376 


KKGGGHI INFPPHKGYKPGPEKKKFFFF 
FFEMES CS VA\RL* CSGM I / SQAHCNLC 
LPGSSN 


10189 


24090 


A 


10265 


514 


157 


GLTLS FRL.EYSGMMSL+ P* LPG / FR * FS 
RLSLPSSWVFPLMYVYHYAQQSFCIFFV 
ETGFHRWQAGLKVTiGLRQSAHLGLPEC 
W\DYRCEPL*LTK1iICFTYGIjR*RSKPP 
FFVDGIQLF 


10190 


24091 


A 


10266 


57 


378 


RSRIW*DSVILCS*LVSV/VDQSYFY*V 
TEILEQIKLLGSKIMCFCILQLHFLCFL 
CGS FVGS T \ I FI FS RNI WLGTVAHACNP 
STLGGRGRWIT*DQEFETCLASMVK 


10191 


24092 


A 


10267 


1 


212 


LIQKDTNI YYIKEAL* IPNKHI /KKISS 
I VSHYKMQ I K IMR / YHTRMAI I KRTDKC 
LYECRETRTLMHCW+GC 


10192 


24093 


A 


10268 


3 


399 


MANKHIKI FSASLIRMEI* IRATMRE/H 
* FI P I KWLVL.KS KRESNKCWQGCGRTG I 
VIHFQRECKIVLLL *KTV*QFLKKIiNIR 
LGAVAHS CNPSTLG CLRPRVQDQ PGQHG 
KTLSLLE I QKLAWHGLHLS PS 


10193 


24094 


A 


10269 


190 


386 


FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/FIFYF*ERVLLCGPG 
WSTVAQSWLTA 


10194 


24095 


A 


10270 


166 


327 


KTNSKTAE VCPLS VI TLN I NG LNS P I KR 
QR*TDF*NPI\ICCLQETHFRFKGTN 


10195 


24096 


A 


10271 


180 


1 


IFFLFEVETCSVA*AGVQWNDLGSQQPP 
PP \GS S DFQLI FAFLVETGY * IMVPSSS 
CRT 


10196 


24097. 


A 


10272 


273 


3 


FLGLFSGNPLKANKPVCHLTLP I LQRIF 
FFFESEFHS \ VAQAG VQ WCDLiGS IiKP P P 
PG\SSDFPGRRR*Q*AMITPLHSSRGDR 
VRLiRLKKK 


10197 


24098 


A 


10273 


399 


27 


YHFVPTRMAI IRRKNN* CW*TCGE IGNL 
VYCWWECK/LV+LLRKIVWQFLK/DVKQ 
NYHPVILLLSIYRI *FDCLFLCKSHVEM 
*L.ISGVRVGAHWEVIGSREWFLMSGFTI 
SSCHQDCEFSQDLIV 


10198 


24099 


A 


10274 


275 


462 


KINLVLQLDFFFVEMESHSVTQAGVQWC 
HLSSPQPPPP/GSSGDSRASDS*VAGIT 
SVHQHAGL 


10199 


24100 


A 


10275 


2 


379 


RGPGGRGLGSGTCFHPGRGGEPIVYQHL 
F*FFGP\PENYIIjILPGFGIISH/IVTY 
YSGKKEPFGYIGMA*AMISIGFLGFIV* 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
YIISRRRRKTKGFLF 


10200 


24101 


A 


10276 


2 


407 


FPSPSPSPFASVSLSLSFHGLPLPEHTW 
FIDGSPTKPNRHSQAKAGYAIVPSISII 
EATA/ PAP FTTC * QAEL I ALTRALT I AK 
GLCINI YTD/SHILHHHAVMG* K/GFLT 
TQGSS I IDVSLIETLLKAFLLPKEAGA 


10201 


24102 


A 


10277 


203 


545 


FLWRGQSLALLLRLECSGSILL/AHCNL 
GPHRFKRFSLPQPPE*LSNW\DYRRLPP 
CPANFCTFSRDGFTTLGQAGLQLLTSSD 
LPTLAS P KCW \ DYRHEP LVHGLTFNRS F 
NFFF 


10202 | 24103 


A. . 


10278 


250 


3 


AKWFLEMKSTPGEDAANIIEM/TKDLDY 
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beginning 

nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 

location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H— Histidine, I— Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptopban, Y=Tyrosine, 
X— Unknown, * c= Stop codon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DINIiVDEAVAGFGRIDSNFERSCTVVKM 
VSNS ITCYRELFHERKSRLMCQM* LSYF 


10203 


24104 


A 


10279 


333 


3 


SLILKLFQSMNHTHVRVTF I DQFCVCSN 
YSTD WL FPHLS PISIiRTPYS LR YN I K I R 
P INHPTMAMASKCS SERESR I SLTIjNQK 
\LETIKL/ SEEGMSKAQIG* KLGLLCH 


10204 


24105 




J 0280 


251 


3 


1TSRV J.DiiNJb' XI'S J. X '1'IjN.LiIXU.LiN'JL'U i 1 w K 

QRLL/DWASSPK*DS I ICYLQEMNFKYK 
I FKKIGRRI YYANI SQKKVGVAVXiTPDK 
V 


10205 


24106 


A 


10281 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
IiCDLGLGNCFLD*YCIIi*TKINSKGNKE 
LNVRAKT I KT.iT >E EN I DRNLCD LGLGNC F 
UDMTPNAQTTKGR \ IDKLDFIKI *NFWL 
DT 


10206 


24107 


A 


10282 


238 


383 


DAKRKVSSTEGDSEGVAHERRSAHLSAK 
P C P AEVE / AKP Q KAAE KG * S S E 


10207 


24108 


A 


10283 


127 


377 


R I PGGS ALGKE I RQ C FGVKGQAE I GKI E 
MSQPTQEREWEGIGSQKAAQVDHLRSG 
VQDQPGQYGETPTLLKIH*LAGHG/GRC 
L 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF* SSGRVS SCCPGWS \ PSELK* 
SAHISLPKCWDYRCQPLRPATFFFKTAS 
H YVAQAKVQ WCNY S S LQPGTLRIiKCC I F 
KKIQWQW 


10209 


24110 


A 


10285 


341 


1 


KWCVLTAPLSGRFSVCLPFLGPPYFLQ 
HSDIEIKPINYPTVASKCSNERRSHTSL 
TLN*KLETIKLIEEGGMSKAETGRKLSL 
L/QSVSQWNAKEKFIjKEIRS ITPVNT* 


10210 


24111 


A 


10286 


199 


329 


LiFFS FLNCLSPHRI E ES Y* KE I KSAPP I 
TPGIPRK*TSLIAGKKNVFMDWKDQTKH 
NIP/LSQSKTITXjFNSMRAETVEEAEEK 
L / ERGC FMRFKERTHLHN I KMKGPG WA 


10211 


24112 


A 


10287 


.358 


387 


YTNAIiLIiYV*DQPGHLMGKDSLiFNKWC* 
KNEYS HAKKKQNH\ P * FSLYTKTNSKWS 


10212 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLLRIQKLAGPDGAC 
KTPSQKKKKK 


10213 


24114 


A 


10289 


197 


375 


I RIGLCKF II* FLE TKLHTVAKVE CSS A 
I SAYGTLSLVDSGDSPASAS /RWGTTG 


10214 


24115 




10290 


118 


371 


GGCGEPRSCNCTPAWATK\GNSASKNKQ 
T \ KNKQQQQQN 


10215 


24116 


A 


10291 


214 


368 


CGYHQKNKKTNPKNS KCWQGCGE I GTLM 
HYWKDHKQV/Q/PL* TIVQQFLKTRI 


10216 


24117 


A . 


10292 


2 


376 


PFPLMAQRPTHLVPL.SASSGTSLLMPST 
LLWLPWLYKAHAVPTVPLSKLFDHTKPY 
AVSKNNIiGALSI*SY*VIPVPPRSRF/S 
FIRDKPTS 


10217 


24118 


A 


10293 


I 


379 


NILHNLDFFKNTTLEPQSSRLQRAVIAP 
LHFSLGNIVRRCFKKKSGWAQWLTPVIS 
AliWEADVAGSIiEFKS ETTXiGNMMRP CLN 
FKKKKKKAFIiFGAPGFFFFMGGKILFGI 
AAPF*SSGRVSSCCPGWS\PFELK*SGH 
I S LP KC * DYRCQPLRP ATFF FKTASHYV 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
KHLysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AQAKVQWCNYS S IiQ PGTLRIiKC CI F KK I 
QWQDI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*LR*HINISLQTFFF 
LRRESHSVA\RLECSGVISAHCNLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VCVIESVKANHLVDEHALKSLSFHLSLL 
FGV* G VC VC VC VCVYMCF / C VC I YVC 


10220 


24121 


A 


10296 


1 


433 


GEGGGGCSGWSGVETAGSRPAPGAEQD1, 
LPAPALGRWQGIiNSVCYPAWPQPNRRQP 
LALLCAAGLGPGQRLECSGMIMAHHSLD 
LPGSGDPPMSAG\SIAGTTGTCHHAW\E 
FHHVAQGGLELLG PSDWPPTVLGLQM * A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


LTFELLYHHYTLYFF * E SGS CSVAQARM 
QWCDIX3tiLiQPKALG\SGDPPASAS*DA* 
STGVD YYSWHTC I 


10222 


24123 


A 


10298 


225 


1 


RPGMPLSALPCTGHAPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


VTAGSGHMCYTWWE CRLEQSL * KTVWRF 
LK* LKVELRFD P AVPLL / S I YP ED 


10224 


24125 


A ■ 


10300 


97 


406 


LHRCTPAWVTERGCLKTTNQNNGQHNVE 
GEACSSRPFTTIC/EEKIHLVCALTEED 
* RIAAETTANTTD I P IGS AYR ILI KKLK 
PSKliSTC/YVAKQLSPGQLQTK 


10225 


24126 


A 


10301 


3 


422 


YRSSSKTb'hl IRRGFIPSSTFIilAHLST 
TtiSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP*F/HGNPPPPG* KNFPAL 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGNPPPGPPKGGGITGGPP 
PP 


10226 


24127 


A 


10302 


330 


1421 


RTKGR PLLXALP VLLLL FLHLAFFSAP P 
GPSSIiSSTEDLKWTLLSPAQSRKLiQPQS 
LHRAGVRDLQWQGRAGAELTAAC/WPAP 
PHPETAVLLCXJLPC/QEGRLLLESriNGF 
ALWSAEGTI FYASATIVDYLGFHQTDV 
MHQN I YD YIHVDDRQDFCRQLHWAMD P P 
QWFGQPPPLBTGDDAILGRLIiRAQEWG 
TGT PTE YS AFLTRC F I CRVRCLLDSTSG 
FLTMQFQGKLKFLFGQKKKAPSGAMLPP 
RLSLFCIAAPVLIjPSAAEMKMRSALLRA 
KPRADTAATAD/ SKMRS *LTLSESRLPN 
* FE I WDSAAPGTLPSLPFFGAFDGQHI F 
CIHTRALRGSTRTRQCIAAALEGASFRN 
HVEI> 


10227 


24128 


A 


10303 


172 


383 


KIENKVWQGYEEIETLVYCWWECKMVQL 
LWKTTWVVLKII.KI.GPP* \DS I IPLLGI 
NQGTE IRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA*KTDFFLVGSLFIYFE * ASHP I T 
\RLECSGTIVTHCSLDLLGSCNPPASAS 
HVSGTAGAHHHC I 


10229 


24130 


A 


10305 


423 


30 


IjLEGKLTNRKDIRTKTPSVRRLHQSPKV 

drtakmgkkqsrksgnsknqsaypppke 
hss s pameqs wmendelreegfrqss /m 
sklkeevrvngk*vktrekrlaksltri 
tnaekylkdlmelktsvlrpl 


10230 


24131 


A 


10306 


211 


387 


IFFFF* IKACSVAQAGGQGRHLGSIjQPP 
PPGLKQDSC/SPSRGVGTTGMTYHAQFI 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, I = Isoleucine, 
K = Lysine, Lr=Leucine, 1M— Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R—Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














FFGF 


10231 


24132 


A 


10307 


229 


382 


ERAWQPG*HGET/ PSIiKKI>* KLSRCGG 
MCLYS KLKRRLR * EDHLNPRSWGC 














GS Y C F FVRRNLiIj I VK YMRS VHL L F IMLIj 
YRGD * WYVTTQYNFCILQKQEKLI YRI A 
/HVRNPSSLRGGGG* IT*GREFKTSIiAN 










253 


402 


KGGGGG VKTPARGNP PPGPPKGGEKRGG 
PPPPTQKGFF/CFFFKKGTRPPGGKTGE 
RAKPNE I PAPGKKEFFLPKPPEKGEIKG 
/ PRPPGQINF*FLKKTGFKKGGGGGSKP 

FGGKKKKKNPPR 


10234 


24135 


A 


10310 


182 


400 


LHVTMI IFWLYI.IKNMLLNFTYF/CYIF 
SVATSTFKIVCV/WLGAVAHICCPHTLG 
G * DGW I T * GQE FKTR1ANM 


10235 


24136 


A 


10311 


371 


400 


FI WGNI * ASNKH I KKT \ STSLAFRKIQV 
K\TTMRYLiSPPTPIRMT/ 1 / 1 * KSYSSK 
CW*GCEEIRSLIHCKMIQPI+KT/WQFP 
KI INIQLLYNPAIAI1I1GNRTRG 


10236 


24137 


A 


10312 


113 


404 


QRCGKIGTFTHCWWEYKMVQPLWKTDWN 
LLQRLNI E LPCOLAX PLLNL YPREMKTS 
IYICT*MFIAALFI I /AKWRQP/KCSSA 
DE/IGKMWHICIMK 


10237 


24138 


A 


10313 


146 


1 


FLTQKFFFFFETESRS*AWQWLDLGSL 
QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


FFFSE I FFFGPGVFS PPPFFIGGPGI FF 
FCPLKKKILPPPRGKKFFFFKKGP/HPF 
FFFFFF*EGVSLIiLPRLECNGAILAYCK 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


FFFTAFFLGVPGVFPPPRFFKPGPGFFF 
LGP*KKNFFPPPRGK/IKFFFLKGPPSF 
FFFFFF * EGVSLLLPRLECNGAIIiAYCN 
LHLPGSSF 


10240 


24141 


A 


10316 


1 


388 


KTITHEVI TEC I WLKVLLLYFPNCMCI Y 
VRMHI CVCVCVHIS I * MDHI VSYKCIHI 
\LCIY 


10241 


24142 


A 


10317 


134 


399 


LGLPGGDTRGGGTIENPPFERALNSQNG 
GK\NP LLG PKTR PLGWGLCS TP KFWEKP 
RVRWDLN*TTPCLEKRAPWLKAKKTPA 
FEIRI 


10242 


24143 


A 


10318 


233 


3 


EFLFSLICAICL*CALKQFSSCFFCLAL 
IEIiLGSMGLLFH/SSLETFQPFFLQIFF 
/SFFPI>FFGANYTYNKPFEYSXiI 


10243 


24144 


A 


10319 


221 


8 


CDQPKS P PliS GVQMGD I L WL VFFL FLF 
AF*R*GSHCVAQAGLELPGSSDP/PQAT 
SQVAGTTGMHCTWLVF* F 










1 


407 


RS TPDPVYLGFTSGGCKTEKRAGCFFLW 
KLHTKGAPARCQPEIiSGMRC\R* PLLGG 
V/SQSRYMAVKDSLEEAGCPLSELKRCA 

GALSQEDGS FI YKPLTGAAAFLS YM 


10245 


24146 


A 


10321 


115 


398 


ADEELLLMNKQIKYLLEMESTPGKDDVN 
TVEMTKESEYSI/NIiKAVAGFERIDSNF 
ERLSAVGKML*NS I TCYKE I FH/RKSQL 
MSQTSLLSYFKLP 


10246 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF* KKPGGGFLGGFFFF 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycinc, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \-possible 
nucleotide insertion 














LGGKIFFSPAGGKGGPQKPPFP 


10247 


24148 


A 


10323 


246 


372 


I S I KS YQLW \ LGGVAYACNS S TLGGRGK 
WIT*GQKFKTSMANM 


10248 


24149 


A 


10324 


415 


1 


IWGAPRVFPPPPP*NPPPEKNFWGPKKK 
KKFPPPPG+KIFFFKGPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNLRLPA 
SSNS PP PSSIiRMPS TPVLRPHGDLLS PQ 
LSAWVLSWPFLPSLLLPPDLDFIVQ 


10249 


24150 










gdg itgl.spt i s kflffvfes i sls prl 
GSSGAIMAHCSIjTSRREPPCRSINFFVS 
VI TUIVNNLTNGKI GRMDFF* MM* YT IQ 
I CNXiQNTHCR P / KDTNCFKMKRRKE I IH 
ANSNQKRAGVAILIPDKIDFKPKNVTER 
HRMLVRES / I SQEDVT I INTY 


10250 


24151 


A 


10326 


24 


387 


NQGLGARNQKSQRVFTiAGFFFFESSSCS 
VAQAGVHW* DHS S V / PGS SAP PASAT W 
VGTTGVHNHPWLIF/VFFVETGSH 


10251 


24152 


A 


10327 


145 


396 


CDYFGEMFSYLIjIjNI ylcddpai pllpy 
VP YENENTDS LKDMTDAHS S F I HNS PKL 
ETKCPGTDEWI *MTKKSDYCKHW*GYRV 
TGTLLYYEGECKI V* LLWGNV* LF\LLN 
I YLCDDPAI PLLP YVPYE \MKTRIHS KT 
♦PMPIAALFIIAPNWK/PKCPGTDEWIN 
TL * HTHAVQ YKVAVER 


10252 


24153 


A 


10328 


94 


639 


AS I S S S I KWDNI S FLVGLLGE LNE IRNI 
CKAVNMKC2ALWKWKLIXiLFVI ENNNVSA 
SSNNFKIFYAYYIAKAVLLIITEHITLS 
I VL I LKKQNNTRAVTKCW\RGGREKETL 
I WL VG * E CKLVQ PLWRRVWRFL I KVK I E 
LPYD PT/I LLG I YSKERKS VYRK\NI CT 
PMF\VAAIiFTTAKTWKH 


10253 


24154 


A 


10329 


143 


3 


SAWFFFQDRVLL* /SPRLECTVSISNHC 
NLRVPGS SKNYCLSLI S IWDR 














RQGLTLFPGLVSNFWLPVLSSHLGLPKC 
WGYQCYHAWPS*CF*NRSI/WSGTVTHA 
YDPSTLGGQGKPIT*GQEFETSLA 


10255 


24156 


A 


10331 


382 


422 


RYKLDL * KliNTEIiP YDPTVPLLGNYPKY 
*T*AF\SKTCT*MFIAAYSQPPKGRSKC 
SSPNE*INKMWPIYIMR/YYSAIK 


10256 


24157 


A 


10332 


194 


1 


CKRQCNKEDRNl/CSK* *KPTILF*TTK 
RKG/ WLVAVAHACNIiS TLGG * GGRAS *A 
QEFKTSRANM 


10257 


24158 


A 


10333 


348 


1 


NSGQSWIMYPLLKKGVGRVSPPDPDIVN 
S IS MVWCGGRLFSNERFAS LLT I KKKLA 
GHGGTCLQSLPLGRLRREDRLSQGGQG/ 
C*GYS E * LCCRCI PAWAAEGDLWS WSPN 
SVAY 


10258 


24159 


A 


10334 


388 


2 


LPRFIFFPPKNPPLKGGGGFFFPPPPKIj 
KTPRLRKKKKKENFCPWKKFFKKKQTPQ 
RGRFFFFFFIiRQSFAVAQAGVQWCDLGS 
LQPSPP\G*SYHPARTRG 


10259 


24160 


A 


10335 


3 


396 


LLRP PCSLRHND 1 E I MPHDNPM I ASKCS 
SKKKSCISLTLSQKIiDVIKLSGKGMLKA 
KTGQNL/ GIjGHLYRTVRHAVNVKEKCIjK 
E I KNAALIiNTC3MTRK*NSLI ADTEKM* V 
V* IQE *AS /HSMPLRQNLIQNK • 


10260 


24161 


A 


10336 


165 


442 


GQMELLIIARNFGILLHFNLSIYYRRLF 
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peptide 
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Predict- 
ed end 
nucle- 
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last amino 
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residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

KHLysine, LHLcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnniine, R— Arginine, S == Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /—possible 

nucleotide insertion 














FSIPRYLFIFTYLFIYLFIYFESVAHSD 
AQVECSGTIMALT* KKNAS LP / ASAPRA 


10261 


24162 


A 


10337 


33 


430 


TGSTHASANFCVSLVWTQLISRSSGQGL 
ARLQLRCLPCC ILFFS FFSLVI YRLHS Y 
LRLPWRRICLQVPSGC*QNSFPCNFRIQ 
GSI>LL*/MQQW 














SRETP\SLLKlt3KLSGHGSACL*SQLFG 
RLRQENCLNPRGGGCSEWRQHQCTPV* V 
TEQDSVPKKK 


10263 


24164 


A 


10339 


414 


2 


ERAGPFSRKIFFSPPLAKKVKYPFPPTP 
RARAHKRPIFSRYVKKERGNSPDLFWPP 
FFPSLW*KQ*GPS1»GRLFYPV/SWGIFP 
SKP*GFPGPRPKVFCFLG/LPPGPFGGP 
FFFFFFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


A 


10340 . 


211 


422 


ASLGNGGRPS PSLAAALQFDL * LRAG 1 1 
FALFKAARQGDL S Lj * RLLLS F \ VCLCPA 
PRGGAYRGRQAFIiSWGGLHPDRASRIiFC 
LPKQAWAMAGAPPPALLLPCSLISYCRA 
SNQQDSGGIGPSEPSAGYKUjVPGFIiLP 
AEK\RTIYGGVTRFFRGP 








10341 






SRQRMPSTYHVQGSGDTKTGEKPCANGN 
KGCGSS I LRS CLHRKRFSF * S LKALVWC 
GGACLEAQLLRRLRQEAHIiSQGVPGCNE 
L* PGYCTPAWATEP \DS VS KKK 


10266 


24167 


A 


10342 


103 


458 


SLNVCVLTTLPSSCSPISLPLRGPPYSL 
RHRNIE IRPINNPTMAS KCSSKRKS * MS 
LTLNQKLE \MINLHEECMLTCAKTG *NLG 
LLHQ I VD Q WNAKKKFL KK I KS AI P VNT 
QMVRKQN 


10267 


24168 


A 


10343 


53 


408 


SFQFFCSVFS PSLWFYLLLVFDDGDVQM 
GFW CGYP FCLLVFLLTD KTLS CRS VGVP 
CRVR CQ CA\ LLGG AS QLG \ S RG S GVRD P 
LEEAVCLLSDLQLRAGRTTTLF*AVRQG 
HLSLQRLL 














NNFPPSPPRGGGNPGPPPPPRFFFFFLK 
NPGIiPPF*VIYLL/yFTLTPSFCIjWCyT 
ISVYGCGFVFFFFSFFFFFFEKKIFFFP 
RVEPKNRI PVYPKI LFPG*NNFPPSPPR 
GGGNPGPPPPPRFFFFFLKNPGL/HPF* 
KKGAKFPDLINPPPWP 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFL.HLWIQ 
PTSHRKYIHL/ WLGMWHTCNPNTliGGQ 
GGWIK*GQEIETSLTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/RPWMGAATHVCKPSTLGG 
RGG* IT*GQEY 


10271 


24172 


A 


10347 


41 


421 . 


FTFVSVCVHFKGIFLSFLSFFFETEVPG 
SPLGPKRGGGGGNSGPWNPPPPGLRGSP 
PPPPQRTGKGGPYPPPPGNFFFKFFFRK 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPLF 


10272 


24173 


A 


10348 


228 


1 


SKQVKEKYDSFLHKIQKQAT*NRTFFRV 
TKKHVEITKK/WAGHWAHAYNASTLGGQ 
GGWITLGQEFETSLANMVKP 


10273 


24174 


A 


10349 


405 


1 


GGFL KRG P P FFFFLRQGLS LLPRLE CS G 
MTVAHCSPNILDSSNIiSLRRSWDYSHQA 
QLIFIjVSF*VI J GSHSVAQflGVEIiLAS/P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl.ilnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, PHProline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KCWDYRHEPL/WPSQSQNLNSFFLDGFS 
LLLSRLECNAAISAHCNLRLLGSS 


10274 


24175 


A 


10350 


301 


546 


RHIFMNLPFHNLVKIIFHCRGKTLFLDI 
SNPYLKKQVSS SKPNCK/HLKE I * LFLE 
AKDYWLGWPHAYNPSTLGGQGRS ITG 


10275 


24176 


A 


10351 


873 


1258 


FCTQCYFHLLHKLHHHRPSPRLEPPPPG 
AALPP VPEGR 1 1 P1.PCPS SG PRQTTGPG 
KGRRHPAPHRT PAASAAS IiPARRTLPAA 
PW*PI>SFQNTTPGSA\PGGGNQPASGAG 
GNSRT 


10276 


24177 


A 


10352 


184 


576 


GLSPLSHL.KMAASSSEISEMKGVEESPK 
VPGEGPGHSEAETGPPQVIiAGVPDQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AG\PQKQPRPQTSA*AQEGNQRPTAAPK 
THAQETVS KPEVS KEATAD 


10277 


24178 


A 


10353 


110 


384 


VPGSPRKTRSPQQMERVLAPKTAKKGCP 
RPGDHRSRLQNCRDTRAR I KWADS VERP 
PWAG \ PVAHACNPNTLGGRGG WI T * GQK 
FKTSLVNM 


10278 


24179 


A 


10354 


123 


422 


RKQIANDKSKLFI INKYLNVNELNSPIK 
/RERLAGWIRKEKHDPSICHL*KTHLT\ 
KDTNSLKMEG*KKIYHANNY/QKKAGVA 
ILISDKIDFKFKKKWREK 


10279 


24180 


A 


10355 


36 


426 


PPERPQYSARVCVCVFYTECRGIiAPKPE 
CRGAI IAHCLLNVIiGS Q*S SNLRVWS S * 
DYTQLIFKLFVQTGSHCIAQAGFELL/S 
S KDP PTIiA/ FPKHWD YRH\ DHHSW 


10280 


24181 


A 


10356 


193 


3 


FQTRIFKISFFFKEFliKFKKFFFFILI* 
ESRSIAQAGVQWRr>LGSLQPPPP\GSSS 
PPASASLY 


10281 


24182 


A 


10357 


205 


377 


S P WFQP YYRLTI PYQ KC* LATVCI * VY 
* FFKLILYPAILLNS I I IWINFTSDPL* 
FSSH ITG * LSL I KNVWVGPGMLAHA\ *N 
PSTLGG/RGGWIT*GQEFETSLANMVKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMWCPFPEELQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
FCRTFRKHGVVPLATRMLV\ I YKNSGPV 
TLQGRGTTYLSCHHGKAGRVCRATQHAA 
GVWTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQS PGSIiFENQKKGRC IAFLPC* 
KCNSE ILVHCWWGCKM\ WKI VW* FLKTL 
\RLELVYDPAS PLLAI YPRCI 


10284 


24185 


A 


10360 


451 


1 


CQDI KT I liRNEQE VL VI PS VTKYKQVGK 
SFVSRRYLWSGCVLCYAFVTEASKTWHH 
FPYRSPGFWGRSGGRQFDS * PKSQADFR 
APQGPPTPEPEGAERREPEIAPPPTADP 
SPKPT/HSPPRSGAPPSAPFPKLGRAER 
GVG\PGQGRPRP 


10285 


24186 


A 


10361 


2 


776 


FFFLRWSIALSPRLECSGAI SAHCKLRL 
AGSRHSPASASRVAGTTGARYHARLI/Ij 
/ YLFLVETGFHHVSQDGLNLLTS * SACL 
GLPK\C\WIYRREPPRPAYFIFFLRRSL 
ALSPRIiECVQWCDLRSL,QAVPPG\LHHS 
PAS AS LVAGTTGAHHHTRLVL I LL \ FY Y 
FLRWS VTLSCRLE \AGVQWHNLGS LQAP 
PPGFTPFSCLSLPSSWDYRCPPPCIiANF 
L YF* * RRGFTMLARMVS IS* PRDPPPSA 
SQSAHDLFIVGQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine y H=Histidine, I = Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R.— Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10286 


24187 


A 


10362 


1S8 


365 


CPQLHTQSNLINMDASQRYQFELKIQAL 
/WAGTVAHACNPSTLGSRGGWIT*GLEY 
ATSL 






-jf — 




146 


1036 


GWAVYQTLiQAAWLGPRTESGVS FHGRFS 
LiPHHPPQPTLHAWPAPPLQPEPCVPLRE 
DPFMPESW*LPPSIPAEGVSF*ECDFSR 

CAPMHTPTHTQ\ PPHCS 1 FLPKI SFKSK 
HFY*LLFCKCLSSSVFSLPESILLLFIV 
ESVCQPGERCLALNMGWDGKGSGRRWAT 
KSQGVMDI AATQTPA/ VFS PV/ PATS CP 
AGVSGGK 


10288 


24189 


A 


10364 


229 


1 


PLENAKS I YWGKGRLLKNNARLFTVA* S 
WNQPQFSLMVDWIKKK\MWRMYTTES CA 
AI *R/DE IVSCAGTWMELEAMY 














RFFFFFPKFFWVFGLKSPFFFFG /RLGI 
* EKDGGFFFFFFFFPEMASGSVA\RLEC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNGVRGRYRGKRGERGR * TRWGWGHP 
QRWDQDRS RERPRETGRERAAGRRRETE 
AAETGRRRRERGRSDRNRN/RGRPRERM 
IEAGRDAGRRER*ARQESWTEKKTARQE 
EGDRD/RQGHQETRAHKGWGNN*D*GRQ 
AERSENRRMY 


10291 


24192 


A 


10367 


1 


491 


NTAPFPTS LESAF VWAVQDRVS / LLS P 
GPECRGAI \LTHCC/LRTPG/FKPISHL 
S / LPS / S WDYRHVP P PU3NL I LCRDNVS 
QSGLKLLAS SNPLTSAFP ST/ YD YRRE T 
PGPRASP*TQMQPCFMP/RS*SQP*TQI 

RPKAFF 


10292 


24193 


A 


10368 


572 


40 


IiHSFRQHFSNTYPQCTPGTP * VTQSLAA 
RPAGLPEPTSASGGAGAVTPPPRTD\PP 
SVRWSGKKGUDPVLRGGAPRPS PANPAP 
APQGPAPRKGTECVPTASKTPRPGGRWE 

PRALTGDLPSSGENRNGLSAFQGQRRPA 
QKSGENCEN 


10293 


24194 


A 


10370 


36 


517 


PFLFSDIDFLFSFYFLSFFFF*KRGLIL 
PP/ GPEGQGGNKG* WPLHPRG*REPPNL 
TPQTRGNKGNRQTGP V I FG I LRKNG FP P 
FGQGGFKPPGLREPPGLALQRGGNNGQN 
PYP/P/PGKFF*RTKRGEKSPQPPYNKP 
NKKFNPAPMEEFSSFKEGRGPSLN 


10294 


24195 










DVEQPKLRSYPTLKCKPRSAILTICPWA 
LQYRSHGLI FGFSWGFLSFETESCSVSQ 
AGMQWCDHGSLQT/ PTDSTSQAQAIDPP 
ASAP \ QAAGTTGACHHTCL I FVFL * RWG 


' 10295, 


24196 


A 


10372 


358 


~33 


DSIjGVPGAPLLSPGVTPHKQWFFRPYTQ 
I KKLGFFPQ I FFGFKV* KAPFFFWGFGN 
LKKKGGFFFFFFFSEMASGSVA\RLECS 
GAISAHCNLQFPGSSSLPTSAPRVA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \KYVIG 
NRLITTQQ* LIKLTSKQMITIHNTKGRT 
♦SLILVSLI IFIATTNLLGLLPHSFTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TL VI * KDGVHTTPHHQQHAYPGPKEYML 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucinc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HKAPLLHS VSRDPS PFAQSSNCYNRS I K 
QE PVDPLTQAEPVPRDAGKMGKTPLSE V 
SQNGGPS /HK*GQYSGDV 


10298 


24199 


A 


10375 


84 


518 


IMKXiRQKSRSSLKIMRSKLQHTRAS/GT 
QLR/PMLGGNFIALN/AHIKKLGRTQIN 
NI>TSQLKEIjEKQD/RNPKARGRQEIT\K 
I*AELKETET*KKEIQKIYEFRV/WFIiE 
NINKP\PASRLIQRMIERNTIRNDKGNV 
TSAPPDV 


10299 


24200 


A 


10376 


70 


363 


FHQKNLVPGLWVQliLGRRTLQSQHPPAP 
PPSSSPEVNVPSEVQAAD*LCCIFYDVK 
VRKCSTPQKKTRKRKKGCLF / CAPGQTR 
/ CLC* KRGEKNI.VGKT 


10300 


24201 


A 


10377 


299 


2 


YPVAGRGNR*GCGAHALGVGGAAMPLGA 
T ISSVSLTI SSLCAFPRRGGLEPS LPAP 
HTPPPPPE/P*ASSPSRGEWHLVPPCMQ 
PGLPPLPPPRLSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCS CERKSHTSLT*NQKLEGTQ 
LSEEGMSKAEIGQKPGLCHQT/VQVVNA 
KKKLLKE I KNATPVNTQITRK\QNSLIA 
DT\EKV*VIQIEDQTSHV 


10302 


24203 


A 


10379 


125 


375 


NEYVFLEKSLKIAGRVGSCLLGRPRQVD 
HLSSGAQD* PGQRGKT PS LQN I QKLSRH 
GGMCP/RRLRWEDHLS /SRGG*GCSETR 
ST 


10303 


24204 


A 


10380 


458 


3 


ITISTISCKATSFERAPRASRPPPDEWV 
SQAGPAPLRGWEN*GTRRRDDWPEASLK 
DPRRSGEPEAPL/PLRLML* SRLPPPLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 
KDKRPE ALPP* FAAGEPLEANVPPVQHG 
GGSVRSRGSYLTAT 


10304 


24205 


A 


10381 


327 


2 


liS APKKFF FQKS LFLGANFRS P P FPGRP 
FTNRFSDLNVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNSVT* SGVEWRNHNSLRP 
ESPG\SGDRP*V*HTTGAPYHTWLV 


10305 


24206 


A 


10382 


754 


1 


NTALETDTTFSGGGGGHQLCVRSRVSGG 
G* DIMPDLHNHPLQQTSGRNQRL.QGEVT 
DPSPGMSGGAQTGPKAPPRPPSALRHSG 
S IiQGQP * FAHLYCGT I P YPCYQTAPLPR 
APPPTPTGH*CPGEA\PQDEIjPVCGCSP 
WPRPPGQAQAGRNKH* FSGQRSSRDTTA 
CS PCSQSKQS PSRP PCVNDS / PGQGPGP 
KAPAGPPPALPEASLGSVHCS P* VGVMT 
PTRPGHFLVPGKVTLEQAPGGGASGGPV 
P 


10306 


24207 


A 


10383 


39 


433 


SHDQTYTEEELLFMDEQRKWFLLMESTP 
GEDAVNWEITTKNLEYCINLVDSTVA/ 
GVSNFERRSTVGQMLSNS IVCYGKI F/R 
*KR/SQSMWQTSLLF\NKLSQPP*PSTP 
TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFS CKPTS S CX, I S PDS PTPDRRFS KAD 
TYRNFMFPLRRHTHMHTHTYMHTHVHVC 
TQLCXiHLRPH/ ITHGLSL/TCAPPVL*R 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


S PP WWTHGFRYWE PT P PRFSVPHDALGQ 
FHTALFRLiTRRFQIjIiSSCINISPHTCIjP 
PCIjGIiCVGGPIjEPAVCWAR/CPP*RGMV 
SGWKASPLG/CPPTLLPPC 
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NSDOCID: <WO. 
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PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
nucle- 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, Jt=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10309 


24210 


A 


10386 


287 


1 


SFCVRSRPAPTSGRTPFPAAPPFPAAPP 
FPAATTPSTAEDGVPG*G/PAPAANCSP 
HSSQRGSAAMGIPLPQAS/PPAPTPAPP 
SP*SLPFPPAPMY 


10310 


24211 


A 


10387 


336 • 


3 


LIEGKLTNRKDIRSKTASVRHHHQSPKV " 
DKPTKMGRNQSRKAENS \ KI KS ASS P PK 
ECSSSPATEESWMENDELREEGFR*LVI 
TNFSELKEKTGJOiNPAAIiVCGRSEDQTT 


10311 


24212 


A 


10388 


15 


470 


APSPDAMGRN FTEEDKATITSLRCKVNV 
EDAGGETLVRLQWYPWTQRYFDS FGNL 
SSASAI MGNPKVKVHGKKELTSLVDAI K 
HLDDLKGTFAQLSELHCDKLHVDPENFK 
LLVNVLVTVLAIHFGKEFTPEVQAYLHK 
MVTWANSLRYK 


10312 


24213 


A 


10389 


3 


438 


QTQREPTMVIiSPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLAFPTTKTYFPHFDIj 
SHGSAQVKGHGKKVADALNNAVAHVDDM 
PNTLSDLNDLHAHKLLVDPVNFKLLNHC 
LLETLDGHLS AE FTP \ AVNAS LEQFTAF 
EITWI/T 


10313 


24214 


A 


10390 


2 


444 


TMS FNTLAICLDCLCSTLQPTRS I PGYP 
SSPLPGNPTPPMTPSSSVPYMSPNQEVK 
SP FLPDLKPNLNS IjHS S PSGSGPCDELR 
LTFPVRDGW/SGALPPAAQPGC 


10314 


24215 


A 


10391 


405 


3 


KMILTCLALPAQHCPPSACPQTMAQPPL 
PLS I KGAASMSDKLPYKVAD IGLATWGH 
KALDIVEN/EMPGLMGMQEIjYSASRPLK 
gac i adclq i tvetail i etlfslgvqe 
QWSSCS IFSTQEHAVAVFAEAGMP 








10392 


248 


5 


RLNKVGGHGGNYEHLYVNKIiETLQEMYK 
FLDIYNLPNLSLEEIE\NLNRPITSNKF 
ESVIKHFPTKKISGUDGFTAEFLTNN 




24217 




10393 


262 


3 


LPRDRQAGRSQGP WPQAVGRARKMP P F 
I S YQ VYS KNF FE I ES CS VTQAG VQWRN I 
SSMHPPPP\GSSDSPASASQAPGIPGAH 
HHA . 




24218 




10394 


287 


410 


ALKG S S R I L I MLGAWLT P V I P A/ LWEAK 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHI FFFFFETE 
SHSVAQAGGQWRDLGSLPSS\PPG\SCH 
SPA 


10319 


24220 


A 


10396 


661 


963 


STL I AF I VI S TLFPLLDMTE I YFSLLDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 
KIVKNVDRYCEAARSEIQVLEHLNTTDP 
NSTFRCVQ\MLEWFEHH 


10320 


24221 


A 


10397 


1 


268 


PQTHREATMGL S TAD WTNVKAG WGKGGA 
HAGEDGAEALERMFLSFPTTKTYFPHFD 
]jSHGSAQDKGHGKKVADALTN\AVAHVD 
DMRNAL 














S RGATL I YVJJN ENGE PGTRWAKD GLKL 
ESGPS IKALDGRSQVSTPRFGQTFDAPP 
ALPKATRKASGTVNRATEES V\ KTKGHL 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


SQPCRAVYM\VGEKNDIPFELRIVDLIT 
GKS ILGFGEPKSQEGDR 


10323 


24224 


A 


10400 


177 


1 


KIjLDI.GQPQSPALAACL.EYSGAIAAM\C 
NLCLLGSSNPPTSASRIARITGVHHAEP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

1 oca t ion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














YA 


10324 


24225 


A 


10401 


185 


419 


HVNKDTLESHI.VWKLKSALLGRKSESVT 
GPSCHPEPSDKTVEL/WQGAVAHACHLS 
TLGGQGGWITLAQEFETSLDNIV 


10325 


24226 


A 


10402 


104 


3 


GIILIPKPGRDTTKKENF/RPLIPIDAK 
ILNKIIiAN 


10326 


24227 


A 1 


10403 


3 


344 


Sqvmavagpapgagarprldlqflqrfl 

QILKVLFPSWSSQNALMFLTLLC/LTLL 
EQFGNYQVGLIPNQYYGVLGNKDLEGFK 
TLTFLAVMLIVLNSTVRSFPLSVSLVLS 
SPV 


10327 


24228 


A 


10404 


3 


404 


ARAQR RGDLS ATGRNWS P LP PAGLP AT V 
VLRHS GS LMAATCE I SN I FSNYFS AM YR 
S \ ED YTLAS VP PAATFGADDLVLTXiSN P 
QTS LEGTEKANWMG \ EQS Q FWPKTQGLD 
WISFQAEKNKYEASAIGFPRWDK 


10328 


24229 


A 








NQLSSIMVMFKKIRSFEEDFNDPEKVYG 
SGDKVAG\RIiIVEVCEDTRVKAVRILAC 
GVAKELRM 


10329 


24230 


A 


10406 


26 


461 


GEVARRSCCGAMAGTALKRLMAEYKQLT 
LNPPEGIVAGPMNEENFFEWEALIMGPE 
DTCFEFGVFPAIIiSFPLDYPLSPPKMRF 
TCEMFHPNI Y PDGRVCI S I LHAPGD /DP 
HGLREQA PERWSPVQSVEKIIiLSWSML 
AEPNDES 


10330 


24231 


A 


10407 


1 


463 


QQAAKMAENSGRAGKSSG / T / P ARGRGR " 
C / PAEQ VI AG FNRIiRQEQRGLAS KAAEL 
EMELNEHSLVIDTLKEVDETRKCYRMVG 
GVLVERTVKEVLPALENNKEQ IQKIIET 
LTQQLQAKGKELNEFREKHNIRLMGEDE 
KPAAKENSEGAGAKASSP 


10331 


24232 


A 


10408 


180 


446 


KKAAASVSAASGSHLSNSFAEPSRSNGS 
MVRHSSSPYVRYPSDKPFLNSDL.RRSPS 
KPTLGYPESNSRAI FSAL/KNLQD/KI R 
RLNL 


10332 


24233 


A 


10409 


1 


417 


EAGLVTMEEIGILGEKAQDE I PALSVSR 
PQTGLS FIiGPEPEDLEDLYSRFX KKLQQ 
ELEFLEVQEEYIKDEQKNLKKEFLHAQE 
EVKRIQSIPLVIGQFLEAVDQNTAIVGS 
TTGSNYYVRILSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


LEAQPHAPPLLGHHLPCWRIPSHLGTCR " 
RPRAMRVQGSEEEGVA/ GQAPPLPWVHR 
GSLP LVRRRPAGKP TGGNAREE KGKVEG 


10334 


24235 


A 


10411 






NNSRKFGNGTLKljVEVEIYPLISESIPD 
KSELLLTPQDKKHAPLPSAVPNPSA/CP 
DLRPHTTPASPVS \ PAKTVETR P S APQG 
PLPGPVRRPRGRFSVRAASPQTSGRPGA 
PRPPGPALREATDAPRAATPPIAALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDEMATTQ I SKDELDELIE AFAKVDLSS 
NGFI CDYELHELFMEANMPLPGYKARET 
IQKLMLDGDRNTDGKI S \YDE FAYVSM 


10336 


24237 


A 


10415 


1 


415 


PQTHREPTMWASADKTNVMAAWGKVGA 
HAGEYGAEALERMFLTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNALYAIjSDLNAHKIiWLDP/VNFMLt.S 
HCLLVTLVAHLPDEFTPAVHACLDKLL 


10337 


24238 


A 


10416 


19 


454 1 


APSPNAMGHCTEEDKATITSLWGMVNVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glut»mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DAGRETLGRLLWY PWTQRFFDS FGN\ L " 

SSDSAIMGNPKVKAHGKKVLTSLGYAIM 

HLiDDIiKGTFAQJjSELLCDKLHVDPENFK 

LLC3NVLVTVLAIHFGIEFTPEVQASWPK 

KMNAVA 


10338 


24239 




10417 


2 


395 


ALSSSQPSAPMHPFYTRAATMIGEIAAA 
VSFISKFLRTKGLTSERQLQTFSQSLQE 
LUVEHYKHHWFPEKPCKGSGYSMIGVNP 
\KMISLILQACIEGGI 


10339 


24240 


A 


10418 


2 


374 


LSTPHAFGTGKWGTFISLLFLFSSAYSR 
GVFRRDAHKSEAAHR I KDLGEEDFTALV 
VI AFAQHIjQQWP FEDH\VQIiANDVTEFA 
TTCAADESSENCDQSLHTLFGDLLCSRA 
TLDESHCVQDGRP 


10340 


24241 


A 


10419 


48 


347 


DPKAQLPEPLRVLWQLTYAMAAGSR / TS 
IiL/LAFALLCLPCFKAGPPNRPLSRLFD 
HAMLQAHRRTTAIDTYQEFENLYPKD/Q 
Q YS / FLMTPTSSALDS I PTPS 


10341 


24242 


A 


10420 


166 


1 


NFLKKPGP FQKF FQRGLPL I LTFFFFF \ 
FETKPHSVSL\LECGGTVSAHCNLCLPV 


10342 


24243 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTCIQASEDV 
KEI/ FARARNGKYRPLKIS I ENGQLMIG 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 


A 


10422 


251 


1271 


KEDLSPRAPMSGTQSTITDRFPLKKPIR 
HGSILNRESPTDKKQKVERIASHDFDPT 
DSSSKKTKSSSEESRSEIYGLV\QRCVI 
IQKDDNGFGLTVSGDNP\VFVQSVKEDG 
AAMRA\GVQTGDRI I KVNGTLVTHSNHL 
EWKLIKSGSYVALTVQGRPPGSPQIPIi 
ADSEVEPSVIGHMSPIMTSPHSPGASGN 
MERITS PVLMGEEHNWHNQKVE I LRKM 
IiQKEQERLQLLQEDYNRTPAQRLLKE IQ 
EAKKHIPQLQEQLSKATGSAQDGAWTP 
SRPLGDTLTVSEAETDPGDVLGRTDCSS 
GOASRPSSDNADSPKSGPKERIYLEENP 
EEQKKG 


10344 


24245 


A 


10423 


198 


569 


QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WEWHQGFFIiVCFFFLRWSLALSPRLECS 
GAISTHCNLCLPGFKQFSCLSLPSSWDY 
RHAHHCTQLIFVFLVEMG\FHHVGQAGQ 
GFFSLEKSLTII 


10345 


24246 


A 


10424 


2 


343 


PQTQREPTMVLSPADGTNVYAAWGKVGA 
HAGEYGAEALERMFL.SFPTTKTYFPHFD 
LSHGSAQVKGHGNKVS \ DALTNAVAHVD 
DMPNALSALSDLHAHKLLVDPVNFKIiLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 
PKCFIVGADNEGSKQMQQIRMSLRGKAV 
VLMGKNTMMRKAI \ RGHLENNPALEKLL 
PHIRGNVGFEITKEDLTEIR 


10347 


24248 


A 


10426 


196 


2 


KNLIISQTKKVICPNTLFLVFPKRVFFF 
FFFFETESRS / VS PRLECSA\ S I SAHYN 
LCLPGLSDSS 


10348 


24249 


A 


10427 


1 


405 


RLECWLEPPHGAGLQGLGWVWSCSVSTG 
PTMQALV\ LIiLC I GALLGHMS CQNPASP 
PEEGYPDPDSTRALEDDEDPIFIDPENK 
IiPEAGSNFGYDIjYRLRAIMSPTSNVLIiP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H= Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W«=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLMVATDLWDLSLEAQQLTES I I 


10349 


24250 


A 


10428 


223 


361 


EGPTEENMAAKVFES/IGKFGLALAVAG 
/GPVNSALYNVDVGHRAVI FE 














LTQRE P TMALS PGDMTNVKAAWGKVCAH 
AGDYGAEALERMFI.YFPTT \ KT YFPHFD 
LSHGFAQVKGHGKKWDAI.TNAVAHVDD 
MPNALAALSDLHAHKLRVDPVNFKL.LSH 
CLLVTLA 


10351 


24252 


A 


10431 


273 


1 


NHKDGKKQSGKTEKSKNQSASPPPKERS 
SSPATEQSWTENDFDEFREEGFI \RQSN 
YSELKEE IRTHGKEVKNLEKKLDDRLTR 
ITNAQK 


10352 


24253 


A 


10432 


211 


3 


S FLWKFCLRG I PSHVSCQ S A / LL LGGAS 
QLGYSGVRDPLEEAVCLFSDLKLHAGRT 
TTLFNAVRQGHLSLQ 


10353 


24254 


A 








TNQEKKKGDPNKIRSGKGDTTTDNTVTI 
I RDYYG / HTYTNKLiKN/ LEEVDKFLDTY 
YLPRFNQREIENMNQPITSNE IESVI KS 
LQTKKS PGPMAS 


10354 


24255 


A 


10434 


219 


1 


SFLWKliRLRGAPGCMRCQSAXLLGGVSH 
LGYSGVRDPLEEAVCPYSDLNLCAERTT 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHPVTXRLECSGA 
I S AHCNLHLPGS S DS PA 


10356 


24257 


A 


10436 


248 


376 


KGGVFFFFFFFFETESCSITQTGAQWHT 
\YGSLQPRPP 


10357 


24258 


A 


10437 


250 


1 


KGTRKELTFIEQLIjSAKYWAAGCMHYFR 
GSGKYSNGYPVFVFVFIFSETESCS\VA 
QAGGQWCDLVSLQPPPP\GSSDSPGASF 


10358 


24259 


A 


10438 


354 


2 


KRGLKKNPFFNPRNMGEFLKLWGPNPKK 
IFKNCKIFKMGLKKKVLGFYKNSKNLKR 
GKliNPFVIFFFFFFFFFFFIiETGSYS IA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


1 


GMYDCVCLFCFAIKEGREREKEREKERK 
RERQRE KER / ERERDKER KRE E R K 














IQKPTADTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSS KSLL INKTL 
NINGLKS PI KTHRVAE / WNKNK0PMLYC 
IQETPFTYKDTHRL/RIQGWKKI FH 


10361 


24262 










IFFPHPFF FKKS FRLS PQVE I QG I FLG P 
WKLiFPPGFNNFS /CPNLPNNLGFKGLPP 
RPSYFFRIFCKKKGF/H/RVYPEGFPFL 
ALGFFPLFPFKSWGFQGDPPSWGPFFFF 
FFFVLVAEGTLSEGCRLIRGRWKNIjVP 


10362 


24263 


A 


10443 


122 


372 


LYNDP I S I YQVTLYRRVFFWVL YSANWS 
LRWKANFLFFSFFKMASDPVAQAGVQWC 
DLDSLQPQ PP\GL\S DTPTLASQVSETT 


10363 


24264 


A 


10444 


171 


3 


RHPTMAGSS EMFLGSGFGCRLTVFVLFE 
TESCSVT/RECSGAISAHCYLSLPGSSN 
LP 


10364 


24265 


A 


10445 


173 


3 


CMNWCFSIFIFLKINFILETESHSVSQ 
AGVQW\LITVHCSLKLMGSSDPPSSASQ 
V 


10365 


24266 


A 


10446 


187 


329 


SFLWKFCLRGVPGHVRCQSA\LLGVASQ 
LGYSRVRNPLEEAVCPFSDL 


10366 


24267 


A 


10447 


27 


226 


YSYPVYIiliLHIFLCWKCStiYFIYFVFEM 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid' 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W = Tryptophan, Y—Xyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ESCSFAQAGMKWCDLrQPLPPG\sSDSIjA 
PAS QVAGVDAA 


10367 


24268 


A 


10448 


119 


296 


SP IHYILVI 1 1 CFIHVILIil 1 1 FSEVGS 
HSVAPAGV\ SGAX I AHCNHKLHiGS S S P P 

vsvs 


10368 


24269 


A 


10449 


111 


363 


SIiKKLVFFFS I YTHNFF P FFDPTLKNST 
RHS LYIiSLSIjSIjFIjSHS V\ SRLECGGT X 
IAHCSLNIjPGSSDPHTSMSQV\AGPTGS 


10369 


24270 


A 


10450 


218 


1 


QRFWGPGPNKVFKNLGSAFFIiGRGKIjFF 

leiwggffffflvfflrqshs iaqagvq 
wcd\lnslqplpprfk 


10370 


24271 


A 


10451 


179 


3 


GPFK1>TIjFFMGKFALGRBKCHFFFFETE 

sctvapaglqwrdlgslqpppp\gssds 


10371 


24272 


A 


10452 


71 


232 


SSPLSDLLFNIVLEVIAKDIMQEVE/ IK 
LILIGKDEVKWSLIVDKI ILYVKNSKH 


10372 


24273 


A 


10453 


153 


3 


ARGELY FLE IGGGFFFFFLVFFLiRQSHS 
IAQAGVQ WCD \ LNS LQPLPPRFK 


10373 


24274 


A 


10454 


338 


221 


MGFLHVGQASLKXiLTS I CP \ LGLPKCWD 
YRHEPPRPVIPS 


10374 


24275 


A 


10455 


214 


366 


YNYKYLIHYYSYLFFIFIFF/NIIHYIF 
FILFFYLTSFIFLSYFFFFFITLF 


10375 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAILAHCNLCLLGSR 
HS/PRVAGITVAHHHARLIFCIF 














eg p r w adhlr pgvqdhlg \ qhg et p s l \ 
QKIQKLARRGGTC/IjCQSQLLGRLSQEN 

PKKKKE 


10377 


24278 


A 


10458 


222 


404 


KQRPGRAQWLTPVI pklweaelgr\mle 
artlrpawat 


10378 


24279 


A 


10459 


122 


362 


LSEDQLNCYPQRX.HHFIFLPVITRVQMS 
HYLCQQSCLFHFF/SFFFLYFEMESSSV 
T \RLE CTGM I S AH SNL YL.PGS SQ 55G 


10379 


24280 


A 


10460 


3 


102 


AASTLiALSPRIiECNSAIIAHCKLH/IiPA 
FTPFS CL 


10380 


24281 


A 


10461 


133 


339 


KLSKFQGSKKPEIKANSI^LALFYHNYI 
TRQCFTS YSNFFGDRVSLLS PRLECNGV 
ILAHCNL/RTPGFK 


10381 


24282 


A " 


10462 


3 


193 


ASNMMTE LKSQ I P ILTLNVNGLNAPLKR 
HRVAGWINDPTICYLQETYFT /CKNNHR 
LTAK 














SS YVRKGKRNKACFTERVP / PVNDI I 


10383 


24284 


A 


10464 


1 


332 


KKNLFLSKDCLSSXiFRFSKSPPAIIiGPP 
RGWTPKTRFIGPKAFFFKKGSLSLGRKY 
HFGLKPVP/ CPPFLSKTRFIiPGVPFFFF 
FFFETESHSVA\RIiECSG/AISAHCKLR 
Ii 


10384 












PP \ LRKDKRNKA YFTK/RFS PVKTD I IS T 


10385 


24286 


A 


10466 


335 


3 


PFKKFYFKISTRALLEPVFFGKKIRVPP 
QRLSP I WAPLFKRGPPNSFQVNGVPLFF 
FFFFVRHGLALS SGVECSGTRLECSGAI 
/ STAHCHIiCHLDASHPPTSASWVARPTG 
T 


10386 


24287 


A 


10467 


818 


1008 


VHTDVIAD I LKS INNAE KRQT / RRCFLS 
XWLQGHCPFLTVMMQHGLCGVELGGLPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^Glycine, H— Uistidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PGDFCKRPR 


10387 


24288 


A 


10468 


111 


1 


MVS FGRPRQADHKV\GVRDQPGQHGETP 
SLLKNIRI 


10388 


24289 


A 


10469 


204 


1 


LWG PILKIFCQI L.KTICEGNFRGPT F F I 
FFFEMESQSVTQARMQWCDLSSIiQPIiPP 
GS\SNSRASAS 


10389 


24290 


A 


10470 


190 


330 


ERI KKQDLS I CCLQVTHFTFKDSQRIjKV 
KGWKK\ I FHTNKNQKRI WT 


10390 


24291 


A 


10471 


162 


2 


IKGPKRAPPANKGRAFFFFFFFFETESC 
SCLQAAVQWHDLSSPQPPTP\GSSDS 


10391 


24292 


A 


10472 


120 


3 


SWGFFFFFF\EVESYSVAQVGVQWCDLG 
SLQAPHVMILVK 


10392 




A ' 








RRALPCPANFLYFYFLSLLLFFETEFHS~ 
\VAQAGVRWHELGSLQTPPPG\SSDSPA 
SGEYSANYTGEYSANYTGEYSANYTGEY 
SVNYTGEYSVNYTGEY 


10393 


24294 


A 








AVTGRAGSMVAPR\ PLRRLVLFYQGKLR 
SMAGNFWQS SH YLQRILDKHDLIiNERQK 
DLKVLSEEEYWKLQRFFTKVIQALCEHL 
KLRQPGIATATAYFKRFYARHSLKS IDP 
VLMAPTCVCLAS TAN 


10394 


24295 


A 


10475 


289 


2 


TQGFPPINNSFYALYPKQKIPFFSDKLF 
RVKGFKGPVFHYGVKDFKNKDGVFFI FF 
FEMASGS VAQAGVQWRNLGS LQPP I \ PG 
SSSLTTSAPRVAS 


10395 


24296 


A 


10476 


138 


402 


ITGRAPPKPFPLKRGYLSGGLTNIGFSF 
SFSFFETESRSVTQAGVQWRKLGSRSWL 
EST / S PDSSNS / PAPAS RVAAI T 


10396 


24297 


A 


10477 


2 


336 


ARETNDTTQTRLLPANNTMTRSDPYI S I 
LTLNVNGLNTPIKR\KRVASWINKQGLM 
GCCLQETHLS SHETHNDTHKLKI KTWRK 
IYQANGKQKKARW/ ILISDKTDFKP 


10397 


24298 


A 


10478 


5] 


250 


RKVSLFKKWCWNTliFICKK\l>SLDTYRF 
AKINFEWITDLNVREKT/IKLLKGNIKE 
NICDLGLGKDFF 


10398 


24299 


A 


10479 


216 


2 


GFTRKALSGFFKTGWFWVFPNFSKLFFK 
KNFSERFFFFFFFEMESHSVT\RLECSG 
PISAHCNLHLPGSSLV 


10399 


24300 


A 


10480 


191 


3 


FSTDGNVNCLVKYSIYIYIYFFFFLIFF 
FEMKSCS/VSPRLECNGAIiSAHCKLCLP 
GS CHS PARA 


10400 


24301 


A 


10481 


135 


768 


LGGQVLSCASQPVSLFPLFFFL I SFLGC 
LLAFLSACFASTFS CVLGS PETS FSPCL 
HPPPKVLSSHP\PPCSRP/CPQPPKG/P 
PLPKHACPP 


10401 


24302 


A 


10482 


126 


333 


MWGQRYCVMCFVI \VLCLFLL.V/WWW 
FLW/ CS VYI VYWYGYAWLF I FFFFFFS 
LliCFAVCLFVWLLDRF 


10402 


24303 


A 


10483 


240 


2 


ATEKMMGLGTSVRGLSGSNPSYGRVRER 
I GGG VCL WSQ I LLRGC\ LRWEDHLGLGG 
GGCSE PRS SHCTPTWVTEQDP VLV 


10403 


24304 


A 


10484 


136 


2 


NRPSYIFSFFFETESHSVAQAGVLWHDL 
SSLQPLPP\GSSDS PARA 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYIiFIYFETESRSVA\ 
RLECSGTISAHCNLRLPGSSN/SPAS 


10405 


24306 


A 


10486 


193 


347 


ILFS FLLF I YFFETE S CSVAQAG VQWHD 
HGSIiRSRAPG\SYNAPALDSQGTE 


10406 


24307 


A 


10487 


114 


2 


GAWQSEVQWRDSGSLQAPPPG \ SRHS P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=C)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=ProlSne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X«=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AS AS Q VAG L V 


10407 


24308 


A 


10488 . 


314 


49 


IRRSTERS E PRY SMVCRRQPRQGGPCL \ 
RS QLLG RLRQ E DHL S P I AGDC S E PRLH\ 
LACTPAWATEQE PVLKKMS PTLFCLFLW 
PLPCQ 




24309 




10489 


291 


1 


KSPPPPNRRGLTLRSRGQIGNGVSFYYE 
APVTDFIFFFQSLVLSTQAGLQWCNLSS 
LQLPPPG\SSDSPASASRVAPVSQGHAT 
APQPGRQTLS S C 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRLAHAFPAARMPKRKVSST 
EWATEEEPKRRSARLSAEPAPAKVETK/ 
PEKAAGKDKSSDKKVQTKLV 


10410 


24311 


A 


10491 


2 


153 


ARGLVFFVYLVEMTLCHVAQAAVKPLTS 
SEPP\ LGLPKCWDYRREPPRPT 


10411 


24312 


A 


10492 


123 


342 


TH YLCLLRFFFLDRGFLLL PRLDC /NAP 
ISPNUtfLRLPASSDSPASASRVTGITVV 
NHYARLTLLF I FQNE TLF 


10412 


24313 


A 


10493 


194 


2 


IPGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/GSSDFPASASQVAGTTSMRINAWL 
IFVFFSRN 


10413 


24314 


A 


10494 


373 


1156 


KVQLKVYLFFFLRLSLAI1L1PRI1ECSGTI 
LAHCNIiPPSRFEQFSCLSLPSSWNYRRP 
PPHHARLI FVL/ SVETGFHHVGQAGLEL 
LTSGDPPASA/F PKCWDYRHE 


10414 


24315 


A 


10495 


181 


3 


S FLWKFS LRG VPGRVRCQS A \ LLGGASQ 
LGYS GLRDPLQE AVC P FSDLKliCAGRTT 


10415 


24316 


A 


10496 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
G\SGGGHLRIjRQENCLiNP\GGRGC 


10416 


24317 


A 


10497 


157 


300 


IQSWFNILKSIHVIHHSNRLK/EVNHIV 
STDAEKAADKIQHPFLIKKQ 


10417 


24318 


A 


10498 


3 


289 


LVKKMLTGSTLGKSYRHSPFS inqghna 
LRKAAG\PLPRKAGY\L.QGFSPIjRYGL\ 
WDGKDLTIHQPDTREGSVLSR I S KRGRP 
L/CSCLPLGTECLS I 


10418 


24319 


A 


10499 


138 


1 


WYGLIYLLLEMESCYVAQAGVQWHDLPS 
LQPPPP\NSSNSPASSC 


10419 


24320 


A 


10500 


138 


1 


WEGLIYIiLIiEMESCSVAQAGVQWHDLGS 
LQPPPP\NSSNSPASSC 


10420 


24321 


A 


10501 


200 


343 


LEAHSFLSFFIFI I I FFDTKSHSVT\RL 
ECS AAI S AHCSLLIi P GS SNS 


10421 


24322 


A 


10502 


84 


339 


PRDGKRRDKQKQETGGEHYKSQLRGRLR 
QEDHVSPGGRGCSEPRSRHCTPAWAT\E 
SNSVSKKS 


10422 


24323 


A 


10503 


141 


362 


EWNCCVCC/CCCCCCRCF 


10423 


24324 


A 


10504 


198 


320 


TIKKQKNTGGLIFNSYILPPLFLEPGDL 
\RFLDVDNRVKLPV 


10424 


24325 


A 


10505 


30 


356 


GYPCNPILYYMSTVGFTTSLLDFDIPLE 
HPILKVCPTHTRALRDLSSDRSNPGRIL 
S TSNS S LY /EKDPRNKAYFTK/RPS PVN 
DUST 


10425 


24326 


A 


10506 


16 


375 


KKFFKKKKKKIFFPPFPLKNFFFPKRVN 
FFGRGWPKKSPPKKKVFFKKFPGGFKKP 
PLKKKKI FFS PPVKMGP PKGFFKKGPPP 
FFFFFFFFFFFFFFFFFFFFL/LCLIPR 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKI QKL 
G \ S GGGPLRLRQENRLNP \GGRG C 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE . 
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nucleotide 
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amino acid 

residue of 

peptide 

sequence 
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ed end 
nucle- 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutnmic Acid, F=Phenylalanine, 
G=Glycine, lI=Histidine, Hso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YYRNLVDKAAAGFERIDSNFESSSTVAK 
ILSNCIVC/Y/REILHEGKSRPMWQTSL 
SYFKELPQTTQHSAI ITLTSPRA 


10428 


24329 


A 


10509 


325 


1 


ECAKEMNAETKRNIAI DATETQRI IRDC 
YEHYNKMENLEAMD KFLDTHNL PRLNHE 
E I ENLNEP I MSNETETV/ I /NSLPSKKN 
SGSEGFTAEFYQRYKTPLILKLFKK 


10429 


24330 


A 


10510 


126 


14 


DSSGQVQWLKPI I PV/LWEPEAGRSPEV 
RSSRPACPTW 


10430 


24331 


A 


10511 


435 


769 


PFKFPPLSHACDGGSLFCRLRLECSGVI 
TXAHCXPQLPG/SQSDFPASTSQELGP/ 
TGACHYTRLICCREEVL\HVAQAGLELL 
CPKDPPTSASQHARDYRH\GHPAQL 


10431 


24332 


A 


10512 


147 


412 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNE 


10432 


24333 


A 


10513 


251 


466 


QTCPTGS PASLLELFNS IATQ WELVRS L 
I AGNALKDEND \ S AVKMLAS L I MS YKAA 
AAEDYKADCPPRNPGP 


10433 


24334 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 
LGYSGVRYPLEEAVCPFSHLHLHAGRTT 
TLFKAVRQEHLSLQRILLPLVGRVGRPN 
DSRCDRVIGH 


10434 


24335 


A 


10515 


28 


428 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10435 


24336 


A 


10516 


263 


2 


I KKPADS E S AGAL I LDFPAS QT VRNS FL 
LLTSHS VC YFCYS SPNGLRQQLKI 1 1 FF 
FFLRQSCSV\SRLECSGMILAHCNLCLP 
GSNN 


10436 


24337 


A 


10517 


62 


292 


PATLFSGVILPPTNS INAFSGSDSKSAH 
K/CEMLGTENFNFFTCHTHI IAEHSNST 
HTHTHTHTQTHTHTHTPLLFYE 


10437 


24338 


A 


10518 


172 




RGKKI FFPLBKGKNLVWWITALFFFFFE 
TDSHS V\ TRLKCSGTIL YHCSLCLPGS S 
DF 


10438 


24339 


A 


10519 


42 


194 


I LVETF \ CLRFVSLENLVKMRFLKLKKK 
KKKKK3CKKKKKKKKKKKRKKKKI 


10439 


24340 


A 


10520 


120 


1 


I FAKRW P I KFFP F FFFFFET VFHS \ VAQ 
AGVQ WCDLGS P 


10440 


24341 


A 


10521 


202 


375 


S FLWKLHLSGVPGHVRCQSALTGG \ ASQ 
LSYSGVTDTLEEAVCPFSDLKLRAGRTT 
TL 


10441 


24342 


A 


10522 


272 


402 


PKIFYVTKXAWNYYPYTITEYTCSF/IR 
KFFIHIETKYEDNKGSN 


10442 


24343 


A 


10523 


147 


439 


LILYVNYI SRKSTVYDSRHPCTYYLI PE 
KNCYKYHCNFLCT/SSSIPFPTPSTPSR 
AS PKKKKKKKKKKKKKKKKKKKKARGG 


10443 


24344 


A 


10524 


7 


415 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10444 


24345 


A 


10525 


321 


442 


TS FSFLFFF / LETE FHS VA\RLECRGT I 
LVHCNLCLPGSSNS 


10445 


24346 


A 


10526 


477 


3 


I EGKLPNRKDIHTKNPS VCRHHQRPKVD 
KTTKMGKKQS RKTGNS KNQS AS P PP KER 
SSSPAMEQSWTEKDFDKLREEGFRGS IY 
SKL/REEIRTNGKEVKNFEKELDEWITR 
I TNAKKSLKDLKELKAEAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan. Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=po5sible 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


RFVCSTIKVLRDLSSDRSNPGRCLSTSN 
S/SPVRKDKRNKAYFTK/RPSPRN 


10447 


24348 


A 


10528 


81 


390 


RFVCSTIKVLRDL.SSDRSNPGRVLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDI I 


10448 


24349 


A 


10529 


253 


23 


RIFLFYFHSFLRQGLARAEGQPHNHSSL 
QP P PPG \ SGD P P AS ASRAARTTS AYHH I 
PLFFFRS YCVAQAKI INMLSLL 


10449 


24350 


A 


10530 


10 


398 


RFVCSTIKVLLDLSSNRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSP 


10450 


24351 


A 


10531 


289 


423 


RFVCSTSKVLRDLSSDRSSPGRFLSTSN - 
SSL/QRKDKRNKAYFTER 


10451 


24352 


A 


10532 


152 


401 


PHCSIHSQVRIQANVHKGHRQRTYGSVI 
PHIIiPLHVLKKTFSLRDFHFSVSLKKNL 
VLTCLHLFL/GVRTPRNDPFVSMMLIiFT 


10452 


24353 


A 


10533 


2 


515 


LDRISLLLPRLKCSGPVSAHRNLYLP/G 
FKRFCSLGLPSSWDYRRAS PRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QCWDYRREPHLAAF 


10453 


24354 


A 


10534 


469 


566 


PTKNTKI SREWWCVPV/ IWEAEARESLE 
PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KPKNLPGGGGPPFYSSFSKGKKIPLPWK 
AKVPINQILPLPPHPGGGKKTPFPKKKK 
KKERKKKKMATVQJCGM / PHKYYHGKTGS 
YNVIQHAVGNAVTNRTRGS 


10455 


24356 


A 


10536 


1 


409 


TPKKRFFPKKPKGFFFPPPKKKKKIFSP 
PPKFWPPQKNFKKAPPPFFFFFFFFFFF 
FFFFLKSNPFFFFF/SKNPILKPPIiPPLi 
FRVFPPPKKKKKKDP 


10456 


24357 


A 


10537 


204 


435 


TSGSLYKLLGMGDLGLGKTTIFKQGGHQ 
NSS \GWHPEATTG/VNWAFKGLHWDPKA 
LAHIiQLWNIAIi/QER/FGKMTKACY 


10457 


24358 


A 


10538 


208 


382 


RFVCSTIKVLRDLSSDRSNPGRFLFTSN 
SSLFER/DQRNKAFFPK/RPSPVNDIIS 
T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFP VGKTPS TSGTGLLFI FLLLN 
LMGRLKPSS I S MKREFRKQNRGQKC / E F 














LKIFRGGPNFPGGEKIFFFFLGGGIKSR 
GGGFG 


10459 


24360 


A 


10540 


3 


1322 


RGYAWPNGALPASTVPCGFAACPGEFLC 
SVNGRQAPAGS SGFLPPVPSLCPHTVCR 
ATFQCKEDSTCISLRRFPWGSETHLLCP 

ASPLPLLLCPPHHLPPSPDCGLQGPSSR 

ALIADRWVITAAHCFQEDSMASTVLWTV 
FLGKVWQNSRWPGEVSFKVSRLLLHPYH 
EEDSHDYDVALLQLDHPWRSAAVRPVC 
LPAR/SENREQGLQ/CSWITGWGALREG 
GP I SNALQ KVD VQL I PQDLCS EAYR YQV 
TPRMLCAGYRKGKKDAC/QGESPG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWSLTLAQAGVQWRSLGS 
LQP/LTSGSLFRQFSCLSLPSTRP 


10461 


24362 


A 


10542 


3 


406 


LMVLWTAHLPALAPGSRTSLLLAF\ALL 
CL.PWLQE \ AGAVQTVPLSKLFDHAMLQA 
HHAHQliAIDTYQEFEETYIPKDQKYSFL 
HDSHTS FCFSDS I PTPSNMEETQQKCNL 
EUL.RISLIjt.IKSRIiEPVQVI.TSMF 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


10462 


24363 


A 


10543 


3 


414 


HSSAHAYGTYESSGKRKYLQRPSRTAEL 
KKALKEKKNRLLLQQRSKSVTSS S SSSR 
DSSARDSSSEREETSTSSSSEDSDTDES 
SSSSSSSAYSTNSSSSFDSDSDSSS\SR 
GRSTSTDSSSADD/STNEEESELE 


10463 


24364 


A 


10544 


240 


509 


TFTSFI IGTPNLGEFCKCNNFG I /GKFR 
MKVACQ I ETLG ILS FFFETE S RS VTRLK 
CGGAIIiAHCKL 


10464 


24365 


A 


10545 


275 


2 


GCSRPFDTAIPLMGPYPEEKKLLFEKNK 
/DTCTCMFIAAHFPVAKMWIQPKCPS IN 
EWIKNPVTHIYIHTHTYMCIYIYTHICV 
CVIHIYI 


10465 


24366 


A 


10546 


233 


2 


WS WMAI QS VTjPQTAFTEIjKNKYS CIQET 
PIXjFYFLKQSLPLSPSLECSGVI iahcn 
/ LQ / RPGSTTFPAS AS KVERTTGA 


10466 


24367 


A 


10547 


2 


420 


KKKI PNRGGCLKRDTKAFLKTPPMLQRR 
TPPAFPKGPGGFLPDFNPLPPSKGNPPI 
YSWERFFPRFFKNIWAPVPKANLL>SFFF 
FSIFWRTGSRYIAQAGLELLGSSYPPAS 
CIiPESWDSRR\DHRARPTI 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRLFFVFVFETESPSVAQ 
AGLQWHSHSSLQPPPNLG\SGNPPALAS 
PVAGNC 


10468 


24369 


A 


10549 


155 


1 


ARGEKYFLEIGGGFFFF1jLVFFL.RQSHS 
IAQAGVQWCD\LNSMQPLPPRVK 


10469 


24370 


A 


10550 


223 


392 


RFVCSTIKVIiRDLSSDRRNPCRFLSTIiN 
/ SliAVRKDERYKAYFTK/RPD PVYDI I S 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFL FGTE S CS VARVG VQ WRHIjS S P 
KPPPP \GSSNS / PAS ASRVA\TGAHLHA 
RLIFVFL 


10471 


24372 


A 


10552 


297 


10 


TDEEI1I1CMEEKRKWFF \DMAS I PGEDTV 
NTDEMTKT/DIjEYYINLVDKVSAGFERT 
DTNF\ERSSTVSKMLSNSITCYRKI/F/ 
RERKSQQMWQTSFLSF 


10472 


24373 


A 


10553 


509 


3 


LLTDRTLS CRS VGVP CSVRCQCA\IjIjEG 
ASQLG\SRGSGVRDPLEEAVCPFPDI/QL 
RTGRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPAPRGGAYRGTQAS LS CGGLHPVRA 
SRLLCLPKQAWAMVGTPPP\TSLPPCSL 
ISDCCASSQRDS / VGVGPSKPGAGYNLV 
VRRF 


10473 


24374 


A 


10554 


262 


453 


GWG I LTANIiLVFGGGS LS X S ER X FFSP L 
ATGFCSVAQAGVQW\LIIAHCSLEFLGS 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPP FFFF F I FFLIRQRKAKLWL 
AFNRRGLRKPD/IFYYAGVGKIVKKKYS 
GF FIELVTTCGKNYIiMCALAVI DPRDSN 
I IRS 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSIiY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10476 


24377 


A 


10557 


203 


350 


QLHADKVSRFATHAECRGVIMAYFNLNL 
PH/FKRSSCLSLWSSWDYRHAP 


10477 


24378 


A 


10558 


98 


387 


IQDTDLIRGVYRYRDKKRAREKDS PPER 
EGEREREIEKQRKKER/ERKKERKKEST 
HQKE 


10478 


24379 


A 


10559 


237 


3 


EVFFKNI FE IESCSVTQARAQGQDHGSL 
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amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen> (alanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPPTPG \ SSSPPTLASQSAVI TGVNHHT 
QSGLTFQSPKQKALGKMCISSGG 


10479 


24380 


A 


10560 


37 


359 


RXKCGGPVIAYCKLEILGSSNTPTSTSW 
VAGTT/TCHHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFLPWGGKKFQGVSFFFFFFLVFFEI 
GSHS VP \ RLKCSGT I TAHCSLDLLGS SN 


10481 


24382 


A 


10562 




388 


LPEPLMVLRTAiW>VAMAPGSRTTLL\LA 
FALLCLPWLQEADAVQTEPLSMLYDHAM 
LQAKRAHQLD IDT YHELEETY I PEDQKH 
SLLHDS QTSFCLSDS IATPSNMEETHQK 
SNLEMVRISLLLIETWLE 


10482 


24383 


A 


10563 


224 


392 


CVDPIIjKWFPLRTATRQGCPLS \lfniv 
LEVLATAIRQEKEI KDP/QIGKKEVK 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 
SASQVA 


10484 


24385 


A 


10565 


175 


324 


APG YGTKYRVI LCK I LLLRRLRHKNHLN 
PGGRGCS EPRSHHC / I TPAWVTE 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
NSSLC/EKDKKNKAYFTK/RPS PVNDM I 
ST 


10486 


24387 


A 


10567 


128 


391 


GETCEGVYGPQQGGKQTWKVFHSKKDEG 
PCSKTWDVGLGSSPQFIFLKKVRFFFFF 
FETESHSVTRLECSGTPGT/HCNLCYPG 
TS 


10487 


24388 


A 


10568 


251 


485 


S FLWRFPLKGVPGHVRCQS A \LLGGAS Q 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSI.QRFLLPr.FSY 


10488 


24389 


A 


10569 


176 


3 


FVPPILKKMGPFLKKKEGKPFFFFFSEM 
ESRSVT\RLECSGA/TISAHCNLCLPGS 
SMS 


10489 


24390 


A 


10570 


212 


386 


RFVCS TI KVLRDLS SDRSNPGRFLFTSN 
SSLY/EKDKRNKAFFTK/RPSPVNDI IS 
T 


10490 


24391 


A 


10571 


134 


489 


SLFFLPSPRDRWSLTQSTEMPPSWKFPG 
REIRSPLLAMLHSLLLQPTAFLCPSSTL 
LLSLKPDSLRTGSLFLSFFS FLYFEACS 
VA\RLKPSGTISAHCNLCLPSSSDSPAS 
AYIVSG 


10491 


24392 


A 


10572 


1 


470 


GQSRGIPLLLTLDLEKPVSLLLSVTNLY 
SKNSAQFSTILQTLSFPATFTPSPSIPL 
SSAYFFFFSDRVS /LLSPRQECSGLNLG 
AHCKPQPPRFKRFS CLSPSS \S WGLTRR 
HTTTPGL I FCI F / S VEAGFHHVAQTGLE 
LLSLSNPPASA/FPKCWDHRR 


10492 


24393 


A 


10573 


242 


410 


VMLRMQIYFSLHNPIFFFFETGSHFIA\ 
RLECSGM I PAHCNLCFPGSSKAPASASQ 
V 


10493 


24394 


A 


10574 


236 


1 


ARTF F I H I LKAS CGDNAI TG VLWHRATW 
CPSVLLDRDSLSERVCEILCIHAC/VMR 
IHEHTHTYIYAHTHTHTHTHTHT 


10494 


24395 


A 


10575 


241 


35 


RS S YLFI FNFFVETRS VAQPGLKLCCSA 
WAGTPG\SSDPPTSSSQSAGI IGMSLNL 
FLQWLI IDQAGNTN 


10495 


24396 


A 


10576 


294 


72 


FKNFKSPLFSLFFFLILGTRFCYVAQAG 
LK\SWPQGDPSALASRVAGTRGVHRHTQ 
LQVSFNYYKVLAMHSGSQL 


10496 


24397 


A 


10577 


1200 


1625 


PDGWSYSSGHSWLPEEASNGRKAWSSCW 
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nucle- 

locfltion 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H= Histidine, I— Isoleucine, 
K=»L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














VPPQLiSPPQAPPSPATHLGPTPPIA/SP 
PQAPPPPATHLGPAPPPPLSSAP\PPPD 
TLLGPAAPPP PSSGPDPFRYS P /EVPPQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 


24399 


A 


10579 


108 


320 


TINVFLKVNVMKLSYLRFNTRNCSFFFL 
AGVQGCDHGSLQLQPPG/SSCDPPSSAS 
RVAGTTGAYQHTQL I F 


10499 


24400 


A 


10580 


1 


288 


ARGBRERERERERERERERERERERERE 
REREREREREREGIjSFFFSRSRGGGKLW 
CVC/RPSARPRERERHAVFSAPPSIiQKK 
KKVFGGREGTHS LFYPPCGNSLiRR 


10500 


24401 


A 


10581 


1 


476 


REWGLALSPRLECSWIIAHCSUSSLGl. 
KRFSCLRLPSWKDYRCAPPCPSLNLFFV 
ET\GPHYVAPGWSLLLGSSD/PPPIASP 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCS IHSQVRIQANVHKGHRQRTYGS VI 
PHIIjPIiHVLKKT / FSLRDF 


10502 


24403 


A 


10583 


316 


461 


LPN I EGWEFFVSETGFCSVTHAGVQWC 
DEHGSLQSQTFG\SSDLPTSD 


10503 


24404 


A 


10584 


172 


383 


NLDIiYLTS YI KINS KWI IDLNVIAKT I K 
IiLE \ KNVAGENI PDLGEGKEFLD\ KTQK 
ALS \TNKKIDKIiNFTR 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAI T/ R/GQE 
FKTSIAKM 


10505 




A 








KIJCCKIIlkL.GIKISFAVIVVVEIGFHF 
CYLGWVQW\PIRTHPGSSDPPASASQSA 
GITSMSHHT 


10506 


24407 


A 


10587 


199 


3 


S FS YIRVffljGHI KHTRHKAS CEVYANCK 
AKG I I TG I KKK/ RFWPGT VAHACNPNTL 
GGRGRWITRLV 


10507 


24408 


A 


10S88 


216 


1 


PRGAPTCMRCQSAL\LGGVLQLGYTG\V 
RDPLEEAVCPISELKRCAGRTTALFRAV 
RQGRLSLQKFLLPFLV 


10508 


24409 


A 


10589 


190 


3 


I S AQPWAAFEKR F I F I YRAPA/ LKFM I F 
FSPFKGVPFFFTFFFFFEMESRSVARLE 
CSGVISAH 


10509 


24410 


A 


10590 


369 


40 


KTERNS ININKNDVHTKTPS KGRQHQRP 
KADKSREMRKNQHKKAENSKNQNPSSPP 
RDHNSS PARKQNWMENEFDELTEVG/ FR 
KVGNSSELKEHILTPCKETKNRTRGW 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVIiRDLSSNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPS PVNDI I S 
T 


10511 


24412 


A 


10592 


132 


1 


ILYLWCVCVCVCVCVCVYLLILiV/CKVL 
SCLGEPALSVGAFQMS 


10512 


24413 


A. 


10593 


3 


287 


IGFITLNVGLYEYLLCYIYVYFFQVNYI 
VFILVLAVFIiLKNRLLRYNTAIjYNEEVT 
REKISHYFI/NSYIHTVGKINKND 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLLLLSGALGLTDTWAG 
SHSLRYFSTAVSRPGRGEPRYIAVEYVD 
DTHFLRFDSDTAIPRMEPRKPWVEQEGP 
QNWE\WTTGYAKANAQTDRVAIiRNIiLRR 
YNQSEAGE 


10514 


24415 


A 


10595 


156 


3 


NGLELDSDDGCIALLFFYFETEFCS \VA 
QAGVQWCDLGSLKPQPLGFKPTRP 


10515 


24416 


A 


10596 


80 


306 


MDEELLFRDEQRKWFLEMESTGEVAVNI 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 

acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 

Q=Glutamine,K=Arginine, S=Serine, 
i — j nreonine, v — valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














V/EMPIKDLG/YYRNLVDKAVPGCERVD ' 
FNFE I SSTGWVWWLTPAI PVL 








10597 


232 


389 


ILCWVFCCCCCCCFti'i'iit'JjVIQAGVHWC 
DLDSLQ/SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


463 


154 


MGQGKPRFPLFFHPFYIFFFLKQGFSLS 
PRVQGRAKKVPGTIaASWGQKILLiPQPPR 
/ RVRQENCLNPGGGGFS E P K I P PS FPAW 
GEKGGS LLKNLKKKKTKHI C 










•708 




IAjP ±*yCjr H AiAFPr r Jf rFETESRSLAQV 
GVQWWDLGSLQLPPPG\SSDSPGLSKKF 


10519 


24420 


A 


10600 


200 


3 


CQPELSCMRC\RQSLLGGLSPSGGMEVR 
DPVEEAVCPLAELiKRCAGRAIiLVRI CCS 
LQSQQAGTFK 


10520 


24421 


A 


10601 


73 


427 


RVtfCSTf KVLRbLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPGNDI IS 
T 


10521 


24422 


A 


10602 


193 


401 


GE VS IiLS PRLE CNGVI S AHCNFR/ L PGF 
KRPSFFFL 


10522 


24423 


A 


10603 


181 


395 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDI IS 
T 








10604 


384 


10 


PRRPGGECVHCLGDSLVTDRRYSGLRSS 
DQTLSGDPMEAVPWSNSIjSCSHYRLKPA 
SGITWGCKQAGFFSFFLFFKMESHSV\T 
RLECSGAISVPCTCLCLPGPSNSPAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTI KVLRDLSSDRSNPGRFLSTSN 
SSLC/EKDKRNKADFTK/RPS PVNDI I 














QTHREPAMVLS PADKTNVKAD \ WGNVDA 
HAGEYGAEALERMFLS F PTTKTYFPR 








10608 






IQKHGLI IRCPQETPFKNNDVGMQKVKG 
RGE KS IMQIL / YFFFFLNS /HLS PRLE/ 
CSGAI SAYCNLRLSHSSDS PA/VDGTIG 
ACHHTQLIFLFLVE 














CSWSWFCGCVRVSTLIKKKNFFFEMEFR 
S\VAQAGVQGRDFGILKPPPPG\SSDSP 
ASASRV 


10528 


24429 


A 


10610 


137 


1 


HKCPSTDEWDERV/WCMYTKEYCSALTK 
EEIMSFGKTWMELENIMI 


10529 


24430 


A 


10611 


201 


334 


SFFWRFCLRGVPSRVKCQCA\LLGGASQ 
LGYSGVRDPLEEAVCPF 


10530 


24431 


A 


10612 


146 


2 


KTPLKKGLKRVPFFFFFETESCSVTQAG 
VQWHNIGSLQVPPP\GSRHSP 












259 


TTMPGLSFCTRTKSNKNLSFLiTLDGSAR 
VDLFLYLIjLFRTI FQNF / CGRDKVLLCC 
PGWSQTPELKQLI CLDLPKYWDHRYEPL 
CLAYPFALGQNLTKI 


10532 


24433 


A 


10614 


153 


2 


GVLMVSDGVLSMLANLFFS FLFF /LFET 
QSHS I \SRMECSGVI SAHCNLCLP 


10533 


24434 


A 


10615 


213 


8 


KPP FFFPFFFLFFFLRWGSHCWSG Y / C 
KGT 1 I VHCGLELLG S SCS P PAFQVSG 1 T 
GLCYCAS 1 FI FEE 


10534 


24435 


A 


10616 


146 


1 


ILTIRDYWVKITLFFFFEPKFHS/VLPR 
LECSGAI S AHCNLCLPASSDS 


10535 


24436 


A 


10617 


208 


2 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPPNGSSDSPASASQVAGITG 
SCHHAQLILVFIVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

Jocation 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N*=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10536 


24437 


A 


10618 


120 


2 


S FFFFFETEFRS VAQAGVQWHDLSSLQP 
PPPG\SSDSPSL 


10537 


24438 


A 


10619 


162 


1 


SFHWKLRLRGAPGCMRCHSA\LLGGVSQ 
LGYSGFWHPLEEAVCPFSDLTLCAG 


10538 


24439 


A 


10620 


193 


419 


TFFFFETESRSVAQAGVQWRDLGSL.H/S 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


KPGLWRGTRSQKQVIjFFEPESHSVA\KL 
ECGGTISAHCNFCLPRSSDFSA 














PTGCPKQEQVQAMLRPWFFFVFETQFN 
S /VPRVECSGTI SAHCNLRFPGSRDSPA 
IuASRVAG I CRR/ CATTAQL I FVFL.VETG 
FCHLVG 


10541 


24442 


A 


10623 


67 


430 


LARRTWKNQHSSTHGCDLFFFFFFFFFF 
LIKKGVFFFPPEGGHGHRFFFINIKFLN 
YKRFFFCLFEDVLMISVGPPIAFFFFFF 
WRGGFSYIiliLRCV/LTFLALGAPLFAPM 
FFLFLFGEEL 


10542 


24443 


A 


10624 


3 


207 


PGGQLGSECSGVRMDEDVL\TLKSLIIG 
ESGVGKSSLLLRFTHDTFDPELAATIDP 
SSWNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


KMLFGQVQWLTP I I PALWEAEAGGSLE \ 
LRSSRPALKRSETSS PPSLPCQKKKKKK 
KFPRAGGQMLEVPFPGRVGAG 


10544 


24445 


A 


10627 


210 


3 


NFCQVKGFWKQKS SQ I KGFGKNKKKKGG 
S FFFFFFET ESHS VT \ RLECS GT 1 1 VHC 
SLDFPGSSDPPTSA 


10545 


24446 


A 


10628 


237 




LCFLSGAS CS PY I D VYTEVNGVNPTFI I 
LTNKTNTFGPLFLI I FFFGETESLSIA\ 
RIiDCSGAISTHCKIiCLPGSRKSP 


10546 


24447 


A 


10629 


1 


352 


RGPLS IQDYISKALFFLLLFGFLLLLLF 
FEVRS CS LTQAGVQWCNHSHIS S LQSPT 
PG\RSPSDPPASAS 


10547 


24448 


A 


10630 


48 


200 


AIKPKNFliFFSETKSSSVTQAGVQWRDL. 
SSPQPPPP\SSSDSPAPAN 


10548 


24449 


A 


10631 


209 


376 


S FLWKFCLKGVPGCVRCQSA\LLWGASR 
LGYLGVRDPLYEAVCPFSDLKLCAGRTT 


10549 




A 








AGFFFFFLFFFIjETESRFVA\RIAQCSG 
AITAHCKIjCLWGSSNYHAS 


10550 


24451 


A 


10633 


18 


522 i 


PLYSLTKRHTPREA/ KDNLKSTQLLSVI 
DAI S EGP I EG PVDGLKS VLLNST PVLDT 
EGNTNISGVTWFRAGEQEQTPPEGFES 
SGSETVLGTE VKYDTP ITRT I TSANIDR 
LRFTFGVQALVETTSKGDRNPSEVRLLV 
QIQRNGGWVTEKDITIKGKTTSQYLASV 
V 


10551 


24452 


A 


10634 


127 


2 


NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 


10552 


24453 


A 


10635 


336 


452 


RI FFFLFETEALS VAPAVAQWFDLGSLQ 
SPPP\GSSESPA 


10553 


24454 


A 


10636 


259 


3 


GTRIFFPFWSSPLSFPPVQKKSVSPFFF 
SF/LFFFFETESRSVT\RLECSGAISPH 
CKIjHLPGSRHSIiASATRNVLRPLWRIHA 
SCA 


10554 


24455 


A 


10637 


96 


413 


GDNTSQHSAILINYFI FFFTFLFF/ETG 
SHSVAQAVRLEYSSVITAHCNFRLRGSS 
NPPKKKKKGGAVLKDPWGGQSLPGLATY 
YFFPYRGANKNLLGDFWEGPLFV 


10555 


24456 


A ~ 


10638 


180 


473 


CYMGRKAWLVLSNLLTLKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK/ EMPVKQKNRGRALYKK 
KKGGRPFPIFWPFFFFFFLGGGKKHPGG 
FFRKKFFFGGGKKKK 


10556 


24457 


A 


10639 


312 


480 


ETVDSTSGEDDVNIAEMATKDSEDYINL 
V\GKAVAEFERTDSNFE\RGKMMSKSI 


10557 


24458 


A 


10640 


2 


516 


MKPLENLSQTASLARGATLIiRPVIiRRIiC 
GLPGLQRPAAEMPLRARSDGAGPLYSHH 
LPTS PLQKA1CLAAGSAAMALYNPYRH/E 
ERPRISTSTLDLGKLQSLPEGSLGREYL 
RFLDVNRVSPDTRAPTRFVDDEELAYVI 
QRYREVHDMLHTLLGMPTNILGEIVVKW 
FEAVQ 


10558 


24459 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPIjS 
KLFGYFSQLYSIYSYLCQHFIHLLSIFI 
TVFCIVFYWVFVHLISVPMYICVCLCTN 
VCVCV/CCLW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 
INPPFFFFFFFDTESHSVAQAGVHWHNI, 
GYLQ / S PPPG\SSDS PAPAE FHHTV 


10560 


24461 


A 


10643 


285 


1 

- 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKSPFKTPRRALPLGVPPKHPFFFFFF 
FSETE SRS VARLECSGMISAHCNL \NLH 
LPGSGYSPAI, 


10561 


24462 


A 


10644 


100 


466 


FbLKFRIiREVPSRVRCQSAMjLGPTSQL 
GYSGVRDTLEEAVCPFSDLKLCARRTNT 
LFKTVRQGHLSLQRFLLPFVQLCPVPRG 
GVYRGRQASLSCSGVHPVRASPPLCLPK 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPHAERETIDKGRLPYYSFFFFETESC 
FVAQAGVQWHDRYSLPPPP\ PGSEGSRA 


10563 


24464 


A 


10646 


360 


504 


QIGHICAYVEKTELRLIFFFLFFFFRQS 
CSVT\RLECSGTILAHCNLCL 


10564 


24465 


A 


10647 


241 


45 


WEVEVPRVAPCHIL.KRDYYFIILFLFLF 
FEME SYS FT \ ELE CS S AISTHCNL CPPG 
SRDSPGIPPH 


10565 


24466 


A 


10648 


76 


462 


FLWEYQEKAHILWSLFSKLILSRSRFTV 
KEKVRQKPG ILF I YLF I FEMES CS VAGA 
GVQWRDLGSLSPQKKKDS/DQSKAITFV 
EGINSKRGGWTGPHF/HCSLKMIFLI 


10566 


24467 


A 


10649 


3 


359 


QTQREPTMELTPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLSFPTTKTYFPHFDL 
SHGSAQVKGHGKKVADSLTNAEAHEDYM 
LNALTALNDLHAHKIiWVDQVNFKIiXLSH 
CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


TPEkKKKiGGLGAPFPGYPKKIV*PPNFF 
FFKKTKFFFFQRVGWAPPPKS FFFFFFF 
EVESCSVAQAGVQWRDLDSLQPSPP\GS 
RGSP 


10568 


24469 


A 


10651 


278 


461 


KKI ICGFFFDTGSCFVAQAGQQWRDLRS 
LQSPPPG\SSDSLTSASRWGMSRHMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENIAFVENKTTLSYHFMFTRMT 
AAKKMENYKGCWGWETIGSFMPCWLESK 
RVQSLW\KVWQFFKWIiNVKLLYGIiAIPL 
TGICPKKLKKYVQGY 


10570 


24471 


A 


10653 


91 


486 


PCFNHGHTTWYMYFFSLSLILFFVFSLK 
PVIIAYKTFS FP ITI RVLLQRRHRQKFV 
LDGFNGVDQNNILGS I CVSMKI FFLTE 



1192 



ISDOCID: <WO 01 64835A2_I_> 



WO 01/64835 



PCT/US01/O4927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
C=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SCFVAEAGMQ WCNIGSLQAPRP \GS CHS 
PGISPPVLG 


10571 


24472 


A 


10654 


267 


470 


NLHMRCEKEQNFNKLSFKDLFGLGWLWF 
FIFCFWRRILTLSPRLEC/RG/MIFAHC 
NI1CI.PGSGHSPASA 


10572 


24473 


A 


10655 


775 


1401 


TFPSQIRYGLRKRPNSLTLS PRLECSGA 
ISAHMQRPPPGFTPFSCLSL.PSSWHYRR 
PP PRPAI CVCVC VCVCVCLVETGFHRVN 
QDGLDLLTS / S / I PPAS A/ FPKCWD YRR 
E 


10573 


24474 


A 


10656 


287 


454 


LPLIYFLYFYETECHPIiAQAGVQWRDLS 
SLQS PPP \GSNDSSASAS PTVPTSAM 


10574 


24475 


A 


10657 


212 


64 


LFLSKFYFFEMESCSVAQAALQWCDLGS 
VQPLP \PGSEDS PASAWG YLD 


10575 


24476 


A 


10658 


368 


473 


GFIDHTRRERPECP/LTDEWIKKMWHIH 
TTEYYSAL 


10576 


24477 


A 


10659 


358 


1 


IIiAVFCFCGSAPGFEGGKLTPFFNRGRE 
TFLAIjVKTPPPGKARGPVFISKNKKI PE 
FKQPPNPNPNS FFFFFFETESHTIARAG 
VQWHNLCSLQS PPPG\SSDS PGRWSLQR 
TEIAPL 


10577 


24478 


A 


10660 


2 


235 


KRDIiIRHYPKEDIYMANKYIIjK\CSTLT 
MHTETLIRTTMRYHLILIKVTIFKKTDN 
MTAVGVHICNPNTLTGHGRGIA 


10578 


24479 


A 


10661 


9 


538 


cvtvr i psrptrplssdrsnpgrflsts 

NS SX. Y / E KD KRNKAY FTK/R PS PVND 1 1 
ST 


10579 


24480 


A 


10662 


374 


38 


sflwklhlrgapgcircqsa\llggvsq 

LG YS GVRDPLE E AVCP FS DLKPRAGRTT 
TLFKAVRQGRLSLQKFLIiP FVQLCPAAR 
GGVYRGRQASLSCSGLHPVRASRPLCSR 
R 


10580 


24481 


A 


10663 


268 


47 


ALIPLSSLT1SAFHLLLLETVSTSSPKM 
ECSGAITSHCSFNLPDSSPSPTSASR/V 
IGTIGARHQAQIiMFIYFC 


10581 


24482 


A 


10664 


64 


451 


FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGQ KLVRLHQTVS QWDAKEKLL KL 
IKSATPVNIGMIKQHNLIADIEKFGMIW 
TDCQTSHRTVLCQRLIQSK/ALTLFNSM 
KAERGKEAADEKLEVRRG 


10582 


24483 


A 


10665 


355 


474 


ILFYFIFFETESCSVAQAGVQWRDLGSL 
QAPPPG\SRDSP 


10583 


24484 


A 


10666 


317 


481 


GHTCPWQTFFFFLFRDRVLLHHPGWSAV 
TQSWLTAALT\FGPKRSSCLNLLNDWD 


10584 


24485 


A 


10667 


57 


329 


VKNTQ WGKDSL FNKRV \ FKNWAS I YRR I 
KLD\LTSYAKINSKWIKDLNVRLEIVKV 
LQVEYPSFKILGNGSVLDFVFFYSGI FA 
LHLMGEHP 


10585 


24486 


A 


10668 


86 


468 


ENYKI PLiMGGKNFIiLiFPS IPPYFFFFSR 
LGLTLLPRLKCSGDHCSLQPRPPGLKRS 
SCL\GFPKCWDYRNEP/CVPR 


10586 


24487 


A 


10669 


254 


25 


GSH I CKVPAAI YSNTCTS S EHGDGGWG 
GCSSGSTTHPSPVADSFFVWETVSL/S 
VTQAGVRWCDLSSLQPPP PG I 


10587 


24488 


A 


10670 


206 


3 


YGRPRKLCNIGLKSGFLDPTHWGQHRVI 
SFFFIMETDSRSVAQAGVQWHNLGSLHP 
PPPG\SSDSPAS 


10588 


24489 


A 


10671 


20 


355 


GFT8QSELLYSIDPITRPTDQVTIGISA 
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SEQ ID 
NO: of 
nucleotide 
sequence ' 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, LHLeucine, M-Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QS YCRVHVHNRVYDLD VESGH PNGAP A I 
KGSVCSTIKVPGDVSSGRSNPGRFVSTS 
NSSLY/EKDKKNKACFTK/RPSPVNDI I 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKLKDANTLNI KVWRNI 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEDFSI INIYA 


10590 


24491 


A 


10674 


317 


2 


TGPGFFPQI WKVFPPFPLKI FFTQKFLV 

DGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\RLECSGAISAHCNLCPPGFK 














DHR P E KTKS S T CQ AGE VP LLG Li FVP FR S 
STDWVRPAFI /MEG/HI CPTQSTNSNIN 
PVWKHPHRHTQDNVWPNAWSPHGPVKLM 


10592 


24493 


A 


10676 


271 


484 


KTPAGQTCRI KS FFFSFFE TES CSVTQTG 
VQWHDHGSMQPQSLGP\SDPPTSANSSA 


10593 


24494 


A 


10677 


364 


121 


KQE WRSLSHNALCNDQAS PLPGSGHWK 
S KQKLS KAAP CAGS S KHKHL / HKECWKH 
THTH'i'H TH I HTHTHTH I HAHTKDR FA 


10594 


24495 


A 


10678 


354 


539 


FFFFFFIVFEESHAIAQAGVQWCHLSSP 
HLR \ S P P PGFN \ DS P AS AS S AG ITTLS S 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWSLCHPRLE \CSGTISAHCK/L/RAPG 

LVETGFHHVSQDGLDLLT/SG/ IPPALA 
/FPKGWDYRRE 


10596 


24497 


A 


10680 


251 


30 


ASLGVS APLC I HVSPCP PNGDLGKTW WV 
GGGFFFFEMEFHS /VLPRLECNGTISAQ 
CSLCLLSSSNSPASASRIPP 


10597 


24498 


A 


10681 


203 


1 


NFPAPGKLGPPRDSLKTAPPFFFFFETG 
SRSVA/EAAVP/W/CDLDSLQPPTLPDS 
S DS PNS ASGVAG I TG 














NSPASAS 


10599 


24500 


A 


10683 






IjYMCWFRPGFJjAHNSHDHGYSIjTIjSCWG 
ASGLKKQP/ CRLSEKKKKKKKKKKKKKK 


10600 


24501 


A 


10684 


201 


463 


I YWED I VQGIVADRLTSRKHNS KFLGEC 

SLRPLPPG/SQVILLPSSDSPALASRVD 
EIAGV 














PEKNRKTQKGAPPPPFPFETGS PS VAQA 
GLQWRSHSSIQNEPP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDIISNFFFFFMETESRSVAQAGV 
QWREHGS\ASRVAETTGVHHHAQLIFR 














N'X't: 1 GWVWWliTT VI PALWEAE AG / E I TR 
SG 


10604 


24505 


A 


10688 


161 


365 


RPPCWI RTSRWCWRYYRFDGST I KVLRD 
AYFTK/RPSPVNDI I 


10605 


24506 


A 


10689 


286 


3 


QPSSVWSFRSVICAPAPFSGMGLSPELQ 
QCIVGNFASRYTMTKSSAVLFIIjIFSLI 
FKLEEL/REAPASLVSFLPPQMIjRIKWE 
SEQWLVLLCDRG 


10606 


24507 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETESRFV 
T\RLECSSTISAHCNLCLPGS SDS PASG 

s 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GI> cine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10607 


24508 


A 


10691 


240 


406 


DGIRRLSFFFFFFFETKFGFVA\RLEGR 
GT I WVUVNLH L PGS RDS P AS AS QVAG I T 


10608 


24509 


A 


10692 


343 


3 


QVPLNLSLHLKLGMLFPGCINGSLSQFL 
QFFAQIS PYQTGFPS FYKTS I YHNSLHQ 
YSLS FLPYFIFLFFFSFFFFLKQSHSVS 
\RLECSGMVSAHCNLRLTATSTTQVQVI 
LV 


10609 


24510 


A 


10693 


196 


3 


YDAGHTKKTLHFPTVYPFVI.FCWWECKM 
VQPRWKI VRR/FLKKLNTKLPYDPAI PF 
LSVHQKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAHFGRPRREDHPRLGVQDK 
PDQLEKPVSTKNTKLVW/SWWIj 


10611 


24512 


A 


10695 


252 


389 


KTGGKVIjFFFFETESRSVAQVGVQWRAIi 
RSLQPP\QPGTSDCPASAS 


10612 


24513 


~K 


10696 


385 


1 


PPNKAKMISSKDNKNLHWGKDl'LLNlCWC"" 
WESWIVTCITMKLDSHLSPYTKINPKWI 
KD/LKTI KI LGENI KKTVLD I GLHK\NM 
SKTSKAMTKILDLIKLKSFCPAKEIISR 
VNRKSTEWKKVFASYLSN 


10613 


24514 


A 


10697 


296 


430 


KHIQARRGGS /CGN/ RQHSGRPRRADHL 
RSGVREQPEQPGEKPHL 


10614 


24515 


"A 


10698 


280 


468 


DYLCL/ SLiS X YXiS I YLS IYLS I IHQFI Y 
HLF IYHliSNLS S I S I FFTKWLS 


10615 


24516 


A 


10699 


15 


393 


RS VGVLGPVRCQCA\ LIiGGDSQLiG\ SQG 
SGVRDPLEEAVCRFPYLQLCTGRTTALF 
KAVRQGHLSLQRLLLSF\VWLCPAPIGG 
AYRGRQASWSCGGLHPVRA/SMLIiCLPK 
EAWAMAGAPPPASLPPLS 


10616 


24517 


A 


10700 


176 


1 


DWTTNFTrFLYNFKPSSLMPYLSHIiFKT 
IiR/MWPGAVAHARNPSTLGGRGGWIMRS 
GDE 


10617 


24518 


A 


10701 


494 


80 


FKTKKD IHS DTPS EGHQJjQRPNVETLiKKM 
GRNQCKKGENPKNQNASSPKDHNSSTPR 
EQNWMKNESDELIEVGFRRWVITNSSEL 
YKG \DVLTQCKEAKNLENRLGKVLrRI T 
SLEKNGHGLMEVKNIAQELCEASAGWR 


10618 


24519 


A 


10702 


264 


410 


KKGPLFTPPGGGGG/ PQKKPPGPLNPGG 
QRDSS FFPP PGGGNTGETPPGG 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/ QWLMPVIPALWEAEAGRSP 
E IRSRDQPRQHGETL 


10620 


24521 


A 


.10704 


184 


2 


RLRVLAPCRHLPRAPRT WPKRP FFFSFF 
IiFFPQTQSHSVAXRLECSGAISAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


IiFLiWKFCIiRGVPGHVRCQSAIj \ LGGASQ 
LGSSGVRDPLEEAVCLFSDLQLRAGRTT 
TLFKAVRQGHLRLQRILL 


10622 


24523 


A 


10706 


82 


410 


I LRG KFGKHYFNRI DWREALRQS LS LFN 
FI I FSNFIiASLHKPEMETELKGS FI ELR 
KAIjFQLNARDASLLSTVDSDFS FCRKFS 
R/CSKCGQ 


10623 


24524 


A 


10707 


390 


2 


S KDCRTAKI AACS FLWKLRSRGAPARCK 
PELSCMRCLSA\LLGGVSQSGGTGIRDP 
XiEEAVFPLAEIiERCVGRSAALFRASRQE 
H1.SLLKMHPQLPLPSGALSQADGSFIYK 
PLTGASAFLiSEMPCQERR 


10624 


24525 


A 


10708 


229 


392 


YSWWRQMHSVAHAGVQWCD/LVSLQPQP 
PG\SSDPPASASLAARTTGAGHHNQLIF 


10625 


24526 


A 


10709 


193 


2 I 


RYLCYQDHVI LLLLLLLLFLRWSLTLSL 
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nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
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peptide 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Aspar»gine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RIiECSGTIMAHCSIjDS\SSNPPISASQA 


10626 


24527 


A 


10710 


180 


11 


IiL F PFETRS /HVS QT WQCGM I SGHCNL 
RLSGSSDP PPLASRVAGTTGKHQNIWLS 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHliQIQPNFYQNIFF 
FETELCSVAQSGVQWHNLSSLQHPPP\G 
SSD 








10712 


312 


470 


TGPHCVT \RLECSGAITAHCSliDFAGLS 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10713 


64 


392 


PKMVIRISSETSLYASLPLQMKGQRQK/ 
CEPQPKKKKKKKKKKKKKKKKKKKKKAR 
G 














NNQKTNNKMVG VS F YLS 1 1 1 LNVNELNA 
PIQRHRVAEWIKKEKKKKK/DPGICGLQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 


10631 


24532 


A 


10715 


176 


410 


ARSS WGLHTAVP VAFFSLRFIiLE I FLDR 
DTRCSPPAAFFFETGSCCVA\RLEGRGA 
ITAQCSLNLLGSSNPPTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDRYKDVQNI IQNPICWPGTVAHA 
CNPSTLGGRGGQIL/RGQEFETSLA 










190 


2 


GPFPPHPGGFLRGFFVPNTIPPPTFFFF 
FFLRRSLSVAQAGVQWCDLSSLQPLPPG 
P\SNSPCQ 










201 


1 


WPLiFLPKMFFFFKRFPHMGPSPWVGAQA 
KKKKIFFFLTGSHSVT\RIECSGTISAH 
CSLNLPGPSH 














EYHTSLVTCGNPCVYRSNNKLNQTSRRK 
VITKIRAELNEIETEK/LQGSGETKIWF 
FEKINKTGLELLGSSDPPVW 


10636 


24537 


A 


10720 


218 


407 


GKKNIiAFKKKKRKEKKRKKS PSQSNMNS 
AKIEARTNIKLWKHGWKNCEI IDA\LQ 
KAFGDNA 


10637 


24538 


A 


10721 


.231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
EMEACSVT\RMECSGTVLAHCN1.RLPGT 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTTiMASSDPSTPAVPPPNT 
THPPLCLSKSHLPLRPKQGLPSGNLLQL 
PLTLLI PLLGAPVACWQLPQQCTLSTFF 
FETKSHPVAQAGVQWCCLGSLQPLiSPG\ 
SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLLFLNKLTVNNFGWLAGFWCLG 
FFWFFSF FETRSGS VT \ RLECSGMI SAH 
YKLCIPGSSH 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETKSRCDIQAGVQWCDLCSLP 
PSS\PDSSDCP 


10641 


24542 


A 


10725 


169 


386 

_ 


DKKQAKT I KWGKKS FSNK WC WNNC I ATG 
KRMKLDP\YLTPYKK/ INSKWIKDLSI 


"'"10642 


"24541 




1Q726 


"257 




PLFFFFKRQGLTLS PRIjECNGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAR 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 




401 


NNYDRAETQIYQYMCIjNPTFYCLQETHIi 
TCNDI YRLKVKGRRE IMQIENKRVGVAI 
LVSDKTDFKPTTVKKKIiHYI I KGS IQPE 
DLIILCTYSPNIRASRFIKRIPDLRKEI 
A / HTVKVGD FSIS L.NRLS RQNT 


10644 


24545 


A 


10728 


154 


1 


PMCSLLGLSKGGIIFFFFFEAEFCSVAQ 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWRDLGSLQPPPPG \ SSDS 


10645 


24546 


A 


10729 


34 


151 


PRPPVPSLLDRGRLQLWRQRGLRHRAHS 
NGFIGGKQQIMKLLKNYVRRPVG/VAVA 
IMFDPDPRYPRSWIEDDFNYGGSVASAT 
VHIRMGSLVEBNKS 


10646 


24547 


A 


10730 


2 


408 


VFLI/TVRTLI crsvgvwwrstpdlvclg 
ISSGGCRTANIGLYQMLLPDHSSGS FCL 
RGVPGHVRCQSA\LLGGPSQIjGYSRVRD 
PLEEAVCPFSDLQLHAGRTTTLFKAVRQ 
VHLILQRFLLRFVWLCPAPRG/GVYR 


10647 


24548 


A 


10731 


274 


89 


ENIPIVFNPPVCSPLLWQPQE/CEYPKL 
CKLCLKLHKHRVFY \ FLFFFFETEPHSV 
ARLECSG 


10648 


24549 


A 


10732 


251 


1 


TFQMMQKCFSHRKIFHNLLDKASYKIVY 
KEDPFSPSSLSSSVSLKNNFFFLETESC 
SVTQAGVLWCHLSSLQPPPPG\SRDSP 


10649 


24550 


A 


10733 


375 


1 


APFPPIiWVRGSPPFSPLVFPHPKLKPPL 
AE I LGFFKERKWGS IRKPCLFKVKKLVS 
VWPGLELQIFKFIGEFPFSFPSLVGRKP 
NFFLGPVFFFFFEKESRSVA\RMECKGT 
ISAHCNLHLPGP 


10650 


24551 


A 


10734 


82 


398 


SFLWKLRLRGVPSHVRCQSA\IoLGGASQ 
LGYIiGVRDPLEEAVCPFSDliQLPAGRTT 
TLFKAVRQGHLNLQRFLLPFVQLCPASR 
GGVYRGRQASLSCGGLHPVRAS 


10651 


24552 


A 


10735 


2 


341 


TFCAISWLESGVEDGPRSRISYRISTFF 
FFFKGAPEPKEVRGRPKPGLAPTSLPGA 
HGTTGLSGWTTPS IGGWQPPPPPRENPK 
GEHPPAPVAGDTFQ\PKKPPPILKVFFP 
K 


10652 


24553 


A 


10736 


171 


1 


RVF YLIiAFALFVDTGS PFVS QAGVQW CD 
HS S I QS QTSG\SNDP PASAYRVAGTTGV 


10653 


24554 


A 


10737 


190 


1 


EHKTTVRKI PKMEEMADSGSNMLiRI 1 1 F 
LFIiFFETKS CS VT \ RLE CN / GS I SAHCN 
LHLPGSSNS 


10654 


24555 


A . 


10738 


252 


413 


GLLGLQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNP I TLGGRGEW/ IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGSTSRCI /QELSDLLEHIjEQENCLNPG 
GRGCSEPQSCHCTPAWVTETQKK 


10656 


24557 


A 


10740 


185 


403 


IjGDPVHTMKSNYHYKLCHHHHHIjQHHHH 
HHHCHCHYHS/HHHNSQHPPPPPPPPPP 
HHHHHHHHHLP 


10657 


24558 


A 


10741 


119 


425 


IKFHLHSMERTHLFL.WNCWAHKQRNCFK 
ADFVITDDIKQLCPPQSWWTRAGKLPLG 
AGRGGSHMLSHHFWRPRHADIIiYLGVLD 
QPGQQGETPS1,/ LKNTKS S W 


10658 


24559 


A 


10742 


112 


2 


GETFFFETCSSSVA\RLECSGAISAHHN 
LHLPGSSKSP 


10659 


24560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSATXRIiECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


24561 


"A 


10744 


176 


3 


SLYSKKKNPSHI,1.FLPIP I KFFFFFFET 
EFRSVAQAGVQWHDLSSLQPPPPG\SSD 
SP 


10661 


24562 


A 


10745 


250 


1 


GPRRI FFLKEFYPRFVFGKNPAPGGFFS 
GGKKPGPFPFNPRPIKFFFFFFETEFRS 
VAQAGVQWHDLSSLQPPPPG\SSDSP 


10662 


24563 


A 


1074(5 


155 


3 


' PPHPFPFNPLPIKFFFFFFETEFRSVAQ 
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E=GIutamic Acid, F=Pbenylalanine, 
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T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWHDLiSSQQPPPPGNSSDSP 


10663 


24564 


A 


10747 


157 


1 


TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLS SLQP PPPG \ SSDSP 


10664 


24565 


A 


10748 


157 


1 


TPRPFPFNPHPIKFFFFFFETEFRSVAQ 
AGAQWHDLSS LQP P PPG\ SSDSP 


10665 


24566 . 


A 


10749 . 


153 


2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDIjSS liQP P PPG \ S SDS P 


10666 - 


24567 


A . 


10750 


157 


I 


PPPPFX.FFPRPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10667 


24568 


A 


10751 


155 


3 


NTFPFFFTPLTPKFFFFFFETEFRSVAQ 
AGVQWHDFSSLQPPPPG\SSDSP 


10668 


24569 


A 


10752 


86 


406 


SFLWKLHLRRAPGCMRCRLA\LL,GGVSQ 
LDYSGVRDPLEEAVCPFSGPKLHAGRTT 
TLFKAVRQGHLSLQKFLLPFVQLCPAPS 
TGVQEGRQASLS FGGLHPVQS SR 


10669 


24570 


A 


10753 


364 


469 


GQFIiHStiD/SHWKKSEDFCFLWFPHSEN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


61 


GEKKKKKPRVFFFFFFFETEPGSVT\RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG 


10671 . 


24572 


A 


10755 


66 


446 


SFIjWKFCIjKGVPGCVRCQSA\IiLiGGAS<2 
LGYSGVRYPLEETVCPFSDLKLRAGRTT 
TLFKAVRRGHLSLQRLLPPSVCLCPAPR 
GEAYRGRQAS IiS CGGLH P VRASRPLCLP 
TQALAMVGAPPPGSL 


10672 


24573 


A 


10756 


252 


482 


RIj prqp vrkwvag vrgc cvwgws ks lq 
rsttldwqgppqrggpillffffetlch 

/TRLEGNGE I S AHCDIjCLPGSN 


10673 


24574 


A 


10757 


205 


488 


PLESIjRSPIjGPI.SWRTASSVSPVICIYY 
IYIYIYIYTHTHY/HIHTHTHTHTHTHT 
QYI FVCWFLKDRVLLCHPRQSAGARS W 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YS YVLFFIFLG I ES CS V/ AFSAGVQWHN 
HSSLQLQTPGLKQS SHLSLP /AS ASPVA 
GTTGMRYHARLIF 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVIjIGRIES I VWGLKPWALG 
GCPSPRAVHWLIiASSDWRPSLQDGAEGW 
KKGEANGNHKR/ GIAVISDQIDFKTKTI 
KGD\KKSHYVMIKGPIQQEAITriNI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
EREREKERQEGREREREKGRQRERERKR 
ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 


1 


305 


ASWDDPAHNNNFHIPGGWAHFFFCRLL 
DCPPWAPRPPAPR/VLL/TPAAAAAAAA 
AASRPEKKRAEENVGAIPPRKRPPRWEE 
RRGPRKGSAP PGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCGKGKTRLG 
FFLNYFGKKKKKRGRELGFPPFCQIWGT 
S I YRGSMGYFFFFFFWRQVSLALLPMLE 
C\HGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


10679 


24580 


A 


10763 


152 


484 


RFVCSTIKVLRDLSSDRSNPGRFIiSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDI IS 
T • 


10680 


24581 


A 


10764 


208 


420 


SFLWRFPLRGVPGHVRCQSA\LLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLS LQR 


10681 


24582 


A 


10765 


319 


85 


GKLLNNNRCFGSKFQGIMFFFGETESPY 
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SEQ ID 
ISO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginioe, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITQSGVRWHDFGSLQPP\PPGSRDSPAS 
AS \RG I TGARKHTQ P I FLL VSNS 


10682 


24583 


A 


10766 


122 


3 


KFFFFFETEFRS VAQAGVQ WHDLS SLQP 
PPPG\SSDSPSI» 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCS IAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFSKI 


10684 


24585 


A 


10768 


244 


85 


RAPPFFFFEMESCS IAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFPKI 


10685 


24586 


A 


10769 


35 


296 


EVKS PSARQPPRLGSEEHLRPAAAPSGR 
EVGGQPPPGQPPCPGGEG/PPPPGSPDW 
EVRS PSARQPPRLGGEPNS SLRTGHDDD 
GGFV 


10686 


24587 


A 


10770 


260 


484 


MDEELLLMNEQRTWFLEVESSPGEDAVS 
I I QLATRD LEYDLNL VEKGAAG / LERKH 
YS FERS STVDKI LSHNTAC Y 


10687 


24588 


A 


10771 


248 


393 


TQEGKKLINWPGTVAHACNPSTLGARGG 
RTTRGQELETK\ LANKIKPCL 


10688 


24589 


A 


10772 


239 


2 


S FLWKFCIiRGVPGRVRCQSA\ LLGGAS Q 
LG\SQGSGVRDPLEEAVCPSSDLQLRAG 
RTSALFKAVRQGHLSLQRLLLSF 


10689 


24590 


A 


10773 


158 


455 


L FFTLCPS LLQH I AVMLELG LKG P KS I Q 
P I FW VFLQGTEP /HFLVTP VR/ CCLPLL 
KLFCIiLVFGMESPSVPHAGECSGVISAH 
CNLCLLGSTDSSASPSRV 


10690 


24591 


A 


10774 


166 


403 


KKTFLGEPLFWGGAKKKKPGKKNPGFFP 
RG/IKPRVFFSRFFFFGPPPKKGFPQKS 
FFLKSLPGFFLFGGCPPPFFFFFFFFEM 
ESCSNTRLECSGVIIAHCNLCLPGSSDS 
P 


10691 


24592 


A 


10775 


208 


1 


RLCFFYFRKALU3KAQ I KNI AFFPRKGS 
FFFFSETESRSVAQAGL/LDCSGAISAH 
CKLRFPGSRQSPAS 


10692 


24593 


A 


10776 


192 


29 


I FRKEFPCLNFFLLFFETESRSLAQAGV 
QWRDLGSLKAPPPG\ SRRSPASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQMAKVYMIFFFFEMEFCSWA 
QAGVQWHDLGSMQHRPPG\SGDSPSCLP 
SI/AGIPG 


10694 


24595 


A 


10778 


340 


3 


LKVPVGKPRSTLLGVKKVPFFGNKRKKF 
IALFFFPRPPPGEGFSTAFLAQKPTPRV 
VPALGFPKNQSPLS PFFFFFFETKSCSV 
TQAGVQWCGLRS PQPMPPG \ S SDS PAS C 
L 


10695 


24596 


A 


10779 


313 


1 


ANPFGGPSGGDPFSSRVFPPPGPKNETP 
FFKNKTTATKKTGNKGGWAPSPHRGGPK 
KGPAL WDKKGKNLWP FFFFE TESHS VT \ 
RLQCSNTI IAHCNQCLPGS 


10696 


24597 


A 


10780 


116 


422 


I LEDTNIQTI ETLLAIREVQ I QTTLRNH 
FTIiTGMAI I / RKTDNNKCWRE CGKX E TL 
I CCWRECKMLCNE VGTALE I VWQFL.QSL 
NIELS YDLAMPGFIIFPRE 


10697 


24598 


A 


10781 


120 


418 


TQTTGAPQIiHIiASRWLSRSGGLTSS PQE 
IPKLFWSIESPLGSSKHLSLQVFVCLFV 
CS FVFEMES CSVARLECS I VI S AH\ CTL 
HLWGSSHFHASASRVA 


10698 


24599 


A 


10782 


134 


1 


SSFFFJjCQTESCFGVQWHDIjSSLQPSTS 
RAQA\LSLPSSWDHRR 


10699 


24600 


A 


10783 


2 


435 


CSHRGDSSSYSQLSGIRAGDLGGGGKDI 
FRLLPTTIiNI FAGKES YDWCVTHERMC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G—C5lycine, H=Histidine, l = Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 

Q— Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, * = Stop codon, / = possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














MCLCL 


10700 


24601 


A 








FSRAGFH / RVSQDG FDLLPS \ CLPPLGL 
PKCWDYKREPPRPAWKLMCRQVHRKCTW 
LFIQIGSTLFKTNGGLSAVAHAYNPNTW 
GGRGERIA 


10701 


24602 


A 


10785 


270 


442 


NVRLRLGIiALS PTLECRGTI MAHCRLD F 
PGLMQSSHLSHRVAGTTG/TCHHAWIjI F 
KFL 














PPTSASQ/VLKTTGVCHHAQL 


10703 


24604 


A 


10787 


350 


3 


DSSSVQTNKGIPGQVLSDILSCAVKASV 
LVTI PLHNALSSWLVPALFLWKS WQVGK 
SQQAHSS VSGLC/MHYRSHTHAHTHS PH 
RHRHTYTYARAHTHTHTQMLS AYLPS KQ 
PSGSLS 


10704 


24605 


A 


10788 


155 


3 


HAFFAIiCIRNRLECNGVILAHCNLC/RX, 
LGSS DS PVS AS QVNQ I A^* ACHHAQ L 


10705 


24606 


A 


10789 


188 


2 


RRDLSSLQPPRGGQ/MRGCIYTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 
ESGRAK 










19 




ACSQGLVWQQTWGPLGPFPSLLGMPHR 
PTFRDLNSEPAPGVANVSGTLSTPLPGA 
SHGLLVFFFETESHFVAQAGVSWGDLRS 


10707 


24608 


A 


10791 


157 


1 


KPGPFPFYPGPLKI FFFFFETEFRSVA\ 
RLECSGTISAHCNLHLPGSSDSP 














QPRPFPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDL.SSLQPPPPG\SSDSP 


10709 


24610 


A 


10793 


83 


387 


S FIiWKl»HPGGAPACMRCQLA\ liLGGVSQ 
L»G YTG FRDLLiE E AVC P FS E L KHHAGRTT 
AVFSAVRQGCLSLQKFLLPFVQLCPAPR 
GGVYRG/RQALLSCHRLHPV 














IiWKEFVHSPWCKCTLPQPLWKTVWRYLK 
NFKME \ I P YVPE 1 PLLG 


10711 


24612 


A 


10795 


207 


2 


RRGFTMLVGQNS LDPS TSRSAHLSLPKC 
TPAPGSHHLiWAALIC 


10712 


24613 


A 


10796 


2 


237 


FFFLRERILLALSPRLECSDANMSHCSL 

NLPG /fsqsfclshp\srwdhrhmppyp 

VKFFGIFVGDRVLALFPKLISYI I 














DRVFFCSPRLiECSGAI I ARCS/LRTPGL 
KQSSHLSLPKCWNHRCKPPRPARCSLNE 
SHSAKKWPGQSTDSSACEEVQPTSPFPT 
VPYPTLLS PSAGHGRGR 














RLVCPKSFFFETKFCSVAQAGVQWHDLG 
SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


I FFFFFEMESRSVARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


I LPI I IRKTCCCFFLKESRSCS VTQARV 
QWHII/GSLQPQTPG\SSNPPASAFQVA 
TGAHHHTSLIRNTLI I INWEKNTHKLSG 
CTSMHFRFIj 


10717 


24618 


A 


10801 


12 


364 


LHHYKTVSGIYKCLCVLNGPPTRFFPDF 
LPPLGLPYSLRHNNIEISPINNPPIASK . 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 


Predicted 
beginning 
nucleotide 


Predict- 
ed end 
nude- 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L= Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


sequence 


09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 














YSRERMSCISLTLNQTLVMIK1.NEES1.Ij 
KANS DHKLCLLR / S VYQ WNAYEKFL 


10718 


24619 


A 


10802 


235 


3 


HRPSEDLDFPPHWAVMRAPQYPSWDTRE 
GGPRS FLFLFLFFEMESYS VAQL/EGT I 
SAHCNLCLPSSSDSPTSRALCSL 


10719 


24620 


A 


10803 


349 


1 


NQTPFFFFFFGGTETTSTTLCS \YGLLI 
LLKYPEVA/ESASQRDPEWEAAVWRWLE 
GPGSAQPPSAPAKGQELDPWGQRPVPS 
PDDHVQWPYTNAVLLEIQRFI SWKRTL 
TLDTLY 






A 




365 




IDVCNVCVRKQYRGFCDQKYRAWISPVY 
PHKCGWHRVYNTPTPHCETEWLWVILHA 
QEH/TFSLTGRHTHTHTHTHTHTHTSKL 
APPASRALFGVAHVEAQKALAS PSSGRY 
LAITMFVQPCI 


10721 


24622 


A 


10805 


406 


96 


CPPEFSEESPRLLKFRVGGYLTPQVSKC 
GLGWRI FKVFFWS PPKVQTSLFFFSKT 
GSHSVT\RLECGDTILAHCDLCAPGSGD 
PPASATRVTVTVGLPPCPAR 


10722 


24623 


A 


10806 


58 


369 


FFFFKGDRAQNNS / WGERCLLNKGYWD I 
WIS TCKKMKS TP YLTLHTKI S S KGLKDL 
IIRAKRIHLLKKYIGINLHDLGIjK\DFL 
NKTPKTLATKEKIDTLDFIKIK 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCSVAQAGVQRCNLSS 
LQHQPTG\SSDFP 


10724 


24625 


A 


10808 


204 


415 


HLGFDSLTTCISLSDGLKYKATVFLVFF 
FERESNC\AVWAECNGPISVNCNLRLPG 
SGSS PAS PSRGVE I T 


10725 


24626 


A 


10809 


168 


2 


KISKRPFFFFFFETGSHYVA\KLECSGV 
ITAHCSLDLPGSSNPPTSASWVAGTTGT 


10726 


24627 


A 


10810 


97 


389 


LAVSPSLSLSLSLSLSLSYLGLPYSLRR 
SNIEITPINTPAEGSVCSSERKGHMSLS 
FIJEKIEVITLSAQDMSNTKIG/RKLDLL 
CH/TSQWNAEEKFLK 


10727 


24628 


A 


10811 


24 


416 


LEY IARRYLGWWLFFFFFLNRQGEKSR 
GPFKFFFPRGFFSTRNGAPPGGPWGPLP 
WGGGPPVGFQKQGKGGAP\ PPKKNRFPK 
GGPLTQPNLPKTPIENPKGPPTRGFFPS 
GPPPKKGAGPPP I FRVGPG 


10728 


24629 


A 


10812 


60 


435 


KKRKNFPQKKISPYFYPLKWFKTPPLWV 
KNQTPPV/ CCFFEAPFSLKKPPRGLKKG 
/WGKNFPPPLVPQKKKKKDTAGVAILVS 
DKKDFNPTKI KKDKGH YIMVKGSMQQEG 
LGCPDAWVPS 


10729 


24630 


A 


10813 


618 


1519 


FWFGVNCKSVCVPFLPVLCFWCVWCER 
GVL SQRSMGQAQ S KPTS LGTMLKHFKKG 
FKGDYSVTMTPGKLRTLCEIDWPALEVG 
WPSEGSMDRSLVSKVWHKVTCKPGCPDQ 
FPYIDTWLQLV/YRPPPY 


10730 


24631 


A 


10814 


179 


15 


KVKRLKTPFFFFFFETRTCSVAQARVQW 
HNHSSLQPQPPGPKRS\PTSVSREAGT 


10731 


24632 


A 


10815 


61 


422 


NCFFLKGPPLFFFFFFFFFFLLQIIVFI 
YKFFAFLFQMEFRSLPRLDKCNGAI SAH 
CNLSLPSSWDYRNLPPRLANFFLFLVET 
RQPASA/ FLTCWDYR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLLSSS 
NPPISAPHIALG/LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


443 


FIFIFYFFETRSHAVAQAGVQWRDLALL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K==Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPPPP\GSSDSPA 


10734 


24635 


A 


10818 


134 


3 


KKAHLPFFFFLEGTESHSVA\RLECSGT 
ISAHCNLCLPGSSVSV 


10735 


24636 


A 


10819 


76 


245 


FLLRKGTRQGGPI>S S / LLFN I VRDVXiAR 
AIRIi\EKKIKDIQIGKEEAELSIiFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


FFEFGALKKWKSMtMklWEkaKRtTLNG 
GGIKAAKTASLKKHLQGWGKVMLMANKL 
LRGERAW/FPPRIRG 


10737 


24638 


A 


10821 


293 


402 


RNP F FFLE TE PHSVAQGE VQ W CNLS S LQ 
PPP\PGS 


10738 


24639 


A 


10822 


168 


1 


PLRGLSDSSLGIMKRMATDLSSIiQPLPP 
G\SSDSPASASRVAGITDSHHHAWVIF 




24640 




10823 


97 


405 


LCVNICFNFPWLGVEWLNHI IGVGLTFF 
FEMESCSVARLECSGAISAHCNLL/LPP 
SFGSSDSPASVT 


10740 


24641 


A 


10824 


1S6 


2 


EPGTISLVALKLQKWPRR/SDHL.RLAVR 
DQ PGQHDETPS LLKNTKIS WAQ WQAP VI 
PALEPV 


10741 


24642 


A 


10825 


199 


1 


KKTQIGGAPGGALFFFFFFETGFPFVA\ 
RLQCSGTNQAQCSIiNLLSS SNPS FPALQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


381 


GQQEFFFPCSVPQAGGQWCKLSSLQAPP 
PG\ SHHS PASASRVAG 


10743 


24644 


A 


10827 


77 


427 


IPQVHCPMSPPVPMACIPRVSSFTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
pcts ws \ cplwprppcsns pv/ tcvp cl 
PCLCISEIPSCVPWP/WTYSSIjCPMSHV 
PDSPCPLP 


10744 


24645 


A 


10829 


20 


518 


SFAFSLLQHLTETSFAINSCSEATLLFIi 
SVFL/RAQTLTAPCQTRGPRRGKDRGSG 
SSS PSGPKATKESS VERRKS FDS WGHRF 
AA\QRLMDNQAERESEAGVGLQRDEDDA 
PLCEDVELQDGDLS PEEKI FLRE FPRLK 
EDLKGNIDKLRALADDIDKTHKKFTKAV 


10745 


24646 


A 


10830 


351 


3 


LHFSPFLQHRQNIKVWLMRASMQRHSRT 
HGAFPLHQDEIQTRPRLRSAASSGPSLL 
SDHILSTLPAFTCANLSSRFCSSSPCSC 
SCLRA/CCTCHFDLSTLIHAHTRTHTHT 
HTHTMY 














GMLPAHLADVIjRHNSVGRPKHMRVMAGA 
LEGDLF I GPKAE \ EHRG 




24648 




10832 


336 


38 


GVATEG VGEAAQGGE PRQPEQ / PPPQ PY 
PPPPQQQHEEEMAAEARQA/AGAPMDDG 
FLSLDSPSYVLYSDRAEWADIDLVLQNV 
GPNP WQ 1 1 YSDKYTLWK 


10748 


24649 


A 


10833 


206 


1 


TYFFPFPPGLFFIAGIFFFFFLETGSH/ 
SSLRLECSGI ITAS CSLNLPDPNDPPAS 
ASQVAETTCATMY 


10749 










~376 


RISQAI XSFYFYFLFETESRSVAQAGAQ 
WSDLSSLQPPSPGV\SDSPALPS 


10750 


24651 


A 


10835 


213 


1 


DRVLLLSPRLGCSGMITAHCOTjHLPWFK 
RFSCLCPPE/SSWDYRCPLP/PPRLTSV 
FIiVETGFHHVGQAGLR 


10751 


24652 


A 


10836 


351 


447 


RENLWIiTPVI PALRE AKAG I \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


SASTAPMAPVKKLWKGG/K/ KKKQVLK 
FTLDCIHPIEDGIMDAA/NSTNYEQFLQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ERI KVNRKAVNLGGWTIERSKS KLTVT 
PEVPFSRRYLKYLTK3CYLKNIJ1JLCDWLC 
IVANSKESYELRYFQITQDKEEEK 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRSFPELWKSPYLGVGP 
GHSYVSLFVAGRCGVRNQQRLCSVKTMS 
PQNTKATNVIAKARYLRKDEGSNKQVYS 
VPHFIiIAGAGKERSQMNSQSEDHK/LA/ 
PVRNAV 


10754 


24655 


A 


10839 


313 


617 


AFFLi I V FQ YVARERERVSRS / VNCGL YQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


511 


WEQGSWEHVLQAPCSLIAPRITHAYTHT 
HTHTHRYTHTHIiYPH/ APTS IL 


10756 


24657 


A 


10841 


269 


450 


TPWPLKKEFFFFLKWSLALVAQAGVQWR 
DLLGS SDS P\LQPSRFKQFS CIS LPS S W 
DYR/PC 


10757 


24658 


A 


10842 


420 


1 


LERGENGTRRDRRKGLSHCHQPMDSVLP 
PLCHP PPLLVTMEEE ITMLF ID IGSSMY 
KAGFAG/DDASRAMFPSIVRCPWHHGVM 
VGMSQKDSYVGDEAQTKHSILTLKYPIK 
HDI ITNE\WDNMEKIWHHTFYNKLHVAP 
CI 


10758 


24659 


A 


10843 


431 


1 


GEHSWASDLAEDVTKVTRGPLGPLFWGL 
CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 
GGA/SWVWKGPSPWTLHLLLRTWGLAGG 
VGGRSLGRWRAWPGNPGSQGQGAPPAHQ 
ATGTPRSRTGSTGGIETVTILEGSHVSG 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRWPGTLLGVQPG/APPDSTSASGSGGA 
RGGPVPTLEGGNTGSRKWEDPCWGSQNS 
PSSDGSKPPPGPLTSKVCDDSRLSE 


10760 


24661 


A 


10845 


109 


427 


QTGPSAAGLLEFARGPLQTLFAWVPAAV 
AAEQQIFVNRECCCLIVPLEFCLRGVPC 
CVRCQSA\LLGGASQLG\SRGSGVRDPL 
EGGSCPFSDLQLHAARTTALFTA 


10761 


24662 


A 


10846 


17 


416 


S FL WKFCLREVPGRARCLSA \ LLG GAS Q 
LGYSGVRDPLEEAVCPFSDLQLRAGRTT 
TLFKAVRQGHLSLQRI LLP FVWQCPAP I 
GGVYRDRQASLSCSGLHPIRA/SRAAVP 
TQAS AMAGAP PPDSLPPCSLSSN 


10762 


24663 


A 


10847 


265 


429 


LSGASCCLFSLP I VLFALLLLS VLSFS F 
CAWCVPSWSLSPTLFLTFVLHFL*RFIS 
CF*VNSNPKNLTFMVNFQGKGMKIGLAL 
ERIC I CGCS LVF PEAI LETS CQNLFCTY 
ACAGVLSSVYDYLALLAVSFLFL* FFSL 
FFFFLYYLFLFALGASLPGLYLLLYFLL 
LCCISYSLF 


10763 


24664 


A 


10848 


12 


462 


QTLGTKMNEGLFATF I APT I LGLPDALL 
IILFPPLLIPTS*YLINNRLIITQH*LV 
KLTSKQMITIHNTKGRT*SLILISLIII 
I VTTNLLGLLPYSFTPTTQLS INLAMAI 
PL*AGAEVIGFRSKIKNALAHFLPQGTP 
TPLIPILVII 


10764 


24665 


A 


10849 


2 


462 


TTLHAFGTMKREAF I TLLCLFTSANSRG 
VYARDAHKSEAAHRI KDLAEKDFLALVL 
IAYAQYLQQCPFEDHVKL*NEVTEFAKT 
CVADE S AETCDKS LHTLFGDS IiCTVATL 
RETYGEMVDCCAKHEPVRYECFLQHWDD 
CPNLP*WRPEVD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nu de- 
tention 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G— Glycine, H=Histidine, I = Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQ VRGE * NSGRE TER * RERE * KLARE 


10766 


24667 


A 


10851 


417 


100 


MMLWGNLETRLWSQAQTIPCTKQRSSL 
FFPVLNAAIFRSKSKVLNCLEYFLGNTL 
FSWDLIPEILFPLSHYHHYHHHHHNHQY 
* KS * QWSASNTRFQGILDKNS Y 














ESRRGREAAP VCPRHVLL *GAQS KQAAV 
AGKRSGTRHASRWPKSLFTPRRRRISLK 
RALHFWQQSADPSPSVSRI PPHGVGSS 


10768 


24669 


A 


10853 


1 


373 


LQQ KGMRKRAGQS EMAP AGVS LRAT I LC 
LLAWAGLAARDRMYLHPFHLVIHNEST* 
EQLTKANAGKPKDPTFI PAPIQAMTS PV 
DEEALQDQLVLVAAKLDTVDKLMAAMVT 
MLAI FLiGFRIYG . 


10769 


24670 


A 


10854 


1 


423 


VSCS FLKLKTMKHG1.LLLIjCGFLL.KSQG ' " 
VNYTEEGFFRARGHRPLDKKREEAPSLR 

APDAGGCLHADPDLWV*SPTGCQLQEAL 
LQLERPITNTVDELNNNVEADSQTSSSF 
L 


10770 


24671 


A 


10855 


343 


3 


RGCEAHPLPRSEGPAGSALAQPVMYCTI 

PVLTKKINPRSTEAAIKYFLTQATASII 
LVI A I LFNNI LSGQ * TLTNTTNQ * S S S I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVS PKSMFLGAVAHS CNP I TL 
GG*GRRIT*GPEFDPSLANMV 


10772 


24673 


H 


10859 


189 


422 


NHTMDD FERRREL I RHKREEMRLEAER I 
AYQRNDDDEEEAAR*RRRRARQERLRQK 
QE EE SLGQVTDQVEVNAHNSVP 


10773 


24674 


A 


10860 


90 


273 


SHQE I E QNS AMAPR KRQGRG I S FMFYCL 
RNNDQRYMT*RL* SG IGWMLLS SGRKG Y 
ALPG 






A 






41 1 


APCAKPCGDWRSRGSLVWAMSGCNARKti 
DCCSRRCGSHL*N*IPTDWPLN*FFLTS 
AKVKE CFPKKES I YSQTVY* S PG * KM* T 
*DKEHPRYLIP*LCIQFYHLCWVTGTGG 
G I ILKHGD E I YI AP S G V 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRWAKDGLKL 
GSGPSMLALNGRSQVSAPRFGRTFNAPP 
S L P IAT IRALGTVNRATEKFVKTNGPLR 
Q*QPRS 


10776 












DSQTPWPWPLCAAAGTSAGTSHSGSSSG 
AFSSWPCWTAAEPAARKRGRPAGSWSS P 
ATGAPGR CRHR I LSRGAGGS AG FVCSGL 
AESGL*ESSSPGRSQG*PQGQRHPQPNG 
LPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TSKLAFP I S I P VI YANKVCP * FS KKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKVF 
FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGDPKKPRGKMASYAFFVQT*REE 
HKKKHPDASVNFSEFSKKCS * RWKTMSA 
KEKGKFEDMAKADKARYEREMKTYI PPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLLILAGGMPGLGQGP PTDAP 
AVDTAEHVYISYLALLKMLKHGRAGVPM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D- Aspartic Acid, 
E=Clutamic Acid, F*=Phenylfllanine, 
G=Grycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutam ine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EVMGLMLGEFDDDYTVRVIDVIAMPQS * 
TGVSAEAVDPVI QAKMSDMLKQTGKPEM 
D 


10780 


24681 


A 


10867 


64 


483 


QGERPAAAMKI * SLTLLSFLLLAAHVLL 
VEGKKKVKKGLYI KVDSE * KDTLGNTHI 
KQKSRPGNKGKFVTKDHTNCTWAVTEQE 
EGISLKVECTQLDHEFSWAYAGNPTLCL 
KLTDEIVYWKPVARNLRLQNDIMIYSPQ 


10781 


24682 


A 


10869 


229 


1 


DTTILKLGRLI IliQWLLSEGKSHMSFTL 
NQK*EIIKLSKECMSKAELSQKLGCFCQ 
VGSQVMHAKEMLLKE I KSAT 


10782 


24683 


A 


10870 


934 


515 


KKVLLCLPGLGCGG VILAHGS LALPGS S 
SLHLS LQS S WGLTGM * YDARLI FVYFFR 
EMGS CH I CQAGLNS YNS SIP PTLVSQNV 
GTTGVSHRAQLAPVFCFCFVTLVDVKAL 
HFQKQENRIFFFFI LFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373 


424 


EFKTNIiANMG* CLKSQHLGRPRCANHMR 
LGVQNQLGQHGE I P FLLK 


10784 


24685 


A 


10872 


1 


469 


RSRSGDSLPACDRPSGASALATCETIFS 
AISCFWDLPAPSLRLTPSCQPTMSSQIR 
QNYSTDTEAAVNSLDNWYLQASYTYLCL 
GFYFDRDDVALEGVSQFFREIiAEENREG 
YDRFLKMQNQRGGRALFQDIMT PAEDD * 
GKTHNAMTAAMGLETK 


10785 


24686 


A 


10873 


244 


2 


NIYSCKETFSVPLLAIHVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL*SQ 
HFGS PRQVDQLRYG VS DQRGQHGE 


10786 


24687 


A" 


10874 


83 


427 


ISLNMI R IAALNASS TIDDDHE * SFTRH 
NTQTKEAQEAKAFAL YH * ALDLQ KHDR I 
EESAKAYHELLEASLLREAVSSGDENEG 
LKHPGLILKYSTYKNIAQIASQREDLET 
AME 


10787 


24688 


A 


10875 


94 


1 


KSQKACNPSTLGG*GGWIT*AQEFTTSL 
ANT 


10788 


24689 


A 


10876 


2 


413 


GVTRGFKMRIEKCYFCSGP I YPGHGMMF 
VRNDCKVFRFCKS KCHKNFKMKRNPCKV 
RWTKAFRKAAGKEliTVDNS FEFEKRRNE 
PIKYQLELWNKT I DAMNRVEEI KQKRQA 
KFIMNRL * KT* ELPKVQDI QE VQLN 


10789 


24690 


A 


10877 


65 


417 


RFAGAGAI PEARA WPTDVHAAEEEKEMD 
LPDLASRVFCGRILSMVNTDDVNAI ILV 
QKNMLDRFEKTLEMbljNFNNIASARLEQ 
MSERIiLRHTRTL*DMKPDIiGSLFRPIRT 
LEWKL 


10790 


24691 


A ■ 


10878 


4 


442 


APTPDAMGHFTEEDKVTITGLWGKVNVE 
NAGRETLGRLLVDYP WTHRFFDS FGNLT 
SGSVIMGNPKVKAHGKNVLTSLGDAIKH 
LDDLKGTFA*LTEL,HCDKLDVDPENFKI, 
LGNEIiETDMAIHFGQDFTPEVYAYLQNM 
VTWAN 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRIN IHE Y YLGNML 
AKNIiLFEKEREAEEKEKS YE I PTKNIQG 
QMTPYYPVGMGNGTPCS*KQNRPRSSTV 
MYICHPESKHEILSVAEVTTCEYEAVH. 
TPLIiCSHPK YRFRASAVND I FCQS LPG 


10792 


24693 


A 


10881 


54 


335 


REIFTMSGALDVLQMNEEDVLKNLAVIT 
HLCCTKTDS PMEQ 1 1 YGS * TE Y I YS I KL 
NRT*QNLMIiAVP*NYDI *NHDEVTVMTF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 

sequence 


Predict- 
ed end 
micle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K-Lysme^'lJ^L^uri^^ 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














NNTLQMFEP 


10793 


24694 


A 


10882 


216 


3 


MSIKSMSICCLRKKGKHGFQIRGK*NSG 
RETERERERERKRARERARGQERERQKT 
SEGGRERERERKKE 


10794 


24695 


A 


10883 


3 


390 


GELKCS KEKCQSMS AKSRT I W I IGAPFL 
KGQPRQGVEEGPTVliI KAGLLEKIiKEHE 
CDVNDYGDLPFADIPNDSAFRIVKNPRS 
VGKASEQIiAGKVAEVKRSG* ISLiVXiCVN 
LGEGLECLCGHAKVYSR 










3 


375 


STMRAWIFFLLCRAGRALAVPQQESLPD 
ETEVVEETVAEVTDVSVGANPVQGEVGE 
FDDGA*ETE*DWAENPCQNHHCKHGKV 
CELDENNTLMCVCQDPTSCPALIGEVEK 
VCSYDTLTFYSS 










95 




KCTQGPSA VAHAClM P N .1'KUlJ.KliliW i T * (J 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHHVSQDGLDLVTL*GACFSLPKCWD 
YRREPTHLAH 














E PTMELTTAEKTNGKAAWRKVGAHAGE Y 
GAEALERMFLTFPTT*TYFPHFDLSHGS 
AHVKGHCKKVADALTNAVAHVDDMPNAL 
SALNYLHAHKLRLDPVNFKLLSH* LLWT 
LSAHLHVEFT 














TPFFQKILKLTRGGGKLLDFQTPGRLRQ 
KREDPLFPGV*NCN* P*SHPWWGTKQNP 
VSKKKKKKLKTSS I FVHRVSMSPI 














fPT t Tier UMnPTPTPMumTT m ^TMJnfe — 
Car ijLiAPljC.MyaljCjl PMNKIXjQIjTIAEQE 

TFLTPALLLPIPHQTYSTASAVPLAKPD 

T* PKDVGIIiALQVHFPAQYEDQTDLENY 

NNVKALKYTEDLGQTLMGFCSVQEDINS 

LCLTVEQP 














TLPGNIG I S FVERVMEVLRPQL I RIDGR 
NYRKNPVQEQTYQHEED**DFYQGSMEC 
ADEPCDAY*VEHTPQGFRTTLRAPS *LY 
TPIVG 


10802 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPARNPACPL 
PEQCFPGPAEWTGPLTPPGSC* IiRPGER 
ERHGPGCWGPRERLSSGQTHPPSPQGCA 
NTCPCHKHQQELLAGGVAVFYKLFYRYG 
NSPKWDL 


10803 


24704 


A 


10892 


2 


197 


PQPLRVLWTAHLAAMAPSSRTSLLLAFA 
LI CLPWLQKAGAVQTVPLCTLFYHAMLiQ 
THRAHQLVIDTYQELEETYI PKDHKYS F 
LHDSQTCL * FSDS I PTPSNMEETQQTSN 
L*LPDIPAPGFCPDLPALASKGWCRPNR 
SAMHAFLPRYAPNASRAPTGH 




24705 




10893 


3 


334 


DQLPEPLKVLWTAHLLAMAPGS RTS LLP 
PYALL*LPWLQEAGAVQTLPVSRLFDHA 
MLQAHRAHQLAIDTYHE FDETYI P * DHK 
* S FLHDSQTS FCLPDS I PTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVS DAI CY ILDSGYI IMSDTFTAY 
VIG*RFEVNG*HATVRFAAWPPVALPW 
LGV*WDNPERGTYDGTREWTVYFKCRHS 
TGGFFIRSNKVNLGTDFVTADKNLYVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMIQTRDLQGGRAFGLLKAQQD * RLDE I 
CTQLLDDLKYSNDEDLPSRLEGFKEKYM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














EFDLIGNGHIDIMYLKLMLYKLAVPQTD 
LQL.PRS I GEVYSGSGETISYPD FLM 


10807 


24708 


A 


10896 


364 


38 


F S QFKI AQ FLKK I P F P GFSP FFQNGGNF 
LKFSKFFPNKRGFFPHPFF*KRGPFFCP 
KWGPQGHIGAHGPPAPQGQKIPPFQVPK 
KMGIKGPTPPPGFFFFFFFFFFSW 


10808 


24709 


A 


10897 


136 


3 


STLiKHR* RPGMVAHACNPSTLGGRSGRI 
T * GQQ FKTS LTNMVKP 


10809 


24710 


A 


10898 


193 


269 


GNKYCLT ILLLLLLLS FKRQSLTLLPTL 
ECSG*TRTPGPKGSSLLNPS * VAGHGGV 
CL * SQL I RRLRWEDHLS PGV* GHSDPRL 
HHCTIiTWATE*DSVS * KTTTTTTTK 


10810 


24711 


A 


10899 


316 


374 


CCCCCCCCCC* CCCCCCCCCCYKETNFE 
QQ 


10811 


24712 


A 


10900 


38 


206 


WCVLVFTICTLLCNTSLGLFHPEFFFF 
ETESCCVAQAGLQWHHC* SLRLLPHRI I 


10812 


24713 


A 


10901 


364 


2 


DYQHISPEKHCRPEGSGMVYLMCRKKKK 
RKKKNLSTKILNPQPS FILi*KPRNSARK 
S*QKEITKIGADSLIIENRKRIEKIYIN 
ETMS W F I E I LNKI DQPLARLTM I ERKDS 
TKFRNERG 


10813 


24714 


A 


10902 


132 


3 


ATSSPWGYSAYRVAI LLFLYFSNKLAFT 
VLYGFV*NYFLQEI 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGSCSVT*AGVIMAHCSLiDL>PG 
SSNSLTSVSEE 


10815 


24716 


A 


10904 


250 


356 


TMEMMLD KKQ I * A I FLF * FKTGDKAAET 
TLNINNAL 


10816 


24717 


A 


10905 


280 


322 


QT*SLILVSLIICIATTNLLGLLPYSFT 
PTTQLS INLAMAI PL* AGAWIGFRSKI 
KNALVLCTSTPLPTSTMLPVHMMDTSSD 
IHPKI * S I R * QTC I QHRLLL WLKC 


10817 


24718 


A 


10906 


271 


365 


KWRPGWAHTYNPS ILGSRGGWI T *GQE 
FKT 


10818 


24719 


A 


10907 


135 


358 


LRYKLTN P KLTLLTCGL I FLKGN IVNI G 
QCNGVHM * YQHFGRPGQDCLS PG I QGQH 
GQHKESLSL*KSLKISWS 


10819 


24720 


A 


10908 


107 


212 


I KNEGMGQVRWLMPVI PALWEA* VGGTP 
EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


P KNRP 1 1 PLKRSS I SNPGDFQKSLRPGL 
TPMGPHFKRGNYS YDENLENLCPVGGNK 
GPGSH * GLLTCETCRGF 


10821 


24722 


' a" 


10910 


119 


291 


GGYRFYVKDTF*K*GFWPLTLFIILKPL 
AGHGGACLWS WRLMRLK* EDHLSLGGRG 
CS ELWLCTLA* ATE + D P VS * KVRLLALN 
FVYNTKTFSRAWWCMPWLATHEAEVGG 
SLESGRSRLQ 


10822 


24723 


A 


10911 


283 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KNPRNSRLKKTLRSETIAQIPLKCNLWP 
GMVAHACNPYTLGD * GG WVT *GQE FETS 
LANMARPSC 


10823 


24724 


A 


10912 


1 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
RQNAPKS E VAHRLKDWG KKNFKAWG * MA 
LAQNLKQGPFENHGKLGKEVPEFAKPGV 
ADDAAENGDKWLNTLLGNYLAPVAAVRE 
TYGERAEC 


10824 


24725 


A 


10913 


270 


361 


S KTWPGTVAQTCNPRTLGS QGGW I T * GQ 
EF 


10825 


24726 


A ■ 


10914 


157 


I 


VFFQLSRLAWKGFSRFFFVFETESHSVA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H— Histidine, I=Isoleucine 
IC=l_,ysine, L=Leucine, M=Methionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














*AAVQWRDLGSLQAPP PGFTPFSS 


10826 


24727 


A 


10915 


1 


357 


LEEWGPEREFAEEEKEKKNTKKKKIKNS 
TPPKKPARKGEEKPGPFKRAITTI FVAR 
IPLFCLKGFLWPSC*IIGRSSPTPGSKG 

AVKNPIL 


10827 


24728 


A 


10916 


102 


471 


PSTPI I LTSSY PHYVKSSVTSTFI I SIiF 
PTTIFMCLYQEFI ISN*HLPTTQTTHLS 
LSFKLDYFLIIFILLTFLVTWSIIEFSL 
♦YINSNPNINQFFKYLLIFLITILILLT 
ANNLFQLFIG 


10828 


24729 


A 


10917 






REPTMVLS PAD K TNVKA VW G KVG AHAG * 
YGAWLFTWLTLNVISD 


10829 


24730 


A 


10918 


178 


393 


LVLCTVLLMWRFHFHDSLSILKAQDVTE 
♦GKWITQGQEFETSLA 


10830 


24731 


A 


10919 


278 


124 


TAWAD WGSNETP FLLKLPKKLT * GGGVC 
LE FQVLGRVR PKNP FNLGNQG FN* P KFR 
PWTSTWGAKQNFV*KKKKKNKNKKP*NS 
RHTPPP 














IIIGSPLIDIKISKCFLKIVAFGQAQWP 
VI PALWEAEAGRS PEVGS PRPA* PGSLK 
VHVDNNW ES IN 


10832 


24733 


A 


10921 


33 


292 


GTMWLEIRHAVEVQCVLVSELVIPTSGD 
KPEQC*DHYLIT+YLILGKWGI**ISGA 
LE KKKKKKKKKKKKKKKKKKKNKI KTGG 
FI 


10833 


24734 


A 


10922 


108 


3 


KLAVYGSVCL* SQLLGRLR* EDHXjNLGH 
GGCSEP 


10834 


24735 


A 


10923 


106 


2 


KLAVYGSVCL* SQLLGRLR* EDHLNLGH 
GGCSEP 


10835 


24736 


A 


10924 


195 


2 


ASLFKSQLNRAFIYLFIYVFIFETGSL* 
PMLVC S G V I TAHCNLHVLG SGDPPTSAS 
QVAETMVL 


10836 


24737 


A 


10925 


127 


2 


YVC I LKNLKGRPGMVAYACN P S TLGGQG 
GWIT*GQEFKTSL 


10837 


24738 


A 


10926 


639 


282 


FFLS IKGWVQQFMPI IPTPWGLKQEDHL 
fctr ijiAftuyf vyrioM rroJUJjK-L *KJjAKRGG 
ACL*SQLFRKPR*ENCLSTGELKPKGGI 
F I RLVQNKI pttrge kr KQSHTGS YQ CP 
KIKKKG 


10838 


24739 










ALL TQ AL TCRQ AG AEAPHAG AT P S LM P P 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 
VIILFKFBCMGHKAAQTTRNINNAFGPEI 
ANKG TVQW R FKN F CKRDE S REDDE * YAQ 
PSKVATDQL 














TEAELLTLYLLPNALLNHFTSPPLMFAD 
RRLFCTNHIDIGTLYLLFGA*AWLGTA 
LSLL I RAELCQPGNLLCNDHI YNVIVTA 
HAF V IIFFIGLPIII GGFGN * L FPL 


10840 


24741 


A 


10929 


180 


350 


EPMAKGKTESPGPKRCGP*I*WVISQRG 
TLRFRGAGLFFMGEFLRLGENLLEIPRG 
A 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG* VQWLMPVI PALWEAKAGRS P 
KARNL 


10842 


24743 


A 


10931 


121 


543 


HRNTGSTHASAHAYHI VHTNP * PLTGAL 
SALLMTS S LAMG IHFHS ITLL I LGLLTN 
TLT IYH* WRDVTRESTYQGHHTPPVQKG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQYGI I LY I T * * VFFFAGFF * AFYHS I L 
SPTPQLGGHWSPTGITPINPLKDPLLNT 
S 


10843 


24744 


A 


10932 


227 


339 


VGGVKSVQLIiLNCHFSR*MKKKKKKKKK 
KKKKKKKKKKKKK 


10844 


24745 


A 


10933 


64 


409 


DQRNKSAHLRAHLKKKKKKKKKKKKKKK 














WAFFLPIoLKRKFVGENFLKKNFFWGGNI 
GGQISSKKYRGGGK 


10845 


24746 


A 


10934 


2 


362 


NKLSGPYPEKVGYTLPVLMNPLAQLVI Y 
STIFAGTLITALSSH*FFT*VGLEIDML> 
AFIPVLTKKINPRSTEAAIKYFLTQATA 
S 1 1 LLIAILFNNI LSGQ * T I TSTTSQYS 
SLIILRAM 


10846 


24747 


A 


10935 


154 


3 


GCPLS PLLFN I WKVLVRAVMQEKE I KS 
IQIGLEVK*SLLTDNMILYLGN 


10847 


24748 


A 


10936 


250 


363 


TAWPG YLYSFS FliY*ETTKI W3JGAVAHA 
CDPSTLGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLGILLVFRI I VEI YKCGRLWLGTVAH 
TCNPSTLGCQRGQTT*GRE 


30849 


24750 


A"" 


10938 


130 


240 


KNEQDPRDL* DNDKWPNIHVIGVPEEDK 
DNGTERVFD 


10850 


24751 


A 


10939 


35 


235 


FILVENTKKMC I CPLLNMI YAI L.YLF VP 
SVFI>+EENKQ*GITEKKKKKKKKKKKKK 
KKKKKKKKI KPGGL 


10851 


24752 


A 


10940 


179 


12 


DQPGQHNKTLSLQENKNKSSWVQCCV* P 
QLLGRLRWEDCLSPGGRSCEEPCSHSG 


10852 


24753 


A 


10941 


297 


278 


INQDNELTLINQSSKQIKHKNINQTIiRT 
KMNENLFA* I IAATILGLSATVQI ILFP 
PIIjIPTSKYLINNRLITTQQ*L.IKLTSK 
QMITIHNTKGRT*SIjLGGD 


10853 


24754 


A 


10942 


3 


318 


fgggrgvrnypdafvlsvllpsrllfph 
lfphlfplppflglapycfltl* yf* k+ 
*l yysqs i ifffyremknkllvfwainl 
fflyyynymemwtlicywkn 


10854 


24755 


A 


10943 


190 


3 


KLS HKQS VYFKTPF I * KNHVLFK* KNLW 
VYPNDKFFFFFFESLALSPRLQCNGTIL 
AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFSS YSSLYGNGLFLI PPFLRVSGF 
GKGFWKKFFFLRARGALFWGS PLKGFFL 
GFFWVFFFF*DGVSLIJLiPRLECNGIISA 
HCNLRFPGS 


10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVLGWAQWVTS VI PALWEV 
KVGGS PE VRS FRPTRP I * KYGFRLGTVG 
YICNPSTLGGQGRRIT*GQEFQTNPANM 
VKPC 


10857 


24758 


A 


10946 


312 


1 


FARPGLLKSWDFQP*PLDPVWGFFFKRF 
PENWLEFEIFPFLI IMAKRKKLSKSLFP 
FPFLG*FKD*GQKTPFFFFETVSLL1>PK 
LECNGAISAHCNLCLPGSSD 


10858 


24759 


A 


10947 


208 


329 


IFTGDRSRNNRIG*ARWLTPVI PSLWEA 
EAGGSLEPKSLR 


10859 


24760 


A 


10948 


116 


374 


FYFGYLLFFFCFFEMAPCSRG*SWSARG 
QSFCNIjIFPGSSDLYVSAFRVARITGAR 
DHAC 1 1 1 VFWDGGCVDT I TSGLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTSVREQCEQL*KCVKARKR 
IELCDEQ* SCRSHTD* DCTDELFDFLHA 
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10861 


24762 


A 


10950 


226 


429 


NFGFRLDKTVFFSKIRINAWAQWFMPVM 
PELWEAKAGGIT*AQELETSLSNIVRPW 
LYKKFKN* VWWC 


10862 


24763 


A 


10951 


188 


433 


YSAEWKIDLGIEVFWVGKMTYKQ*FFWF 
VFKFLFF*RQGRTLSSKLEWMSVITGHC 
NLDLLG SRETFAS AFQVT WT I GS CHS 


10863 


24764 


A 


10952 


298 


445 


LFSKCSSKSIKEEGPGVVAHACKPGTLG 
GRGGWIT*GSEFESSLANMVK 


10864 


24765 


A 


10953 


244 


1 


KTPQGLWFCLFKKS PLKGI KGVSMGVL 
ICFFLKSRINLFNPRGLGQRKNFFFFFF 
' ETDSYSVAQTGVQ*HNLGSLQPPSC 


10865 


24766 


A 


10954 


199 


3 


QETKKEQNKENKQIK*RSTRKKHRQGTN 
KTKERGERQTPPVGNRQTPTLGIHARPR 
' RRATTSPRA 


10866 


24767 


A 


10955 


157 


3 


YSYTFSFIITTSILIIQTLSGHGGACL* 
SQLLGRLRQDNHIiNIiEHGGCTRA 


10867 


24768 


A 


10956 


263 


357 


GLSLKVLTQWLGAVAHACNPSTLGGRGG 
*IT 


10868 


24769 


A 


10957 


333 


2 


KWKADMLIPLVIKSFFLGIFFKMFTNSK " 
RYIFLLQLLAQHSSFFSFIRCMEL*WPY 
PIPLCYGQSNQPS V* LAYCRDL FNEL I F 
FFFFETEFHSCCPGGISAYCNIiCLSC 


10869 


24770 


A 


10958 


130 


228 


GSSFLGGSPSVAQAGVHLPDHGSLQ*DK 
SPSVQKMSKWIGCSGACL*SQLLRRLRC 
QGRLSP*GQACSEP* SGRCTPAWATEGD 
PPKKLLPYTTDS 


10870 


24771 


A 


10959 


250 


3 


VQACS ITGALYQRRNAEDEPQTARP ISG 
FTTSIAMRLMLLTCSGHIWPVADLAFSG 
IMP * GLFFPSRQNliALS PRLECS ARA 


10871 


24772 


A 


10960 


199 


326 


VLGRMWSNQKNYTLLARM* KI K* * NNTR 
CWGGCGATRRI IHC*QECRI * KCQPQSL 
WRTV* Q FLNTLN I R 


10872 


24773 


A 


10961 


323 


3 


LLFFHLPDNWQH * YVFCHYNLS FLQF YI " 
N* I IQLWFCVWLIiSMS 1 1 LKSHPCHFS 
K*LSTVTFFFRQSRSVSRLEWSGAISAH 
CNLCLPDSSDSCASAS *VAGSC 


10873 


24774 


A 


10963 


120 


3 


PFFFFFF* EMRSCFVAQARVQ* CDQSSL 
*PSTPGLKQSSC 








10964 


168 


441 


LTPVI PALWEAEAGRSPEVRSSR* S PP 


10875 


24776 


A 


10965 


19 


207 


APLKLNVAMELSLGQWDVSRSNLWE I PL 
KKGDTGRVQWLTPVT PALWEAEVDRTPE 
VTSVTRC* PLKLNVAMELSLGQWDVSRS 
NLWE I PLKKGDTGRVQWLTPVI PAL WEA 
EVDRTPEVTSVTRC 


10876 


24777 


A 


10966 


209 


1 


TKSALSNMWTTHIGLCKNFCIVL*DRV 
SVTQIGVQWHDLSSLQSLSPRLKDPPTS 
AS R VAG I TGMHL V 


10877 


24778 


A 


10967 


109 


1 


DYLRSGV*DQPGQHGKTSSLLKIQKLAR 
LVGPIISC 


10878 


24779 


A 


10968 


141 


2 


GLAMLPRLENI I FRPGTVAHACNPNTLG 
GRGGQIT*GQEFKTSLSC 


10879 


24780 


A 


10969 


175 


348 


LVYECKHILCVFNTSLFFPH*TCYK*RG 
VLWLGEVAHACSPGTLGGRGGW IMASGD 
RD 


10880 


24781 


A 


10970 


108 


2 


KTKKLHMRSGWAHTCNPSTLGG*GGWI 
T*GQDSC 


10881 


24782 


A . 


10971 " 


184 


2 


SFLWKLHLRGVPGRVRCQSAPTRGASQL 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GYLGVRDPLEEAV* LFSDLKLCAGKTTT 
LFLV 


10882 


24783 


A 


10972 


245 


339 


SQ I SGPGTVAHACNPS ILGGRGEWIT *G 
QEF 


10883 


24784 


A 


10973 


295 


340 


KRVFF*ARKKNPFLSPLWGLTLENPLGF 
RLFLNPFLGLKISQFFFPRQI * PLAFFF 
WGTLPRPLNLKFGPFFFFFFFFFFF*DG 
VSFCCPGWSQARRYIEINISLSTCPRA 


10884 


24785 


A 


10974 


186 


340 


RPYLTI RFKGHPPNEPIQ* S FFVLFCYC 
FDTESRSVTRLECSGSNSAHCNL 


10885 


24786 


A 


10975 


253 


491 


FLECTSLLKRQRVHPGQHSKAPPTKELQ 
TTGRGGALLQSHLLRRLRQEHCLGPGVG 
S YSEP * LHHCTPAWVIEGDS SKK 


10S86 


24787 


A 


10976 


141 


1 


SFS FFKFS PTGDM IGFF* HFFFFIiRRSH 
SVAQAG VQWCHLS SLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEE TGFHLVS QDGLNLLTS * STHLGLPE 
CWDYRREPPPGREGDF 


10888 


24789 


A 


10978 


186 


323 


YESRS ATQAGVQ W CELGS L * PSTSRFQ* 
FFCLNLS STWDYSGliAPS 


10889 


24790 


A 


10979 


183 


3 


IKILFPFFFLRLSLALLPRLECSGTFSA 
HCNLCLLGSSDSAS*VAGITGTTTPSPM 
QLV 


10890 


24791 


A 


10980 


309 


55 


KNFGPNWVKFLGGKGGEMAFLGXFFFPF 
F FFFE KE S WFVFQG * MQWGDFRS LQGP P 
PGVTQISRLGKIGGPLFFKKKKKRERGN 
M 


10891 


24792 


A 


10981 


637 


838 


SQHLGRPMRVDQLRPGV*DQPGQHGEMP 
S LLR I QR* AGHGGTHL* SQVLRRVRQDN 
CLNSGGGGCSE 


10892 


24793 


A 


10982 


275 


2 


RNRTLKMEFLSWFWGFWNWLLNMIRPKN 
VKDSTS KSMENTDS PWHELFKELGK XNA 
FDTPDSLLVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHATLRGLCVLTFSFHITALSVSGTN 
DAEDCCLCETQKPICGYIERNLLYLLIK 
DVCRVPAW*WERVYSLISRYSLWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
LFGA* AG WGTAVS LL I RAELGQ PGNLL 
GNDHMYNAIVTAHAFE I I FF I VL P I I I G 
GFGN* LVPL I IGAPDMAFPR INNI CF * L 
LPPSLLLLLASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRTKMNENLFASFIAPTILGLPAAVL 
IILFPPLLIPTSKYLINNRLITTQQ*LI 
KLTSKQM I TIHNTKGRT 


10896 


24797 


A 


10986 


3 


347 


HELRTKVNEHLIASFMGPTSLGLPAALL 
I I L* P PLLI PTS KYLI SNRLI TTQQ* L I 
KLTSNQM I T IHNTKGRT* S L I LES LIU 
IATTNLLGLLPYSFTPTTQLS INLAMAI 
PL 


10897 


24798 


A 


10987 


46 


317 


KSMTP IRKINPLI KLINHSLIDLPTPSN 
ISA* *NFGSLLGACLILQITTGLFLAMH 
YSPDA*TAFSSIAHITRDVNYG* IIRYL 
HANGAS 


10898 


24799 


A 


10988 


245 


1 


VS CLLEGKLTNRKD IHTKNPS VHHHHQR 
PKVDKTTKMGKKQNRKTGNSKKQTASPP 
PKK*SSSPATEQSWMENDFDELREE 


10899 


24800 


A 


10989 


255 


1 


SCLPWANWISVQNQHKKTFLVIRTFFPI 
GS FHVT * LLGPPL I LIS PPRI FFFFETG 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 














SFTVARAGVHSGAISARCNLRLLGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GE KP WG PLI WGGL VGR S S WAS CQVLE LL 
*E^^VFRHVAQAGL.KLLSSSNPPASASQS 
AGITGVSH i 


10901 


24802 


A 


10991 


109 


330 


TNQFKTKKEREAGKKKVKELERERERGK 
KRKEQRKNEKERR* P*NMD*RRIRERNH 
SFDVYEFLICGLLNLLHV 


10902 


24803 


A 


10992 


244 


326 


GRYLKGWIiDGPA*AWIiGTALSLLIRPE 
LGHPAI LLG IDLIS S VI VTAHAWS I LLV 
RVI P I I I AGFGN* LVPL I IGAPDMAFPR 
INSISF*LLPPSLLLLLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 


A 


10993 


101 




KTFWARFVGTCli*SQLLRRPRQEDHLRlj 
GGRGC 


10904 


24805 


A 


10994 


177 


1 


TPSLLFFVNI ICFCLNPGGGGYSEQKLC 
LCT PAWVTE *NS I S RPSQNNS RQSRQ VN 
HLF 


10905 












FLFVFRFWHGYPFPGAFFSTQKWLVF*T 
MVLFFFFFS*VSLLPRLECSGVISAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGMVYHFPS ILLIiYI>CKLFFSPKLEF 
SSCCPLECNGTVLAHRSLRLPASGDSPT 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRVDGQHIMTHQSHAYHIVKPSP*PLTG 
ALSALLMTSGIiAM*FH 


10908 


24809 


A 








PWAYAEPIiTLHDATYSCT*S1jIRASFTL 
FIA* TTRLRLLP * S FTPTTQLCMNLAMA 
IPL*AVAPDIGYSSMITNALSHLLPQCT 
PTPLISILGIIETVSLLIQPITLGVRRT 
ANIT*CHLLMHIjIGSATI,SISTI 


10909 


24810 


A 


10999 






Q*IKYLILIPSNLFHNKPNIiKTAYSVPP 
SPNLHLPS SSNS PASASQVAGNAGARHY 
AWLIFVFLVETGYSLLV 


10910 












I WG I XjQ S RR CG S RL * S QHFGR P RQ AD C 
FTPGVPDQEYFTLLPEWFFLV 


10911 


24812 


A 








uv^iji \jk. JLo X J JLi\.ltIlsJtSJjbwoijr zrJjLiJj 
LLLF * EGGSHSVTKLECSGGVSAHCNLC 
LLASSHPPTS S SQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GD3TLLSHTLRAHLIENLHAEFIDRTILG 
IjPAA* LI I LFP PLLi I PTF * YL INIRLMA 
NRH*I>IKLTSERMITIRNSIGRT*SIIL 
VSLI I T I ATANIiLGLLP YS I TRTT 


10913 


24814 


A 


11003 


214 


83 


S KKSASPLTFNI VFQVLDNS VREEE IRY 
TEMEKEEIKLS*FVD 


10914 


24815 


A 


11004 


241 


368 


S YLL * PSP * PLTGALSALLMTSGLPMRF 
HFHS I TLL I LGLLTNTLT I YQ * WRDVTR 
ESTYQGHHTPPVQKGLRYGIILFITLEV 
FFFARFF * AFYHYS LAPTPQL *GHWPLT 
GINPLNA 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHL P W I * EARAYQAAG CS KL FDHAM 
LQAHRAHQLT I DT YQE VEE TY I PEDHKY 
SFL.HDFQTSFCFSDS I PTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


10916 


24817 


A 


11006 


156 


335 1 FGCCCCFLFF*LVYCCYVMMLYYIYVLI 
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K=Lysine, D=Leucine, M=Methionine, 
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SFGFYF*YWIM+YFYRLVYNIYI,VLFYL 
LYMS 


10917 


24818 


A 


11007 


107 


2 


KCSSSKHFTKEDSQITNKHIEKCSS*LL 
VREMQII 


10918 


24819 


A 


11008 


129 


412 


KTRSGFKKTPGQKGKTPLLKKTPKITGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFFSPWGKK*NFFQKKKKNLLYPG 
KIFFFWMKNI 


10919 


24820 


A 


11009 


63 


268 


CTFKFKNHCSTIQS XT * LDRS EKKKKKK 
KKKKKKKKKKKKKGGPL* KNFSIiKGG 


10920 


24821 


A 


11010 


154 


422 


ttEFS FFFFFWiCPKP PFFAQVGGQYRNWG 
*LKPRPPWLKNFSGFASGEAGITGGVPH 
PG* I IiVF * VKAGFH I ENLVWNS ETGNQ P 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKI KKKKKKKKKKKKKKRGAP FLK 
TPWGAPFFPGSAI FIFFFFGGFF*TPLG 
FFWETLF FWGG KI LGHLS P K I LPFWGKK 
NFFWVKGGKNLLMS PFLKI FFLGVFFKK 
FFPPG 


10922 


24823 


A 


11012 


151 


361 


ACTTTPSSFFSNIDTVSLCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPAHFFP 
ILI RSRYVAQAGLE FLALRDTP I SDCPC 
TVHYPPSIiAN*SVFFFFLTQSFVSLPRL 
KRK 


10923 


24824 


A 


11013 


146 


378 


NFWFSIEFLHICY*IYSFICLIKVFL* 
* KKKKKKiCKKKKKKKKKKKKGENIiKGAG 
GGKKFLGGGKNNS FFRNGGCF 


10924 


24825 


A 


11014 


82 


264 


TEIiFLYTICNWSAILYIIiC*RNKVFIjSF 
INTEKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKIERKI 


10925 


24826 


A 


11015 


381 


2 


GRTALFTLLKGPIHRPQMFFTPLRVFPG 
S YTGIS IRQSGFFFIKRVI FFRL* * KAL 
FLLPFLLPFFSFFLKIGTYSVAQECNGM 
I IDN*NFELLGRRNPSTS ASQ I AGNTSA 
Y*HTQLVFIiCEDG I 


10926 


24827 


A 


11016 


1 


251 


PNCIVHSLRTGTLNFIILLLLCFTFE*F 

Lnknlifesqkkkkkkkkkkkkkkkkkr 
gaplkktpggpkfypaskkkispqkgg 


10927 


24828 


A 


11017 


234 


1 


DINIKGSWVKGIPKPSVLSLQLFCNSKI 
FPN* KL I KKKNPNEPGVMVHVCNPSTSG 
GRNGQMA*GQEFKASLANMAKP 


10928 


24829 


A 


11018 


107 


342 


FQLC IGAl VHC FLFMKQHSE YKS TDHRA 
SSKCHTVQGSSQLSTSAFKQK*THKKKK 
KKKKKKKKKKKKKK * KKKGGAS F 


10929 


24830 


A 


11019 


264 


1 


CFKPUjFKYPGI IKAVNLPVSTASATFH 
KF* YVTFS FS * KKFLCVGGKTGSHS ITQ 
AECSGTI KAHCSLELPGPGDTPTPTSRV 
AARA 


10930 


24831 


A 


11020 


1 


288 


RTRGTSSRSRAATLFFFFP FKKGGFFNG 
NLPTTFGKLGNKKFPLNLFPGGP PLPHG 
GLKKGPGAPQVPPLFLKKPQTSPP*GLR 
EGPPPLKKFYLF 


10931 


24832 


A 


11021 


115 


287 


STCEI S FGCSRGCCWDYLMVWEFSHFYF 
* FLETECCSVA* AGVQWSNLS SLQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKMGRDQSRKA*NFKNQKASS 
PPKEYNSSIiAREQNWMENEFELTSVGFR 
KSVITN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
etb 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyiatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10933 


24834 




11023 


180 


353 


IFCFCI FSRDGVS VC * PGWSQS PELM I H 
♦NTKIS *AWQRVPKIPATQEAEAGESLE 
PG 


10934 


24835 


A 


11024 


155 


1 


PQARRGGTCLQS * LLRRLRLRQEDRLS P 
GV*GYSEL*SCPCTPAWETERLV 


10935 


24836 


A 


11025 


22 


325 


KLILIiYDPQLTFIFIjNTRAVVAHCAFCC 
WPIHFVFGNYGFVFSCLFTFIFVYSGKA 
DIiFFKPKKKKKKKKKKNKKKKKK* KKGG 
RGLKNSWGGPIFWGFPK 


10936 


24837 


A 


11027 


127 


259 


GQVQWLTPVIPTLWEAMVGGSTKLRSSR 
PFFSTP* PCI VMILYG 


10937 


24838 


A 


11028 


37 


408 


IASGALFFFFKAAGKRGDFKTEGAYQRQ 
RTI FKNKKRARRE KTGRENLRGNYKNMG 
RGIjKTPRGALGGPYLDKKGPLRGKGPFQ 
GGTLLGGGPKIKIQGTTj*IRRNYWPYIR 

rynrfkkr qknm 


10938 


24839 


A 


11029 


146 


3 


LLVLKDLPRIMVRSAPTYIYIYTYLYIY 
I YIYTHTHTHTF I T * VLFC 


10939 


24840 


A 


11030 


268 


435 


IjMCLRNETN* F* KVE * WLGAGAHTCNPK 
TLGG*GGRIA*GQEFETRLRNIVRP , 


10940 


24841 


A 


11031 


322 


2 


KLPHLQQQCVKVNCYKPFLKNGKIGHGG 
VCIi*SQPLRRLRQENHSNPGQCTSAWVT 
QRDSVSKNKKKRKMEKYFSSVRQNKPVK 
HTKRMSPTKADTKSTHCGGGRV 


10941 


24842 


A 


11032 


251 


380 


HKRLHTI YYFVPLKKKMGPGL VGHT YNP 
S TT/3GQGGQ I T * GRE 


10942 


24843 


A 


11033 


64 


324 


FWFFFFFFFFFLNPPGGGKKKKKPPPPL 
GKKKKRGKKGGEKFFS PPPRGGGGGPKK 
I SLKRVLKFF*NPPGGTKKGGG P FLIPP 
GSP 


10943 


24844 


A 


11034 


420 


534 


CPGWAHAYNPSTLGS *GRQITIDREIE 
TIIiANMAKPH 


10944 


24845 


A 


11035 


196 


1 


KNGHSLPPGPQKGTPFPKKKKTPKINLS 
ELGKSKAYSWPGTWHACNPSTLGG*GG 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 


PSDR*LFSTNHKD IGTLYLLFGA* AGVL 
GTALSLL IRAELGQPGNLLGKDH I YNVI 
VTAHAFVIIF 


10946 


24847 


A 


11037 


32 


405 


DYVSKRKEKREKKRNVILETS I SSHLVE 
WMLCSRYHPLIKKRSSVRILIALLCPQK 
VRDMS*GIGSKKQCWDSWLSIWIKMESD 
PFLIPYPKINA*RLKDLCERINLKI UK 
REYLHDYRVKKV 


10947 


24848 


A 


11038 


337 


448 


KNPRGFFGKNPFFWGGPFGGPPPPKKMG 
FGEKKKF* FKRSKKKKKKKKKKKKKKKK 
KKKTG 


10948 


24849 


A 


11039 


307 


415 


RFVCSTI KVLRDLSSDRSNPGRVLSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRGIKYYFIjNKLPDFPRQTFKK*G*RL 
GAVAHACDPS TXiGG * GGWITLGQTFETS 
LTRMGKS 


10950 


24851 


A 


11041 


104 


2 


SAFFF*ETESCSVAQAGVQWCDLSSLQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYXAYICVCIHIYI,YK*MYICINVHVCK 
I TYLY I YKYLKYKAI YLHI GFFS FIiHT I 
S 


10952 


24853 A 


11043 


314 


395 


TVYILNLS INSVQS VLLSVFHRLRHQPR 
GRVQWLTPVI PALWEAQAGGS P + VRI SQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

l^Lysfn^^Le^ 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, V=possible 
nucleotide insertion 
















10953 


24854 


A 


11044 


310 


403 


RGKGGKKPRNPGGGKS * ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 
KKFLPKRGAKKKPPGDFGKKNQIGGGEK 
RGKTPPKKKKP*GKKKNLKGKGGKKTPK 
SGGGKKFPPRW* KKKLPPGG 


10954 


24855 


A 


11046 


181 


3 


EGPLSLSLIiFFFF*RQISSLLLRIiKYSG 
TI IAHCSFKFPSSRNPPTSASQAASTTG 
VHH 














KESRSLSQGGREKGPFKFLAPPAPKFKR 
IFRPHPLKKIiGPKGPPPSPS* ILSLKKK 
GGPPILiARWFWNS CPQGFPPPGPPKRLG 
FKGGPPPPPLF*KKKPPFLiGGENQKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVI FVFIiLRKGLTLLPRtiESGMIMAHLK 
S*PPGLKQSSCHSLLSSWDYRRRRG 














HGKIPSPLQ*HK* IAGYGSACLQSQLIjR 
RIiRWESRSISI 














RKIRRGGLHLWSNLIjGRLRWEDIjLGRIjR ' 
WEDRLS PRRGGCSE PRLCHCP PAWATE * 
DPVSKKKKKSLFVEKPQGGG 














FCPPVGRAGTHFGLIiEPFPSRV* KQYYW 
IjGTVAHVCNPSTLGGHVGES*GQEFKIS 
LiANIVK 


10960 


24861 


A 


11052 


140 


337 


Wl IMFFFFFFETVSFLLL.GLEGKGTI wv 
N*NLCI.PG*GDSPLGCS 


10961 


24862 


A 


11053 


197 


428 


IRSINNPTVASQYSSEWKSHIPLILNQN 
XiDMIKFSEEGMXjKAKIGLIjRQTVGQWN 
AKEKFLKKNQSATPLS I * IRK 


10962 


24863 




1 1055 


180 


1 


AI IALLYSRLGDYARRFHLRRKKEQWQG 
MVTHVYNPSTLGGQGGRTA* TQEFKS SL 


10963 


24864 


A 


11056 


158 


426 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAISAHCN1jPA*FSCIjSIjPSACNCRRAP 
PRLTAS AS RGAG I ADGVS FTQCS MVPRL 
ECSGV 


10964 


24865 


A 


11057 


220 


421 


YALHLNMKNNSYF * MRKKKKKKKKKKKK 
KKKKKKKKQNKKDPGGAVYKK3 P 


10965 


24866 


A 


11058 


162 


464 


SCSVGLKLFSMKTSLALSYYCLLLLLTL 
HYYYYYYYYY* SLVFM 


10966 


24867 


a" " 


11059 




134 


APENRVDPRVRKTLVPL I LP I ITTLANP 
CKKD* YPYYVKIS IAC 


10967 












ecPT »DnT T PPT pngnvr ★r-r-KTC 1 T — 
* * CHCS PAWVTEQDPVS KNIHT YIHSQS 
INYGSMIGYIHEQKGI IADHKPI IAEPE 
DSTYPRDHT 


10968 


24869 


A 


11061 


298 


378 


S CS ASP CCSS RHGWS PCG * QT CLLLAI A 
IiRCIjPWLREAGALHTGPGCRLFDHAMLQ 
AHKAHQXiVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDSITTPSNMEETQQKSN 
LKLLRI SLLLI ETWLEPVRFLTRI VANN 


10969 


24870 


A 


11062 


39 


486 


RPTRPDAYHIVKPSP* PLTGALSALLMT 
SGLGMRFHFHSITLIjILGIiLTKTLTIYQ 
♦WRDVTRESTYQGHHTPPGQKGLQYRIM 
IiFITSEVFFFAGFF*AFYHSSLAPTPQL 
GGHWPPTGITPLNPLEVPLLNTFGILAI 
GGS I T*AHH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10970 


24871 


A 


11063 


269 


429 


ATWQKLPLQIQKNYPGLGLVAHACNPST 
LGGQGGRIT*GQEFIiNQPANMVNTH 


10971 


24872 


A 


11064 


248 


65 


I IEGQAQ WLMPVI PALWKAEAGGSPEVG 
SSKPA* PCDPPAS ASQSAG I TGVSHRTQ 


10972 


24873 


A 


11065 


176 


1 


LFPPPKKVPPWGNTQFKSICIiYTTNFFP 
FETESRSVARLECRGTISPHCKF*LPGS 
RE 


10973 


24874 


A 


11066 


2 


397 


MVLWTAHLRAMAPGSRTWGLLTFALLCL 
P WLQEAGAVQTVP I S RLCDHAMLQAHRA 
HQLAIDTYQEFKETYIPKDHKYSLLNDS 
QTS FCDSDS IPTPYNMEETQQKTNLEVL 
RMSLLEIDS*LRPAQSNKR 


10974 


24875 


A 


11067 


172 


399 


QILYL * KYAYTVTMQHL.SHP I L * KKKKK 
KKKKKKKKKKKKKKKKKKKKTGG 


10975 


24876 


A 


11068 


192 


3 


PKLTPMLKLIVEHIILLPI,T*LSKKHII 
* I NTTTHS I I I S I I P LLFFNQ I TSNL FS 
CSPTFC 




24877 




11069 


204 


345 


DKSGEHSKTSS IQKNIiRLGAVTHTCNPR 
TLGGQGRWTT* SQEFQTSP 














HIiJjPQPIjRVIjWTAHXjGATAAGSRTSIjIjL 
DFALIjCL>PWVQEDGAGQTVP*SRLFDHA 
mlqahr s hrlg i dt yqewe t y i pedrk 
LSFPDDCHTYFCF*HSIPTPSHLGETLL 
TSNLELIjRICLVIjIDSWLEPARIIjTS 


10978 


24879 


A 


11071 


108 


2 


PSPFFFETESCSVT*AGVLWRDIjCSIiQA 
PLPGSSC 






A 








KVK* LRQENQWNS rgrdcs e prsrhgtl 
PWTTEQDS I PKNKTKP YKI S 


10980 


24881 


A 


11073 


139 


3 


KLAGCGRAHLS FQLLRRLRRE DHLS PGG 
* GYS E P WSRYCTPAS RA 


10981 


24882 


A 


11074 


17 


421 


DHVIGQPRWLiMPVILALWEAEPGGIjLEP 
kvqdqpgqnckslslltikkkrrenfpg 
RGGTQLCSQLLRRLRVGCCIDPKLHNCL 
PAWMTERDPTFKRE* KKNDVLEAKNFFT 
ILTFMNTTKEKMVAQRIGSLFHG " 


10982 


24883 


A 


11075 


412 


1 


QFFFLGVCCPTQI FGVGKKGEKKKKKRG 
FPPRVFFFFNPQIFSPFFFPGPFFFLGK 
FPPFFGERFIiFFSKSKFFFPGVFKRGFP 
FPPKNFFFF*RI F* KFWFSPPKPFFFFF 
FFF*DRASLCHPGWSTVAQS*LTSC 


10983 


24884 


A 


11076 


279 


410 


VFCCVAGRDGAPGVKAHFPAAQKPWDGG 
FPGP*SAGVGPPPIPR 


10984 


24885 


A 


11077 


12 


386 


I AHLLLFS FYHKDTGTXiYLLFGAGAGVL 
STALSLLIRAELGQPGILVGNDHIYNVI 
VTAHAFGI I FFI I I P I I IGCYGN*LATL 
I I GAPDMAFPR INNI SFGLLPTSLLLLL 
AYAIVDAGARTGW 


10985 


24886 


A 


11078 


412 


3 


IiAILSLSLSLGFPYSLRYKNIKIRPINN 
PTMAS KCSS KRKTRTSFTLNQNIjEMIjKIj 
IEKSTAKAKRLiKVRPIjVPVSQWNAEEK 
FLKEIKSAIPLNIQMIR*QTSLIADMEK 
VSVL * I KDPTSYNI PLSQS * IHS K 


10986 


24887 


A 


11079 


78 


443 


RHRLiPSDTCZiTAAVP*PCKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGGFKKNS 
GGGQNPAGGKKI FFFFFGGVFKNPLW I F 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGCGGETS F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=]Vletbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possiole 
nucleotide deletion, \=possibie 
nucleotide insertion 


10987 


24888 


A 


11080 


395 


2 


KR VP KKKPKKNS PPGGRKGGS PP * S P P F 
GGPDQPIF*VGGFDPPWIiTPENPFFFKK 
KNN* PGKGGMPLILKSLGGWGKKKI FTP 
GGGLPKN*KETPPPPPGGKKKI PFPKKK 
KKKKKRAAARDLELVPTRP 


10988 


24889 


A 


11081 


24 


182 


TVS TTKCFSLL I P YYGRKDME * KT * I IQ 














PP 


10989 


24890 


A 


11082 


173 


3 


NNVI CY I IjY IPS VP KYKTTLNMRGS AFF 
FFFFFI F*DGVSLCHLGWSAVARSWLTA 


10990 


24891 


"A 


11083 


284 


391 


SVAHACNPGTLGG*GEQITRGWEFETSli 
TNMEKARL 


10991 


24892 


A 


11084 


257 


3 


IKKIiREVNMLGWLYYMREEARPVDYVHW 
RGPE YS SLTKT IRNVIVKGHQHY * KAS A 
GRGGSRL*SQHFGRPRRADYLKSGVRDQ 


10992 


24893 


A 


11085 


42 


171 


ARRMRAGIHRVSQEGLDLLTS * S AHLGL 
PKCWEDRREPLRLA 


10993 


24894 


A 


11086 


176 


325 


FKRS KKKKKKKKKKKKKKKKKKKRGGAL 
IiKKF* FKRSKKKKKKKKKKKKKKKKKKK 
RGGALLKKFLGGPQISGGGKRFFFFF 


10994 


24895 


A 


11087 


67 


379 


KKKKKRKKPKTKKRTGPVRQNVRWGRSL. 
SPGRAPAEGPQGGSTESPPLLVGRCLIL 
GLREPGVPPAAYGRFLRDYMWSIQGKLE 
KQRP Q * GLPS LTGAAEL I KH 


10995 


24896 


A 


11088 


128 


326 


KKKSGGGVFFFIIGGGGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNLVKH 
SSKKSSWVGW 


10996 


24897 


A 


11089 


109 


2 


KTHSRPGTVAHTCNLSTLGGQGE W I T * G 
QEFETSL 


10997 


24898 


A 


11090 


52 


347 


DHSLLELQMKLYASAQAAITKCHRMGGL 
NNTHFLFIiFLSIiFFFFLKQSLALPPRLK 
GRGP I1>VN* KLPLRGKRDS PAS PPQCVG 
ITGLAHPPRLILFF 


10998 


24899 


A 


11091 


386 


48 


IHLQPPLQICTLKFFTNKNPHLFLTGCP 
KYALIFLCFGALVKTGFHHVGQAGLQLV 
PLCPDRLGLP KCSD * RRE P PHPAKKQFF 
IDRVMLCYPGWCPAIiGINPSS PVGLPKR 


10999 


24900 


A 


11092 


164 


3 


VYSLLL,RCPFS*AIjTAKFINRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


11000 


24901 


A 


11093 


148 


325 


CLKLQPFGLGWAHACNPSTYFISYQ*L 
SPVIPAIiWEAKTGGSPEVRSSRPARP 


11001 


24902 


A 


11094 


162 


3 


LLGLKRP I FFFWGGKFFFFFFFFFE I ES 
PSVAQAGVQW* CNL.GSLQPASGRV 


11002 


24903 


A 


11095 


259 


358 


LRPGTVADTCN*GQARWIjIPVIPAIiWEA 
KAGGSLEVGS LRLAWP 


11003 


24904 


A 


11096 


254 


2 


SWGFI PMEERERENRYWDRQQGHAI VSN 
IVINAFLYRYTEREKPGVSLSPRVECSG 
MIVAHCNLSLVGSSHSPAAAS * VAGTTG 


11004 


24905 


A 


11097 


132 


3 


KKKKRQPRWAHACNPSTIiGGCGGWIT * 
GQKFRTSLVNMVKP 


1 1005 


24906 


A 


11098 


2 


455 


LNLEQLIEPLEVLWTAHLEALDPSSRTF 

vllafall* lpwlqeagavqtvpiisriif 
dhamlqahrahqlaidtyqefeetyip* 
dqkys flhdsqtyf cfsds i ttpysmee 

TQQKSNliELLRIYIiLLIESWLEPVQFLM 
SMFGNNLEYDT 


11006 


24907 


A 


11099 ■ 


2 


375 


QLPEPLMDLWTGDLVAMAPGSRTSIXIA 
FVLLWLP*LQEAGAGQTVPVSRIiFDHAM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R = Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 














I/3AHRAHHLAIDTYQEFEETYIPKDQKY 
SLLHDSQTSF* FSD YI PTPCSMEETQQK 
SNVELLRICLLV 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATALIL 
GCALIiCLiA*VE * ARAAQTVPQSNF*DHA 
MLQAHRAHQLAIDTYHE FEETY I PQDQK 
YSFLHDYQTS FCLS YSIATPSNMEETQQ 
KSNLELL. 


11008 


24909 


A 


11101 


1 


370 


LPEPLTDLWTAHLVPMAPGS RTSLLLAF 
ALLCLPWLQQAGADQTVP * SWLFDRAML 
QAHRAHQLAIRT YQEFEET YI P KDQKDS 
FLHDSQTSFCF*DSIATPSNMGETPQKS 

WT.KT.T.PTRT.T.T. 


11009 


24910 


A 


11102 


2 


361 


AHLLAMAPGYRTYIiLLAFALLCL.PWLQE 
AGAVQTVP*SRIiFDHAMLQAHRAHQLAI 
DTYQEFEETYI PKDQKYSFLHDSQTSFC 
FSDSITTPCNMEETQQKSNL*LIRICLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLRALWTAHLAARAPGSRTS LLLAIALL 
CLP WLQEAGA* QTVP I SRIiLDHAMLQAH 
RAHQL P I DTYHE FE KTY I P KDHKHS FLH 

LLRIFLL 


11011 


24912 


A 


11104 


3 


341 


EPLRDLRTAHLVAMAPGSRTYMLIAFAL 
IiCLPWL * EAGAAQTAPLFRL FDHAMLQA 

HDSQTSFCFSDS I S TP CNMEETHQRSNL 


11012 


24913 


A 


11105 


240 


329 


RNWPGAEAHTYYPS TLGGRGRQ IT* GRG 
FE 


11013 


24914 


A 


11106 


2 


316 


HEERERERERERESAIVRQIHTQGEGLT 
ELVFPYRRPAPSGLKFTFVYVQEHILGV 
GGNGIj S PLQ I G VI LRDGRGLAH I RCGTR 
NKILR I LiKSQGRAPDLS *DLY 


11014 


24915 


A 


11107 


169 


1 


EDGHHSGLHSKQQKQPAWPSMVTHTCNP 
NTLGGRGGWIT*GQEFETSLTNMEKPCL 


1 1015 


24916 


A 








GMSLTGHSNYIIIiNIjNFLiWPCWAHTCN 
PMTLGGQGGRFT*GQEFENNIiAYRGK 


11016 


24917 


A 


11109 


145 


383 


EVNKDKGKYHVSQG IQSFS YVAYVYFRK 
L I YTMMT I VNS ALL YTELV I CQGRRVLV 
FDFGF* FWFFETESCS VIQAGEQ* CDLG 


11017 


24918 


A 


11110 


199 


400 


TRAGFFP P FKKFLKGPGPGAPP FNLNFL 
GGRGGGFF* IPKKKIYPtiLLPFIjKVQKT 
PSKPPFFFFFFFLRQVLLCCPGWSAMAR 
SRLTRA 


11018 


24919 


A 








QFETSLGTMSGPS FYLKN* KSARCGHTC 
MRFQLLQRQSREDHLSSGVHRCSNP*LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


NHFWDWYHFSSPFFFFFFEKHLCSVPR 
AEDQGMNXiS *LNPLPPGLKQFFA* PSQE 
AGIIGPGHH 


11020 


24921 


A 


11113 


192 


16 


KKTENNKR* RGYGETETPAHCGRACKLG 
* PLWKP V* Q FLKKLKMKLPYDPTI PLPA 
IFS 


11021 


24922 


A 


11114 


113 


369 


AGGQGPNLFLQKPGPPGLKGPPPLPliPG 
GGNYGGGPPSRGIF*FFEKKGFGGVPKV 
GPNLRG* RDLPAPPS KGAHPLGNGFFSR 
N 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
Od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


ILCDVEHSLWPGVEVHAYNPSTLGGQDG 
RlA*AQEFETSLNNIARL 


11023 


24924 


A 


11116 


192 


1 


VOTLFWKPSPHALTRRRTSVFHL.KR* FS 
WLGAVAHTCNPS T3jGG*GFWR I T*AQEF 
ETSLGNMV 


11024 


24925 


A 


11117 


289 


202 


VFYCQ* KFIRKKKKKKKKKKKKKKKKKK 
KKLSKKI KKMDQQKLRSPTI K 


11025 


24926 


A 


11118 


412 


421 


RA*PTEQVTTiGITEQSYSRGHINNRVHD 
LDVGS * HPDGAAAI KGS FVQRLKG Y 


11026 


24927 


A 


11119 


16 


472 


DRLKPSATHAFADAYHIGKANP*PLTGA 
LLTLLMTSGLTM*VHLHSITIjIiIIjGLI.T 
NTLTIYQRWREVTRESTYQGQHTPPVQK 
GLRYGIUjFITSEALFFAGFF+AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEDPIiLN 
TSVLLASRVSIT 


11027 


24928 


A 


11120 


304 


346 


GIiWHATG ITP*PLTGSF*LLLMTSS LAM 
* FHYHS I TLL I LSLLSSTLT I YH* WRE V 
TRES TYQGHHT P P VQKGLRYG 1 1 LF I T S 
EAVFLARFC^AFYHSSIAPTPQL+GLWH 
ATGITPLNPL 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYALTCTVG 
TAVSWSSVS* PQRNMMMS * VHAPAMAGR 
LVCHYIRVYRRFEHVCKRAVHGQYP 


11029 


24930 


A 


11122 


299 


466 


I FI LHE I C ITFFFETRHFCH * AEVH WCS 
LGLLKP P P PGLRCD P STS PS RVAGTRP S 


11030 


24931 


A 


11123 


138 


1 


NPFNEKNPPPKGPGPKNAFFFFFFEMES 
CSVAQARVQ*HDLGSLQP 


11031 


24932 


A 


11124 


241 


499 


NQRERS * HLYLL YNTVCLTHGL.PIL 1 1 Y 
SAYQIAIYDLLLRRDAIiARLQADLADRR 
RGLQPKKRFAFKTRRKDAASSTKVDAAP 
GI 


11032 


24933 


A 


11125 


252 


391 


PLSLDLQSNASLSHFTSPPLMFADR*LF 
STNHKNI * TLYLLFGAWA 


11033 


24934 


A 


11126 


45 


459 


AFLKNHPLET * SNHS FIDLPTLSNIS A* 
GNFG*LLGACLILQITTGLFIiAMHYSPD 
ASTAFSS I AHITRDVNYG* I IRYLHANG 
ASIFFICLFLHSGRGLSYGAFL.YSQT*N 
I GI ILLIATI ATAF IGYVLP * GQ IS F 


11034 


24935 


A 


11127 


109 


301 


QDSLMRKKSKNRSPSHER*RSKCLERKR 
SGDKKKKKGGPPKKKIGGFPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


SCHHVSSLGDRARPCiiQKEKKKERKKVG 
DFNTPLS I MDG YQH WYQARKI I IMDGTT 
K* KITRK* KIRNQ * DLTDVYRTLHPRVK 
YTFLSCAYGTFSRTHRIRGQAI 


11036 


24937 


A 


11129 


182 


2 


FIjPRGFQPPPVWPGPLGFIjFP PPSSRD 
PGFPSMPFFFFFFFETESCSVS *AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFIjFFFLR*GLSLSPTL 
ECSGAITAHCSLKLPGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTDFPRKGRNSASRLPSDSNCNSSLS 
LQGLSLLYRFWTYHAVTGKGSQSRPQEQ 
VLGYHARKNSR*VHRVK 


11039 


24940 


A 


11132 


247 


225 


KNFWGGPPLRGGKKNFFFFFGGGYKK*!, 
LLCLVNMYI * CV+ YI*HKRIISDP KKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 














K 


11040 


24941 


A 


11133 


242 


3 


KKKKI FFPPRYFWAPPYFFFLGPP PFFF 
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JSDOCID: <WO 016483SA2_L> 
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PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenylalnnine, 

K=Lysine, L=Leucine, M=Metbtonine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF*TQFHLS 
PRL ECSG TI SAHYNLHLPGPSDC 


11041 


24942 


A 


11134 


361 


2 


ASQSAGITGVSHRTRPTHGFKDT*RKPA ' 
EPEVTIH* I * ITLMSCNIESPEKVCADL 
IRGAKEKNLKVKRPAGTPIKASRITTRK 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


"A 


11135 


1 


370 


I RQT FFQKRE IVGII LCY WNVG Y YSQLH 
*SSRMLWIFHFYRVYISVKVILVTLVPH 
EKKKKKKKKKKKKKKKKKKD 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
PPGLAL* PWECFSLPPALKIRFPSFLPF 
FFEMEFSRLSPRLECNGVTVAHCNLRLP 
GSSNFP 


11044 


24945 


A 


11137 


130 


354 


GQLNKIiSGPYPENVGYTLPVLINPLAQP 
VIYSTIFAGTLITALSSH*FFT*-VGLEI 


1 1045 


24946 


A 


11138 


366 


2 


LiVEMGFLRVRQNGLiYLLTS * SARLiGLSK 
CWDYRREPPCPASDWVFILTSPLIHALD 
GKEHTHTHTHTHTHTGLGICQSSLGKQS 
GGWGWLSANRGQ FS PFAVCL WS FLPEV 
PWTSALFT 














CRAGRGL PAAVL 1 I LFPPLL I PTS K YL X 
NNRLITTQQ* LI KLTSKQM I AI PNTKGR 
T* SLILVSLI I FKATTNLLG 


1 1047 


24948 


A 








HI PTPPHTTCPERPS IRHNS I YYLKRYC 
LLR 1 1 LSLLFLQTS P* PLTGALS LLLMT 
SGLSM*SHLHSITLLALRLLTYTLTIYQ 
* WRDVSRHSTYQRHHTPPVQKGLRYGI I 
VFTTSKDI VFSGLF *AFYHSRLAPTTQL 
* CHWPPSGITPLN 


J 1048 


24949 


A 


11141 


188 


3 


IPTPGGPPPLPPQKVLFKPGNPRAPPGV 
FFFFFETDSRSWRLECSGVI *AH*NLiH 
FRGSR 


11049 


24950 


A 








S VS S P CI SNTHKVDVRS SALLQES TS SR 
E VMTGHFHYL I IYET*LRSLAQSARLAC 
SGT I SAHCNLRLPGSSN 


11050 


24951 


A 


11143 


262 


371 


SYCGPGWAHAYNPSTLGGRGRRIT*GQ 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
AL* EAEAGGSLEPRSLR 














FEAESHS VAQTGLQ* RHLGSLQLP P PQV 


11053 












GQGAGEAFVEVWEKELRGFKVEGEGDMD 
KERWD+GL*GGRERERERERERERERE 
RERERERESLV 


1 1054 




A 








LRQNDHLTPGV* KCLEL* LHH * TPTWTT 
KPTAKAIFFNLKKKDSWD 


11055 


24956 


A 


11148 


230 


358 


VSVSLIIFIATTNLLGLLPYSFTPTTQL 
S INLSMAI PL* AGAV 


11056 


24957 


A 


11149 


81 


367 


QTLRTKMNKNWASLIDPTILGLPAAEL 
IIIiFPPLLIPSCKDVINNRLMTTQQ*LI 
ELTWEQMI TIHNTKGRT* CLI LECLIMV 
IATTKLLGLLP 


"11057 


24958 


A 


11150 


257 


2 


SVCAVI TKYLRLGCFADAAPTQRPLPPA 
TVKLTM FFN I ATHGE PLGCIF * LFVDKV 



1220 



^SDOCID. <WO 016483SA2_I_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PKTAENFHALSTGEK* FSYKGSCFHRIL 
P 


11058 


24959 


A 


11151 


257 


1 


WRCGEKETLIRCWWECKLVQSXiGRTVWR 
FVRELKLEI»PFNPAIPI.LGIHPKEKK*L 
DPCTCMFIVALVT IAKMWNQPKCPSMDL. 
V 


11059 


24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQS TS P PQKDC S S S PAM 
EQSWTENDFDELREEGFRRSNYELQEEI 
QTKGKEVKNFEKN*DECITRITNTENCIi 
KELMELKAKARELCEECRSLSSRCDQLE 
ERVSA 


11060 


24961 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDLGS 
LQAPLPGFTPFSWRQRLQ*AEIVPLHSR 
IiKLHFKKKAGVFS C 


11061 


24962 


A 


11154 


1 


347 


GTSNHFPFIiYTYQVHNSTDYPRNRCCLN 
PSLRFHTSSKPLPARPHIMTHQSHAYHI 
VKPSP* PLTGAUALLMTSGLAM* FHFH 
SITIiLILCLLTNTLSIYQ*WRDVTREST 
YQG 


11062 


24963 


A 


11155 


223 


3 


YSTNLKILRNHFII>FF*DRVLLCLPGWS 
AVASSRLTANSASCLSLPSSWDYRRAPP 
HPANFCTDRFHHVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFFFETESCSVTQAG 
MKWYDLS * MQPPP PRFKLV 


11064 


24965 


A 


11157 


149 


355 


LQPLPPGFKRFSCLRLLTSWDYRKDVDS 
ALL£NYVT*KYISKTLCRHQSPEIVREF 
LTAMKS HKLTKVG 


11065 


24966 


A 


11158 


156 


370 


RSDGCQKIPRNMIjGGWSVLHHTVDEQ*G 
PNRYIKKCSASLI IREMHI INTVRRCLT 
PATMATVRRKTTSVG 


11066 


24967 


A 


11159 


298 


4 


1 HFQWIMWRQLVKGRLQAQKNAAFTDSH 
THLCYLKCVYFCVCVCMCI CVFVCV* SR 
AL**SLYLFLCIKAVKAGHSVSGLESQH 
YGRLRQ ADHLRS G V 


11067 


24968 


A 


11160 


364 


285 


NSWGKIKILKGK*PPPPKKIFFPKYPQS 
FFLS PP* KKKTNFPHPRKYLS P PGI FLK 
APPPLIFFFFFFFFFFFFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KS VYMTLKKKRGS FHERGDVPAI CNFFF 
* IHIYMCVYIHICVYI YIYTHI YI YTHT 
QIYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 


376 


KI ITKHNIARTNPYTFCIMN * LE I TLQG 
EPKLRPPKPDKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKRGGGGFKKKPGGGP 
KIRGGEKI FFFFLGGD 


11070 


24971 


A 


11163 


343 


391 


WR*LSSTNHNDILTLYLLFRARAGVLRS 
AVSLL I LAEQGQPGNLLGDDHI YSAI DP 
AHAFAIMFFI VIPMI IGGFDN* LIAVRN 
GAPEMAI PHINNISC* RLPPSLLLLLAS 
AIVEAGAGTG*TAYPPIjE 


11071 


24972 


A 


11164 


1 


158 


QPMTPNRGPL.SP* PLTGALSALLKTSGL 
AM*FHFHSITLLILG1jLTNTLTIYQ*WR 

dvtres 


11072 


24973 


A 


11165 


265 


466 


KIoLTSIiS FFNLLRETNQVGLLEIiSQAKY 
KKTPRLGTVAHVCNPSTLGGQGGWIT*A 
QEFETS1ANMA 


11073 


24974 


A 


11166 


204 


458 


KRCIiLPIiP PDSNTQS S QCRREQ I GRAIR 
KILTFETQNIKRLKTEKQLLKLLE*SGK 
ITAHCNLDLPGSSS PPTSASQS AGITAG 
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NSDOCID. <WO 0164835A2J_» 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine T S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














s. 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
SLLPRLECSGAI STHCNLCRLGSSRTRG 


11075 


24976 


A 


11168 


1 


457 


HRTPS VRTPNCRGDPRVRS TNHKD IGT1> 
YIiFGA*AGVLSTAI>TLLIRAELGQPGN 
LLGNDHI YNVIGTAHAFVI I FFr VIP I I 
IGGFGW* LVPLI IGAPDMAFPRINKI SC 
*LLPPSLLLLRASAIVEAGAGTG*TVyP 
PLAGNYSHPGAS 


11076 


24977 


A 


11169 


343 


428 


GR* LFSTNHKD IGLLYLLFGA* AGGLS T 
AIiSLL IRAELGQPGNLLiGNDH I YNVIGT 
GHAFVIIFFIEIPIIIGGFGN*IiVPLII 
GAPDMAFPRINNISF*LLPTSL1>LLLAS 
GIGEAGAGTG*TVYPPLAGNYSHPGAYV 
DLT 


11077 


24978 


A 


11170 


91 


2 


PRQDPSFFFFFF*TESGSVAQAGVQWCD 
Xi 


11078 


24979 


A 


11171 


219 


1 


IFKSKH.CLPPPEPNFFFFFFFFFFFFF 
FFMRHGGACCSPSYI,GG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 


11079 


24980 


A 


11172 


292 


365 


RI *SLGQAQWLTPVI PTLWEAKVG 


11080 


24981 


A 


11173 


251 


1 


ICDLQILTHGITTCVKMGLTTLFVSGIT 
FLIHHSLAI KFGV* KNPS*LGT WHTCK 
LSS*GGRGGRIA*AREFETCLGNIVRP 


11081 


24982 


A 


11174 


200 


407 


KIFETDVYVKCSNILCLWASISFSNYFG " 
GPFYIJUi* * FLSYFSFLETESHIVAPLE 
CSGMILAHCNLCIi 


11082 


24983 


A 


11175 


11 


155 


STNFFIiFSETGSGSVLQAGVQWPGLDLL. 
ASSHS PASAS * VSGTTVDAA 


11083 


24984 


A 


11176 


140 


1 


PSTFPDVVAMRRVSCLSKGDKGQV*WLT " 
PVI PAI WEAKAGGS PES I 


11084 


24985 


A 


11177 


210 


1 


TDEELLLMDEQRK*FIEMESTPDEDAVN " 
rVEMPMKDLEYYINWD*WAGLKRTDC 
S FERS STVGKMLSN 


11085 


24986 


A 


11178 


133 


148 


STSTR*IiFSTSRKDIGAIiHLLFGA*AGV ' 
IiGTAVSLLIRAWIiAQLGSNKEA 


11086 


24987 


A 


11179 


236 


1 


KFWKKDPLTKGGCPLKKKGGGGI FSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KKKKKKNKE * LF I S FAH I AVHL 


11087 


24988 


A 


11180 


210 


1 


QSUGQENLQVFFPLFGP* PNYLDKGEKW 
XRVGKIWGHLFFFFETESRSVAQSPRXiE 
CSGAIAAHCNLCLP 


11088 


24989 


A 


11181 


292 


348 


KKGALFFS PGG+ KKNFP PGGKKKGPL.FS 
KKKKNPP * KKPGEKEKTRFP PFLGGLGG 
GFPLAPGGKPPRNRVIAPPPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


3 


KPEASMLRMDFFFF* EAGSHS VAHAGVQ 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 


A 


11183 


87 


351 


KGIKWSLNIQRGVQPHLRSYNYSFFSFF 
S *DWVLDRIiECSGTTTAHRSLNliPGSSD 
P PTS AS QAAG TTGMHCHTRL IF* ETGFH 
HAVQ 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQUDTRPWGIEKI FFAPPP *KGKRGVP 
PGGEEFKNPPAPKGE P PI>S * KK* NQ PGQ 
GAPPCYPP FLGGGAKKI FS I PQGRVSN * 
PKRGPSSP PGGKKGDFS PQKKKKKQRFR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


LVETRFHRVSQDGLNHLLTL + S TRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


11093 


24994 


A 


11186 


135 


3 


IFPKVFGGPFFFFFFF*ETESRSVAQVG 
VQWCNLGSLQAPPPGFT 


11094 


24995 


A 


11187 


305 


14 


IMLITSI RRS RLS STAEKG I LSRLR I CG 
RVAK YL I NNRL I TTRE * L I KLTS KQM I T 
ILNTKGRT * S L I LVSLSWYGYHLF * G * 
FD*SLIiIiY 


11095 


24996 


A 


11188 


198 


1 


LEQTFNMMFPPNKS I I S I PS FGFLFKI S 
FPFLLFFF*DEVFLLiLSRLECNGTILAH 
CNLCLPGSSD 


11096 


24997 


A 


11189 


352 


3 


QLCNTPLPKTNHAMLTS S NANTLI YRRS 
AIAAAS SPPI LFPASKI LLNSRLITPHQ 
*LIKIASKQMIAI PNTKGRT* SLILVSL 
IICFEVSLISHCWWISRLLMRYLEVGI 
NRGG 


11097 


24998 


A • 


11190 


270 


137 


CIINRLYVCVYIYI YIYIYI * IYFYMYV 
YNILCILGHRRESGVI 


11098 


24999 


A 


11191 


18 


342 


ACRKRKMSSKFALRATGNCYYSCTFCS I 
SFPHPK*MPCYQKKKKKKKKKKKKKKKK 
KITV* K* S KTKKNI KGLGAR * ELSLA* L 
IiGEPPIFAKGHPLMLKWVEICKRP 


11099 


25000 


A 


11192 


169 


3 


ERSGII RVYQD *NQCLAWS LA * KI IHPW 
LAAVAHT CNPS PS GGRHG W I T * GQE FK 


11100 


25001 


A 


11193 


407 


I 


KRACFRWLTLL I STL WEAKQENCLNAGV 
HDQPGQHGETPS PQKI KKSAWHGGPRL * 
EVRWED*LSQGG * GYSEPCSHHCTPAWA 
TERDP V* KQKRS I P WKELASERKDSGRC 
RWLVREMWPSCQASVIQVRRISF 


11101 


25002 


A 


11194 


110 


412 


VCGFRL.LHS PAI LMPLLMPS YFRRHSNT 
EIKP ISNPTVAS KRASERKS CTSLTLNQ 
KLEMIKIiSEEAT* KAEVGQKLGLLHQKV 
SQWNAKEEFIiKE I ESA 


11102 


25003 


A 


11195 


34 


155 


AWHEGMHHVSQDGIiNLLTL*DACLGLPK 
SWDYRRKPPCPA 


1 1 103 


25004 


A 


11196 


289 


453 


NQPGKS FMI S S T FHFNWE FTLVHAGV * W 
LDLGLLQPPPPRFKGFSCLHLPSGWDR 


11104 


25005 


A 


11197 


177 


432 


KWE KQAQTLGGGNFP KG P KTR VGKNFE R 
LKGNPPFLCL* KRGRKGDTGPWKKPAPS 
GERGIPGKKGGKTGPPGFKGPMEKIKRG 
K 


11105 


25006 


A 


11198 


139 


3 


AMVSLSPRLECSSMIMAHCSLGSLGSSD 
P PTSAP * VAR I TGMHHP 


11106 


25007 


A 


11199 


150 


492 


GDPLPGHHPV* EVRRPFARQLPRLRSEE 
PLRP VATPS GK * GAS LSGSHP I QEGGGG 
GQPLPGHLPHPGWRSGVKPPPGQSARPG 
GEGHLLPATPTGK*GGPQSGQPHLSGGR 
RE 


11107 


25008 


A 


11200 


239 


64 


YQKYYYYYFF*ETESRSVARVKSSGLIS 
AHCNIiHI»PGSRLQIjCLFPTSAVFGQESH 
IT 


11108 


25009 


A 


11201 


31 


396 


ILTMREIVHIQAGQCGNQIGAKFWEVIT 
DEHGMDPTGTYKGDSDLQLDRISVYYNE 
ATGGKYVPRAILVDL * PGTMDS VRSGP F 
GQI FRPDWF VFGQSGAGNNWAKCHYTKG 
AELGDSVLDV 


11109 


25010 


A 


11202 


85 


407 


VLLPHSEALEGADTMPHSYPALSAEQKK 



1223 



JSDOCID: <WO 0164S35A2_I_> 



WO 01/64835 



PCT/US01/O4927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E*=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ELSDIALQIVSPGKGILAADECVGSMAK " 
RLNQ I GVENTQENRRL YRQVLFNADDRA 
KKR I GGVI FFHETL* QKDDDGVT 


lino 


25011 


A 


11203 


302 


115 


THLDHVAI Y LS I YLS I YLS I YHLS I YHLi ' 

SIYLCRF*LID*DRVSLYFSGWLQMPGL 

KPSSCF 


inn 


25012 


A 


11204 


130 


348 


GNAYGGPCSVLVQRT* * IPALTFPSTCL 
DS KFQRDLVTLRTDFSNFTSNTVAE I QA 
LTSQGESGLDGALGWGQ 


11112 


25013 


A 


11205 


232 


62 


ISARNEGFNTLN*RTFFFFFFSSHCLSR 
LECGGMILAHCNLQLPGESIiEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFLLPL IEPFCIQEI QLRLGAVAHT 
YNP S TLGGRGG WIT * GQEFKTS LAN 


11114 


25015 


A 


11207 


251 


3 


PNHLPKAPSPNTTLRIRFQHLNFRGAQQ 
TFGLQHKLTSKQMITIHNTKGRT*SLIL 
VSLIIFIATTNLLGLLPHSFTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKL* INQKYSFLKKKKKKKKKKKKK 
KKKKKKKKKKKKKICKKKKKKKKKKKKKK 
KKKKK 


11116 


25017 


A 


11209 


436 


110 


ISGRGEQKFKYFSPAPFFRGINPPPLGG 
E*NQNIjKPP*KKIFLGSPKYPKKKGGPQ 
KILKKGPGGGGPPLFPPPLGGQGGGFPF 
SQGFKTPPGKKKKPPPKKKKKKKK 


11117 


25018 


A 


11210 


217 


2 


DKWSVLHINYNILSSSLVHVLFKYLEN* 
K*KKWLWPGMVAHACNTNTLGGQGGRIV 
*AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCIiSWGYFFF I LFliETRSRS 
IAQAGVQWRDLGSL*PPPA 


11119 


25020 


A 


11212 


187 


384 


GLIC FYLTCF Y FIF YFI LF Y FSL.L.C Y VM 
LCYVLCYVMLCYVMVCYVMLCYVI *DRV 
SFCHPDWSAA 


11120 


25021 


A 


11213 


373 


482 


MI KLRHRLGTVAHT CNPSTLGGRGQW I A 
*GQEYKTS 


11121 


25022 


A 


11214 


64 


362 


ITGVSHFTQPL.PFIGGLAI.S PKL.ECRGM 
I IAYCNFELPGSGDSCHI.SLPS S WNYRY 
MPCMPS * LKKNF* *RQGLTVLSRLEKDY 
L.L,*DFSALRSFPWGR 


11122 


25023 


A 


11215 


181 


368 


CDSAVPLLDIYPTEFKADLKKIRIPMFI 
IALFAYAKR*KQPTYPSIDKWIXKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDMWHTYNSSTSGGQGGTIT*AQQ 
FKTSLT 


11124 


25025 


A 


11217 


90 


3 


IGQAQ * LTPI I PALWEPKSGGSPEVRSL 


11125 


25026 


A 


11219 


277 


272 


SLTAPGL*SOLLRRLRWEDHLSIiGQSSF 
SEP * S CPCTPAWVTQ*DS VSKKKEKKKE 
KKRKEKEKVNQTPKATESPGQALIGSHS 
LQVQKRVYAHP I L 


11126 


25027 


A 


11220 


19 


343 


IiEIjEVE PEDVS ELLQS LDKNYLSCFLKR — 
FLEMKYTPGED WNT I KMTTEDLKYNTD 
L.VDKAAAGLERINFNFEGSSSVGKMLSN 
S IAWYS E I FYERQSQFMWQT * LIA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDL<3LLQSLPPG 
FK 


11128 


25029 


A 


11222 


105 


2 


RFVCSTIKVLRDLSSARSNPGRFI *TSN 
3RPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLIiD *DLGW V*WLTP 1 1 P I LWEAE 


11130 


25031 


A 


1J225 


155 


348 


PLFFFCFHRNTFY*KSVCRFPKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
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NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=l,eucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCKKKKKKKKKKKKKMEGGGGGANTKFLY 
GENRKP PLFFFF 


11131 


25032 


A 


11226 


271 


3 


NFLSVFNFTRICLGMCFLSIFLLIFGGS 
LKYQLFLAQGFFFFLHYFFTSSIHSSLL 
Y*MLDFLDLFFLYLFFCNFNFSVFLFCI 
LSKFF 


11132 


25033 


A 


11227 


3 


413 


S PAE PHRS YTMWNVKVGVNGFGR I ERLV 
TRAAFNSGKVDIVAINDPFIDLNYMVYM 
FQYDSTHGKFHGT VKAENAKL.VINGNP I 
TIFHERDSCKIKRGDAGAEYW* STGVF 
TTMDNAGAHLHG *ATRGI I SAPSAD 


11133 


25034 


A 


11228 


48 


413 


VFVLVSCGFFFFFFWEKKICFFWPGGKK 
RGKIGGTGTFGPKGKGNFRGP P PKNKEK 
GGPPPPPNNFGFLKKKKFYQGGQGG *KT 
RNFGKPPAPNPPKGGDLGGGPPTQGQNG 
FLKKKKKQKK 


11134 


25035 


A 


11229 


172 


404 


EFENSCSPGWQRRGVMIPGVTVEDMNQQ 
EFIRALVAbLKKSRKLKVPEWVDTTKLA 
KRKELTL * DEKWFYT * AAS TT 


11135 


25036 


A 


11230 


247 


2 


QPKKKSVSKKKKKELPCDPAIPLPGIYP 
KEKKSVYKKDNLHliRVYCRAVFTIAKIW 
NQPKC* SGDK* I KKI * YIYTVDLRR 


11136 


25037 


A 


11231 


1 


372 


TETTVE VAWCELQDRKIiTKSERQRFKEE _ 
AEMLKGLQHPNIVRFYDSWESTVKGKKc' 
IVLVTELMTSGTLKT*VHQYYKS*PRSM 
REFDFLKTFRSTS FTLWQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPRNLiCKWPGTVAHACNPSTLiEGRD 
GWIT*GQEFETSLTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VPVASRRRRRCGRVGGGKAMADLDKLNI 
DSI IQRLLEA*GAKPGKKVQLHENEITG 
LCLEPRE IFLRQP ILL* LVAPLKI CGDI 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PTTSNMRS RLLRGAS S * KKTAGPQQRNL 
EPALPRRWGGRSADNPPSGSLRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


1 


NNTHGLSHSSVGRNPGAVGSGSCMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSWVAS 
NNRHFFSPSSGG*KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCGCVTPFSAF 
TW 


11141 


25042 


A 


11236 


122 


419 


RTPRGPKPNPGLQRKTFFLIGGR I KTYL 
GIFGKDLYFWGGKNWDTLLPKI *RSREN 
KNF*REKWVNLPPYPGCLKNGARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 


1 


379 


AFNHLHAGHGLSGAAMKS LVLLLCLAQL 
WGCHSS PHGPWLIYRQPNCDDP* TEEAS 
LEAIDYINQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A* CNVRQLNE HAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVISLPLHS 
ATHTRPTNQHTNHI PMMARRSTRKHIRR 
APHTTCP KRPS IRDNP I Y YLRNFFLRRI 
FLSLLPLQPS PYPPIRRALAPNRHHPAK 
S PRRPTP * PLTGALSALLKTSGLAM* FH 
FHS I ALL I LGLLTNTLT I YQ* WRDVALE 
STYEGHHTPPVQKGLRYGI ILFITSE I F 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 

location 

ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H— Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *~Stop codon, / = pos9ible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFAGFF * AFYHS S LAPTPQLGGHWPPTG 
ITPIiNPLEDPLLNTSVLLASGVS IT 


11144 


25045 


A 








FVSVR.I TPIYASAATWLS YVWVWASSPN 
VCP PGDRVALSPRVECSGT I S AVRNLRL 
PGSSDYPASA* PADGWAPPQYRCPPPRL 
ANF 


11145 


25046 


A 


11240 


391 


31 


VFFFFFFFFFFFFFFFFFFFFFFFPPPF 
LKNPFEPPPNFFFFCPKF * ILFLN* I * F 
* I KF* I * KIPGFFLNPPPFFPPFFFFP P 
GALFFFFFF*EGVliPCSPGWSAMSHLLS 
EKRKLTPRMSYN 


11146 


25047 


A 


11241 


371 


491 


KI FLCWARHDAHACNPS I FRGQGGHIT * 
GQEFKTSLANMV" 


1 1 147 


25048 


A 








LDLLTS * S AHLGLPKCWDYRCKP PRPAL 
NCFFRTISQLLNLCIKD 


11148 


25049 


A 


11243 


382 


333 


T YK I KDFKTF * R W FHPN I S RVEAE KLFL 


11149 


25050 


A 


11244 


169 


477 


TQKDEKESTTTTKNAENSKGQSAYSSPS 
DCNTSliARAQN*EEAEMDELTEVGFRRR 
VIMNFVKLKGHILTQCREAENHDKTLQK 


11150 


25051 


A 


11245 


200 


462 


S IiIYNELCT IHI LYQYYVA* QRS * CFQQ 
LMSFY * RTLKNLKESLCKVALLYPSFLK 
WKRGWLGTVADT*KPSTIjGGQGGRIA*A 
QEF 


11151 


25052 


A 


11246 


484 




IFCEKKGL*FQLLRRVRWEGPFNPGNQG 
CS*LLSSPCPPGWITE*DPVSQKKKERK 
RNQVMELVSAQENKNKPRSVGLQACARS 
MLAFSTTPR 


11152 


25053 


A 


11247 


226 


458 


NTVAVDWGPRCFFSLLLSHAENCDMWFT 
KIEI>FIFTNHLLLKQDLEMLW*GQAHAC 
NPSTIiGGRGGW 1 T *GQEFETS 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK*KIG*AQWLTPVISTLWEAKV 
GRSLDPRNSRSAWTMY 


11154 


25055 


A 


11249 


216 


451 


THKYLFYYLGYNPMliYIERETGSCSIA* 
AGVQWYSHGLLHP* PSGLKRSSQVAGVI 
CIaANF F FFFIiKKW S Xj wlprlg 


11155 


25056 


A 


11250 


260 


474 


DKYFLLYPLKALNWLGAHINI * KVSKI 
MC*K*DKNMYNFFFSF*DGVSLLLPRLE 
CNGTILAHCNLCLPC 


11156 


25057 


A 


11252 


193 


496 


MTSCYSFYL.* IKDASMVKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKASSSQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 


DLGWLKI.GT VAHT YNS S TSGGQGRR IT* 
GQKFKTSLANWKPC 


11158 


25059 


A 


11254 






KSIQKEKKKKKKKKKKKKKKKKKKKST 


11159 


25060 


A 


11255 


292 


48 


LPLTCHRPAVSHTHLAFSNPTAGITIF* 
TRSSS WNAGTFI I FKIVTFFLFFFF* ER 
VS LKL TRLERSGAI PAHCNLHL PE F 


11160 


25061 


A 


11256 


347 


16 


TQSADWCI YNPPARHKSS PS PHPTQ*PS 
WmLVDPAPGPWVELPTSPAPSACTPQP 
LGSRWDQPPQSKGQCPSGRLGPQGSPPQ 
VGLGKAGWRSQALPVGIiTRSIESHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYIFPQQHNPTDSQVI FSLTASHLL 
I CS PLCIS VLHFNGMNIDDENRNAHVHA 
RACAHAHTHTHTHTHTHTLYLMNYRETL 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MLLKTRGRATGSHARCIFS*SI J LQACP 


11162 


25063 


A 


11258 


310 


461 


LSFGDVLQLESDRPGAVARTCGPSTIiGG 
*GGWIT*GREFETSIiANMLiKPR 


11163 


25064 


A 


11259 


461 


174 


MGFHR VS QDGLDLLT * S S AHLGLPKC WD 
Y* CEPLRLAPVGKLFKCVGYKLVLCHGS 
GYMSMVIKIRMPGLIWSLEGGSAAWHR 
YWSIWEISFGC 


11164 


25065 


A 


11260 


343 


472 


IiLRRLRQKDCLSSGGQGCSEPRLDYCTP 
AWVTK*DPVSHKQE 


11165 


25066 


A 


11261 


229 


3 


VKMKLQ*VPVKVKI*KYTPLLGIYLKEL> 
KAGFGRNTCTLMF IAALFT IANRWMQPR 
LiP* IGECVNNM* YIHTMES 


11166 


25067 


A 


11263 


163 


399 


KLLPGTRNLHAMEHPIFGWLRNPHATAQ 
G*HPLSQSSI J A1.HGRADHICYPE1,CTSS 
SSCI IAGYPN* EGMFASQHHRVH 


11167 


25068 


A 


11264 


316 


427 


TNKKHI I * IINTTTHSLI IS I IPLLFFNQ 
INNNLFSCS 


11168 


25069 


A 


11265 


184 


418 


KTPPGFFEKNPFFWGGKKGPPPPPKNLP 
LGGKKKFLRGKGGKTFFFFFF*KIGSSS 
SPPPPPGGEQAPPPPPKSPPRR 


11169 


25070 


A 


11266 


219 


13 


KFFFSLEHQNFGGGGVPLFPPPKEGFLP 
KKPKQGFITPPLKQKIKPPPPGGYSGPP 
RVL * KGP PH I FY K 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRVSQDGLDLLTS*SACIj 
GLPKCWDYRREPPRQARKLYCLGMRTNE 


11171 


25072 


A 


11268 


157 


2 


PVDITHVKTTVGLGMVAHACNPSTLGGR 
GGR I T * ARGSRFLLPPHWTSELR 


11172 


25073 


A 


11269 


291 


468 


I SEGKRS LFVI L.HFILFFETGS CSVAQA 
GVQWCDIjDSL* PHI PGPK* SSHLGI.HTG 
ACH 


11173 


25074 


A 


11270 


29 


478 


GWNPYNQQDIGTLYD*SGARAGVLGTAIi 
CLLIRAELGQPGDLL.GNDHIYNVIVTAH 
AFAI I FFIVTP 1 1 IGGSGN* LVPLI IGA 
PDMAVPRINNI S F * LLPTS LLLLLASAI 
AEAGA+TG*TVYPPLAGNYSHPGACVDIi 
TIFSLHLAGV 


11174 


25075 


A 


11271 


61 


474 


P YNQGGNDAHAAWPHLTKWRSQGTTE I C 
REPEGAWQRPSQPRLHKGRSPGQIiRALS 
QNLPSYSGGSHLLSAYYVPDTISKHFIN 
TIGWLGTVAHACNPSTL*GEDRWIY*AQ 
EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


KIYQPVKIHGPRPGAVAHTCNPSTLGDR 
DGW I T * GQEFETS IANMNS PSL 


11176 


25077 


A 


11273 


350 


11 


KEKTAGQARWLTPVI PALWEAEAGAS PE 
VRSSRPA*PPP 


11177 


25078 


A 


11274 


138 


3 


NYINLYAWLGAVAHACNPSTLGGRGGWI 
A*AQEFKTSLGNMAKP 


11178 


25079 


A 


11275 


359 


219 


FVIKMGFPQVGRDGFNFLAS*SAPLGLP 
KWWDYRH * PPRLALFFFF 


11179 


25080 


A 


11276 


262 


3 


FCMS LVHLFSHS QDNYFTS FSL * I FN I P 
SGP FLFAFQDTLQ * P * DF I YFYFETES P 
SVTRLECSGAISAHCNLRLLVSSNSPAS 
AS 


11180 


25081 


A 


11277 


285 


1 


VKKGDS VS KI FRKKKKRKDKI RLiHFMIT 
TVNNIX3 1 EGNLLNM I KEI YKNPTANI IL 
NGERLKAFSL*SGTRMSFLTPTFF*IIiV 
DTQQVYI FMSH 


11181 


25082 


A 


11278 


160 


3 


TDKE VPL I NAQRKW FL.EME S I PGEDAMN 
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Predict- 
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nucle- 

lo cation 
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ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H— Histidine, l=lso!eucine, 
K==Lysine, Lt=Eeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IV*MTTKELQYYINLVEKVLiARPE 


11182 


25083 


A 


11279 


334 


488 


PLNTCGY * NLH * *NFQPGSVAPACNPST 
LGGQDRWIT*GQEFETSIANMVK 


11183 


25084 


A 


11280 


527 


356 


PFVFLVETAFHHVAQDSLDLIjTS * S SRXi 
NLPKCWDYRHEPQRQAKNVGEFSETRS I 
Q 


11184 


25085 


A 


112S1 


486 


248 


VRQL FSLiLLPRliE CNG VI S AHCNLRLPG 
S CDS S ASAS * VAR I TGASGS QAWLQVQ 
CLQPVQPGEl.l,RVDI,FQIiVVLQR 


11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDLLTS * STLPKCWDYR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQAGLELLTS*SACLG 
LPKCWD*RHEPTDPAK 


11187 


25088 


A 








IDKCVCSDYSKDWPFPIFLPLLRPPYCL. 
RHNNMETRSINNPPWPSKCSSEKTNEEG 
MLKVEIG*KLCLLCQTFSKDMNAEKKFIj 
KEIKSATPMNTSMIRKQNRLIHDMEKVIi 
V I R I E YQASHN I L 


11188 


25089 


A 


11285 


65 


280 


SRAVEFNLLYTT* SACLGLPKCWDYRCE P " 
PHPASP*FSKLLSSNLKTYLSSL*DSHS 
GFYSFVCSLGLI ILI 


1 1189 


25090 


A 


11286 


244 




MRLGRVPSVLFCYVRMQREGIILEAENK 
PS PD I ESVGAL IL.DFTASRTIRNKI LLF 
IIFPG*GILDSPNVP 


11190 


25091 


A 


11287 


345 


445 


KRPGTVAHACNPHTLGG*GRWIT*GQEF ' 
WTILA 


11191 


25092 


A" 


11288 


223 


500 


S P GT R P KVLiE CQ S T LG RQ A I L LLG E G W Li 
WGQSHRLGVQSQLCLCVLWDLTCSLS CS 
FQI IKLR*RPGVMVHTCNPSTLGGCGGR 
IAQGQEFK 


11192 


25093 


A 


11289 


157 


22 


LSCOEVKGEIGEAEICWAOWIjPPIIPAL. 
WGAKVG * SPE VRSS KPA 


11193 


25094 


A 


11290 


305 


497 


KLWCIYFKGVSFMVRELYCNKIV*WI.G ~ 
TVAHI YNPS ALGGACGW IT* GQE FETS L 
TNMAKPRL 


11194 


25095 


A 


11291 


478 


1145 


QHVQACP ERPQMMGTLERS RAVAS KI GH 
SYSLDSQPARAVGKPWPQQACTRVTEIjT 
EATGKliIRTSHIGKPHPQSFQPAAATQK 
LRPASQQGVQMKTQGGASNPALQIGSHP 
MCKSSQFKSDQSNPSTVKHSQPKPFHSV 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKPSQPWPPQSKPSQPRPPQPKSSSTNP 
SGAKAHHSKAGQKRGGKH*RANSRDL 


11195 


25096 


A 


11292 


352 


468 


TFPGQHGKIPSLI.KIQKLAGHGDACL* S 
RLSPR 


11196 


25097 


A 


11293 


295 


24 


ENIQGSKKNERQHMHNGVGTVJNSNPGFV 
STWVYRNCIjACIjD I S FS F I KWVKSHKDT 
SMGMFTAALFTIAKT*NQSTCPSMVDWN 
STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECEPVQTLWKIRKFLKKLKIELPSDPAI " 
PLKGINLKEVKSVP*SDTYTPMCr 


11198 


25099 


A 


11295 


168 


469 


PSLGNKSETPSPKKKKNLFLGFPYFREK 
LVGILLKTVLQKGAGGVFFGPKNPPEKN 
PDPPIjDSPPKTFRRLGAFVKTFPKGLKA 
AWFPAIiV* PQGQMGG 


11199 


25100 


A ' 


11296 


37 


348 


IKIFNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGGPRIYPDGARKNLPHIGTAKKHW | 
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Predict- 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GGYWEKNPHGGGGNTGPTPPQKTRPLGG 
KNNLKRVQGEKPRPRLGGGKTRSQLFSC 
HA* P * VQGEKPRPRLGGGK 


11200 


25101 


A 


11297 


93 


388 


DEYASDNCCYFPGSSHDSQVKQIGSQFT 
TQIQAHLNLAKPRRAL*NHS* PMEMRSK 
CLLLAYKKKKKKKKKKKKKKKKKKKKKK 
PKYLSSRGKDHF 


11201 


25102 


A 


11298 


61 


243 


YYIHRTTVFMCMN*GLKDNVDK*TIDLC 














KKKKKKKKPQGGGA 


11202 


25103 


A 


11299 


250 


341 


IYMSLTLGLQNIi*EIHLTHKDSHKIiQVK 
GWKNI FHANVNQKQTGVAIL I SDNTD I K 
ATTVKKKKKRENPT I LNI YVPNTGAPKF 
MKHIiLLNIjKNERDGTTVLG 


11203 


25104 


A 


11300 


3 


285 


DAWERKKRSQMNQKTTNKVAGVS P YLS I 
MAWNVNELNFLI KRHRVA+CI KKTKSMI 
CYLQETNFIKDTHRLKIRGWKKI FHPNR 
N*KLAGWY 


11204 


25105 


A 


11301 


175 


407 


AENNPDVLQLKA*TSRGTFMP* ILLRVK 
RGSNSRNNRCWQGWRGTGMIjIjHC*QECK 
LVQPLWKTVWRII.PDLESEIL 


11205 


25106 


A 


11302 


75 


280 


YHLNLVLYPHPPKNRVC*KKKKKKKKKK 
KKKKKKKKKKKKKKKRGGPLKKNL.LGGP 
FFPGGKKKNFFF 


11206 


25107 


A 


11303 


299 


3 


IHASFLFRGCITLAFTVRCSHRSITSSE 
TSDLTHYYYYYYYYF * LASQS VALAGVQ 
WDQLGSNHPPTSAS* VAETAGASHHARL 
I FNF FVEMEVS LYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF * KYLGREKKKKKKKKKKKKKKK 
KQKKKK 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHI VKP IP* PLTGALSALLMT 
CGLSM*FHFHSITIiLiIIX3LIiTNTLTIYQ 
*WRDETRESTYQGHHTPPVQKGLRYGII 
LFITSEDLFFAGFF*AFYHSSLTPTPQL 
GGHWPPTGITPLNPLEVPLLNTSVLLA 


11209 


25110 


A 


11306 


142 


330 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF * KYLGRE KKKKKKKKKNKKKKK 
KKKKKKN 


11210 


25111 


A 


11307 


259 


440 


RP WIRDNP I YYLKS CFLRT I FLSLLPLQ 
PRPYPPTRRALSPNRHHPAKSPRSP*PL 
TGALS ALLMTSDLTMGLHFHY I TLLI LG 
LLTNTLT I YQ* WRDATRESTYQGHHTPP 
VHKGLGYGIILFITSKWFFARFF*AFY 
HSSLDPTPQLGGHWPPTGITPLNPLEVP 
LLNTVRL 


11211 


25112 


A 


11308 


433 


3 


FPPPDIRHSRSLGFPPSFGRAGFLNFAF 
SKGQEPALPGGFYYPSPKKKNLAFCSPP 
LFYWGKFGSKFSLGVGPLFFP+ *SPKSF 
CQFLBNKGFFPEKSFFKFFPPAGGCVFF 
FSEGGFPFCLFLFFFDRVWLCHPGWISV 
ARS 


11212 


25113 


A 


11309 


2 


423 


GRVGDR*LFYTNHMDIGTLYLLFGA*SG 
VLGTALSLLIRAELGQPGNLLGNDHIYN 
GI VTDHAFD I IFF I AI P 1 1 IGGFGN * LV 
PLI IGAPDMAVARINNISV*LLPPSLLL 
LLAYAIVEAGAGTG*TVYPALAGNYSHP 


11213 


25114 


A 


11310 


234 


1 


TFFGKFP FCLKFKPAQRGGENTPGP FPF 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H^Histidine, l=Isoleucine 
K=0Lysine, L= Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVFFFFEMESRSVAQTGVQWRDLLGSSN 
S PTS AS * VAGTTGACHHARli I F 


11214 


25115 


A 


11311 


406 


211 


LSLFLHS LIS THGARPTDP VGLVPTQDG 
LDLLTS * SAHIGLPKCWDYRRAPPCPAC 
GFDLHL.PND 


11215 


25116 


A 


11312 


379 


398 


KT*NFTVRGSIPLLNNIPMASMTFSKKK 
KKKKKKKRGGPLKKNPGGAKFNGGRKRK 
IFSLKGGEKKNPGGILEKKPFFGGGKKG 
ENPPKKNKGLREKKKF*RGKGGKKPQKG 
GGKKMGSSSAKTPTWG 


11216 


25117 


A 


11313 


390 


489 


RYRLG WAHACNPS TLGSRGRW 1 1 *GQE 
FETSV 


11217 


25118 


A 


11314 


286 


434 


ILWYMNYI S IKLLKI II YKLGAVAHTCN 
PSTLGGRGGW IP* GWEFETS L 


11218 


25119 


A 






3 


NIiLKIYI<GLiY*FYYVPLLRTKSLKKTGS 
FESNIiSFK*VFLWSGTVAHTCNPSTLGG 
RRGWIA*GQEIiETSIiTNMERPR 


11219 


25120 


A 


11316 


251 


389 


rFFFFEREFHFVPQVGVQGGSLGSLQAP 
PPGSHSLA*AS PRCWDGG 


11220 


25121 


A 


11317 


313 


80 


EYICPRCLSPGFRD* PGQHGETPSLOKL 
QKSAGCGGAHLRSQLLRRLGDRVRHCLK 
KRI YIiSI/FNSLVLSNNRFQI I 


11221 


25122 


A 


11318 


145 


410 


APKTIjKVALiTSAAAARESICQLGVIjDEIj 
vi fxie lnrs td qr p g avahtcnp rtlg g 
HGGWIT*AQEFKTNLANMVKTC 


11222 


25123 


A 


11319 


393 


3 


NFFKIIFTLSPPLPLIFTPPFFFFFKKI 
PPKPLFFGVFHSQTCPQISFFKNFYPFL 

VGNPLFFLFFFFFFFFFG*EGVSLLLPR 
LECNGT I S AHYYLR PRV 


11223 


25124 


A 


11320 


135 


I 


RkDVJQjEYAIRKMEVRPGVAAHACNPST 
LGSRGGWIT*GQEFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDEEEKVLSFLLFSKRKV 
LES KTFP TLHSG CKKRS YFLKI TTSS FL 
* PRVBCSGVI IDHCKLKLLGSGDPPTSA 
FPVARTTGM 


11225 


25126 


A 


11322 


149 


30 


WGSGGQVRWLTP I I PALWKAE AGGS PEV 
RSS*PVWPCLY 


11226 


25127 


A 


11323 


316 


399 


STLGGRGAQIT*GREFSTSLANMPNPIS 


11227 


25128 


A 


11324 


223 


397 


IIKAYQ*DVNREQIKTIIIYFQNKIjKCI 
KKMIWPGAWHVYNPSTLGG *GGW IT*G 
QE 


11228 


25129 


A 


11325 


3 


396 


NYSTAALFLRGH I WGWR WGTL YYGS S P I " 
YCRGLNSTLGLYPLNASSNPHPRKLCKP 
KIYTHIAKCPIGHKIIYPYANTPIH*HH 
THTHTHTHTL*DSLLWPGWAHTCNHNT 
LG S WGRQ IT* VQE FKTS1A 


11229 


25130 








442 


KRYXUXRAHSEVIiCGHEIjWGTIjHNPVQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
LFHL *NRNNSHHHHHHHHHHHHHRNPC* 
ECCRAHSDSYAVWSDKVSQEPVAAM 


11230 


25131 


A 


11327 


145 


3 


ELDIKPPNPFFCHTGVQ*CDLGSPQLLP 
PGFK*FCLSFPSSWDYRCA 


11231 


25132 


A 


11328 


189 


I 


KTNLLIil HS KLRLHLQS S VFQC * S KSEK 
SNVCT * CAFSCLPKLFWLGKVAHACNPS 
TLGGRGG 


11232 i 


25133 


A 


11329 


193 


405 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P>=Proline, 
Q=-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














AYTSKKKT 


11233 


25134 


A 


11330 


209 


11 


GGKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC*CNGEI*AHCPLRLPGSSNSPAAAS 
RVQETATTPS 


11234 


25135 


A 


11331 


38 


390 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K* QRQKKKKKKKKKKKKKKKKKKKGGGG 
LKKKLFL.GPGGEKFFFFFWGPFFFFGGR 
FLKRGGGKKPGYKKNKISGHPPPFLGGG 


11235 


25136 


A 


11332 


38 


326 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 














FIKKIFFGPGGEKI FFFFWGGFFFFGGG 
FKKRGGEEKNLF 


11236 


25137 


A 


11333 


1 


234 


RTRGKNWMAF I KEG WVPTGRMGGS FIKH 
SFLESKNTWLGKKKKKKKKKKKKKKKKK 
PHDKI KDLFDP V+NKKQNI S I P 


1 1237 


25138 


A 


11334 


244 


398 


INHLYSNKNDLI FLLVFYLFFFP+RRPG 
EPPLIKGWLPYLGGVLNLRKDPL 


11238 


25139 


A 


11335 


174 


1 


HQVYPYVKQEHYKQDSCQCYI KYMNVW P 
DAVAHGCNPGTLGAQGGWI I *GQKFERT 
RG 


11239 


25140 


A 


11336 


148 


411 


LFFFPHCYLLNAPS ILYLNEPKWYNLH 
LCVS AVCLLVPVHS PHRTCLY * KKKKKK 

KLLF 














11240 


25141 


A 


11337 


237 


453 


GQRPLLSFREKGSFFFLLETEFCFAPQA 
EGQGHNLG*LQLPIiWG*SNSLVSSSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDDFVPEARTRFFKS I * EAWNNKNI 
KPLLSTFSQVPGSENEKKCTIjDQAFIGI 
IiEEEIINHSSCENVIiAIISliAIGGVTEG 
KYGSVLFCLK 


1 1242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCLPKTRENAREERD 
KKIPPGLC*AGKLFPQEKSSTWPGKVVH 
AWNPSTLEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


GIRPGAVAHVCNPS PIjGG * GRWI SRDWE 
L*TSliANIR 


1 1244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKEWMP I PKLGC 
LVKDMKIKSLEE I YFFSPP I K* FBI IDF 
FLGASFKDEVFFLFVCLFVCF 


11245 


25146 


A 


11342 


149 


30 


TQIFPPPPPPGKKKKFFFPKKKKKKKKK 
ERKKDRKNKQK* KGPGVGVS PLFPPPLG 
GQGGGFPQGIjGFKTPIAPKGKPWFP*KT 
QKFPRRGGGPPKI PLFGGGGEKKPFSPG 
GQGSNKPKFFPPPPPRGKKKNFFSPKKK 
KKKKRKKERKTEKTNKK 


11246 


25147 


A 


11343 


120 


504 


GVHDRCLWDDMTRRPPAAAATACPAGRP 
PPRREEEHSQLIiLiISFQGFRWD*DQDVN 
TPNUDHIiAREGVKAKYIiMPPLVTMTSPS 
HFTAITGKRHSAHFTRCPSNPQRPSFPV 
IRSKSSVSSREVEVA 


11247 


25148 


A 


11344 


213 


399 


AL VI HCLTTT IQ VS GLSGS KYLiMRQGMV 
AHGCHPNTLGGRGGWIT*SQESETSLAN 
TVKPCL 


11248 


25149 


A 


11345 


308 


3 


NPTPRSGDLWVEEGSPGAGNWGCLSEQT 
LRAI I KATTS YS F YCYFFETGS HS VAWA 
RVQWYNPGCS I CGS SHLPTSAS * GAGT I 
GMHHHAWLIFLFFVENE 
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sequence 


SEQ ID 
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peptide 
sequence 


M 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparogine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *— Stop codon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11249 


25150 


A 


11346 


189 


433 


GHCRAGTSLTLTHLGGSAMELIIHLGLC 
LGRMKFIHTTCI ILFLFFLSATQAGGQG 
GDLS*LQPPPLGFKQFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGELCFLTIKEERMGFPLAKKRKIKGV 
F*FGLFFFFETESHPVAQAHCNLSLLDS 
SYSPASAS*VAGITGACHHNQLIFIFYE 


11251 


25152 


A 


11348 


159 


434 


LSPS CRDVGTYSARAHPPHTHTRVRAHT 
HACTHVRAHTHTHTTAHGSHLTKEQPGL 
CTLPGSSW* PGLGGRAIATAQAWVHCAG 
IITAAHPP 


11252 


25153 


A 


11349 


240 


406 


AFKKLAKNNSMGVNRSLSRPGVVAHAWN 
PSTLGGRGRQI T * GQEFETS LANMVKP 


11253 


25154 


A 


11350 


163 


360 


PQLLQLMKFLQN * LLERLRPQNGLNLEG 
LACSESRSRPCAPVWRTERDSVSSPQPR 
PGAGGRPSFF 


11254 


25155 


A 


11351 


145 


406 


PLNSGAKPTLPPWSPRRPELNAPPPPPG 
EKEFFFFLGRGFLFAPQEEAGGENIV*G 
KPPPPG*REFFAPPPGGDGNKGAPPPPP 
LFL 


11255 


25156 


A 


11352 


122 


337 


ILWVLLIiVRESLKFFFFLLH.FFt.FFET 
DVA* AAVQ* SDHSSRQP * PPGLRGS YTS 
AFRIAGAIGICHHLY 


11256 


25157 


A 


11353 


325 


416 


KCVY I MG WA* WLTP VI PALWEAGAGRSL 
EV 


11257 


25158 


A 


11354 


260 


415 


LEELQNTACQKLEPFLSRTETKQGCLLS 
PRLFNIVLEVLA*AVRQEKEIKGI 


11258 


25159 


A 


11355 


160 


2 


ISE I STKRNNYFLKNWLG WAHACNPS t 
* RG *GGQTTRGQDFQTSLANMVKP 


11259 


25160 


A 


11356 


160 


3 


ENLKLYNGSPFLKI IWLGMWHTCNPST 
LGGQGGRI P*AQEFKTSLGNVRRP 


11260 


25161 


A 


11357 


235 


408 


GQPLPAALFFYQWHLHFKELLAHPWGKT 
VWRFL* PLKKEGPHDPAIPLLDIYPKNK 
KW 


11261 


25162 


A 


11358 


11 


393 


QTERTPTPKPHLYVSFPPNSQDPQLQGG 
KLTNRKNTHTKTPSVCHHHQRPKVDKTT 
KIGKKQSRKTGNSKNQSASPPPKECSSS 
PATEQSWTENDFDELREEGFR*SNYTLL 
KEEVRTHGKEVKNLE 


11262 


25163 


A 


11359 


283 


475 


FMVSVS I KVYFFIFI FLFFEMESCSVTQ 
AGVQWHDCLSFLOVLASSPGNSSPVR 


11263 


25164 


A 


11360 


266 


407 


LE IHHS KTL* FWLG I VARACNPSTLGG* " 
GRW IT* GQEAE I I LANI VK 


11264 


25165 


A 


11361 


93 


1 


DVLS I LLPRLECNGAI S AH *NFRLPGS S 
NSS 


11265 


25166 


A 


11362 


131 


377 


MEEMI KGRGLSKPPALSSYLTLKSS *LK~ 
RVRARRRL FLKS RFS LDLQVAG FFVCL F 
VFWDRVSLCHPGWSVWQSRLTAALT 


11266 


25167 


A 


11363 


212 


392 


QAEECAPLFIGVKEP P KNLTHWLGTVAH 
TSNPSTLGG*GGQITWGQEFETSLANMA 
KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLEWLGTVAYTCNPSTVGR 
WI T* V*EFETSLGNMVK 


11268 


25169 


A 


11365 


215 


384 


ANFN IRPE I SLAS KCPSERKSRTLLPLN' 
QKLEMI *LSEEATSKAKTG*KLGLLHOI 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVLKISSSLKIHCYLSILCLIL 
MPTLECTAAIMAYSNLELPGTS CPPTCA 
S * VAGTTGACHRAPTVLKI FLRDRI FFF 
FFTNVTYPWCHTPADPAIFKANAR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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NO: in 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valtne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11270 


25171 


A 


11367 


310 


2 


GLICLNRENAYFFVIRQISMSCLEKEKP 
ARYS SPYAKS VTRGEKRRQRFFLSRAP P 
IPFLFFI* IV* AESRSVAQAGGQWLHFS 
SLQTPP*RRTSDSPASAS 


1 1271 


25172 


A 


11368 


303 


50 


MFALTNRG VQGS P VI FLGVFL I S KMGFT 
CFKGFSRFKVLVFFIiGVFPSFFFFLRAR 
VLLCCPGWSPTLGIK*SSCLGLPQCVLT 


11272 


25173 


A 


11369 


109 


419 


QSNASLSHFTSPPLSVADR*LFSTDHKD 
IGALYLLFGA*AAVLCTAISLFILDWLG 
QPCNLLRNDHIYYVIVTAHAFARIFFIG 
IPI I IGGFGN* LVPLI IGA 


11273 


25174 


A 


11370 


112 


258 


GGRFKGSNFTSAGMQRNIFFMGPPK*NS 
RAGV* QRGEGKNPGVTKLNRL 


11274 


25175 


A 


11371 


823 


1078 


SQHFGMPRWADHLRSGVQDQPGQHGETL, 
SLLKIQKLAGRGGRCL*SQLLRRLRQEN 
CLSLGGRGCSEPRSHHCTPAWATQ*DSV 
S 


11275 


25176 


A 


11372 


184 


420 


I KQQKNLFLALVKEIiVQLIjWKTVWQFIjK 
KliNI ELLCDPAI LLIiGIYPRELKMYVHT 
KTCT*MFIVELFIKSKK*KQPTC 


11276 


25177 


A 


11373 


234 


44 


LATITDRLLALSDSAIPPLGIYAKEKKS 
VYQKDNCT YI F VAAL FTVAQ I *NQPKCP 
STRTRGS 


11277 


25178 


A 


11374 


73 


456 


PPIGSPIjVPYIPSPALSSPQAPRMGSHP 
CLIPSASCHPPLETDFIiSLLPTHFCIjAV 

ftkerfsppppspyprf*kflrs*kfsf 
flp pfl ffggtvllciisgws avaq s wlt 
aastslgssdpptsts 


11278 


25179 


A 


11375 


43 


456 


EFFHHVGQDGIiDLIjTS * SAHLGLLKCWD 
YRREPPRPASDGHY* tdatgslps sgtt 
* IRTKPSQAPASWGLWNLAHHPPRSHPS 
CPMANLICSTLSSFDGGSPGTGPGGWCP 
LGLSGSPARAVFKDSSCSLHPLATGI 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCIiPFLHCFI FFETE 
SRSVAQDGVQ*CDLGSGSSLPPG 


1 1280 


25181 


A 


11377 


129 


1 


NFNALNL I RAGAVAHAYNP SHLGGEDGR 
IA*GQEFETSLSNTA 


1 1281 


25182 


A 


11378 


307 


1 


DSEIPSLGLAKFWNHRREPPCIiAliFLTL 
NQKLEMI KLSEEG I S KAKTR* KAGFLCQ 
RVSQDVNAKEKFLKE I KRATLVKTQMMR 
HQNSFIADAEKVWMAWMK 


11282 


25183 


A 


11379 


200 


397 


CISLFSHYYKDLPETGSFIKKRGLIGSH 
WLGAVAHACNPS ILGGQGRRIT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


167 


1 


SNGLYS REARMVQHTH I RPG WAHTQJP 
STLGGHGGRIT*AQELKTSLGNIVRPC 


11284 


25185 


A 


11382 


12 


395 


AHSSFLSLDLSVFATCPHPRAETQTGHR 
FS TLLPLSALYPK* D YFKKKKKKKKKKK 
KKKKKKKKKKKKSGGGA 


1 1285 


25186 


A 


11383 


160 


2 


TMEMMLDI K* I *EI FLFEFKMGLKAAET 
TQNINNTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVI PALWETKAGGS LE * GQAQ 
WL.TPVI PALWETKAGGSLELKNSRPSLG 
NMVKPP 


11287 


25188 


A 


11385 


389 


37 


WS YESSWS F I FKS FLFS FMISAYDM* YA 
SQILFTPKLYRTHPYFLLVLFIilLSILV 
ILETGSRFVSRLECGGGIS AYGS FK* LG 
SGNPPAAASLSSRI PYVHQSCWSDLAFN 



1233 



MSOOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GJutamine, R = Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, impossible 

nucleotide insertion ■ 
















11288 


25189 


A 


11386 


214 


374 


KGGGR FKGS KFTS PGGC3E KN F F I GA P K * 
NS RAG V * KRREGKNPGVTQLKKGVG 


11289 


25190 


A 


11387 


142 


410 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
* KKK I KKKKRGAP FKKKNF FYRGGGGKY 
FFFGAPKINFLGRVFQPGGGENPGRS PF 
KGAWG 










144 


1 


CLSNEDS IKTPKGLGTVARTCNPSTWGG 
RNGWIT*GQEFKTSliANTVK 


11291 


25192 


A 
— — 


11389 


167 


2 


RNISSHLLNMAITKQTNNNNCW*GYREI 
ITHIHC*WEFKMMQLL>QKLVPQKVKHT 


Yy2§2 


"25193 










SDQVQDAGRL IS SEAS KLiGLPTATFLLC 
LPTGFALiCTRVPGVFLLiLRTSVILiREGP 
TLVTS F YLNYLLKDLVS IQSHWGLGLQP 
MNFGETQFSL*QKXiV* KEKESMPGMVAH 
ACNPRNLGGQGGL I T * GQEFKT I LA 


11293 


25194 


A 


11391 


1 


206 


TGRLAGHRVSSLP IRGGRAEAPLTSRTG 
RLNQDGLNLiIiTS * SARLGLPKCWE YRRQ 
PIiRLAVISGFLT 


11294 


25195 


A 


11392 


262 


388 


VSK* KKKKKiCKKKKKKKKKKKQ 


11295 


25196 


A 


11393 


78 


414 


ICFFLRAPPFFFFFFFF*TKIFFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLAS P PGGAF F F 


11296 


25197 


A 


11394 


238 


11 


I KWETAGRS SACL * S * LFERLRRENPLN 
PGVHG * DE I * * HHCTPAWVT VR PRLSKY 
INKNKK*ARLAARCGGSRL 


11297 


25198 


A 


11395 


340 


1 


KHI LVP PF INVS E IQRYSLV* HLLFVG V 
* NLFFL FLS FKKP VLCSM * 1 1 CHE 1 1 S P 
PSKPKCKASHLSFV*KKNSNIiWLGAVAH 
ACNPGTLGARGGRIT*GWEIETSLTNME 
K 












245 


QMDLNRHLSKDDK+MANQCMNRCLTSLA 
IRQMQI KTIMI 


11299 


25200 


A 


11397 


292 


2 


DPKKEAGRGGAQL* FQLLRRFKHETHLN 
PGG*GCSESRSHSSPPAGATK*NGVSKK 
K* Y*APKEERHPRPVX>VEPRVPQVPSPG 
SQTYRQDHSPRV 












321 


KKLiYTHKSKSDVMI FLKIETVSWLYCPG 
WRVMAHCDLKLLGSSKPPTSAS * VAETS 


11301 


25202 


A 


11399 


118 


394 


QALPQPIPVLILLQFFFFFF*GTGFCFV 
AQ I GGQGGH YG * LE P P I RGL KPS S RRTL 
RGGWD* GGHHNAQL I LGFFEKTGS CF W 
QAGLKALA 


11302 


25203 


A 


11400 


273 


379 


AE PGWAHS YNPGTLGCRGGG I T * AQEF 
YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KI FGQARWIiTPVI PAFWETEVDGSPEVG 
RSRPA*PS 


11304 


25205 


A 




254 


404 


G P AGHD FHM I TPS KS TE TGLGAVAHVCN 
PSTliGGRGGWII*RQDLETSLA 


11305 


25206 


A 


11403 


161 


3 


I LTPTLQYI KINLRGQAW WFTHVI PALW 
KAK*SGGLLELRSIjRPAWVIEGDL 


11306 


25207 


A 


11404 


1 


170 


MGVSLLS PRI,ECNGVISAHCNCHI,L*FT 
FK* FSCLSLPSSWYRCVPPCIiATTFLIY 


11307 


25208 


A 


11405 


249 


3 


NAKSRI QVI S I WVFTVKFFHLY YLCNFS 
Q*NEMAGGQSIFRKKHLDFSLEHSGMMT 
THCSLDLPGSSDPPNSASSVGGTRH 


11308 


25209 . 


A 


11406 | 70 


401 


CGKHGVNLI IHSFLNHL.KYLLI.LQMANL 
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peptide 

sequence 


Amino acid sequence (A=Alanme 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FISQFLLTCVSEPLLF*YCHQIIKVGRIi 
HEVKQMRGTKKQEKMFPYWNNDKNVCPG 
VPAHACNPTTVGG* GGWI T * GQEFNT 


11309 


25210 


A 


11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFSFETES 
* SWRLE CNGM I SAHC 


11310 


25211 


A 


11408 


364 


420 


SQPFGRPRQADCLSSG*AQWLTPWPAL 
WEAKAGRLLELRSLR 


11311 


25212 


A 


11409 


312 


413 


TSGMGPGWAHACNPITLGGQGGWIT*G 
REFETS 


31312 


25213 


A 


11410 


283 


425 


HGHKDYVHWLGAVARACGPS I LGGRGMW 
IS *GQEFETSIiANMVKPCL 


11313 


25214 


A 


11411 


389 


3 


CR I DGIiF FNliVFliRLVVW KF * VI FF LS Y 
FSFLVIIIiQFLTYMLNCTFF*QILNIjFP 
ILVLLTKILNTLLSTENYCHFSLLYCLE 
FWFHPI I KHKMYNLFLI * FILI FILRQS 
LARSPRLKYSGTISIH 


11314 


25215 


A 


11412 


267 


3 


QEFQQEDTSELDLE IRSLCKRKGGLKQG 
DTKDNLS KRVWETDKGKAAWWPGAVTH 
ACNPSTLGGQGGW IT*GQE FETTLTNRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAWSCRIPGEGSLSASQAIQPFSLYPVN 
PQ* I SLSARKKKKKKKKKKKKKKKKKKK 
KKKGGGGVLKKIYFPPRGEKKFFFFWGP 
PKKTGGGGF 


11316 


25217 


A 


11414 


275 


2 


KTAHFQLFNSDFKFPGSRNPGQSWSPRA 
KQSFCTQALLKYFLK*MNEMKIiNKSRSG 
AVARTCNPSTLGGQGGWIT * GQE FETSL 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 


RNLGVQVYEYWLVYKYCSVSSNNKMKN 
KRDLLVI * VPG I VIHLIYRL I KENPELY 
ISLFIYSMRSLKKLTYRPGAVAHACNPN 
TLGGRGGWIVCSQEFETSLGNM 


11318 


25219 


A 


11416 


121 


428 


LSCDKWGNRDMTKMLSKELKKKKKKKKK 

GGGGALIKNILFPSRAASGFFL.SLLKYK 
KLGAAVYIILGEKTILWL 














11319 


25220 


A 


11417 


79 


253 


KIFNFQFSNMLENNSHTTRINNYFPGRE 
RWLTPVI PPLWEAKAGRSLEVRS*RPSC 
P 


11320 


25221 


A 


11418 


180 


4 


EVGFMKPGLCGVFLEAGSHS VALT I VOW 
CDLGSPQHPPLGSRDPPP*ALQIVGNTR 
RVP 


11321 


25222 


A 


11419 


154 


3 


STLKDKENQKLS IQPDTMAHVCNPSTLG 
GRGGWIT*VQEFEISLNNMVKP 


11322 


25223 


A 


11420 


436 


478 


AIPLPH*RKKILHAALFFFFLKGSFVLS 
PRLECSTKIWAHCSLPPPNISNPPASSS 
QEGGTT 


11323 


25224 


A 


11421 


143 


398 


AAFGFWFVIjIRFLRQRLALSPRLDCFFA 
KSAS CQPRPP * FKRVCWLSLPKSWDYRF 
DLLCAAGLVWVYLSVSRLHINEKNQPPP 

p 


11324 


25225 


A 


11422 


1 


138 


MVFHRVGQDGLDLLTS *SAHLGLPKCWD 
YRHE P PRPAAHS YFLLHK 


11325 


25226 


A 


11423 


2 


390 


LELPILLLNTLII.FLFI.ILYIYLVFSPN 
AFRILF I T I VLES FS FI LLVP S KP WKYS 
LILSNAFVLCMMISSPPFPLFSLSTTRI 
NKKLD FWLGAVAKAYNPS TLGS QGGWI I 
♦GQEFETSLTNMAKPHL 



1235 



0164835A2 I > 



WO 01/64835 



PCT7US0 1/0492 7 



SEQ ID 
NO: of 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny lalanine, 
G=Glycine, H=Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


J 1326 


25227 


A 


11424 


267 


414 


VFQ I TFCIKNFWPGWAHACS PSTLGG * 
GGR I T * AQE FETS LGNMVKP C 


11327 


25228 


A 


11425 


189 


73 


HQDGLDLLTS * STRLGLPKCWDYRHKPP 
RSARPNAWVAP 


11328 


25229 


A 


11426 


318 


33 


KKT I SSHARGRKLGSFKTAAPFFFFF * T 
GSCSLAQAGVKWCSHSSMQPRPPGLK* F 
SASGS AGX IGMSHHTRPIAGFLLAKLKK 
KSNSTGYGGAHV 


11329 


25230 


A 


11427 


291 


393 


DRVSLCLPGWSAVLPS*LTAAQISWAQV 


11330 


25231 


A 


11428 


51 


399 


KRVARQNPPNGQGFGWCSCS P ILYGQSR 
AQGEN I YKLLGSGE W PGQL I IDVEREQL 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNPSTLGGRGR W I T*GQE FENSLANMV 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKG S KFTSACLQG I S FFKG ' 
APKLNSRAGF*QRWEGKNLGVPQFNRFG 


11332 


25233 


A 


11430 


77 


405 


TKRGDSSFCLYRIWIIDSVMKWNQSKAA 
AAAAFFFFLRGTLLFtiFRLKGKVGASVI 
*NFCLRGWGGSPASPSKEVGMEGACHNS 
GLFFVFLEKRRFPLVGQEFKTTRAN 


11333 


25234 


A 


11431 


121 


439 


LPLPTSGPEDPLALS FKPTP FPSTLSGM 
I YTGHPS PEHTSHLMPAPPVNNNKACRF 
FKT * KKKKKKKKKKKKKKKKKKKKGGPL 
KKKKIFAPPGGEIFFFFWGPKK 






A 


j J432 






L>TP WNLDSRCE I KGNGKPQENHLNLGGG 
GCSEPKSHHCTPAREKRQNSVSKK* INK 
IK 


11335 


25236 


A 


11433 


,296 


1 


FLKKPSKGFSKLPHIiPQNLNLFFPPK* K 
CPRAKKDIPPPSF*KISSLFGEG*GFTG 
SRPPSLNIiENPPKRIFFFFFFLDRVSHC 
RPGWSAMASSCPRV 


11336 


25237 


A 


11434 


385 


418 


QSETLSQKQKTK* E * SWLGAVAHACNPS 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQ WGNGRTKTLLMRENVG K 
F PAE PS CS AliGVWLQALAC YPGLVRRQ * 

G 














11338 


25239 


A 


11436 


221 


3 


KHEFQTIQMESMK*NTSIKICIYAS*IQ' " 
D CLR YQ WQR PAAVAQ AI TEH FG R PRQ AD 
HLRS GVSDQPGQHGET 


11339 


25240 


A 


11437 


288 


366 


KMFFS WA* WLMPI I PELWEAKAGGSL 










160 


370 


LWYHYFPVRDGLTI1,VTYQLKAIRRIjIiT 
LKSRSGAVAHTCNLNTLGGRGG W I T * GR 
E FETS LANMEKPHL 


11341 


25242 


A 


11439 


230 


404 


MQHGQVG I YSQGGGWRPVDKKNSQLGTV 
AHTCNL I TLGGRGGWI T * GQE FKTS LTN 
MV 


11342 


25243 


A 








LSGVWFLTPPPKGGGFPTP*NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKNFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFL 


11343 


25244 


A 


11441 


271 


416 


KRTLS TELQYMCCLL I KS KS I F K* KKKK " 
KKKKKKKKKKKKKKKKKNPPSLP 


11344 


25245 


A 


11442 


254 


392 


SVTGMT IMTSSCCYKI QKCTKI QKLARH 
GGTCL* SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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OCysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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P * FPPLRGGGPGNPI.YPGGQGCH 


11346 


25247 


A 


11444 


122 


1 


PAPLFFFFFFETQS CSVA*AGVQWRNLG 
SLQPPPDDGTRL 


11347 


25248 


A 


11445 


272 


403 


RPKVDKTTKMGRNQSRKAENS KNQSAS S 
PPKDHNSSPAME*SWM 


11348 


25249 


A 


11446 


70 


397 


LCHCAPAWATERD PVLGEKKRTMMS YCH 
PPMRMVEKKI VGDI KC* *GYGETGFL.TH 
C*WECKMVQIL*NIV*QCLKNLNVHSPY 
EliAIVLLGIYPREMKTYIFTKKTSM 


1 1349 


25250 


S~ 


11447 


3 


401 


HASAHASAHASAYYRLCAYPKSS FLYPS 
S FLLWSNFPLN ILDTTIGPKATETLF I 
KHVMHDVISMQLKSKTFYNYFFLLSLNV 
FYFVXiS CCKRRKRS QLGMVSHI CNPSTL 
GGRGRWIT*GQEFETSLANMA 


11350 


25251 


A 


11448 


333 


1 


KSGCIPPKGQWLFFLQKTAKIGSPKPV 
S PPRGKGQFFPQGLRGFY* FLPPPPGVP 
E IFFFFF* DSSHS VTLSPRLEYSGMNTA 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLGGVSQLGYMGVMDPFKEAVCL 
FS ELKHCAGRTTAL FPJWRQGCL s LKNF 
LIjPFFQIiCPAHRGGVL*RQ*ALLSCGGL 
HPVRASRPIiCLPTQA 


11352 


25253 


A 


11450 


198 


2 


KYKFMAGLGGSCL* SHHFARPRQQDGLS 
PGV*NQPGQHGETPSliQKKVSWAWWHAS 
VEAEAGEQP 


1 1353 


25254 


A 


11451 


171 


3 


TPPPRATS LFFFF * TESHSVS QECNGT I 
SAHCNLCLPGSSDS PASASRVAG I TGA 


11354 


25255 


A 


11452 


177 


461 


PMTGGKY I SNSTCS KPNSLS PHLPLKTQ 
TKNMEHFGRLTRGDCLSPGV*DQPGQRG 
KTLLLQKI SQALWDVTAHAWWPGLPTR 
ERRRSLS FPTG 


11355 


25256 


A 


11453 


247 


3 


EGECCGIESR* ILHLKGRICWHINYtiKI 
TKIKN* VI LKKMYKQKLRPGAVAHAYNP 
SALGG*GGRIARAQEI J ETRLGNIVR 


11356 


25257 


A 


11454 


328 


57 


EVHSNICATLSTHQMKILTKGTRHQHIS 
*GFF*EGEVLTLSTKLECSGVIMAHCN1j 
NIjGGSSDPIiALVSQVAETRGASYLIjLAI 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


ATRAKIiHIjKKKKEGRKKEFFPHWQGQPT 
PAtiVFPQPRTHFIFKVSIFSLKKNREQL 
P YTVQTQS L LPLNS HWGQVA * AGVQ WRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


11456 


112 


401 


KCWDYRRERPCPAPBI CI F*LTFFFLGK 
KLNQSFFFFFFLKGSLVLPPRLKGGGAF 
LFN*NLTLQGKNNSPASPS*RAETLGLG 
PHAKLWFVFLKK 


11359 


25260 


A 


11457 


198 


3 


E KTS VKQ S GDSNKKAWLEMKTNKKS IiKR 
*GWAHACNPSTLGGRGGWIS*GREFET 
SWTNMEKP 


11360 


25261 


A 


11458 


179 


1 


FFHMYNHIVCEYTITLSVNINSQGPGPR 
AHAYNPSTSEDRWI P *AQEFKTSLANMV 
KPH 


11361 


25262 


A 


11459 


210 


5 


RLiIHYFFKNNtiPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK* KI SQVW WCMVVVS ATQ 
EVEVGNGLS PGV 


11362 


25263 


"A 


11460 


139 


2 


RIIiALSKLFFFFFKQESHFFLPMLiECSG 
VISAHCNI^IiLGSSDLS 


11363 


25264 


A 


11461 


388 


272 


S * KKKKRGGRFKGSMFLSPAGQGNP FFM 
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amino acid 

residue of 
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Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GPLKSISLAW*QRREGKNLGAPHITRIj 
GAHPLPAGWHKTQGAE F 


11364 


25265 


A 


11462 


2 


387 


CVWKSNNKKAP P CVAALR FRWKDDMDEM 
APGKHWGGLTSKVQSGGLMGRYTGPLTK 
TENS ILLKNPYFEAGHSGSRL*HQHLGR 
PRWEDYLRPGGQGCSEL * S SHCTFT* VT 
EGDPVS KKIiKRT I FGG 


11365 


25266 


A 


11463 


325 


398 


SLPTWAVCAPRPSG*GSGIPELKTMLAG 
VILEDYIjDIKWFGAKVVGIiSCTLATGST 
LFLGKVGPFVHLSVMIAAYLGRVRTTTI 
GEPENKSKQNEM 


11366 


25267 


A 


11464 


174 


2 


YLEILFFFETGSRSITQAGVQWSHPGSL 
QT PP P C PSNS P ASA* VAAHAS ADAWADA 
W 


11367 


25268 


A 


11465 


105 


411 


TDVELLLMDE * RKWFLELETTPGEDDVN 
IVKMTTKDFGYS INLVDKAVSEFKRIHS 
YFERGS AVGKVLSHS I VC* GE I FHERKS 
PLMQQISLLTFILFYFSE 


11368 


25269 


A 


11466 


291 


404 


SQEGVKIKRQGTVAHTCDPSTLGG * GGR 
IT*GQEFKTS 


11369 ; 


25270 


A 


11467 


64 


236 


THASGLDLiLTS * SSCLGLPKCWDYRHEP 
PRPARFSS F YSGS LLN YLAKI IKRDAFC 
I 


11370 


25271 


A 


11468 


278 


416 


NPQ I S CLKQHVWGAVAH ICNPSTSGG * G 
GRIT*GQELKTILVNMVK 


11371 


25272 


A 


11469 


178 


325 


RPEVKDQGASCRDQPHRVGGIjLPVCSDE 
RVPQKKKKKKKKKKKKKKKKKSKKKKKK 
KKASSGGARF* KKKKKKKKKKKKKKKKK 


11372 


25273 


A 


11470 


264 


60 


DINHQSSWLSGRWTPLSWVSISQREDRR 
GFSASIYIYIYIYIYIYIYPPIHIYIHT 
YT * S YHFNQNT 


11373 


25274 


A 


11471 


95 


399 


DFPHIiLLFLPLSQFCTACCPVPENRCFM 
YFFLIFSALCWEMNSRISHSNTARVEQC 
CKVFKIIiQMKSVLGQVQWLTPVI PALWE 
AEAS * LPE IRSRNQPGQ 


11374 


25275 


A 


11472 


139 


2 


NHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT *GQEFKTSLATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQVFC*GFLPVFPKILPIWPLRK 
GVPFPPNFFFFFLRVSLCHPHWSAVARS 
RL 


11376 


25277 


A 


11474 


282 


12 


KYIRNE*IVLFVIYFCRNRISWCCPGWS 
* TPEL KRSS CLCLP S CWD YRHE PLCPAE 
MNKLDRLNKMDTTKWIQQKKKNHRTQRQ 
DNKII 


11377 


25278 


A 


11475 


145 


3 


VCMQLRTDFQPGAVAHACNPSTLRC* GG 
WIT*GREFKTGLNNMEQPC 


11378 


25279 


A 


11476 


61 


397 


PQTPRLKQSYHVSLPSSWDYSWIYHERL 
VNPGRETDWATCYSGGNIQLQENKLNTP 
TDSTLWVSLCCPRLECSSAIPADCSLNL 
PG*SDPPTSTSHVAETTGACHYAQLIFG 


11379 


25280 


A 


11477 


156 


3 


FIFLKSSQNSYI * KCWPGAVAHACNLDT 
LGG *GGWIT* GHRFKPTRRPRV 


11380 


25281 


A 


11478 


224 


383 


CWVCLIHRAFLAPRPVLGCQ* QKKKKKK 
KNKKKKKKKKKKDSRGGG 


11381 


25282 


A 


11479 


339 


58 


NFKNKLCFCASGYLDSKDSQHYI IFFIR 
WSLALLPRLRWBDCIiNPGGGGCSEPRSY 
HRTPAWVXE*DPVS KK* IK* NKI K* KQQ 
DWLRLHDKIF 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11382 


25283 


A 


11480 


244 


407 


SFGKKIGFLFGPKKFNGGSYGNYYKINR 
RFFFP I LGEGFS PHVYF *NLETRTTGGG 
LLETTKTTFNLLGRKLGS FLGPRSLTAA 
PTGTTIKLTGGFFFQFWGKVFPRMYIFW 
KDEWKKGF 


11383 


25284 


A 


11482 


308 


30 


SPQKNPPTYSGFKKNLSASPGQKAKPRF 
FLKNQFLPGIMGGGRQFPLLRRVRPPNC 
FNFGGKRCN * QKLGPCP PGG P KETVS KK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


NHWNKK* FW PKVAHT YNLS ALGGQGGR I 
T*GQEFETSPSNMRPHAS 


11385 


25286 


A 


11484 


10 


411 


QPPIYFLiSLRISLFWVFHINGIIQYVAI 
CVWLVS CHNFSRF I CVAACDQDF IVFLW 
LNNI PLCGYNI LC IHS I CPNWRTTGLFP 
LIiAI 1 1 KL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 


1 


TDEELLPMDEQRKWFLEKKPTSGEDAVN 
I FEMTRKDI E C YVH * VDKAAAMYER I DS 
NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKNLNRHFSKKVI*MASRHIKRCSISLT 
IRET* I KS I I R * HVIPGQMTF I QKTGDM 
PGSVSPLKSHHLKli 


11388 


25289 


A 


11487 


317 


408 


FRPGTVAHACNPSTLGGQGGWIT*GKEF 
KT 


11389 


25290 


A 


11488 


340 


469 


GKGDTKCHLWLGTWHACDPNTLGGQGR 
RIA*GQEFKTSLSNM 


11390 


25291 


A 


11489 


38 


389 


KRPTSLKKSNFCSLQGFFFFGIiSQKPGK 
SNRS PGKKKNP FFSGGKAP * KGF * KI FL 
SPGLGGSPQKPHLLGGLGEKNYLTLEKG 
GCRDPK* CPCFPPWAGEGNPFSKKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


FVLM WIjMPKTjH I P VTEFRRL FAFPE Fit * 
FLFCFVLSYFVIiLETGSHYVTYAECSGA 
ISVHCSLNIjPGSSDPPTSVLV 


11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSLFFFF 
WETKFYFAPQADGRGPNLG*WNPPSPG* 
RGSPGPTSRKRGNGGGGPPGPVI FGFLR 
KNGVPPGGLKGPKSLAPGIGPPGPPKDR 
GLRGGPPAPGtiLKKNPKVW 


11393 


25294 


A 


11492 


234 


438 


MWI I AQQCDCTECHY*LKMAKMINFML 
FIFYHTHTHTHTHTHTHTHTRERGPSYI 
YGKVS FKLTHL 


11394 


25295 


A 


11493 


344 


477 


CFTMFIS* IKT*KFNISQLVAHTCNPSI 
LGGQGGRIT*GQEFKT 


11395 


25296 


A 


11494 


35 


482 


GIPGFHHVGQNGIiDLI J TS*S IHL.GLPKC 
WHYRPG P PRPAI S I TPST * AYLLGNQKI 
GVTHFIAI LiAL FFSGTKAM ISP KYVC 1 1 
FYXiLVEEHXiGCFYFLiAINTATCVCVYKL 
FDTYMYQVESHATHEFISYYFFKILCSL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


LSPSERHELKCVLWPGAVAHAYNQSTVG 
GRGRQIT*GPEF 


11397 


25298 


A 


11496 


487 


349 


FLIKTEFHPIGQEGFDFLTL* FARLGLP 
KGWD YKGE PLRLAFWHPF 


11398 


25299 


A 


11497 


224 


417 


ASGASALKVELCSFSRYKIYPRHKRRYA 
RTDGKVF* FLSAKCEG I TFFLS FFFFFL 
EVLFCSPG 


11399 


25300 


A ■ 


11498 


17 | 411 


KRLVFGGRGRTDRFISQYV*AEYVSINIi 
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amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

iocation 
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ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 














LI FIjTKKKKKKKKKKKKKKRGGAIjKKKP 
WGGQNKTGEKKKNFFFLRGGKKKPWGDF 
KKKNFFWGGENWGKPPPKKKTPWGKKKI 
FKGKGGKTPPYFCFLKKMG 


11400 


25301 


A 


11499 


117 


370 


VSESADLAWSLGICFFIEFPSDADVAGL 
ETAFFLGGWTGSCSWQPGV*WCNHSSL 
QPRPSG PRQS SCLSLPSS WD YKYMP P CL 


11401 


25302 


A 


11500 


244 


1 


EELFECYSCSPTFSSDPLTTPLLILTT* 
LLPLTIMASQRHLSSEPLSRKKLYLS I L 
ISLQISLIITFTATELIIFYIFFET 


11402 


25303 


A 


11501 


130 


55 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKGRP F * RDNSAI LF 


11403 


25304 


A 


11502 


186 


421 


SFRLKIAGI IGVHHCTWLWLI LNLVLL 
K*ITFALKIMMFSKYSDIGQAQGIiMPVI 
PALWEADAGGSLQPRSLSLGNK 














GELLFLYIFA* WLFLLPKLLKFVTVCLC 
ELQFNERFIMVLCI I I ILGVGGFFLFCF 
LL * FCFDFFFFFWRG 














AuvQWPRSWLRLP*IiFFTQVTjMIFFPPF 
YHQNFFFP*KGFFFLGGLSHFFPPPNKG 
FFFNYPQGFFFSPSIiKKKFFFFSSPIFL 
APPGIFL*GPPRFFFFFFFFFF*EGVSL 
CHLGWSAVAAI LAP P AG FTPS S CLSLPS 
S 


11406 


25307 


A 


11505 


245 


440 


WAPCIG*QLLKDPQVLFAGYKVPHPLEH 
Kill RVQTTPD YS PQEAFTNAI TDL I S E 
LSLLEERFR 










256 




GKLM*GHltVGQAGLkLLTSSDPPALASR ' 
SAGI TGVNRHAP PRLNVTLYTKGQT I I L 
YHNQYNQ 


11408 


25309 


A 


11507 


1 


149 


GCMWPRTVLFAFQHKFVNFLKTLGDFGQ 
VQWLMRVI P ALWEAEAGKLPE ARRWR P P 

LMRVI PALWEAEAGKLPE 


11409 


25310 


A 


11508 


316 


420 


YT Y I YE PG * VQ WLTPVI PAL WEAKAGG S 
LEVRSLR 














GQEFETSLANMMKPHL 


11411 


25312 


A 


1 1510 


167 


420 


HTHTHTHTHTRACI *KVRVRALFSPKKK 
PLFCLSVSQIYVGRCLLKRLLFIFLYRH 


1 1412 












x jc 10 vah xiiLir J-AXriv.LiAj.or r r r Kit 
EVLFLPPRREGGGAF*VN*NLCFWG*GN 
FPA* PSLKKGITGALYTPGLFFVFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 


1 


PPVKFWAPFFFFF* IGSPSVAQAGVQWN 
NLSSLQSPPSCL 














VLFILSGFFGPINYPTSPPPQLPFPASG 
NHPSTLYLHEFNCFYFYFLFIYFF*DRV 
SLCRPGWSAVARSOLLVPSDS 


11415 


25316 


A 


11514 


434 


2 


RPPFFPVWGSPPPFFFFFGEGIWGQEER 
KGAS S PP WSRG I LRGGGPFGGPGAPPT 
PLKKPPLFFFLAPPFLGVFPRFFSFPTV 
FFLPFFVCPKKREALLGVPKGPRPFFFR 
SFLFFFSGD*VSLCRPGWSAVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETKSRFVAQAGEQG 
GNLG*LEPPPPS*SDFMASP . 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A la nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q-Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFERES CS VTRAGVQWHRLS S L* PLPP 
AFRRF 


11418 


25319 


A 


11517 


256 


404 


NILKI FYLRI * KKKNQLGWAHACNLRT 
LGGRGGWITRGQ* FKTSKEKMVK 


11419 


25320 


A 


11518 


211 


404 


YIiCFTRCSVHSHLK*VFLHKCVT*MYRP 
HT*TGL * CMQ VHTHTHTHTHAHTHTHT P 
LSQGGDCI 


11420 


25321 


A 


11519 


41 


232 


ESVE PRRRRVPRAQ I TPLHCS LGDRARP 
CL*SQLLKRLRHENL»IjNPGGAGCREPRS 
HHCTAAWATEQDPV* KKKKSLNFKTVRA 
INLKMLGALISFHLFLWGIWF 


11421 


25322 


A 


11520 


146 


2 


LIDQQGVNQIiPRMECSGAITAHCCLELP 
GSSDPLTSAS*VAGRPPTRP 


11422 


25323 


A 


11521 


297 


437 


TDHFWNR I LSSCGDTFRDTWLG WPHAY 
NPSTLGGRARRIT*AQEFE 


11423 


25324 


A 


11522 


244 


460 


NCATITTNSRIFLQLQKKHCTKWLSLS I 
SWQSLISF* * I CFVLFFETNLAVSLRLE 
CSGAIIoAHCNLCLLSS 


11424 


25325 


A 


11523 


222 


1 


ESKDIiDIFRCQLAPRNSSCFVLFCFVFP 
LEIGS CS VAQAGAQWCNHGS LQP PPP I A 
SAS * VAGTTNVRH YVQLF 


11425 


25326 


A 


11524 


308 


424 


ITFLTMFKI YLQFDPAI PLLGI YPKDY * 
SCCYKDTCTRM 


11426 


25327 


A 


11525 


98 


2 


NIGGRPGAVAHAYNPSTLGGRGGQIT*G 
QEFE 


11427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPGGQGCSEP* 
SHRCVPAWATE*DI>IiSKKKKKGVLEKPF 
LPPPS VGKP PFS P PERLFFF I LRP PLGG 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


PKYCQI *VLiQCIAQLFKIiSIYNTYTHTH 
THSLS LS LS LP P P KVNKWLMS S EFTLT F 
KMYTFGPGRVTHTCNPSTLGG*GGWIT* 
GQEFETNIi 


11429 


25330 


A 


11528 


284 


2 


1 FWGPQKKNFSFPPPGLKIGSFKRAPPFF 
FFFFETKSCSVTQAGVRWCDLGSIiQPLA 
WATERD* LNK* I KGMS KKLHT * VCMFRT 
RGRTRGRTRG 


11430 


25331 


A 


11529 


134 


2 


DTLLPRbECNGTITAHCSLKLPGSGDPP 
ASAS * IAGTTGMCSHA 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF* EVES CSVAQAGVQWL 
NLGSIQ 


11432 


25333 


A 


11531 


157 


373 


CVFYTYS TSQLRCY I FIGNTRS VFRCHE 
I YT * KNR FWASTVAHTCNFS TLGSRGKW 
II*G*EFMTSLANWR 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF 
YRKIPCASQV*WHAPIVLATWEAE 


11434 


25335 


A 


11533 


259 


122 


HISPS VQRHFFLFYF* TKPHLVAQAGVQ 
WCNLSSLQSPPPGFKRF 


11435 


25336 


A 


11534 


164 


1 


SSWSQALILLFLKKTCFWPGAVAHACN 
PCTLGGRGERIT*GQKFETSXASINIi 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHFFDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 
* AEKEYKI QFDG 


11437 


25338 


A 


11536 


167 


378 


AS FFGTT I TYKTILVYVYMFEKF 1 1 YIH 
MYTF*RQGLVLSKLECSNVI IGHCNLKH 
LGSSNLPTSASRVA 


11438 


25339 


A 


11537 


101 


396 


VNNLKQRIGLiSKRGEKKLLVLI SGFGGG 
FPGAGW I KKGKP P PGFPQGIjGP * KRGPP 
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nucleotide 

location 

correspond 
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residue of 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGTPGVRGPLGDL 


11439 


25340 


A 


11538 


18 


361 


QIGPRLY* DQNGETPSLLKI QKKKI PGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
PK*PQGLPPWGTNQNFVKKKEGGREKGK 
EGRKEGGGEKKKTL PKG I G ALNLGGL L T 
SP 


11440 


25341 


A 


11539 


134 


365 


PGFQNKKTGGGGPFRPKTKTNFKVF* KY 
LKICFGFWPKRPPPPGFFILETRSLFFI 
FFLGFFF* ETKSRS VTRLECSGVILAHC 
NPRLVGSSHYP 


11441 


25342 


A 


11540 


2 


318 


TTERFIYRITFGPKETEVNDLPGAPSEP 
VAALRLEARSFILKL*SFSLVFDFFFLP 
F*KTGSCLSPRLECRGEIITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GTISFFRTCRLGMVAHTWNPSTLGGQGG 
*II*GQEFQT 


11443 


25344 


A 


11542 


1 


123 


GKQRQE PALS YDHATVLLTS * SAHLGLP 
KCWDYRSDPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHWIHMDTKIETADPGNYYSGERGK 
RPRAEKLPVCYYALYLGNGI ICNPKPPC 
CWKGHDFIYIYVCGHIYIHIYICTHIYI 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 


LFCAFFLS PSKLDFYHIjSFYV* TLFLKD 
LISLAFFFLFWEAGS CSVTQAGVQGPNS 
THCNLCLPGSNNPSTSASQAAGT 


11446 


25347 


A 


11545 


256 


354 


P YP FGKPKRGDHWGLGV *NQ PGQQGETP 
SEiLKI 


11447 


25348 


A 


11546 


196 


335 


EIVARLKRSPVGRAQWLMPVI PALWKAK 
ESGSPEVRSS+VKKITCRPGTVAHACNP 
STLEGQGKWIT*GQEFKTSIiANKVK 


11448 


25349 


A 


11547 


202 


377 


TIHTKGVMVGNFILWFYQNFFKTFF*D 
KILGWARWPMPL IPAFWES EMGGLLE PK 
SD 


11449 


25350 


A 


11548 


396 


26 


PPRIKKEKRSPNGAFYPFPCWNF*KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QLIiRWYVLTSHIAVS FKHLCE PHKNFTL 
QHS I PYLYI IE I EVYS YFINRNT I YKIN 
MDFS I VGFVFL 


11450 


25351 


A 


11549 


232 


371 


RE VP FEN I KI GQ VQWLTL 1 1 PALWE VKA 
GGSLEARSL* KYQNWPGAVAHAYNPSTL 
GGQGGR IA* GQKFE IS PANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFRYADRAEVQRLLTGTILEHCGL 
QRPGIi* RSS CFTLPCGWDYRH * S PSHVA 
GTTG I SHHTRI, I KKAFFSQTGVS S C 


11452 


25353 


A 


11551 


262 


359 


DYSNLGLVWWLTTVI PALWE AKAGRSSE 
VRSS * PALWEAKAGRS SEVRSS 


11453 


25354 


A 


11552 


70 


380 


LHKCSFIYINHCNNPIYGLISLYPKQLF 
S YENFKDFCNKI * P * I PNFELKTSTP* L 
EKL*NWAMARHGGSRL* SQHFGRPRRAD 
HMRSGVPGQPGQRGESPST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF*KIFFSVGPPLIFFLPKK 
FIGEQKKNAPGKNFFFGFLFSLFFFFLL 
TLSSALECSGAMSAHDNLCLPGSSNSPA 
SAF 


11455 


25356 


A 


11554 


230 


399 


MMGEI IKQANIISVCQIIRRLRHESHLI 
PGGRG YSELRSYHCTSTWVTE * DSCLKK 


11454 


25357 


A 


11555 


376 


396 


F*IFWRDGVSPCCPGWSQILRLQQSTCL " 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLPKCWD YRP PHPATAAS CP YFYHYYLL 
KIWRYSFDLSCLWNLWWGIAFG 


11457 


25358 


A 


11556 


176 


384 


KAWPWVRNNYR*NFFFQCCLNNFRKRLK 
CFRCGADKFGKTGFSCLQYALKPTNWGR 
EQWLTVNPTNFGK 


11458 


25359 


A 


11557 


295 


391 


PYKLL I THTHTHTHTHSRTHT I * HTTLS 
LYTF 


11459 


25360 


A 


11558 


1210 


1753 


NT WCMERIAVAFGS fnrns plrgwalwl 
TP 1 1 IiALWEAEAGG PLEPQE FETCLGNM 
VKHCL.YKNLKNVSGV* DLPGQHGETLSL 
QKFKKCVRRGGTCL*SQRLRRLSWEDCL 
SQGGQGCSEP*SHQCVPAWVTELDPDSK 
KTKTKKFFGLPSPFL.FVCFLTHTYVNKE 
YAFVLiAEEAS GKTTS KLTMVTS RNGLiGK 
TKKFFGL 


11460 


25361 


A 


11559 


172 


3 


KOKFFFPPPSHQ I KIFGGGAPEMPFFFF 
FFEKESRFVA*AGVQWRNLSLLHPPPHP 


11461 


25362 


A 


11560 


143 


1 


GQAQWFMPVIPVLWEAKAG*SLEARSSR 
PVCATATPPQLI PK* YEGL 


11462 


25363 


A 


11561 


84 


2 


KPLDTLGGQGGC IT* GQE FETSLANMV 


11463 


25364 


A 


11562 


12 


387 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FGKGGQINPQGGGQATEPRLREPSPSGL 
TLQGPGNGGP PPPPGQNFFFKKKGGYPG 
GAGGVLNPGP KGNTPPGPPKARKKRGG P 
PGPGQKFL*NPKG 


11464 


25365 


A 


11563 


230 


375 


RAKGKLXjSESIiIiS I QS FKKPSVHTCNPS 
TLGG * GRWI T * GLE FTTSLA 


11465 


25366 


A 


11564 


287 


55 


PPDGRVQWLTPWPGLWEAKEGGSL.KSR 
SS* SQINPVGSVPYSLRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHI fw 
NFPPPG*GVFPPPPPPGGGNKGGAPPPG 
FFFFFP*KTGFPPLGPGGPPPPLPGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLL I GW YNTGCI TQGSTRKIGTTFFLRQ 
SSSVAQAGVKCSGMILCYRNLRLPGSSN 
LPDSAS * VAGTTATYP 


11468 


25369 


A 


11567 


101 


410 


FLFFFFFFFLKKI FFFPPRGGGGGGN* K 
KQTP PLGGKKNFPP P PPKRGGGGTPPPP 
PLLFLVF*GEGGFPGGAGGGKKTPKGNP 
PPPPPKGGEMGRGPPPRGG 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNIiQIAGRDGAR 
L*S QLQEHHLS PGFRGYSEL * SCHCS PA 
WATE * AYRKQNKTKQNYRS S ALLHL YT P 
MATAW I KI AFLRAGD I FLSVLS FLPNS Y 
PLFSHPSQ 


11470 


25371 


A 


11569 


301 


443 


EDTT F VCFEVES CS VAQATDS PAS AS * I 
AG I I G VCHHARL I FVFLVE 


11471 


25372 


A 


11570 


44 


408 


RGRGHG PHS P WRRRRRXiR WEGRKATE S 
KRRAYKYPIiPALLCSHQWKPNEWLDSQ I 
KIPTGL* *UC*PSFIFIFIYLFWKQGLS 
LAPMIiE CRGAVLAH* HUILPRS SLSHAS 
ASQVAVTTG 


11472 


25373 


A 


11571 


201 


1 


AERP VTGPPVYAEPGLQGRQG* KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKI TQAQC I 


11473 


25374 


A 


11572 


343 


1 


FGLLFAFTLEYKLYESINHFAHCCVCRF 
SSGTIiHVWCNRHPNKXiDEWNSGGGGRQ 
DLTIjTPRIjGCSGVIMAHCNIjVLLiGL>SNP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, ^Phenylalanine, 
G— Glycine, H— Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSAS * VAGriGAHSHAWIDKLFFVKTQP " 
LY 


11474 


25375 


A 


11573 


209 


3 


VGPPPKSL.PGGFPFFFCPKCFPRLKKKK 
KKPGLVAYTCNLNTSGVRGKKIT*AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 


430 


QRYARPGPWHCYNSSTVGGRSRGIV*A 
QEFATSL.CNMT 


11476 


25377 


A 


11575 


125 


3 


NSLGPWHACNPRS1>GG*GGRITSAQEF 
ETSLGKIRRPLY 


11477 


25378 


A 


11576 


190 


362 


CVNNYID* EKKKKKKKREREKKKKKKKK 
KKKKKK* GGGAPKKKKKFTPGGGKNFFF 
L 


11478 


25379 


A 


11577 


231 


340 


KRITRVNQDGLDLLTS *SAPLGLPECWD 
YTMKPCLY 


11479 


25380 


A 


11578 


60 


3 


RMQVSV* I FTPMSLQAYLI YSSSVAAGA 
QSG I EECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNWPGAMAHTCNPSTLRGRGGWIA*G 
QEFETNVGDMAK 


11481 


25382 


A 


11580 


121 


1 


NbNCPGPVAHASNPNTIGG* crpiv*gq 
EFYTRIANTVKP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGIFFCPKEAGPAPPL*TPFC*T 
PLFFS PAPFKPKRALKFFLKPKRGFFQI 
PFFFF*DGVSLCHPGWSTVARSCPRV 


11483 


25384 


A 


11582 


677 


934 


YGSRHCICFLQAISEILFLKNPARHGGS 
QVQDQPGQQSKTLTLRKKSK* ARHNGS C 

HC 


11484 


25385 


A 


11583 






ISAHCN 


11485 


25386 


A 


11584 


349 




KPCFPKKKKKKVKGLNKHYSKEDIQMVN 
RYMKKCSTSLPRERKIKTIMRYHFPSIR 
KAI IRKI KDPKDQ* GCIEKGGS WRPMY 
L 


11486 


25387 


A 


11585 


163 


1 


TAVRI KHNSYTLTPCLRHSRYLINVTCW 
PGPVAHACNPSTLGGRGRWIT*GQEF 


11487 


25388 


A 


11586 


192 




S KSKYHX YGDSALQKSTVYKW ITHFKKG 
* DDVKDKAHSTRLSMS I CEEKE 


11488 


25389 


A 


11587 


30 


374 


GWSPEDLFPGSLPPALKGFSGEGG*PFL 
P FQKKRGGGGGSS P PFPVLKRVRPENRL 
YPGGGGFR* PKQGPCPSAWGAEPDSIiSK 
KKKGFGQKKKKKKVLKEKT 


11489 


25390 


A 


11588 


66 


245 


S LCFFVFFVFF* TE S CS VTQVGV* W YNL 
GSLQPPPPKKKKNI KKQKKKINWGRGFK 
RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKKKKK KKKKKK KKKKN* KDAG 


11491 


25392 


A 


11590 


205 


3 


IRLLG I I KNPTPAHCQ * K* KTGEPLW* V 
VW * F FRRLHTEIiS YDPAI PLLC I NSTEV 
WLAPGIPPRFR 


11492 


25393 


A 


11591 


404 


152 


MGFHHVSQDGLDLLTS * STLLSLPKCWD 
* CWDYTRE P LCPGHFAI SFNKWLNTTT 
WLYFYLI ILHCIYCPYKLLFLFFFYIY 


11493 


25394 


A 


11592 


311 


386 


TVYPPLAGNLAHAGA*AGIVGTALSILI 
RAELGQPGALIjGDDQIYNVIVTAHAFVI 
I FFIAI PI I IGGFGN* LVPLI IGAPDIA 
FPRINNISF*LLPPSFLLLLASSIVEAG 
AGTG*TVYP PLiAGNLAHAGASVDLT I FF 
LH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide - 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11494 


25395 


A 


11593 


39 


397 


EGCPLFFFVFFFFFFFGGKKIFFFPPRG 
GGGAPHKIPGPPPPGGNFFFPPPPPLRG 
GKRGPPPPPGFFFFF* KKGGVSLFPRGG 
VFFFS FWRGGRGGFPPPPPQKKI FLKGP 
PPFYKKK ' 


11495 


25396 


A 


11594 


243 


46 


GILSCCFHDSLSLNSL*CD*VWISLCLS 
YLVS I ELLGQCGLI FFIKRES FWLLLLK 
MYLQPLS FLS 


11496 


25397 


A 


11595 


327 


2 


P RA I LR I QFLLWPMRP YVMW S QP CSLCF 
CHTAKTVPTSRPLL1I1GFFSI1PGIAHISM 
WLVLS YGLGFS SKVTLWPG WAHVCNPS 
TLGG * GGWIT * GWE FKTSLANMVK 


11497 


25398 


•A 


11596 


292 


384 


liGTVAHTCNPSTLGGQDGWIT* IQEFET 
SLR 


11498 


25399 


A 


11598 


125 


2 


PFQGQRSGGTRGPGAVAHTYNPSTLiEG* 
GGRIT*GQEFETS 


11499 


25400 


"a 


11599 


302 


408 


TGGWLGWAHACNPSILGGRGRQIT*GQ 
EliETRLA 


11500 


25401 


A 


11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 
KITRGGGQPPLFPLIjQRVRPENCLSLGS 
GGCH*TKLPPCPSTWGAQ*KLVSKKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


196 


330 


HHCMLAWVTGRPYLNIYICIYADIYIHI 
YICI YIYTHTRVYI YLLAI * IYI YVYMR 
IYIYIYIYVYIYTHTHGYIYIY 


11502 


25403 


A 


11602 


41 


252 


TPGRARGQRLGHSSTGLDLLTS*SACLS 
LPKCWDYRREPPRPAEDIKFMF*AILSB 
TIL.VRGVRSRATPS 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKESKFTSAGLQGNIFFIGP 
P KLI PRAAV* QRGDWKNLGVTQLNRIiCH 


11504 


25405 


A 


11604 


429 


2 


FSSFFAEKGCHCVSQAGFKIiLSSRDIiPT 
SAPHLGIRLDGRLRQEDPPSPGV*GCSE 
LQSRLCTPAWI TKRDPVS * KF KKQ KTKT 
HTH*RKPTKGQDHQYHCLPPPHVMPIjKV 
FRGSNVHGAVRTRG S RGS I PE LSHAVPH 
AP 


11505 


25406 


A . 


11605 


113 


2 


TEATHLVCTYKN* KLGWAWW IMPVI PAL 
WEAEVGRLL 


11506 


25407 


A 


11606 


301 


3 


WVMVCSVKNIALGHACGKFHYEGRGKPA 
EHPSWVLLVLiESLQKKtiFLiSYFLFTFLL 
QPPSPRLT * PVFRFFFFRDR IMLCCPGW 
SAWPERVGRAGNSE 


11507 


25408 


A 


11607 


117 


431 


SCDFCLSFSLSLFFFFFRKGIFFLGPRG 
ENKGGNKI KWTLGPGGKGNSRPS PPKGG 
GKRGGP PPGANLDF * KKTGFPQGGRGGE 
KPPPPGGRAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


277 


88 


RRTTSSWLARNCIQGPAGYHTGS*TPDL 
KGSACLSFPKCWDYRWRLDL 


11509 


25410 


A 


11609 


200 


27 


VRLKPGDSRDPDMQWYYI * ENRPARCGG 
SLL*S*HFGRKRKADHLRLGVQDTRSSA 
R 


11510 


25411 


A 


11610 


17 


414 


TVWLVMQDPENEWIVSGWGGKEEPGRYF 
MCEARSSSMDRVFFFF*KKTLFFPPGGG 
GGAKI WVNG I PPPQGS PLS VAYPLGVWE 
KKGPPIHPHKKRVPPV* PGGVLFSEPGN 
PPPGPPKGVGKRSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


R I YEN WRGTVAHACN PSTLGGQGGW I T* 
GQSL.KTSTGTVAKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


NFVRGVARGSRGGATRCEWVGLRPVGMA 
YGRSPSWLL*VKPSAASHRPPLPRAADT 
PGTAPAPTPTPAPAAAPALTPSSGPGSA 

NS 


11513 


25414 


A 


11613 


158 


2 


LMLSRTPLQQL*YTLLELELPRLLAPDL 

XjSNGS S Ij kdlkvjthsnyras ketk 


11514 


25415 


A 


11614 


3 


410 


RSEVYGLVLTTWYQVNRVPWTHKAARFT 
RVSRWARPRSKISESCYWLPQAHRTKS * 
PLVRTLSGLCSLLGFHPLLFSLLATDLV 
PLPSFALLLLLELFFLPGMPKLAP*SPT 
KRSFSFYFFRDRVLLCCPGWSQTP 














iJwJLr JjVfciKJNQF.r FIjr r BTKPCC VAPTG 
VQGPNFG * LQAPPPGLPLFSALAAPEVG 
ITNGTPPPPRFFF 


11516 


25417 


A 


11616 


158 


1 


KGKPTPKTFFCQKQIFFLVFFF*TESHS 
VARAGVQWRDLDSLQAPPPGFTPR 








11617 


319 


425 


KVWLYSQAV* KVWLYSQAVAHTYNPNTL 
GGQGWRIA*AYEFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLT IS WLGTVAHVCNPSTLGG * GGH I TR 
SR 




25420 




11619 


381 


9 


FFPL P L P LPT VS L F PRS P SDAE PKLD CT 
AAI S AHCNLPA* FS CLSLPSACNCRRAP 
PRLTASASRGAGI ADGVS FTQ CSMVPRL 
ECSGVISAHYNLHLPATSLGLPKCRDCS 
LCPDE I VDPR I 


11520 


25421 


A 


11620 


144 


431 


IVNVIVGKGEKGRKIPE*LIRCNGWN*S 
WDRWVAE DHVLRE PS ENRRLARVRCLT P 
VSQHFGRRRRLDHEVRSLRPACATWRNS 
VSTYNTKSGWAC 














V b'S LPLLGGAS QLG YLEVRD PLE EAVC P 
FSDLKPHAGRTTTLFKAIRQGHLSLQRF 
LLPFVWLCPAPRGGVYRGRQASLS*GGL 
HPVRASRP 


11522 


25423 


A 


11622 


316 


412 


GGPS AVAHACNPS ALGGRGGR I S *GQEF 


11523 


25424 


A 


11623 


183 


2 


PKQVYLRNGLLSHSKCNKIHHINRLKRE 
KIQPGWAHAYNANTLGGQGRWIT*GQE 
FENE 


11524 


25425 


A 


11624 


355 


168 


DLL I VCYDPNAINL * S LLLRRL KWED CL 
S PGG * G YSEPRS CHCS PAWAAE PDPVS K 
KDRNVEFIYCC 














TADVL FLGQAGRGN FWMKKNCR PGTVA 
HACNPSTLGGRGG* ITRGLEFK 


11526 


25427 


A 


11626 


412 


3 


I FFPPPKKDKRTRGGQKKNI PPAPPRNF 
LGRGGNFPPPQKPFFPPLF*IFPRPP*C 
KGSIPPFLFPLCFQKKKTPSFFFLPNGS 
PPICEKI FLFPKKGFLKFKKPRFLKKKG 
PFFFFFGNAVLLCQPGWSAVAQSQ 




25428 


A 








GGQGGWIT*SQVLRPP 


11528 


25429 


A 


11628 


213 


3 


LSGIYSGNRRLVRYSKSVSVIHHINRLK 
KKNHI I IS IDGEKPLEQIQNS FMT*KRN 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 


2 


FQKNRDLE P S KAG W PGMVAHACNPSTLG 
ARGGQ I T *GQEFETSMVNMVKP 


11530 


25431 | A 


11630 


422 


189 


PLSGFCPSTWGAKQRALSKKKKRSEAGH 
SGSCL*SQYLGRLKQKDHPSPGGRGFSE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN • 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X==Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLNCTPAWVAKQDPVPKKRIjK 


31531 


25432 


A 


11631 


239 


424 


KEDNWRVFVIGPLGGLNEIiAHDKAFKTQ 
NHHWLGVGAHAC^PSTLGGQGRWTT*GQ 
EFTTSli 


11532 


25433 


A 


1 1632 


343 


401 


AREREHKSAACQ*GMERSVIKHCHVATE 
* EREQKRERQQERASEKERQQESERDS K 
RERARESERERESTRAQLVRKIVNQIS 


11533 


25434 


A 


11633 


125 


371 


FCJGCAKICIiTNTIjLPYAHGYlilWRHDGK 
GCHEKMTAKI IHSCKLSPDAL* SQLGVP 
L * THTHTHTHTHTHTHTFS E KKGLVP 


11534 


25435 


A 


11634 


235 


433 


AALTSFSGPFSFCSYLINTIiGKNFPNHE 
I S TT * FTLQRL * KI I FWPGMVAHACN PS 
TFGGQGRWIT 


11535 


25436 


A 


11635 


93 


385 


FTLKSERRKHSVPQGKKQNKIKLLSPDC 
KLENKRFGWARWLTPVI PAL * EAKAGES 
PEGNGEREEATEPFGS I ILGFFFCTQIA 
HLHRSDYEPNEDS 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPPPLDPKKKASKK 
TKFFLPKKISTRKFRTGPWRGTNFSIiKK 
AKKNLGKTRKKIFP*RPPFPSISSLFWP 
FFFFI»FF*KDKVPLCHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 
TKFFLPQKFP * NPIiSPLI PALFWPFFFF 
FFF* KDKVPLCHRGWSAWQSQLTATS I 
SQMY 


11538 


25439 


A 


11638 


261 


91 


EKTIiWAGHGGTCL * SQLLGRLRWENHLN 
LGGRG* RSRHCTPAWVTERDSVSNK* IN 
K 


11539 


25440 


A 


11639 


1 


334 


FRANRTVKDAHS IHGTNPQYLVEKI IRT 
RIYESKYWKEECFGLTAELWDKAMELR 
FVGGVYGGNIKPTPFLCLTLKMLQIQPE 
KDIIVEFIKNEDFK'VQCSLANIRGMY 


11540 


25441 


A 


1 1640 


193 


457 


DGII.LCGPGWSAVAPSQLCSGTrSAYC3T 
LCLPGSSESPVSAP*YLGLQACAHAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPIiL 


11541 


25442 


A 


11641 


257 


373 


NTISCFRSLRPVAHTCNPSTLGG*DGQI 
T*AQEFETCLG 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL* FKKSRK*RVLDRI WGN* 
MLVHCWWEHKMVQIiL*KTVWWLFK 


II 543 


25444 


A 


11643 


112 


367 


LQ I KT I R YHH I LTRMAK IQNTDTNKC YQ 
ECGATGTTiI HCWWE FKI I P VI P ALWEAE 
VGGLLE VRSSRPAWPTRENP I FTKI IKN 
Y*KYKNTPYI IR* LQIKTIRYHHILTRM 
AKIQNTDTNKCYQECGATGTLIHCWWEF 
KI I PVI PALWEAEVGGLLEVRSSRPAWP 
TRENP I FTKI IKNY 


11544 


25445 


A 


11644 


310 


335 


YHLIAGSLGDIERLSML* IGKAVCHNKY 
* *GCGEPGFLOiHCWWEWKMVQSL*KIVC 
QILQT 


11545 


25446 


A 


11645 


143 


455 


KRKGPSLWLLGKSMGGIRFF* KKVFFSG 
FKSLWFFFWPLWKPLERLN*IKISFYPP 
AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
S ISAHCNLCLLGLSDS PTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF * KRVFFSVI KS I WFFFWPLL 
KPLGRVN* IKISFNPPAKKRGWER-KAQF 
FFFFLPRSFSLAGCSGSISAHCNLCLLG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IiSDSPTS 


11547 


25448 


A 


11647 


319 


3 


RSTKYSPRGWEPSMWFLFLHVFSLPFRS 
ILGPPLLGTPLRGSPPPPPPFEGKGFIY 
GLTFPFP * EPPCEI FFFFFLDRVLLYCP 
GWSAWQSAHCSLELKDKVCI 


11548 


25449 


A 


11648 


165 


2 


DQSGRPRKILCAHTHTHAHMHTS LGQT * 
QDPV*THTHTHTHTHTPLYAAAQRIN 


11549 


25450 


A 


11649 


290 


3 


SKQNTTHWATLKSYIYPSQFWRLGSPSS 
RWQLDGFLVRAL FLVCRQP P S LR I LTWW 
EKTEERKREREK*DRERERNEIERNERE 
RERERETQRMY 


11550 


25451 


A 


11650 


220 


2 


TPTLPKGTGLTALDRIPGRDPGPRRAGR 
PF*AGVAAENCPDPAEGGSTRPliGAPGR 
GSAPP * RHPGPAGGRR 


11551 


25452 


A 


11651 


29 


387 


FLIFDVTIVIDIiACHEPHSYKMAHLINV 
VCVLTAP PTSYFPIS FLGP P YS L RHND I 
EIRPINNPTRTSQCSSERKSHTSLTLNQ 
KLEMTTLSKDGMSKAKTG*EPGL* C+TV 
SQAVNAK 


11552 


25453 


A 


1 1652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFLFSFF 
FFFFFFFMREGFAVTWVPGAVAPGAVLA 
HCSLELVGSGKPP I SAS *GAGTTGMHYH 
TRMY 


11553 


25454 


A 


11653 


50 


411 


EAVAGGMEKSMMNLPKGPDTLCFDKDEF 
MNEDFDVDHFVSDCRNRGQLEELRDNPQ 
liYYKLLKTAMVE L INKD YADFVNLS TDL 
VGMEQALNQLSVPI^QLREEVLSL^SFV 


11554 


25455 


A 


11654 


303 


403 


VRNMVEWLGVAAHACNPSTLGGQGEQIT 
*GQEF 


11555 


25456 


A 


11655 


356 


653 


SVSPCPFPGPASTLSFLVADFRRRGVDV 
SQVAWQSKGDTPSSCCI INNSNGNRTIV 
LHDT*GPRASPCYNPPISWLNQFPHSPP 
WASQLPPSVASVYQK 


11556 


25457 


A 


11656 


442 


2 


GRPILMRPAVL*SEPRTPACVDIQQQIM 
TI IDE WXACAKVQTLSAP INSASRMQS 
IRHWYILKDSSARPAGKGAIIGFIKVG 
YKKLFVLiDDREAHNEVEPLCILDFYIHE 
SVQRHGHGRELFQYMLQKERVEPHQLAI 
DRPSLV 


11557 


25458 


A 


11657 


109 


474 


D INMCGR WLVAERM * KRKRDKYN YEET 
E*VL* KKVRHSQLQKENVAQNAPWQNA 
AYIDQPSPAHVGQQGLS KLPSRPGDQGV 
EPQNLRTLQGRSVIRSATNTTLPHMLMS 
QRVIAPNPAQLQGQRGPPKHG I CRTTTP 
NMNPANV 


11558 


25459 


A 


11658 


342 


3 


ENQLKNGEHSKSQSASSPPNDHNTSLAR 
A* KWAGAEAE IALLTEAGFGRWV I MNFA 
ELKEHWTQCKEAKVHDKTTQELIAGFE 
RNITYLM*LKITTREPHIAITSINSRMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWS I ALATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS*GNHFNPDGASHGGPQDSDRTGPRGP 
CLPSPWGAQILRDWYPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDI YNRCYL I F I NI I * YLSSNFD * I F 
I II YLVAIYRIiLCIiGIiFRAPRQCKHPRP 
QFSFNXIGTKR*AWPGAMAHTCNPNTLG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Ptienylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














GLGAQIM*GQEFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQI FFFFWKRFPFFAQGGTEGAQFR 
LI KPS PSGLKE I PPPNPPRKGE * RDGPP 
FRGDFVFLRKNGVPPCGQGGS KPPAPGE 
PP PFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QALQVMLKHVNFFFFLFFLFWKQSFVPI. 
PRIiE CNGATS VHCNLCLPG * RNFPASAQ 
EFRIAGSYDHAICPPASGKGE 


11564 


25465 


A 


11664 


1 


404 


NTWSDLiI ISYFHPRPSAFFLRQGLALS P 
RLEYRGTITGHYSLDLWGSSDSPASISR 
VAGTTYRCM* FQLLGRLRRENHLSPRGR 
GCSDP * LCPYTPAWVTERDPVS KKKLRG 
GGENSIiLSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NIWI PDVLPGHLARQ*LGS PGKLS AGF * 
PLQFPQTLSSARAPPFCLPRLQVRNTVG 
FLPSPHTSRPSHCLPACPLPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPS P* SFQG 
SSWWDTDLRALECLLRILRGRNCSVFVT 
TQGQIjSHEGMNKEFLtiTCAKKKK 


11566 


25467 


A 


11666 


3 


401 


CMERAVTVriLPGSATQSPPVYAPRAIiAR 
LWLTAAMMISGFI AD YEA* SSRCSSACP 
AGDSLSYYHSPADSFSIMGSPVGAQDFC 
ADLGVSRANFIPTDTDMSTSPDLQWLVQ 
PALVYSVGPSETIAPHPLGVP 


11567 


25468 


A 


11667 


1 


446 


LSCAKPPQRPLRIIGI IKIFNVFEETRAN 
KHIiIjGVSKIiKPRGFPAIjFLSVSSFGQKK 
HVPQWLQHSHSHILVISIiDFFFFFFFEK 
KFPFFPPGGRERDQTPFN*TPPRGG*RE 
SPPFPSRGGGTQGWAPHCP I I FRFFQKK 
GVFPLRAR 


11568 


25469 


A 


11668 


18 


403 


DPLQR PQ YRGGLRS VG CDLQS PTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 
FSGQGRPPPLGDHPHRRKGAGPPGPMGP 
* HEGQGS C * GAS CRGGLACTAFPAASS P 
P IFGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQG FVFVFF FFEMQ FGRLE CSGAI S A 
H*NLHFLGSSNSPASASHVYAPLY 


11570 


25471 


A 


11670 


396 


7 


EKVGQARDKSACEGHSCADARSCLHLRV 
DQGGQHGSHGACPGRRLjRGPDTHRQSCY 
QCRCNRS PDRLQPGRAGTP * CAEAAAEP 
DAAVHPGQPAPARLPPALPGRGLLPLPH 
PGGSQRVYCVRCIGSKLR 


11571 


25472 




11671 


209 


3 


DDLGTHRPGKSQVRVTPREAGENPSAVL 
PCYLHLVPFV*EDEVKPEDSIPDMPGNE 
YARE FIAHAPLY 


11572 


25473 


A 


11672 


270 


101 


LSLNRWILGAYI IFFETESHS VAQAGVR 
•HNLGLLQSLPPGSRRPIiAFKVAFLiFIG 


11573 


25474 


A 


11673 


421 


592 


SSCTPGVIIKIPCVRSNHMKLPGPGMVA 
HTCNPSTLGGQGGRITWSE* FKTRLGNM 
A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALSSLIMTYGLAM* FHFH 
SMTI.FI IGVIiSVAIiAI YQ*WLDVSRVSV 


11575 


25476 


A 


11675 


1 


418 


NTVCVCVCVC VCVCVERKAGHRLTDS FP 
IQAGPGAQARTLCLRESTGGALSSGL.PA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
PPARSPSREAVCYFRQTLKKWRSPRGQG 
GR I LRVI RTAST FRDTS * tes aclgarp 
CLDGK*VCESVPCFSIjHTHTHTHTHTHT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 


N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Unknown, * = Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HCI 














S FMGRS PRK I DQ F CNS RNMVHGSVTFRD 
VAIDFSQEEWECLQPDQRTLYRDVMLEN 
YSHLI SLGSS I S KPDVITLLEQEKEPWV 
WS KETSRWYPGK* E * SRQGEAI IVPDS 
P CAQRGVTPLRCGLENFLQ 


11577 


25478 


A 


11677 


107 


397 


GGDGRETLRAPADDGC I * QERAETPLAI 
S C P RS S STCRGRARRYDG CRTRDGHQL I 
QRGQQQESSRRTAQPGSSKbCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 


A 


11678 


514 


554 


RAKPTPP*GGGEGQNFGLIiKPPVSGIiPP 
LACPTPPKRWKPKPPPPAPPHFLFF* *K 
RGLIF* PGE VFNLRNWKLP PS PPQRGGN 
KGRNQ P PPRGTFMF 








11679 


266 


3 


GPPLFLGLKN*TFFSPTKV* IFFSQVFS 
SFPQGLKGGVREI FFFFFF* DRLECNGT 
ISAHCNLRLPGSSDSQASAGTTGVCHYT 


11580 


25481 


A 


11680 


1 


408 


NTCALRHRDHFS F WSDG KTKEGEER I VQ 
ENWliISWNNFSVPETLLTLSCPLGKEGM 
PGEDGTAGAGKVPGEDKI PG*DGTAGED 
GTEDENGTTGEDETAGVGKTPGAGGTTG 
EDDTESEDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAGA 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTHIQPAYVH*EKYFRPSAMAHAb 
NPSTLGGRGGWIT*SQEFQTRP 












1 


YNTNQFTLiHGTQAS VYTCLTAii IVLKLL 
NQPYTLSAVLLNAGNQPFHAFPSPPSLH 
PDAFHSKIMSHII HjLIjGFIPFIjPASQS 
IiTLSPSIiERSGTISAQCNLCLPGSSNSP 
ISA 






— 






423 


SSHQAS S PPTTASHSMKI S VAAI PFFLL 
IT I TLGTKTES SS RGPYHPSECCFTYTT 
YKIPAQRIMDYYETNSQCFKPGIVFITK 
RGHSRWTNPK 


)158 j 


25486 




]16g - 


~163 


— 


IKIPCNRLLKFGPPSCCGFFFPIjLPPFF 
FFETLSRSVAQAGLQWRDLGSLIAL 


1 1586 












P IVFLGMIiFVAFAFRGP FESVTKPPNFF — 
FF FEMES CSVAQAGVQWRDLGS LQARP P 
RST 


11587 


25488 


A 


11687 


192 


1 


EQGFDGHIiASWISLKDTAKWKNGDELCS 
YYQS CCTD YTAECKPQVTRGDVFTMPED 
EYTVYDDG 


11588 


25489 


A 


11688 


89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 










168 


2 


QDRLQ PHE S KRKLG PAPQLRRSESDTPS 
WNFQSTESQLMSKGDEDTKDDSKETV 


11590 


25491 


A 


11690 


301 


397 


HFVFSLVTGMNPLSPYLNVDPRYIjVQVR 


11591 


25492 


A 


11691 


18 


421 


TKALQITCYLHSTMSEESDGKLIEDSLI 
QLRCHFTWKLL I EAPE I PDLENR I WEE I 
QFLDTKYNVGIHNLIAYVKHLKGQNEEA 
IjVSLKKAEDLIQKEHANQADIRSLETWG 
NFAWVYYHMGRLAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PLQGKGFGAKERGQKKGFLRGGVRASRP 
RPPFGPSLFFFFFFETDSHSVAQDGVQW 
CDLGS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=L)nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTEEDMAT I TSLWGQGNVE 
DAGGETLGRLLDVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKGLTSLGDAIQH 
LDDLKGTFSQLSELHCDKLHVDPENFKP 
LGNGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


NSGVKAAFQLNPGNPNLEGKPGPGFKTH 
PGQLGETHFLLKIHYLSGLVGKPWESQL 
LGNIiNRENH WT P ERGG PRDLRH CHG I Q P 
WATNGNFVLKKK 


11595 


25496 


A 


11695 


158 


838 


CGTVHSCDAGQRQRATAPSHPCDHGNQQ 
PILYRVLCCQLFWILCFVFSHS IMSTKC 
P S VS PVSGEMKKRKAI TLEMKLKI IAQH 
EGGKPVMAIARELGLWQSTI STILRDKK 
QISDAAKSSASVKSTVITKKRAGPIDDM 
E KLL.VMWMEDQ I OKR I PLSLLM I QAKAR 
SLFNMLKDRASDPTYTQMFKASHGWFQR 
FKRRHNFHNVKITGEAARAGNEGAIAFK 
EQL 


11596 


25497 


A 


11696 


834 


1431 


SSACQGGSQGWPPRQPWWMWGWSRRAPS 
VTHRTGERCSTSGSAPLPLLLSSLCSSL 
PRSWEHSTLRQPPHPPLLPPGCLPPGRE 
AQRLSGAGHGPGPRPAAASYIPDTQADR 
KQQQHHPGPDQHVGHREGLALEDAATHL 
GLVAALSCWLLGRAEAGYQVPHGHDHPK 
DQHPQADGGQRIVRAIGLGLGHHVSGRR 
ARP 


11597 


25498 


A 


11697 


143 


1 


ISKERGASRFSGPWVFFFFFLESRSVAQ 
AGVQWCNLGS LQAPP PGFT 


11598 


25499 


A 


11698 


58 


459 


KGKEEKVKRKEAEQNFSPYAQDKQERIK 
GNENDEKTKQG KET I IDIELFKGLDETG 
ENMDSTLTRTPFEPLENNKQ1LVLGLDG 
AGKTSVLHSLASNRVQHSVAPTLGFHAV 
CINTEYSHMEFLEIGGSKPFRS 


11599 


25500 


A 


11699 


215 


1 


GTKKALWAGGGGFFPYLPPMGPLGPICG 
GVGKGSPGIiGCPIFFFFFFFETESCSVA 
RLDAQWPDLGSLQSP 


11600 


25501 


A 


11700 


300 


420 


KPKILFGNVFAAPHMENLKCRGETVAKE 
ISEAMKVKAMC 


11601 


25502 


A 


11701 


351 


466 


RI KNADLSQAQWFTPVI PALWEAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


11702 


172 


400 


SNRLRNKIQG VFLNDSS ISPFI LRKQS I 
GQAWWLTPVI PALWKAEASGS PEMDSTT 
LLPS SS QVPS L VKMEKLNYS 


11603 


25504 


A 


11703 


284 


408 


ASVFSSFFFVCLFVFEMESRSFAQAGVL 
WRDLGSLQAPPPG 


11604- 


25505 


A 


11704 


20 


447 


LPGADYGGGHLSLRLFHLLLASAAWVPD 
ESQVTLNSAICVLSTVLIMEFPDLGKHC 
SEKTCKQLDFLPVKCDACKQDFCKDHFP 
YAAHKCP FAFQKDVHVPVCPLCNTPI P V 
KKGQ I PDWVGDHIDRDCDSHPGKKKE K 
I FA 


11605 


25506 


A 


11705 


1 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 
AMKICSLTLLSFLLLAAQVLLVEGKKKV 
KNGLHS KVRSEQ KDTLGNTQ I KQKSRPG 
NKGKFVTKDQANCRWAATEQEEGISLKD 
ECTQLDHEFSCVFAGNPTSCPKAQDERV 
YWKQVARKLlRS 


11606 


25507 


A 


11706 


1 


428 


DAEADKMAAAAVRGGRSGGSGGCSGAGG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine t D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, l=lsoleucine, 
K=Lysine f L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W = Tryptophan, Y = Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASNCGTGSGRSGLLDKWKIDDKPVKIDK 
WDGSAVKNSLDDSAKKVLLEKYKYVENF 
GLIDGRLTI CTI S CFFAI VAL I WDYMHP 
FPESKPVLALCVISYFVMMGILTIYTSY 
RE 


11607 


25508 


A 


11707 


] 


422 


rsqarssaaaaarasvplrgspgpsaim" 
pmfivntnvprasvpdgflseltqqlaq 
atgkppqyiaehwpdqlmafggsseps 

AIjCSLHS IGKIGGAQNRSYSKLLSGIaLiA 
ERLR I S PDRVYINYYDMNAANVGWNNST 


11608 


25509 


A 


11709 


88 


423 


AADAMKQAEEAMGAFMRLAYQELQIDRL 
KEDKMLLNIiEGNKREHAERLGMGLVSRS 
SVSHSVLSEMLVIEHETPVSAKSSRSQL 
DliFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 


25510 


A 


11710 


393 


130 


NPS ATAQS FHIiAARKLSTLDRPGSQAPQ 
KPMPSPKPGWLPASTNAVPTPLAEATP 
SKAHPAISLLSTEEGI FKAVPSPASSCS 
FLHV 


11610 


25511 


A 


11711 


331 


440 


KIFFIiNFLISRAWWCAPLVIiATWEAEVG 
ESLYPRRS 


11611 


25512 


A 


1 1712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVLDGVEDAQGEEE 
EDEEEEEEDDLSELPPLEDMGQPPAEEA 
EQPGALARE FLAAMEPEPGPS PVPKEWL 
DILGNGL 


11612 


25513 


A 


11713 


102 


2 


TNLGNPRRPPPFFFFETVSLLAQAGVQW 
CDLGS 


11613 


25514 


A 


11714 


126 


2 


FFAPFLKI FFFFFFSEMESCS VLQAGVQ 
WHDLGS LQNEGSQ 


11614 


25515 


A 


11715 


178 


449 


LSGGNLFG 1 1 ILLYCAI IGGKLLGLiI KL 
PTLPPLPSLLGMLLAGILITNIPVINDN 
VQIMHRWS YSLRS I AIiAI ILVRAGLVLE 
SKALEK 


11615 


25516 


A 


11716 


48 


417 


GSGGNHSVCCDTMEGGGGSGDKTTGVLA 
GFFGAGEAGYSHADLAGVPLTGMNPLS P 
YLNVD PR YLVQDTD E FI LPTGANKTWGR 
FELAFFTIGGCCMTGAAFGAMNGliRLGL 
KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PKSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


11617 


25518 


A 


11718 


1 


413 


WPSdQVLVGCLSFSLYCWKRSLKRKMPG 
FEVLLKIFLKNHPRCHTNRMIQLTATPV 
SAIADEPAHIRATGLIPFQMVSFQASLE 
DENGDMFYSQAHYRANE FAEADLNHAAS 
LGGDYLGDLALRLLCTLE PVFPQSY 


11618 


25519 


A 


11720 


116 


462 


AGMLPAVGSVDEEEDPAEEDCPELVPIE 

TTQSEEEEKSGLGAKI pvti itgylgag 
KTTLLNY ILTEQHS KRVAVILNESGEGS 
ALEKSIiAVSQGGELYEEWLELRNGCLCC 
SVK 


11619 


25520 


A 


11721 


167 


407 


E I YSLTRF I E VKMS KKI SGGT WEMLGD 
EMTRIIRETLKEKLIFPYVESHLHSYDIi 
GIENRDATNDQATKDALEAFNKPY 


11620 


25521 


A 


11722 


254 


423 


NQLSSIMAMFKKIKSFEWFNDPEKVYG 
SGEKVAGRVIVEVCEVTRVKAVR ILACG 


11621 


25522 


A 


11723 


3 


424 


VSCDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADLAGVPLTGMNPLCPYLNVDPR 
YLEQDTDEFILPTGANKTRGKFELAIiFT 



1252 



01 64835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possiblc 
nucleotide insertion 














IGRCCMTGAAFGAMNGLRVGLKETQNMA 
WSKPRNVPILDMVTRQGALWANTLGALA 


11622 


25523 


A 


11724 


2 


343 


AFGTMKWVTLVS VLFLFS SAYSRGVFRR 
DAHKSEVAHRFKDLGEENFKALVMIAFA 
QDLQQCPFEDDAFS TSEVPE FATTCQDD 
DHFEDRCRRVHTLVWQLCTPATLIiETD 
Y 


11623 


25524 


A 


11725 


1 


359 


HAFGTMKWVTVISLVFLFNSAYSRGVFR 
RDAHKSEVAHRFKDLGEENFKAIjVLIAF 
AHYLHQCPFEDHVKLVNEVTEFAICrCDA 
DESAENCDKSLHTL FGDKLCTVATLQE T 
YGEMADC 


11624 


25525 


A 


11726 


1 


349 


GAMSDRKFSAPRHGSLGFIjPRKRS IRHR 
GKAKS FPKDDPS KPGHLTGFL.GYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVAD I AGYMETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGSWSCCVCLCVCLCTWSHLCRLVTW 
LiPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVSSQQASPCTPEQDWPCWTPC 
SPEGC 


11626 


25527 


A 


11728 


264 


388 


QADPKD I MKFPGPliENQRLS FLLEKAI T 
REAQMWKVNVRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAEVLEAIVSCCVGPEGRQVIjCTKP 
TGEVLLSRNGGRLLEALHLEHP I ARM I V 
DCVSSHLKKTGDGAKTFI I FLCHLtiRGL 
HAITDREKDPLMCENIQTHGRHWKNCSR 
WKFI SQALLTFQTQ I L 


11628 


25529 


A 


11730 


160 


377 


LQGFGRPSVYHAAIVIFLEFFAWGLLTT 
PMLTVS IAELGLCFVRERDKFLGTYHCV 
CLDTCLGVALDSDLKQ 


11629 


25530 


A 


11731 


45 


438 


KLGQRKEATVTKESCKSCSRKVGSPDRF 
RSPQKRSKGRQDCFTLFCFWKVIiDKNME 
LISPTVIIILGCIALFLLLQRKNLRRPP 
CIKGWIPWIGVGFEFGKAPLEFIEKARI 
KVOGRGRRGLQRRQCFLF 


11630 


25531 


A 


11732 


169 


292 


DSILLVNLACSAVISAHCNLRLPGSSDS 
PASASRAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KF WEKRTEGRGKS KS KNSETGE I VS I S A 
LSTTEVAMHTSTSS 


11632 


25533 


A 


11734 


19 


349 


APSPDAMGHFTEEDKATITSLWGTVNVE 
DAGGETLGRLLGDYPWTQRFFDSFGNLT 
SASAIMGNPKVKAHGTKVLTSLGDAIKH 
LDDLKGTFAQLSELHCDKLLVDPENF 


11633 


25534 


A 


11735 


234 


359 


FYHLKSFTVSSVQSRWLTPVI PALWEAE 
AGGSPEVKSSRPAS 


. 11634 


25535 


A 


11736 


15 


372 


KLPLKALTGEEKTHINIDIIGHVHSVKS 
TTTGHL I YKSRG I DERT I E I FEKEAAEM 
GKGSFKYACILDKLKAERERGITIDISM 
RKFETSKY YVTI IDAPGHRDF I KDMTTG 
TSHADCA 


11635 


25536 


A 


11737 




357 


I W KAAMAS P AGS WARP PRPMRE PQT LAM 
PTNAAEDQKLKLERLMKSPDLAVTIPEK 
MSEWSPGPPPEFDRDVMGSRAGAASGEF 
HVYRLLRRRE YQRQD YMDAMAE KRILDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEEDKATITSPWGKGNVE 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutaroic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DAGGKTLGRLIjDVYPWTHRFFDRFGNLS 
SDSAIMGNPTDKAHGQKVTjTSLGDATKH 
LDDLKGTFAQLSELHCDKLH 


11637 


25538 


A 


11739 


141 


335 


MQFLPCIPILKSLEKSVASHSQTVHSDI 
ISTVEFNHTGELLSTGDKGGRVVI FQRE 
QESKNQVHR 


11638 


25539 


A 


11740 


7 


337 


APSPDAMGHFAEEDKATITSIjWCKVNVE 
DAGGETLGRLIiAVYPWTQR F FDSFGNLT 
SDSAI MGNRKVNAHGTKVLTSLGDAI KH 
LDDLKGTFAQLTELHCDKLHVDPENF 


11639 


25540 


A 


11741 


182 


360 


SRHSISPPVNQIQLGASVTEELTWTKT 
ARVSRAQWLMPVI PALWEAEAGES PEVR 
NSR 


11640 


25541 


A 


11742 


174 


1 


HFSDHSIjFVCVRQDLiTLLPRIjECSSIjIT 
AHCSLHLLGSGDPPTAASWVSGNTGVHY 
HA 


11641 


25542 


A 


11743 


75 


218 


KI ILGRAQWLMTVIPALWEAKVGRSPVS 
ASQSAWDYRREPPCPSCTI 


11642 


25543 


A 


11744 


168 


2 


LAPLWSLGPVLGGVGQGSPGPGFPFFFF 
FFFFETESCSVARLECSGPILAHCSIiR 


11643 


25544 


A 


11745 


83 


2 


REQRFLLPFVQLCPAPRGGVYRGRQAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVENPSFFFSFFETGSCSVTQD 
GECTGATliAHCDLCFLGSSNSS 


11645 


25546 


A 


11747 


244 


330 


KDRAQWVTPVI PALWAAKAGRSLEVRSS 
R 


11646 


25547 


A 


11748 


201 


1 


TSQPLIRITLSTFFFFSRDGGLTMLPRL 
DPELPGSSNPPSCSASRVDGTTGMCHHT 
RLIFFSQTDKK 


11647 • 


25548 


A 


11749 


262 


3 


VYTSLTPPPISVNIjIjTQVSPRSVSYEQN 
IL,CI.l.FVTYFTCTFFVCLFVCIiFVCFFK 
MESCSVAQAGVRWRDLGLLQAPPPGFTY 
TT 


11648 


25549 


A 


11750 


225 


3 


PI INFSVPQFLHLYNGI I IESTPPSCCI 
LKCQSLGQAQWFTPVITTLWEAEAGRSL 
EAWTLKT I LANMAKPHL 


11649 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDJjVFLKGFFFFFFFF 
FFETEFCSSPRLERNGAPLAHCNLRLPR 
SSNSP 


11650 


25551 


A 


11752 


261 


336 


DRVS LWS PRLECNGAI SSH CNLRLP 


11651 


25552 


A 


11753 


282 


381 


TDLLYKKKCGLGAVAHAYNPSALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRKVKLQWALILPIiPFNLGGQLKSRFQ 
KKKKKKKKKKKEEKRKKKTLWI C 


11653 


25554 


A 


11755 


293 


357 


LTPVI PALWEAEVGGSPEVRS 


11654 


25555 


A 


11756 


264 




TbPINWCCDKSTGPSLISIiPLIiSLPYS 
LRHNNIEIRLINNPTTACKYSCLKKSHK 
SLTLDQKLEMI KLGEEGLLKAQ IGQKLG 
LLHQ 


11655 


25556 


A 


11757 


126 


1 


ILPGFLKELKIELPFNPAILPLGIYPKE 
KKLLYQKDTCTHMF 


11656 


25557 


A 


11758 


399 


226 


IEKGTGRVWWLTPMIPALSEAKAGGSPE 
VRS PRPAC I GLLKF WYYRCE PQHPARLS 
EI 


11657 


25558 


A 


11759 


235 


380 


DTTVIiKt.tiQLI TLQWASKCS SERKSRTS 
IiTLTQKLEMVKLSEEGMSKGD 


11658 


25559 


A 


11760 


249 


357 


NRAS GQTWCPTP I MPALWEVEAGGSLEP 
RSLRLTWA 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leuctne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11659 


25560 


A 


11761 


168 


372 


KLHCLLCFYLSEFFVFCFLETRSLSVAQ 
SKCSGTVITHCSIjKIjLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWLMPVIPALWEAKADGSPE • 


11661 


25562 


A 


11763 


135 


1 


ATTPGLFFIFYFFETESHS IAQAGVQCM 
ISVHCNLCLPGSSDSPV 


11662 


25563 


A 


11764 


219 


2 


KLRSSQDSEPEVRPDSTLSMLMHARAHT 
HTHTHPHLFLPHPCSRPHTSTPTCMPIV 
THSPIYLKFHSHTYIL 


11663 


25564 


A 


11765 


102 


1 


NRPINNPTMASMCSSERKSHKSLTLNQK 
LEMIKL 


11664 


25565 


A 


11766 


245 


3 


PGEPEEDQLVKFRPSLLWVFMFLFFLRR 
SLAIiSPRIiECSGVECSGVISAHCNLCLT 
GFKQFSCLSLLSGWDYRHVPPCPS 


11665 


25566 


A 


11767 


350 


97 


GSVPASPQVTRPTLMTSIADKWFSLCLT 
S LRVNFGVAL I LS GSHS VS QDTL.DLLTL 
FSTHLGLPKCWDYKRVPPCSAQFLVTFL 


11666 


25567 


A 


11768 


188 


391 


LGSVAGDUCjCFFGGVIFPCSFMFPMSLC 
YYL Y I WYNS C VFLFFKTEPHFVTHSGVH 
WCDGIiLQPPPPW 


11667 


25568 


A 


11769 


131 


2 


YTLYI PNKS P I ENIQLS WAQWLVPVI PA 
LCGAETGELLEPRSS 


11668 


25569 


A 


11770 


165 


1 


VLIPLIHL.CIYIYIYTY3CVYIYTRIYT 
H I C I HTHVY I HI YVYIHTY I YT YMYTY 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


TSFVLFCFETEFCSCCCPGLECNGAIIiA 
HCNI.RLPGSSD 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWLCKKKKCEKDFYFERTF 


11672 


25573 


A 


11774 


190 


2 


GFS PRQRGAPRVPPPIAGPFPP IFFFFF 
FFEMWSHSVAPAGVQWCNHSSLiAALTSP 
DSGDPP 


11673 


25574 


A 


11775 


287 


1 


GAHRRKLFITPGESLRDKQIAGFEHRGG 
E KKKTLFYKKKKKRKKKRKEKKGKEKKM 
L.PKCNYTMKNHQVPYSIRPTRECADLRV 
FYFIKFQILKY 


11674 


25575 


A 


11776 


121 


3 


KCASRDIiSKFFFFFFLjETESRSVAQAGV 
QWCDLGSLQAL 


11675 


25576 


A ' 


11777 


142 


1 


EKTLHVRNTIHNSEGSLVIKIHGRLGW 
AHACNPSTLGGRGGQITRS 


11676 


25577 


A 


11778 


179 


3 


SHQPVPGTLDLPRGPQKLQSTSEAESEA 
SMSEASSEDLVPPIiEAGAAPYREEEEAA 
KK 


11677 


25578 


A 


11779 


147 


6 


KTPGLKKNSNFF FFFFFFETRSHS I ARA 
GVRGCDLSSLQPPPLGLK 


11678 


25579 


A 


11780 


184 


3 


GFGFYI IPNYRI.FFFSLLIGPFFFLIFF 
ETEPCSVARLECSGVISAHCNIRLLGSC 
DSPA 


11679 


25580 


A 


11781 


120 


319 


VKSIAKFI.VNYQLHRQTCSGQAGRGALR 
QRFGRPRQVDHLRPGVRDQPGQHGETPS 
LLEVRKXiSSG 


11680 


25581 


A 


11782 


91 


3 


YICLSIilYTHTHTHTHTHTHTHIYIYVC 
V 


11681 


25582 


A 


11783 


221 


332 


SRLGMVDHPRSGVRDHPGQHGETPSLLK 
IQKLARRSG 


11682 


25583 


A 


11784 


243 


379 


LKYSVPIKNCKPWPGTWDHTRNPSTLGG 
QGGWITLDQKFETSIiAN 
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peptide 


Predict- 
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ding to 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 


11683 


25584 


A 


11785 


327 


5 


GRVESQPIYPKNPPGSFPKIYPGSFKTL 
FPYLQSRHSFSIPRSKIiARFHQAKPLFC 
SKPSQS LS PHSELKKKDLSQ AR WLTP VI 
RALWESKAGRSPEVRSSRPTWPT 


11684 


25585 


A 


11786 


245 


359 


FKGMDRLRSGVRDQ PGQHS KT P S VLK I Q 
KLAGHGGMRIi 


11685 


25586 


A 


11787 


356 


67 


RDI AS I TARLRYDRGVGITRLEVLSKS D 
EYTKGSVLDFSRETEPIGYIYIYIYIYI 
Y I YIYLHS LPLNHME ISLGHPPLSNNKF 
VQNFDS PKLSYH 


11686 


25587 


A 


11788 


103 


3 


KYIIPVCIFFFFETESCSVTQSGVQWCE 
LRRRG 


11687 


25588 


A 


11789 


136 




VHQFT I R I LNY YY YY F FETE S YS VTQAG 
VR WCNLG S LQPL P PGFK 


11688 


25589 


A 


11790 


291 


163 


SLLLLPRLECNSTISAHRNPRLLGSSDS 
PASGRSLEPRSSKLQ 














X FMYVFFETESRS VAQAGVQWRYLGSVQ 
APPSGFS 


11690 


25591 


A 


11792 


329 


3 


KNPRNIIGKSRGPYITNFPGDKGPKNSS 
PRGNDLS I NKNWAQAFQPRGQKKLFFPK 
KKKKKETQSRSIPRLECNGPTIAQCNLK 


13691 


25592 


A 


11793 


213 


3 


ISPWLFLLLOATLFRSQYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSGI ITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQPAPEQRDIiKKKKKKPPLAPSSL 
FRFWEKLLPSFQFRQPRGPFLGTRGVSF 
AFHRRRFWQYGNMGEKQTWGNPGSSSNP 
PPGTAGS LSGNRGPG WGGKIADRFWLYA 
RDS FGHS RGLKG S CCQAHLGK 


11693 


25594 


A " 


11795 


3 


769 


RKEQTRNARAEVLRQAKANFEKEERRKE 

KKKKKKDKHSKKAKKEKKKKSKKQKYEK 
NNESSDSSSSSEDEWVEAVPSQTPDKEK 
AWKVKDEKSGKDDTQI IKRDEWMTVDFM 
S VKTVSSS S LKAEKETMRKI EQEKNQAL 

WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDRYATTDTAKNSNNEKFIGDEKDKRPG 
SLE 






A 




no' • — 


13 


XI I n x n J. ri J. tl AXl 1 ri 1 ±1 1 ri 1 MKUKisijKX K<- 

VPTV 














VLGQEQLLTPVIPALWEAKAGRSPEVRS 


1 1696 


25597 


A 


11798 


83 


387 


GERRRRRRRRLWAPLQKKKKKKKKKKKK 

RQKNI PPLKGGELKRAPAGDFKNPGRGK 
IARGGFFEKNIjSWGGEK 


11697 


25598 


A 


11799. 


153 


2 


I PWEDTTVYSGYPQFLFSHSFFYFYFFE 
TGSCSLTQTGVQWCNHGSLQPR 


11698 


25599 


A " 


11800 


93 


3 


VWGRAWWLTSVISALWEAKVGESPEVRS 
S 


11699 


25600 


A 


11801 


115 


1 


NPL.FFFFFFFEMESCSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


137 


RCFIFILFINKLYFVYLFIYFFEMESHT 
VAQAGVQWRDLGSIiQAP P PNKLYF 


11701 


25602 


A 


11803 


"287 


1 


MGPHAPFKGTCGGFQDFRFKNI.EISAGL 
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NO: of 
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M 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=L,ysine, L^Leucine, M=Methioninc, 
N— Asparagine, P=Proltne, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














THSPLliAQNFRVGAVPS PRYS PRFGS KG 














LGSLQALPPGF 


11702 


25603 


A 


11804 


202 


3 


WS VCCFKQGLGLS PRLECNGAPGFKQS F 
PLSLLSNWGYRGPPPRLNFFFFFFEMES 
CSLAQAGVQR 


11703 


25604 


A 


11805 


3 • 


315 


FMLL ILTIjFLLRNDRIiVQCDVRS SWCL 
GFLLGWSVILYPLMAAFMPTMWI FLFI L 
FSMLFFYVFFLFLLFFLLFI.YS FPLFCF 
FFCFYLFRFFFFFL I FFSS P 


11704 


25605 


A 


11806 


126 


3 


KRGFFFFFFFFFFETESRSFAQAGVQWC 
DLGSLQALPPGV 


11705 


25606 


A 


11807 


130 


2 


QS I KMTCS SLYFF I KKNVAWWLTP WPA 
L WEAGAGG SHE PKS 


11706 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLSPRLElCSGAiPTHCK 
LCLPGSRHSPASAFRWGTAG 


11707 


25608 


A 


11809 


258 


3 


KYYYQGNRQIKFVNAKSNKNLPSYSQKA 
SWFSSRNIRMMGQHQPMHLDNPTSFIKK 
LGQAWWLRPVTPALWEAEAGGSLESRSIi 


11708 


25609 


A 


11810 


153 


285 


CAS P IRSHQKIiNKNGWVQWLMPVI PVPW 
EAETGGS LQLRS S RIA 


11709 


25610 


A 


11811 


160 


3 


NHLFLLLNRYKSIFIGRVQWLMPVSSAF 
WEAKAGRS FEPRS SRPAGQHGKTL 


11710 


25611 


A 


11812 


89 


2 


AFFFFFETVSCSVAQGGVQLCHLHSLQL 
N 


11711 


25612 


A 


11813 


254 


336 


LLGQPQWLTPVI PALWEAEAGRPSEVR 


11712 


25613 


A 


11814 


108 


2 


CVGVLGFFVCLFFETESCSVAQAGMQWH 
DLLGSLH 


11713 


25614 


A 


11815 


166 


289 


SHSGWNAVWRDGSLQPLTPGFKRFPCLR 
VPSSWDHKCAPPC 


11714 


25615 


A 


11816 


192 


2 


DKDPGKVTRPFLLLCIFYLFiCLFLRRT 
LAVSPRLEFSGRGCSEPRSHRCTPSWVT 
EQDSCRK 


11715 


25616 


A 


11817 


228 


343 


LLQYS I QNEDAGWAW WLTPVI PTTiWEAK 
AGRSLEVRSL 


11716 


25617 


A 


11818 


322 


407 


VIiRMLLHCLRECKIiVQPLWKTVWQFLKD 


11717 


25618 


A 


11819 


119 


1 


WEKIVRGKKCQKENTHTHTHTHTHTHTH 
THTYRESKRERLV 


11718 


25619 


A 


11820 


269 


1 


FVQVFY I LWTQS FCLTCCQWFLLGYHLS 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
RCVTQARVLECSGSISAHCKLHLPGPRH 
SPALV 


11719 


25620 


A 


11821 


217 • 


389 


EHIETVYGKLGISEHWGKGGLFNKWLTI 
WKKDYSWQWWIiMPVIPALWEAEVGGSRV 
H 


11720 


25621 


A 


11822 


188 




GSHMPCRVI SSVHESMNEFPAVPTS YPA 
NPQPRERAWRNQREKEDKKERSQRSVGR 
VQAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGOVQWLTP VI PAPWEAEAGRS PEVR 
SSRPASTWRNLV 


11722 


25623 


A 


11825 


224 


3 


AL 1 1 KKE VS AMELEGSNRI QAYG VQS I T 
GARDYHAAASRVPAI KGTHHHARVI FVF 
LVETGFHHVGQAGLDSC 


11723 


25624 


A 


11826 


3 


364 


HELPEPLRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRIiFDHA 
M LQ AHRAHQ1AI DTYHEFEETY I P KDQK 
YS FLHDSQTYFCFSDS I PTPCNMEETHQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
"^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KSNLELIuR 




25625 




1 1827 


2 


376 


ARELPEPITVLRTAHLKAMAPGSRTYliL 
LAFALLCLPWLQEAGAAQTDTLYMLPDH 
AMLQAHRAHQ.LAIDTYHEYDETYI P KDH 
KYSILHDYQTSFCFSNSITTPYNKEETQ 
QKSNIjELLRISLL 


]1725 


2Sfi26 


— 


n82 g 





— 


IHTJGtfPSVHHHHQRPKVDKTTICMGKKQS 
RKTGNS KKWS AS PPPKEHSSS P ATEQ S W 
TENDFDELREEGFRRSNYSELQEEIQTK 
GKEVENFEKN 


11726 


25627 


A 


11829 


107 


1 


KRSGVRLQRGLVRWLTPVI PTLWEAE VG 


11727 


25628 


A 


11830 


220 


354 


QGNKS FKNI I APWLGTVAHACNPSTLGG 
QGGRITGHEIETILANM 


11728 


25629 


A 


11831 


273 


352 


ENLKTGQIQWLTPVIPALWEAKAGRS 




25630 




11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTALHK 
RPS S PARFT.TEAVPVPPVDYLQS VAAFA 
VSAVAS QWERTG KP FNPLLGET YSL I R 


11730 

. . . 


25631 


A 


11833 


189 


294 


DQTRWLTPVIPALWGAKASGSPEVRSSL 
HFPLLF 












332 


GTS PE PLTViiWTGDLtiAMAPGSRTSLLLr 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQDHRAHHLAIDTYHEFEETYIPKDQK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQPQMFFF 
FFWRQESCSVAQAGVQWHEPEQQE 


1 1733 












Vl? VETGSRSVAQDGGQWCDLiSSLQPQPP 
RPRA 


11734 


25635 


A 


11837 


267 


334 


S WVQWLTPVI LALWEAEAGG S P 


11735 


25636 


A 


11838 


180 


2 


SVLEKKEKNLYKNLFTYKLLKKVQGKTQ 
CEGRAQICSVCVCVCVSVCTCVHVCAYV 
SSC 


11736 


25637 


A 


11839 


183 


2 


AAPLTS S APQAPGWKPHLAPNS PFKNFT 
GRGQAQWIiMPVI PALWEAKVGRS PEVRS 
LSSC 


11737 


25638 


A 


11840 


54 


330 


DPNGQLPEPLKGLWTAHLVAMAPGSRTS 
T >T iT iRFGliLiCLP WLQKAGAVQTVPLSRIj F 
DHAMLQAHRAHQIAIDTYHEFEEPYIPQ 
DQKNSFLD 


11738 


25639 


A 


11841 


1 17 


344 


i Q ¥ V YJJ IN hIT fcJMRlsjjsJ tli> Y JbNHTS LTI T I 
HTLCLMGSYTiEHFQNQCKGRARRLiMPVI 
PALWEAKAGRSPDVRSSKPA 


11739 


25640 


A 


11842 


750 


968 


RAQGPRWILKIPFSSPFRLFPLIPLVFL 
YNSPPLFPPFPRTNATNNSPFPPPLPPP 


11740 


25641 


A 


11843 


757 


1000 


WELVWYCCLFSPFFPLPSPPLFPPPPP 
SPPHSFPPPSPFRPPLPLAPTRPPPPFS 
LFQAPPLLPS FCSP PPPPPPS LSFL 


"11741 


"25642 





"71844 


"271 






11742 


25643 


A 


11845 


260 


2 


GSCIiLEGKliTNRKD I HTKNP SVHHHHQR 
PKVDKTTKMGKKQSRKTGNSKKQSTSPP 
PKERSSSPATEQSWMENBFDELREEGFT 
RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPSPPPCWQEVFKTSPLGFFFCP 
RSFPVNGFPRLKNLFFFFETESCSVAQA 
GVQWCARA 


11744 


25645 


A . 


11847 


219 


1 


SLGHTL.KQSESSLSFSPGGTPLLEFKPS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VYDPSVPPFQVSKGGCVGFFFFFQTESC 
SVAQAGVQWHDLGSPRA 


11745 


25646 


A 


11848 


126 


3 


RFFFKGLFIiHFFFFFFFFCFETESHSVA 
QAGVQWCDLGLV 


11746 


25647 


A 


11849 


216 


339 


KCFKVGKVYKLNRETFYLAQNFFDRNWA 
TQKMWKTLLELI 


11747 


25648 


A 


11850 


190 


334 


VSYHIEMSFFEVYNEKIHDIjLVCKDENG 
QRKQPVRLKQFI ICFEFLFL 


11748 


25649 


A 


11851 


198 


352 


SRQGTWGIHGLECFFVCLFVLFFETESH 
FVPRLECSGAI SAHCTLHLPGEL 


11749 


25650 


A 


11852 


371 


1 


PPKKLGIQVFTP FAPRAF WPVLVFKKKG 
FPQGNLVI.PGFFSKVS IQGVPPFCRKPF 
SGWGEAFQIGFFINRGKKFFFFPLGFFL 
KKVFLKFIiAKFFFFFFFFFETESHSVAQ 
AG VQWRS LiGLV 


11750 


25651 


A 


11853 


176 


3 


KKGFPLDWNSMVKNIIVKFGEDLQISKT 
LHTFFFLTESRSVAOAGMQWCDLGSLHS 
C 


11751 


25652 


A 


11854 


281 


375 


QLTFKKYFLGWARWLTPVIPALWEAKAG 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGALIAE IEVPLFSELPJDFLLAYSRMT 
ELCFKRCVPSLHHRALDAVDDDCLHSCS 
CTDYPINRSLMNAY 


11753 


25654 


A 


11856 


136 




LS I LCEFFNWL I EMEYRS VSQAGVQWCD 
LSSLKPPPPRLQQCHSC 


11754 


25655 


A 


11857 


277 


361 


I VLRGAMWLTPVI PTLWEAEEGGSPEVR 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQPGVQWCNLGSL 
CSLRLPGSSNSPASAFQVA 


11756 


25657 


A 


11859 


1 


342 


GTRLPE PLTVLWTAHLGAMAPGS RTS LL 
LAFALiLCLPWIiQEAAAVQTAPVSRLFDH 
AMLQAHRAHQLAIDTYQESEETYIPKDH 
KYSLLHDSQTCFRFSDSIPTPYNMEDTQ 
HK 


11757 


25658 


A 


11860 


244 


332 


TIEMMLDIKQIQVIFLFEFKMGRKIAET 
T 


11758 


25659 


A 


11861 


1 


339 


GTRWTI CQVQLHAYAHFLYFFEMEARS 
VAQAGVRWCDLGSLQPPPPGSSSSSSSS 
S 


11759 


25660 


A 


11862 


311 


394 


GMLGAVAHACNPS TLGGLGGW I TCSQEF 


11760 


25661 


A 


11863 


130 


1 


VPPSVRTFFFFFFETESRSVSQAGVQWH 
YLGIiLQAPPPGSTS C 


11761 


25662 


A 


11864 


3 


320 


IMMYALFLLS VGLVMGFVGFSS KPSP I Y 
GGLVLIVSGWGCVIILNFGGGYMGLIV 
FLI YLGGMMWFG YTTAMAI EE YPEAWG 
SGVEVLVSVLVGIiAMEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GL I LLPMLECS DR I S AHCSLYLLGS YD P 
SDSGSHVAGTTGTCHYAWLR 


11763 


25664 


A 


11867 


208 


314 


GSRNICLS GQAQ WLT P VI P AL WE AEVGG S 
PEIRSSR 


11764 


25665 


A 


11868 


102 


1 


KKFFFFFETESHSVAQARVQCNLGSLQA 
PPPRFT 


11765 


25666 


A 


11869 


126 


3 


NNALQLHSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


11870 


1 


273 


KQLPVNFLNWVRLELTGLLCTLNLSKPC 
MIFI ILVIVKYWFLLFCNIFKNHI FSQA 
QWPTPVIPALWEADMGGSHGYWITTIVD 
FMCATLT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X— Unknown, *=Stop codon,/— possible 
nucleotide deletion, \-possible 
nucleotide insertion 


11767 


25668 


A 


11871 


126 


2 


FPKALLWFFFFFFFFETESRSVAQARV " 
QWHNPGSLQCPLR 


11768 


25669 


A 


11873 


269 


9 


QVSGKFWPDIFYKTGEKGLQCPNFPQGH 
PPRGLKKKS FS KKKKKKKKKS I AVGQAQ 
WLIPVIPALWEAKVGRSPEVXSSRPACP 
TW 


11769 


25670 


A 


11874 


114 


1 


LEAKPGVFQSGVLliVIFFFFETESHSVS 
QAGVRWCDLG 


11770 


25671 


A 


11875 


146 


1 


GNHLSSRVGVQDQPRQHGKTLSLQKIQK "' 
VAGPGGAHLQSQEVQVGGSL 


11771 


25672 


A 


11876 


2 


159 


SLQFFFFFFRWSFVLVAQAGVQWHDLGS ' 
PQP P P PG FKHS PAS ASQVAGRGGR 


11772 


25673 


A 


11877 


124 


240 


FRAFSSGQAWWLTPVI PALWEAETGGSL 
EARSSRPDWRT 


11773 


25674 


A 


11878 


135 


5 


QVLFYFYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNSFIiCFFFFFFFEPEFC 
SVAQAGVQWLHLGSLQPPPPVFKVDAAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLiFFHIPAPSVSGTS 
DAEECCLCVIYRLICGYTSRNCLYFSDH 
R 


11776 


25677 


A 


11881 


1 


292 


L PEPLRVLW TAHLQAMAPRS RTTLLiLAF 
ALLCLPWLQEAGADQTVPLSRLFDHAML 
HAHRAHQLAIDTYQEFEQTYIPEDQKYS 
FLHDSQTYFCFSD 


1 1777 


25678 


A 


11882 


93 


2 


KYQMGWAWWLTPVILALWEAEAGRSPET 
SC 


11778 


25679 


A 


11883 


145 




PPbGLRLQVQAPTPGFFFFFFETESRSP 
RLKCKGAILiAHCNVCLLLV 


11779 


25680 • 


A 


11884 


202 


1 


TWWRWGVTVLVRLVMNCRPCDRHKSASQ 
Ij I GRVRQENGLNARVGGCSEPRSRHCTP 
VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLS PGVQDE PGQHSGTSS LQKILK 
LAGHGGTCP 


11781 


25682 


A 


11886 


215 


1 


S TMARHCP L S PML FNS AME VLVRAISQE 
KEIQGIQIGKEEVQLSLFTDDMIFNIiEK 
RKDCSKNLLQLMNX.V 


11782 


25683 


A 


11887 


214 


1 


GYFFGIiNEVLGKLEKPSLKVPQNPSPKR 
PFFLGFFFFETESRIVARLECSGAISAH 
FNL.CLPGSSDSPVSC 


11783 


25684 


A 


11888 


215 


3 


WGPGFPFFSFFFFFLRQGLAVIRIiKCSG 
T ITAHCS LNFLGS SD PPAS VS L VATTTG 
HEPSITQFHSHGSC 


11784 


25685 


A 


11889 


118 


1 


RFFIMGENPTKFFFFFETESLLPRLECS 
GVISAHCNLSC 


11785 


25686 


A 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPSIQKIQKIiAT 
SFKKSLKIVIPPLPFKNDKTDSKSSCSS 
LSPHS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CLSLPSSWDYRGAPRA 


11787 


25688 


A 


11892 


116 


1 


SKGVGHFIiFFFFFFETESRSVAKTGVQW 
CDLGSLLCLV 


11788 


25689 


A 


11893 


397 


475 


RFVCSTIKVLRDLSSDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 


KNVCLFVFVEMKNHLNPGDEGCSKPRSR 
HCTPSWAAQQDSISKYIClYRYLYliDIY 
LS KLWRLRINQLPNS 


11790 | 25691 


A 


11895 | 134 


251 


INPPVSRKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolme, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKGGGL 


1 1791 


25692 




1 1896 


186 


402 


PGKNLTLENVPRENKVGEKAPGQNE P PV 
LGGGEYQDPGGNVKGVGPPPAPGFGKDG 
PKRLADTFDLIEGDGD 


1 1792 


25693 




1 1897 


100 


2 


KGPPFFFFFFFFFFFFFFFGQSGQVKIjK 
SPKCK 


11793 


25694 


A 


11898 


92 


3 


S VAR I MP YAIiFLIiS VGLVMGFVG FS S KP 
S 


11794 


25695 


A 


11899 


283 


362 


MTYTMTIVHALWASVCLLLNHAHDPLD 


11795 


25696 


A 


11900 


248 


1 


VMSAQPGLSVILRFMGKWSNWTGMQYTE 
SEVERYRDRGRARETERQRDRERERENP 
RNRKL FFYGRFKHS PQDFFMPQ KVH F 


11796 


25697 


A 


11901 


253 


2 


KLQENPFKIiLNFIINLCSVSUJVP IILN 
INIKPSFIAPKPRCPSKFQRNYIYIYPS 
IYLS1YLSJYLSIYLPITISKVLKDVE 


11797 


25698 


A 


11902 


179 


3 


IAVTKNEIMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLECSGVISAHCNLRILVSS 
SC 


11798 


25699 


A 


11903 


1 


403 


GTSS QESFGGCCVSGL I AMGTKAQGERK 
LLCLFILAILLCSLALGSVTVHSSEPEV 
1 1 PENNPVKL I CAFYDLSFFSSS FSYFF 
HLS FF I YYPYYI.FVSCFTFY I VI LLS FI 
YSHHL,SSVSYYSYVLLIYTFIL 


11799 


25700 


A 


11904 


37 


242 


KGPTRDQLQHPKARLPAPIiRVLWTAHIiA 
AMAAGSRTSLLUVFALLCLPWLIYDGAS 
HIVSLFMLFIIF 


11800 


25701 


A 


11905 


1 


298 


GTSQHRGRKDSRTGSHSSSDHPGAKLliS 
TEEKQAAETMRPPSAPPRRGCIPWQGLL 
LSSS 


11801 


25702 


A 


11906 


225 


30 


ACYLQKDGAFAILPDVLPDMFKGHSSLY 
PCQHAECDHI KNI YNCVCVCVCVCVCVC 
IVICKLNV 


11802 


25703 


A 


11907 


196 


320 


HSSSSTTPSQDTHTHTHTHTHTHTHTHT 
HKIPQRELLPSVPD 


11803 


25704 


A 


11908 


130 


3 


NGFVFSPFPFFFFFEMESHSVTQARVQW 
CDLGSLQSLPPGFK 


11804 


25705 


A 


11909 


381 


2 


ASI CGQKLIFFQHWKKI ILVQQVGLQKK 
NSTLKRAWEFLSFFQPAPPLGSPLKGIN 
IliNKGAKKLKGKTNFPGPIVFFFSVLPP 
FFGLKKFQKKVFHQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


SVPLKEFIISQARWLTPI IPALWEAEAG 
RSPEVRSSRPASTWRNP 


11806 


25707 


A 


11911 


231 


54 


!tRASF(jIQIRICLTIFFFFETEPHSVT6 
AGVQWRDLGSLQPPPRRQEQNSVSKQTK 
TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRPPATSRRWAGPTLWRM 
EVTGDAGVPESGE IRTLKPCLLRRNYSR 
EQHGVAVSCLEDLRSKACD ILAI DKS LT 
PVTIiVLAEDGTIVDDDDYFLCLPSNTKF 
VAMAINE KWAYNNSNGGTGW I SQ ESFDV 
FEAYSGATLLFFF 


11808 


25709 


A 


11913 


318 


407 


LCGRLWWLMPVI PALWEAEAGGLLKLRS 
MR 


11809 


25710 


A 


11914 


340 


5 


DRVAKFSEARLFRLFFPFFPLKI FCFPR 
GFKI FRGVCPLLFPPEFWGLFQKGPGMR 
LFFP PLGGFFFFFF FFFFFFFFSFFFFF 
FFPFFFFFFFFFCHKNKTLLKKVHNSI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=f hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11810 


25711 


A 


11915 


274 


385 


I KPQRWGETRAEKLKSLKI RVPLLLQRN ~ 
AAPHQQWNK 


11811 


25712 


A 


11916 


259 


441 


DTKLPKVNIKLNFHALKKKKKKKKKKKK 
KKKKKKKKSSSLRG 


11812 


25713 


A 


11917. 


254 


402 


LI VS VI DFLRWRLPLLPKLE CSGMISAH 

CRIiRIj pgs yhs PAS gcqllgr 


11813 


25714 


A 


11918 


223 


1 


NTNSPRKKFFYEEVSSVPVFFGVFPPSP '" 
LKGSPRAFLKLAWRPPLPLFFFLEMESR 
S AAQPE VQWCDLGS LQPP 


11814 


25715 


A 


11919 


97 


3 


KP FFFP FFFFFFE I ESRS VTQAGVQWRD 
LGS 


11815 


25716 


A 


11920 


154 


384 


KEFFLMLFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFS IGPPPPRGKKDPPP 
SPPKMGGKRGTPPPPGYFFFF 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKIKSWSGAVAHYNPSTLGGW 
GKWITSGQEFETSLANWKP 


11817 


25718 


A 


11922 


116 


372 


MEYTWNKMAMSYNIjHIYQTYKNSVIEHP 
TGVLPHSRVIMVNNIVLCISKQLKELLR 
RLMWEEHLSPGGGGCSEPRSRHCTPAWA 
T 


11818 


25719 


A 


11923 


271 


409 


KKKKKKKKEKKKKKKKKKKKTGGGA 


11819 


25720 


A 


11924 


121 


228 


KKKKKKKKKKKKKKKKKKKKKKKKKKKR 
GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKNFLKKKKKNFFFPFYPLKFFFF 
PKSLNFFRRWPK I S PPKKKF FFKNS P S 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 
RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ " 
NTSSPPKDHNSSPARQQNWMENEFDGLT 
EVGFRRW 


•11822 


25723 


A 


11928 


175 


413 


KKKKKKKKKKKKKKKKKKKKKKKKAGGG 
A 


11823 


25724 


A 


11929 


490 


182 


RKQKIKGCKKPNPLAQGGVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKI PI PP 
QVFSLRQKGAFPGRIFFFFFFFETESCS 
VAQAGVQTIjSYKNKNKIK 


11824 


25725 


A 


11930 


110 


2 


KKIPRKPPFFFFFFFFYKAGSHLVAQAG 
VQWHDLGS 


11825 


25726 


A 


11931 


169 


987 


YLEKIMSEHSRNSDQEELLDEEINEDEI 
LANLSAEEIjKELQSEMEVMAPDPSLPVG 
MIQKDQTDKPPTGNFNHKSLVDYMYWEK 
ASRRMLEEERVPVTFVKSEEKTQEEHEE 
IEKRWKNMAQYI/KEKLNNEIVANKRESK 
GSSNIQETDEEDEEEEDDDDDDEGEDDG 
EESEETNREEEGKAKEQIRNCENNCQQV 
TDKAFKEQRDRPEAQEQSEKKI S KLDPK 
KLALDTSFLKVSTRPSGNQTDLDGSLRR 
VRKNDPDMKELNLNNIENIRK 


11826 


25727 


A 


11932 


161 


389 


SVQTHPNLRSCSVLKNAMHFYLLGYVIS 
GCTEPAKAI KP I DRKSVHQ I CSGP WLS 
LSTAVKKIVGNSLDAGATNI 


11827 


25728 


A 


11933 


134 


289 


ASTQNMGQMLRLMSSPTLLSLSLSHTHl' 
HKHTRMHTHTHTHTHTYTPTKKV 


11828 


25729 


A 


11934 


128 




IKKGEFFFFFFFFETESHSPRLQCSGTI 
LAHCNIiHIiLGSSNS 


11829 


25730 


A 


11935 


131 


19 


MYIGWVQWLTPVI PALWEPKVGASLEAR 
SLRPARTSE 


11830 


25731 


A 


11936 


104 


3 


LKGQLIRAQWLMPVIPSLWEAEMGRSPE 
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VQSSR 


11831 


25732 


A 


11937 


146 


2 


GGRLGI I YHAWNPEGFFFFFEMESCSVG 
QAGVQWCNI.SSLQPPHPGFK 


11832 


25733 


A 


11938 


149 


1 


DKCFLCVCCLFIFIYLLIYFYFFETESC 
S IAQAGVQCCNLGS PQPLPLA 


11833 


25734 


A 


11940 


176 


380 


QSVAQLPRLECNGMILAHCSLHLLGFKR 
FSCLSLPSSWDYSRIiRNYQGGRRWSKTH 
QAWCQSLTILWN 


11834 


25735 


A 


11941 


133 


415 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKSG 


11835 


25736 


A 


11942 


222 


420 


QGDKFLDNNNST I LYMEKKKKKKKKKKK 


11836 


25737 


A 


11943 


163 


425 


KNPPKNSQLSPSTIiKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKTG 


11837 


25738 


A" 


11944 


143 


3 


KMGKKQSRKTGNSKKQSASPPPKERSSS 
PAMEQSWTENDFDEL.REE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTMALQARVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 
GGLFFFFFFYETESCSVARLECSGWSA 
HC I LHLRSTNEGRRGR 


11840 


25741 


A 


11947 


68 


177 


ISSSFKNKCLCEKKKKKKKKKKKKKKKK 
KKKKKRKR 


11841 


25742 


A 


11948 


210 


329 


RHTHTHTHTHTH I LS L S LSHTHTHAHTL 
MYFLAFFDLRS 


11842 


25743 


A 


11949 


183 


20 


LGTVAHS CNFVFSLSRGFLHVGKNGLRL 
PTSDDPPTSASQSAGTTGMSPCTWPE 


11843 


25744 


A 


11950 


232 


329 


VGIKMS ISSDEVNFLVYRYLQESGFSHS 
AFTF 


11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLELLELKQS PCFGL 
PKCWDYRHEP PRQADLPFCVPSSAWASW 
Li 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPEIMHYVHVHRVTTQPRNKP 


11846 


25747 


A 


11953 


193 


3 


LSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSLALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


11954 


185 


416 


SVQTHPN1>RS CSVLKNAMHFYLLGTE PA 
KAI KP I DRKS VHQ I CSGPWLSLS TAVK 
KIVGNSLDAGATNIDLKLKDY 


11848 


25749 


A 


11955 


100 


1 


AHL.SKVFPI FFFFFEMESHSVTRLECSG 
TISAH 


11849 


25750 


A 


11956 


35 


521 


KEKFFFHAGVYWGPPRNFLKRAPLFFFF 
FFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


KKKIFFFKEIFYPKLSRKKKNFFRGAGK 
ILNIiS I FPEKKKIFSPFSFFFETESCTV 
AQAGVQWHVLGSLQP 


11851 


25752 


A 


11958 


114 


1 


RYFSGQAQWLiTPVIPAliWEAEAGRSPEV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


SKCQCTFFSNFKKTEMESHYLAQAGL.EL 
LGFSHL.PTSVSQTVGITGVSHCAQPNAH 
LY 


11853 


25754 


A 


11960 


241 


556 


SSIPFLPNKHLLLSFSTLSSLGREVS IQ 
DMCQGTYQKQPFPFPLPHVFHLS IRDGE 
IjCVGRDVHLTCQTDIiAQVFVCLFVLGVIi 
LSLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KIGQALTPI IPLLWEAEAGASPEARSSR 
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K=Lysine, L=Leucine, M=Methionine, 
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SAWPT 


11855 


25756 


A 


11963 


43 


2 


CCCYCCCCCCYCC 


11856 


25757 


A 


11964 


30 


415 


GLGFKQKIFFFFFLGQGPNFDLLRAKKR ' 
PFLPPPFFPKNPKNRGVFLLGPSGVPKP 
RPRQGDVPPLKPRS lgvflts kktplgp 
WANPPPPPGDGP 


11857 


25758 


A ~ 


11965 


149 


2 


SKTWLKLAKYLKNTYPGWVRV/IjIPIVPA " 
LWQAGAGEIjHEPRNLRPAWAT 


11858 


25759 


A 


11966 


179 


3 


kkniffpppvkfgppqgffkrppplfff 
fffffffetesrsvaqagvqwls igslq 

AP 


11859 


25760 


A 


11967 


245 


382 


dvtdskmhpdfrsshvkmwlgwahacn 
pstlggsrgqefktslan 


11860 


25761 


A 


11968 


264 


10 


lstleckgvspgvfspkgwrkglgyifs 

PFLEKPH I PPLG I SM YFFFFFDGVS VAQ 
AGVQGRDLGSLHLPSPGSSYSCASASLS 
S 


1 1861 


25762 


A 


11969 


326 


406 


RliLKGWALWLTPVI PTIjWEATVGRSPE 


11862 


25763 


A 


11970 


120 


1 


KGYGFPFPPKKPLGPPFFFFFATESCSV 
AQDGVQWRDIiGS 


11863 


25764 


A 


11971 


84 


370 


RNGAELKTPlAGQTQITINCRYSWVTLV 
LGDLESILCWHIFKKTNLFICGFFSSFL 
LGAGRMKSHCWRLECRGMI SAHRNVCL 
LGSNDS PCS AF 


11864 


25765 


A 


11972 


196 


3 


SRGKMGEYPFVLFKNPSWAKAGGNKGNP 
FKFFFEAESCSVAQAAVQWCNIiGSLQAP 
PPRFTPRA 


11865 


25766 


A 


11973 


559 


644 


KIGCSAYGVAILLFLYFFNKLAFTLWKK 


11866 


25767 


A 


11974 


3 


391 


HEAQLPEPLMVLGTGHLAEMAGGSRTYL 
LLAFGIiLCLPWLQEAGAAQTVPLSRDFD 
HAMLQAHRAHQLAIDTYQELEETYIPKD 
QKHSFLHDSQTSFCLSDSIPTPSNMEET 
QQKSNLELLRISLLLIE 


11867 


25768 


A 


11975 


138 


365 


LKFECHSTLC7USIHGKQTFTFVFYNKFLL 
KYTFYFS VLTYAINWY I KNLINVFKKKK 
KKKKKKKKKKKKNFKKKKN 


11868 


25769 


A 


11976 


275 


433 


MS ILDLSKARNFFLSFLETGS CS I TQAG 
VQWIJNHSSLQPQTPGLRDPPASASQ 


11869 


25770 


A 


11977 


108 


3 


RCGLFFFETRASPCRPGWHNLGSLQPLP 
PRFKRF 


11870 


25771 


A 


11978 


270 


408 


FENNVWIiGRAQWLTP VI PALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


11871 


25772 


A 


11979 


228 


441 


QALKFVI EMLLSLKKCLDVS V I FNRHKK " 

IELIiQKKKKKFOCKKKKKKKKKPSQKKKD 

SSRGKDS 


11872 


25773 


A 


11980 


2 


447 


GAIiALNKTTADVWRLNFLVSGIiHWKRWL 
QQTSLSKWVKIKECSILKKKKKKKKKKK 
KKKKKKKKDKK 


11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSKNQKYSEHLRI 
HCCPPFTFIiISKKEIGDRKYS I CKSGCF 
YQKKEEDWFCCPCQKTKTSRRAKSLKRP 
KQKPVAPPGGVKAPAKPRSLPRF 


11874 


25775 


A 


11982 


48 


429 


KESNGS QDRLLPK I HS PD VSGGKSGGMS 
QNPARGGPKLSKNQKSSEHLRIHCCPPL 
PFLISKKEIGDRKYSFGKSGCFSQKKKE 
DWICCPCQKTKLKGKIRPPPKKNGPGGS 
LNGRT TWVS GL FVHN 
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K=Lysine, L=Leucine, M=Methionine, 
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11875 


25776 


A 


11983 


419 


500 


SliliVKVEKQWPGAVAHACNPSTLGGPG 


11876 


25777 


A 


11984 


362 


496 


LWS P IRIIjIiIjRWAKWHVPT VPAI WE AE V 
GRLLEPRSLCNMVRPIS 


11877 


25778 


A 


11985 


142 


433 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGGV I KKFLGGP I FGGGG 
KEKFFFFGGGFIN 


11878 


25779 


A 


11986 


177 


2 


CQRRWVTLSHHTHSITRPLGQLKKGWSP 
GAVAHTYNPS FLGGQGG WVTYVHRTRGR 
TR 


11879 


25780 


A 


11987 


178 


443 


DKKKKKKKKKKKKKKKKKKKKKAGGGLF 
KKI 


11880 


25781 


A 


11988 


329 


410 


LIFFFIFFERESHSVAQAIVQWHDLGS 


11881 


25782 


A 


11989 


146 


1 


DRASALQPGRQGRNFVSKWKNNNVI YI 1 ' 
NTLTTTI FLGTTYDALS PE L 


11882 


25783 


A 


11990 


240 


420 


GVFAPLLGDVSQSGYTGFRDPLEEAVCL " 

FLEIiECHAERTTALFRAVRQGCLSLQNL. 

SVAF 


11883 


25784 


A 


11991 


239 


410 


GTiFIYLLGDLFIYFRNKSLTLLPRiEC 
GGVIVAHCSLDLLGSG YPS I S ALPS S WD 
C 


11884 


25785 


A 


11992 


126 


3 


YTGE YKS FCHKDTCTHMFTAALFT I AKT 
WNKPNGRVGGRV 


11885 


25786 


A 


11993 


300 


22 


SQLLGRLRQEKHFNWGGRGCSEPRSCHC 
I PAWATRANS I FCGFQASS VEVRRSARK 
KLFSDILKRHNTISWRVSGLLLVDSYFG 
RLATPVRAQ 


11886 


25787 


A . 


11994 


303 


2 


EGEEI FLPSPPFPPGQENPGSNCSPPLF 
P FGTFLD I R VAGSHKVQAKGVFKKR P PS 
LHLFLIKKRFFFFFFETESRSVPQAGVQ 
WRDLGSLQTPPPGFKR 


11887 


25788 


A 


11995 


171 


457 


SFSDHLILGWIGFCaSINLSKIALCSSPIK 
NKNDDLQKKKKKKKKKKKKKKKKAKASS 
SYQDSS 


11888 


25789 


A 


11996 


357 


204 


EKTGFPHVGQAGVDIjLTSRSTRIjSLPKC 
WDYRHEPMRLAGHIjYYFYPQMK 


11889 


25790 


A 


11997 


219 


I 


PRVFWAPPPRYPPGALFWAPRPVGVSLG 
ARAPTKAGPKQKGARGTFFFFFEMESHL 
SPRLECNGTIWAHCNLC 


11890 


25791 


A 


11998 


254 


1 


AHLRGNRQLPKHTFFQYMTTNLKCAFSV 
GRQSYS IPWRSFYTSLFFKYSCVFHTHT 
HTHTHTHTHTIjYFQIMVLLLPSLRKKGS 


11891 


25792 


A 


11999 


167 


2 


NFKS FFQGLSRGVLNPKVNLFFFLETES"" 
CSVS QAGMQWGDLDSLQP PPTRP PTR P 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDLCS 
LQPPP 


11893 


25794 


A 


12001 


178 


2 


KIFQRGAKNSPWGLiPFRGGENKKGAPPC 
QKGFGRFFFFFETESRFITQTGVQWCDL 
GP 


1 1894 


25795 




12002 


163 


282 


GIGGEWCLSKYVI KYVKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKK 


11895 


25796 


A 


12003 


195 


2 


KWGR I PLKKGGFCFTKKKSGQTLL I KS P 
PPAFFVFFIFFFIiETESRSVAQATVQWC 
DLGSLQAP 


11896 


25797 


A 


12004 


152 


1 


CFVDVSAEKLRPKDFMTKTPKAMATKAK 
I DKWAXiTKLKS FCTAKET I IRV 


11897 


2579S 


A 


12005 


359 


1 


EWPIWPPPKRRVLSKVSKQFFISAPIR 
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KKPFPCQVGLIIiAPFRVLLIGRPFFFFF 
GVL WE I QR WS F I S LVWWRAR IMM Y A 
LFLLSVGLVMGFVEFSSKHCPVYGALVL 
VSIGGRV 


11898 


25799 


A 


12006 


189 


12 


DGRLPGSVCYYLIFLRQSFTLIjAQAGAQ 
WCHLGSLQS PPPGFKRGRRVSRIRAVRG 
DQ 


11899 


25800 


A 


12007 


133 


3 


KI FFFFFFFEMESHS VAOAGVQWRDLGS - 
MQDP P PGFMPLLFY 


11900 


25801 


A 


12008 


150 


2 


I I LFLVP IEAVFFFFETVAEAG VQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 


25802 


A 


12009 


305 


3 


KFFFSLKAS IRGRWFGLFFTPPKKGFLP 
KI PHQVFKGGPLWEKLQLGKAGLNLGP Y 
KGFFKGAARFFFFFFETESRSVAQAGVQ 
WRDLAHASVHASAHAS 


11902 


25803 


A 


12010 


263 




YGPPFFFFFFKAESRLVAQAGVQWQDLG 
SLQPPPCRTRGSP 


11903 


25804 


A 


12011 


107 


374 


WMGTWGSLPVLLTTRSTCPPRSPRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQLPGS S SS PTIASQVAGTKAHLS 
DCFVY 


1 3904 


25805 


A 


12012 


344 




FPKKINNFYFTFXiQDPTGI FSLDKTI GX» 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


IFHLRKIFTFLPGIFWPIRLLSEKKKPK 
RFWVKIKKFKNGYPLiKNPINPKYKGPRF 
FFKGEKNPNGKVP VLKKIKI PS FSKALG 
FLQKKASKPFFFFFFDTESCTVDQSGVQ 
WCDLGSLQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWGDLGS 
LQAP 


11907 


25808 


A 


12015 


205 


2 


VFNPSGINVMYVDAIILNKRLiATRIQQH 
IKQRILHDQVGFIHGMQGWFS IRKSINV 
IQHINRPKDKN 


11908 


25809 


A 


12016 


168 


1 


GCVCQFQKKLGNGGLNGFFFPLFFFFFF 
EMKSRSWQAGVHWRYLSSIiQTPPPEFK 


11909 


25810 


A 


12017 


311 


2 


RGLRFGQWKTQMENPNTPFSALRPGQLS 
S IRSGLLHTYPGLGDRSEPiSCS IhSSS 
KYWWQAVTSALSSNKPGASQGHWKDDF 
FLFFETESCSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEMGFRHVGQDGIiDUJTSQSAHLGLPK 
WWD YRRE PPRLAT I E VL 


11911 


25812 


A 


12019 


2 


379 


RVL WTAHLAAMAAGS RTSLLLAFALL CL 
PWLQEAGAVQTVPIiSRLFKEAMIjQAHRA 
HQIAI DTYQEFI SSWGMEAYITKEQKYS 

NLELLHI SLLLI ES 


11912 


25813 


A 


12020 


3 


389 


PEPLRVLWTAHRGAKGAGSRTSLLLGFA 
LLCLPWLQEAGAVQTVPLS KX.FDHAMLQ 
AHRAHQIiAIDTYQEFEETYI PEDQKYS F 
LHDSQTSFCFSDS IPTPSNMEETQQKTN 
LELLRIFL.VLIELWLDP 


11913 


25814 


A 


12023 


247 


386 


PMLGHVSQSGGNGVRDPLEEAVCPLAKL 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPS FRTDLLLAYALL 
CLPWLQEARAVQTVSLYRLSDHAMLQAH 
RAHQLAIDTYHEIEETYILKDQKYSFLH 
DSQTS FCFSDS I PT P S NKEETQQ KFYLE 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LLPISL 


11915 


25816 


A 


12023 


3 


359 


LRALWTADLGAKAPGSRTFLLLASALLC 
IjPWLEEAGAGQTVPLSKIiFDHAII.QAHR 
AHQLAI DTYQELE ETY IPKDQKHS FLLD 
SQTSFCFLDSIPTPSNMEETRQKSNLEL 
FRTSLLL 


11916 


25817 


A 


12024 


2 


363 


PEPLKGLWTDHLGALAPS SRTS LLLAYA 
LLCLPWLQEAGAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPEDHKYSF 
LHDSQTCFCFSDS ITTPYNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLT PVI PAFWEAKGGRS LEAPS 


11918 


25819 


A 


12026 


80 


392 


PLIACSFFLFLFFFFGKKILFLPPRWKG 
GGGIWVNGNPGFRGQAFFLAS SSKKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGTPPTGPPKGWEIPAK 


11919 


25820 


A 


12027 


165 


2 


WQKLLFHFGTESCSVARVGVQWRHFSSP 
KPPPPEFKQLSAPASRVADRRPPDAW 


11920 


25821 


A 


12028 


1 


338 


VFVALKFLMCLLSVCIFSLENIYSNSLP 
QKNCLPLIFFFLFFEMEACSVTQAGVQW 
GDLGSLQPPVSHLNLGGGGCCEPRECHC 
TPCSRPGDRDFVFNKNTMIQDNHLMELT 


11921 


25822 


A 


12029 


2 


315 


HEERERERERERERERERERERVGRTVG 
GVGETYKAELPRVGGAAQKRAHFSARGR 
LFMEICGDMCGEKPGKPPHSLTEECLSR 
CGERFFDTSLAI TRG CAQS V 


11922 


25823 


A 


12030 


216 


3 


ERIIPRGVRQRNRLFSGRDTASPPLFYP 
PPRQIHKRGVEDREHERRGERERERERE 
RERERERERARAAR 


11923 


25824 


A 


12031 


114 


329 


QTERNS ININKKDIHTKTPSVGHQHQRP 
KVDKTTKMGRNQSRKAENSKNWKASSPP 
KEHNSSPAREQNWMES 


11924 


25825 


A 


12032 


177 


1 


ikkktffwqniplfskkktcrqkfffff 

FFETGSLLPRLVCSGAVLAHCSLCLPGS 
AFL 


11925 


25826 


A 


12033 


127 


1 


psfffffffnetesrsvaqagvqwrdld 
slqapppgfitpsc 


11926 


25827 


A 


12034 


47 


314 


GAPVASVSISCPSCSATDGWRNGKSTA 
GHQRYLCSHCRKTWQLQFTYTASQPGTH 
QKIIDMAMNGVGCRATARIMGVGLNTIL 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAI PGAMGHFTEEDKATITSLWGMVNAE 
DAGGETLERLLVDYPWTQRFFDS FGNLS 
SASAIMGNPKVMAHGKKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLHVDPENFKL 
LGNVLVTGMAIHLGKEFTP 


11928 


25829 


A 


12036 


122 


2 


KLVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPPPKLGFWGGPRQRVRGRGPPFFFFF 
ETESPSVAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSIjSL 
PSGKDYRHAPLCATQKILAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DGVLLLLPRLECNCAISAHGNLRLPGSS 
NSPASNKTKQNNNKKTLSNKF 


11932 


25833 


A 


12040 


63 


355 


LGRGT AHRRLiWRPLiS RP ASRVS YLS AAT 
NKRSFAPPSRAFPWDNKWERGGFYFFET 
GSHRRRPGRMECSGAITAHCSLDFPGSE 
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nucleotide 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 

otide 
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correspon 
ding to 
last amino 
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residue of 
peptide 
'sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MGFTMFPSLVSNS 


11933 


25834 


A 


12041 


184 


3 


GVSRLLGGVSQLGYMGVRDPLiEEAVCLiF 
SKIiKHHPGRTTALFRAVRQGGLSLQKFL 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLCYLGWECSGTI S AYCNLHLPGS K 
RFSCLSLPSS 


11935 


25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 
DKTTRMGRIQSTKAENSKNESTSSPPKE 
HSTSPATEQSRMENDFDELREDFRRSVI 
TNFSELKEDVRTLCKEAKNLEKRLDEWR 
TRINSMEKTINDLMELKTMA 


11936 


25837 


A 


•12044 


101 


2 


HLGQVRWLTPVI P ILWEAE VGGS PEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTV I KELPLGQARWLTP VI PALWEVKVG 
GSPEVRSSRPA 


11938 


25839 


A 


12046 


3 


432 


P I FGGGGERI FPF FFFFSQGGGVFFPKT 
KNKGFFPFLGFFKKIFLRNFFFLFFPFF 
FLNPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFFGGGFFFFFLGFF 
FFFFFL 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVIPALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVSEYI 


11940 


25841 


A 


12048 


187 


381 


IiWBDKRVSLWGDENILKVESNSGCTTLYL 
LVCFEMESRSITQAGVQWRDLSSIiQPPP 
SRFKRFSCL 


11941 


25842 


A 


12049 


84 


464 


SYRVPSHPDTLVLSRISAQEAGEKSPFC 
FPERVWPCPRPLSDLGRRLKLECGPDLL 
DSTFLS FFFGEGLKTGSHSGALECSGVI 
RDHCGLCIiSGSGDPPISACLRVAGTTGV 
SHHCIFCRQDLPGS 


11942 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSEE 
GMS KAE I S QKLGLIiHHLGL VANAKQRFL 
KVIRSATPEFRHAE 


11943 


25844 


A 


12051 


401 


518 


FFETESCSVTQAGVQWRNLSPIiRA 


11944 


25845 


A 


12052 


191 


1 


I1FPI1KKE K I FFLGPGTNGAP PM F F LKAP 
PLFFFFFFFFFFFFFFFFFFFFVNIiFLL 
LSFLHIL.EP 


11945 


25846 


A 


12053 


12 


369 


PliPIYSACIEVGSNPQGPGIDAKSVSHN " 
NCYLKEKKKKKSNGSQNNLLPKIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
LKKHCCPRLTFL I S KREKGNRKYTFGKS 
GCFYQKK 


11946 


25847 


A 


12054 


89 


513 


NFXVRGS I PIoLNNI PMANLLLLI VP IL I 
AMAFLMLTERK I LG Y I QLRKGPNWGP Y 
GL.LQPFADAIPLFPKEPLTPAPSAFALC 
FAAPALALPFALFLCPPRPLGRP 


11947 


25848 


A 


12055 


158 


3 


SQL.TWP IMAQTDSMCLWIVYFDDNIGWA 
WWLMS VI PTLWEAKVGGSLEHSN 


11948 


25849 


A 


12056 


113 


1 


KSHGGHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGAALA 


11949 


25850 


A 


12057 


44 


263 


ARIARSAHEGKMPRKYTGARKNAENRPQ 
RELQLKASRSTIDLAEHACNALWYQLFL 
ISVGSWKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKIFFRYKKEGFSFPKFFRISLFFFFFF 
ETESRSVAQAGVQWRDIiGS 


11951 


25852 


A 


12059 


12 


347 


IVDKSTKMRKNQCKKAENSKNQSASSPA 
KDHNCLPAKEQNWTENEFHKLTEVGFRI 
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residue of 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=TJn known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ITNSVELKEHILTQCKEAKNLEQRLEEL 


1 1952 


25853 


A 


12060 


245 


3 


MPEGS FFLEI PE I TRVFVKKPGKFLGGV 
GPSLLFFFPFFFFFFMRQSIiAVSPRLEC 
NSAISAHCNLCLLGSSDSRSSGSR 


11953 


25854 


A 


12061 


94 


3 


KGGI FFFFFFFFLRQSPS VAQAGVQWCD 
LG 


11954 


25855 


A 


12062 


108 


5 


NNLSSGVRDQPGQHGETTSLLKIQKLAR 
HGGRHL 


11955 


25856 


A 


12063 


298 


1 


KKKKINTHAQKRGPPFFFKPPPEKGKAP 
PPQRVTKKGGEKDSFIPKRREKKNPPPL 
SFFFFLVRVHVRERERARARERERERER 
ERERERERERERERAR 


11956 


25857 


A 


12064 


122 


1 


VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


25858 


A 


12065 


126 


1 


ATCTKFRNDNRALLRVRVFERERERERE 
RERERERERERERESR 


11958 


25859 


A 


12066 


48 


351 


FFFFFFFFKKGPKIVPPGGGGGGDPIFL 
EPPPPPLKKFWGPPLRGRGNKGGGPPGW 
VNFGIFIjKKKGIjPWGPGGVKTPAFKGSP 
GPTPPKGGNNRKNPPPW 


11959 


25860 


A 


12067 


1 


365 


GTRLT VLWTAHLVAMAPGS RT S L.LLDFA 
LLCIiPWLQEAGAVQTVPLSRLFDHAMLQ 
AHRAHQLGIDT YQEDEET YI P KDHEDS F 
LHDCQTS FCFSDS I PTPSNMEETRQKYH 
LELLRISL.LF 


11960 


25861 


A 


12068 


174 


1 


PAWVMQHNPVSLFFCFFETESRSLTRLE 
CSGT I SAHCNVRLPGSSDS P VS PSRVAA 
RA 


11961 


25862 


A 


12069 


270 


375 


TRIKRCNGGRAQWLTPVIPALWEAKAGE 
SPEVRSS 


11962 


25863 


A 


12070 


197 


350 


KKIYIFLTINTIMDLIIPFLLDNILGFW 
LGAVAHACNPSTLGGRGGW I TRS 


11963 


25864 


A 


12071 


3 


378 


HEGQLPEPLKGLWTAHLLGMAPGSRTSL 
LLAXDLL CLP WLKEAVAVQT VP L S RL YD 
HAMLQAHRAHQIiAIDTYQE IEETYIPKD 
QKLS FbHE YQTS FCFSDSI PTPSNMEET 
LHKSHLEIiLiRIYIi 


11964 


25865 


A 


12072 


219 


1 


FSHPFPPGVFQTPLVFPRPWEVLGSPGG 
GGGPPKKWAGVQKWVFFFFFFFWEMESC 
SVAQAGVQWYDLGSPRA 


11965 


25866 


A 


12073 


134 


3 


VFGPPFFFFFFCETESHSVAQAGVQWRD 
LCSLQAPPPGFMPSC 


11966 


25867 


A 


12074 


175 


357 


MPINQPVKKMCVCVCVCVYIYIHTPLYI 
RVYMYMCMYICVYIHVYIWMCIYIHTHV 
STYVW 


11967 


25868 


A 


12075 


193 


350 


KIPHLTSLYHTQNYLKYCIGQAWWLTPV 
IPALWEAEAQGS PEVRSSRPAWPW 


11968 


25869 


A 


12076 


252 


1 


GVFGPFPKKGGFFWVKGFI.GTPQIFWVG 
VFFKKKGGALWVKGLLGKPPPPPLFFFF 
FETES CSVARAGVQWRDS VKKKITIARA 


11969 


25870 


A 


12077 


273 


366 


IFIYLFIYLFETESCSVSQAGVQWPNLG 
RLR 


11970 


25871 


A 


12078 


127 


2 


KKKKTLFFFFFFFFEMESRSVTQAGVQW 
CDLSSLQPPPPLV 


11971 


25872 


A 


12079 


206 


334 


HNRVTIVNNNIiIVHFKITKRCWTQWLTP 
VIPALWEAKAGGSSE 


11972 


25873 


A 


12080 


135 


2 


KMKRKFGEDSQIPKTLQTFFFLTESRSV 
AQAGMQWCDLGSLHSC 
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NO: of 
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NO: of 
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od 
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NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11973 


25874 


A 


12081 


147 


3 


ATTPGLPFIFYFFETESHSIAQAGVQCM 
ISVHCNLCLPGSSDSPVLV 


11974 


25875 


A 


12082 


106 


2 


YFIPINEGYCFIYLFFETESRSWQAGV 
QWCDLV 


11975 


25876 


A 


12083 


164 


346 


CPYKKRKKANQAKWLTPVIMLFGRPRRV 
DRPSSGVQDKPGQHGETPSLPKIQKKSQ 
AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR 
SRSVTQAGWGAVIMAHCCLNLPGPSDPP 
TPAPRA 


11977 


25878 


A 


12085 


98 


2 


CPFFRVNFFFFEMESRSVAQAGVQWRDL 
GSRA 


11978 


25879 


A 


12086 


154 


2 


SPTPEKGVWELPSPFFSWGPKIWGCI FF 
FLYETESGSVAQAGVQWRDLGS 


11979 


25880 


A 


12087 


326 


3 


KTSVITCGLPMGWSWGPFPKGPWWCKKF 
CVCKFWANQTLKKNLGAGRFQKPFSAAA 
FLEMGGFPFKFFRFFQGSQFFFFFFFET 
ESCSVTQAGGQWCNLGSLRPLPP 


11980 


25881 


A 


12088 


154 


3 


KEFSFFAPGGKQRGEIRSLRGPPPQVKP 
LFFFFFFETESRSVAQAGVQWR 


11981 


25882 


A 


12089 


174 


2 


SWKAILQYSLEIILYLPSFQIFLCFSHT 
HTHTHTHTHTHTLYSQIMVLLLPSLRKK 
GS 


11982 


25883 


A 


12090 


265 


30 


WF I VYS E I S QLWGLPS FNTIiGTWQS LS F 
I FVAEMRS CCVTQAGLELLAS SDPPVS A 
SQS ARIRAMS PSVAWVI PGRS R 


1 1983 


25884 


A 


12091 


257 




GSVMRMHTEEQYPENKVEQSSSGFIRPH 
LVYRVCFYLS VCLS VCLS IYLS I YLS I Y 
LS FFLS FFLLRKS LYLS I FLS FFLLS PR 
V 


11984 


25885 


A 


32092 


311 


404 


LYHQNPWLYSVFFFLETESCSVAQAGVQ 
WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKLFYVLIMMVVIQL 
HVFVKIHKTLQLCTGYILLCKLYLINLTK 
NNFQKPKTNSQKTLHIFHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


IiREPNLIiNPGGEGCNEARWCHCTPAL 


11987 


25888 


A 


12095 


255 


393 


HGILFSS FVCNLNQFLSQHWLMPVI PAL 
WEAEVGGSLEPRSSTLAW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETESRSVARLECSGMI 


11989 


25890 


A 


12097 


367 


1 


PRVDKPTKMGKKQGRKTGNSKKQSPSAP 
PKERS S S PATEQSWMENDFAKLREEGFR 
RSNYSELQEEIQTKGKEVETFEKNLDEC 
ITRIPNTEKCLKELMELKPKAQELREEC 
RSLRSRCDQL 


11990 


25891 


A 


12098 


94 


236 


MWADYFCMSPS IDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 


212 


NSLHPSTSLFRFSSHFHIKLISGQAQWL ' 
TPI J PVPWEAKAGGLLETRSSRLAWAAQ 
SDPISMPVCK 


1 1992 


25893 




12100 


140 


324 


NLGQAQWLMPVI PARPWEAEVGRSLAVR 
SSSNSPASASQSAGPTGPSHHNWLMFLQ 
LLTRL 


11993 


25894 


A 


12101 


247 


357 


KQIQGRAWWLTPVFSPLWEAKAGESPEV - 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


DVLVGGLCLPAGGTLHLTS I FLLSGCWC 
GSKDEEAPCKQRISVQRESQSRTLRAGV 
S P KKAHP CEMCGL I LEDVLH FPDLQ 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L- Leucine, M=Methionine, 
N=Asparagine, P~Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


11995 


25896 


A 


12103 


230 


379 


KEASFFETESHSVSQAGVQWRSLNPLGS 
GNPPTSTSRVAGTTGMSHHVWL 


11996 


25897 


A 


12104 


86 


2 


DGVS VLS PRIiECNGL I LAHCNLRL.PGSS 


11997 


25898 


A 


12105 


116 


2 


SQKTPFFFFETVSLLLPKLECNGAISAH 
CNLCLPGSSD 


11998 


25899 


A 


12106 


271 


416 


QFTFS KAFNIiVF I KPS FHIS FYRTHYS Q 
AWWLTPVI PELWEAE AGGS P 


11999 


25900 


A 


12107 


194 


3 


HVI/GTVLVAEDTASNKIRSLQGCL.LLPL 
LFSWLEVLARAIRQEKEIKVIQIGREE 
VNRRRGR 


12000 


25901 


A 


12108 


142 


3 


fcCLLYWDCKRHAWWRGQVRWLTPVIPAL 
HEAEAGGS PEVRS LRPAW 


12001 


25902 


A 


12109 


233 


346 


HQRS LI GWAQWLM PVI P ALWEAEAGVS P 
EVRSLKPAWP 


12002 


25903 


A 


12110 


234 


349 


GFSPFYDFCLFVFETESCCWQAGVRWH 
DL.GSLQPPPP 


12003 


25904 


A 


12111 


136 


333 


RLNFFYFYFFETASCSVAQAGVQWHDLA 
HCIPAWETEQDS I S KNTVQKRKKKVTRA 
GISKTKNENF 


12004 


25905 


A 


12112 


345 


110 


QDGLCLVMTLQETQPILAYSLWFPIVFP 
TTKACNVQGDSKFLLKKKYLGQVQWIJyiP 
VI PTLWGADVEGSPEIiRSLEPA 


12005 


25906 


A 


12113 


108 


285 


YNAMKNRFLKTILNNKNS IGWARWLMAV 
I PALWE AKAGRS PEVKS SRSRPGAVAQV 
DAA 


12006 


25907 


A 


12114 


1 


364 


RWAAEMGKFMKPGKAALDLAGRYSGRK 
AVIVKNIDDGTSDRPYSHALVAGIDRYP 
RKETAAMGKKKIAKRSKIKSFVKVHNYH 
QLMPTRYS VDI PLDKTWNKDVFRDPAL 
KRKARREAK 


12007 


25908 


A 


12115 


120 


3 


TQIWGAFLTLFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 


12008 


25909 


A 


12116 


120 


3 


TQ I WGP FLTFFFFFFFEAKSHS VAQAGV 
QWCSLGSLQA 


12009 


25910 


A 


12117 


231 


1 


FLSPFSCYSAITKLLSLSYFLLGHILLL 
LGSTPEAAAQWQWVSFADSDIVPPAST 
WVFPTLG I MHHNKQATENAKE 


12010 


25911 


A 


12118 


216 


1 


IjI FPPLLNFCYGEVLGFQGFLALFFACM 
FFWPRFS PKTFFFFFYE KEFHS AAQAGV 
QWCDLGSLQPLPRGFK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFPSPPLLGCFIRDQKHLLGILV 
YRSHLISSLLCLEGIILSLFIIATLITL 
NTHSLLANIVPIAILVFAACEAAVGLAL 
LVSIS 


12012 


25913 


A 


12121 


167 


372 


ISGQDLLKETKRVKRPFQQDDVPFINIF 
VPTPGAPRSLRQILELRGKTDPSPI IVG 
DFNTLFSALDRS 


12013 


25914 


A 


12122 


124 


2 


GLLKFNIFAPFFFFFSETESHSVTQAGV 
QWCDLGSLQPPP 


12014 


25915 


A 


12123 


121 


1 


RFTAAS ANSGAVSAAVGYMGDPGSE I IE 
SVPPAGPVSSW 


12015 


25916 


A 


12124 


176 


2 


RDIiRGFSRFSNYKFWNFGRVGGI FKFVT 
GKGVKRFFFFFFETESCSVAQAGGQWRD 
XiG 


12016 


25917 


A 


12125 


137 


2 


KLGFFGGGGGKIFAPQKNFFFFFFFFFE 
MES CSVAQAGVQWRD IG 


12017 


25918 


A 


12126 


103 


3 


LEENLGNTIQDIGRGKDFTSKTPKAMAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














KAKID 


12018 


25919 


A 


12127 


212 


351 


LITFEI LN I CLGDNILDLGLARWLTPVI 
PALWEAKAGGS PEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRTPGVRVLYYILMIATMA 
I P Y I PMANLL L>L I VP I L I AMAF IMLTER 
KILGY 


12020 


25921 


A 


12130 


13 


329 


ASRVTIRVMASNSTKSFIADAGYGEQEL 
DANSALMELDKGLRSGKLGEQCEAVVRF 
PRLFQKYPFPILINSAFLK1ADVFRVGN 
NFLRLCVLKVTQQSEKHLEKI 


12021 


25922 


A 


12131 


231 


330 


MDTDE I YLGRAPWLiTPVI PALWEAKASG " 
SPEVR 


12022 


25923 


A 


12132 


95 


2 


KGKCFPPFFFFETESRSVAQAGVRWRDL 
SSL 


12023 


25924 


A 


12133 


129 


1 


DLiAXiLPRLWCSAMIMAHCNSEIiMSSSDP 
PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


361 


TIMEYYAAMKKKSESPLLHFQGKKLNQD 
GGWAWWPTPVI PALWKAKTGGSLEPRS 


12025 


25926 


A 


12135 


120 


1 


PPPPGLjFFFFFFETESYSVAQAGVQWCN 
LGSLHPPPPEFK 


12026 


25927 


A 


12136 


235 


3 


KKGIR I ERDRLKQIiS VCQRFQLAKFPP F 
PSGVGEKQNFFFKKKKKKKGRAQWLTPV 
IPALWEAEAGGSFEVRSSRPA 


12027 


25928 


A 


12137 


185 


1 


IWCPFFCPGPQICSFRCHFFFPFFFFFF 
LEAETRS FAQAG VQWCDLGS LQ S S W VTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFS I FFCGGTMVLS PADKTNVKA 
AWGKVGAHAGEYGAEALERMFLSF 


12029 


25930 


A 


12139 


76 


3 


KVLARAIRQEKEIKDIQIGKEEVK 


12030 


25931 


A 


12140 


. 157 


2 


FFSSPRLLKRRPGNFLGARENFFFFFFS 
ETESPSIAQAGEQWRDLGSLQAP 


12031 


25932 


A 


12141 


234 


388 


LSQLMSANLFFVLLFEAEFHSVNRLECS 
GM I WAHCNLHLPGS SES PAS TS Q 


12032 


25933 


A 


12142 


112 


453 


LGRRQAASMREG I S I HVGQAGVHIGNAC 
WELYCLEHGIQPDGHMPSDKTIGGGDDS 
FNTFFNETGAGKHVPRAAFVDLEPTAID 
EVCTGT YRQL FHPEQLi I TGKEDAANNYA 
RG 


12033 


25934 


A 


12143 


44 


443 


AKIiGTRKLPLKAKMGKEITLINIWIGH 
VDWGKTTTTGHLI YKCGG I DKRTI EKLE 
KEAAEMGKGSFKYAWVLDKLKADRERGI 
TIDISLWKLDTSKNYVTI IDAPGHRDFI 
KNMITGTSHADCAVLIDAAGV 


12034 


25935 


A 


12144 


3 


386 


REAATMRECICIHVGHAGVQNGNACWEL 
YCLEHG I QPDGHMPSDKT I GGGDDSFNT 
FFSE TGAGKHVPRAVFVDLE PTG IDE VR 
TGT YRQLLHPVQL I TGKEDAANNYARGH 
YTIGKEI IDIiVLDRIR 


12035 


25936 


A 


12145 


3 


386 


AGATYIDRLRVALFEATTFTLWVRPELS 
SGEATTMRECMS IHVGQAGVQNGNACWE 
LYCLEHGVQPDGHMPSDKTIGGGDDALK 
TFFSETGAGKHVPRTEEVDLEPTE IGEE 
GTGTYRQLFHPEQHMM 


12036 


25937 


A 


12146 


3 


385 


GRATYIDRLRVALLEATTSTLRIiRRELG 
SREATTMREGI C IHVGQAGVQNGNACWE 
LYCLEHGIQPDGQMPRDKTIGGGDDFFN 
TFFIETGAGKHVPRAVFGDLEPTDIGE I 
RTGTYRQLLHPEQLI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


12037 


25938 


A 


12147 


11 


381 


HTPEGRTR VP VL I AGWYCQE ALRD WGR 
VTAS S TGAMAFIiRSM WGVLTALGRS GAK 
LCTGCGSRLRSPFS F VYLPRWVS SVLAR 
CPKKPVSSYIiRFSKEQLPIFKAQNPDAK 
TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQY FI PFYCQ I MHLCFFLKTRVSLL 
FPWLECSGATSAHCNFCLI.S S SNS PAS A 
SRVTGIT 


12039 


25940 


A 


12149 


51 


400 


AATMRECIS I HVGQAGVHIGNACWELYC 
IiEHGIQPDGQMPSDKTIGGGDDSFNTFF 
NETGAGKHVPRAVFVDLE PTVI DEVRTG 
TYRQLFHPDQLITGKEDAANNYARGHYT 
IGKE 


12040 


25941 


A 


12150 


25 


399 


EATTSTLGLRHQLGSRESTAMRECIS IH 
VGQAGVHIGNACWELYWLEHGLHPDGQM 
PGDQTIGGGDDSFDTFFSETGAGKHVPR 
AVLVDLE PTVI DE VRTGT YRRL FHPEQL 
ITGPEDAANNYAR 


12041 


25942 


A 


12151 


1 


402 


TKEEELCLACRQRRADGGWRSQMNAGSD 
PVGIVSAARTI IGSFNGALTAVPVQDLG 
STVI KKVLKRAT VAPEDVS E V I FGHVLA 
AGCGQNPVRQASGGAGIPYSVPAWSCHM 
ICGSGLKAVCLAVQS IGIGDST 


12042 


25943 


A 


12152 


3 


400 


TDRLRVALKKTTTS S FCFPHHLGSRETA 
TMRECIS IHVGQAGDHIGYACWEL.YCLE 
HSLQPEGQMPGDKT IGGGDDYFNTFFSE 
TGAVKHVPRAVFVDLEPTVI DEVSTGTY 
RQIiFRPERLI TS KEDAVNI Y 


12043 


25944 


A 


12153 


2 


398 


GR CAliDRNTF I FARFGC YIi I ASGHPGE K 
LMDMDMS PLRPQNYLFGCKLKAEND YHF 
IVANDENEHQISLTTASLRAGANDDMNI 
VEAEAMNYEGTPI KETLATLKMSVQATD 
SLGGSEITPPSVLRLKCGSR 


12044 


25945 


A 


12154 


203 


405 


FTCPSRI ICVISRRISPCCAPDIiNPMLS 
ANAMIiDFVFTVEDPGGWDSKNliQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


"229 


397 


TKIAHHKAGFALISKKITKNLKMFLSKF 
IiPIHAIiWVTGSSGMQPYPLVWGHYDLGK 


12046 


25947 


A 


12156 


2 


399 


GGVPHCVWATAWGMRPGLPGPTGL.CAQT 
S SRGQKS VLKQKES CGIWQLYHFLSRKQ 
EPRWEPCVSGSSSGDGAVADIiADELRGY 
PALCCTI.PVHSYRSWAGIRPQIMNGPI.H 
PRPLVAIjIiDGRDCTVEMPIL 


12047 


25948 


A 


12157 


85 


414 


ALLPQSEAIjQGAVTMPHSYPALSAEQKK 

evsdialrieapgksilaadesvgsmak 
rlsqigventeenrllyrqvlfsaddre 
kkciggvi ffhetlyqkddhgvpfvr 


12048 


25949 


A 


12158 


85 


407 


GliPHSEPLQRAVTMPHSDPALYAEHKK 
DVSDIALRIGSPGKGILAAYESMGSMAN 
PLNQMGVENTKQNRRIiYRQDLFRAEDRE 
KKSIiAGVI FFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


85 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
ELSDI ALR I VAPGKGI LAADES VGSMAK 
RLiSQ IG VENTEENR PL YRQDLFS ADDRG 
KKSIGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFI ISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 | 25952 


A 


12161 


212 


2 


KRSRPFFFFFKEMGS CYI AQ VGMQWIjFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Th reon i n e, V=Va line, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GTITACYDPELLGSKDPPTSASQVTGNT 
GTHTHTQLIFVFLV 


12052 


25953 


A 


12162 


253 


2 


GRVDSECCLANGNAEKS I FCCLLVYLFY 
LS S RQS HS VAQAGVQWHDLGS LQPPP PG 
FSLLSRVAGTTGARHHDWLIFVFLGET 


12053 


25954 


A 


12163 


228 


1 


LLGLWFEQLGRFTKIAKTKGERSQTQSG " 

HSMDMRVPAQLLGLIjLLWFPGSRCDIQM 

TQSPPAVSASVGDRVIITCR 


12054 


25955 


A 


12164 


122 


1 


RliLFFETESRSVARLECSGAISAHCSLR 
LPGSGSSPASAS 


12055 


25956 


A 


12165 


239 


350 


GQARWLTPVIPALWEAKASKSLEVRSLR 
LAWAGHGGS 


12056 


25957 


A 


12166 


235 


359 


TDKAIS KRDLS VLRE I IFFEMESRS VAR " 
MECSGVI SARCNXi 


12057 


25958 


A 


12167 


209 


396 


QFSGIPFDQYKCGFKNYIYIYIYIYIYI 
YI Y I YRKYTRAYTKR YMCVFS RATHN I V 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQARVQWPDFGSLPAPPFGFTPFLR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKKKKKKKKKKKNIKKG 
GRRYKPS 


12060 


25961 


A 


12170 


260 


375 


EKVTCSWTQWLTPVIPALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVILGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
CPGRLECSVAVSAHCK 


12062 


25963 


A 


12172 


105 


3 


LIFLRQSSALLPRLECNGAISAHCNIjCL 
TGSSD 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSIiAVAQAGVQQRDLGSLK 
APPPGVHAILLPQEY 


12064 


25965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS 
FMRHNNIEVRPINNPAMTSKCSSERKSL 
VSFTIiNLKPGMIRLSEEGMIiK 


12065 


25966 


A 


12175 


292 


380 


VRGLGRVAHTCNPSTLGGQSGW I TGGQE 
F 


12066 


25967 


A 


12176 


109 


1 


GRI KKVI KG QAQ WLMPVI PALWEAKVGG 
SPKVRSSR 


12067 


25968 


A 


12177 


170 


291 


LINFYFYFLRQSHSVTQTGVQWCNIiGSL 
QPPPAARRRRRG 


12068 


25969 


A 


12178 


260 


380 


LT YIALFI FLRRSLTLLPRLE CSGMIS P 
HCNLCLLGSSDS 


12069 


25970 


A 


12179 


54 


166 


PKRGFHRVTQEGLNLL.TLGSPRLGLPKG 
WDHRQDPXiP 


12070 


25971 


A 


12180 


273 


359 


APAGHGGSCLiQSQHFGRLRQADHLGSGV 
R 


12071 


25972 


A 


12181 


27 


225 


IGQAQWLT PVILAFWEAKAGGS LEVRSS 
RPAWPIYFLVLLASYLRRFCLIQGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKIFIAYKYSGFQTVFRGRAWWLTPVI 
P ALWEAKANR S PE VRS SRSGRRG 


12073 


25974 


A 


12183 


91 


2 


FNLFFFFFFETGSRSVARLECNGAISAC 
C 


12074 


2597S 


A 


12184 


122 


2 


GQAQWLTPVI PPPWEAKVGKSPEVRSSR 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRARRACFCGKVFDGELS FAIikiiA " 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














REMGRPDWRAMLAGMSSTEYADWHRFYS 
THYFHDVLLDMHFSGLTYTVLSLFFSDP 
DMHPLDFSLLNRREADEEPEDDVLMQKA 
AGLAGGVLFLI 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLAWVTERESVS 
KNKI KIKINKQ I EEGFLFQP FGQGFI WC 
YFPAKEFI IALSKT 


12077 


25978 


A 


12187 


37 


443 


PDFP I PFPPRKVQHLARPS PKTARPAWT 
IRRILPPPPKKRHPPGLARGKFGPRELA 
KAGLAKECPAGGPTPAVRFATPTRGGGG 
RVAGFQTPMG I PMGKARLGLLTFLGFAS 
GWIGAYRPRETLCGGELVDTLQF 


12078 


25979 


A 


12188 


3 


411 


AFPENAATGSTFQDPVPASKGRAEVGNM 
RLSVAAPISHGRVFRRMGLGPESRIHLL 
RNLLTGLVRHER I EAPWARVDEMRGYAE 
KL I YYGKLGDTNQRAMRMADFWLTEKDL 
I PKLVQ VLAPRYKD QTGG FTRMLQ 


12079 


25980 


A 


12189 


2 


414 


QEFGTRKRATFISLLFFFSSVYSRGVFR 
RDAHKSEVAHRFKDIjAEETLRALTLIAF 
AHYLHQ C PFADHVKLGNEATE FAKTCVA 
DESAENCDKSLHTLFGDKLCTDAPLRET 
YGEMADCCAKQEPERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 
FVHH I RRNQI ARDD YDKKVKQAAKE KVR 
RRHTPAPTRPRKPDLQVYLPRHRDVSAH 
PRNPDYEESGESSSSGGSELEPSGHQLF 
CLEYEADSGEVTSVIVYQGDDPGKVSEK 
VSAHTPLDPPMREALKLRIQEE I AKRQS 
QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKTWGPFWKWFMGFG 
PPLFFFFEMESRS VAQAGVQWHDLGS IiQ 
ALP 


12082 


25983 


A 


12193 


53 


400 


PETPSWLGPVRRFYFI I IKSVWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 
GVKKKI FFFLGGSKKHPWGFFEKKLFFG 
GGKCGAPPPKDISCLWGKKNFLGAIGEK 
TCCC 


12083 


25984 


A 


12194 


182 


3 


KGIFIQIAPPKKKKNWGPIjWKWFKGFG 
HP I F F F FEMESRS VAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
SCFVTQAGILAHCNLKLLGSSDPPTSAS 
QAA 


12085 


25986 


A 


12196 


341 


54 


S LSFHGGLS VLCI FS TYKKLSQEQWLTP 
VISVLWEVKVSRSPQVRSLRPGDPPASA 
SQSAGITGVSHHARPHLSLNPRLEMIKL 
SEEGMLKRMTG 


12086 


25987 


A 


12197 


1 


462 


GGPPRPFRMKAAVLTLAVLFLTGSQARH 
FWHQDEPPQSPWDRVMDLATAYVDGLKD 
SGRDYASQFEGYALGKQLNLMLLDNWDS 
VTYTFSKLREQLGPVTQEFWDNLDKETE 
GLrHEMSKDLEDVNAKVQTYLYDFQKTW 
QKELT FTAE S EP VP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 
12089 


25989 
25990 


A 
A 


12199 
12200 


221 
239 


3 

336 


TMGIMLDKKE IQAI FS FKFKMGHAVAET 
TCNINNTSGPGTANKVTVQWWFKKFCKG 
DESLEDEEHHGRRLEV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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od 


SEQ ID 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKKKKKKKKKKKKNKKKKKKKKKGFK 


12091 


25992 


A 


12202 


153 


40 


GFLFLI FFFFETKSRS I I QAGVQWCHLG 
CTLAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKCNGAVSGHCNLRLPGSSDFSGSA 
S 


12093 


25994 


A 


12204 


2 


925 


RSGPDHELTLDSGKGCTRSMVPASAASE 
DRRKLPIIVEDEGGPTRSRACSSPARGS 
RPPPSAIGCSPVAQASDSAAGPARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
IMAPKHKSSDAGNLDRPKRSRKVLPLSE 
KVKVLDLIRKDKKSYAEVAKIYGKNESS 
IRE I VKKEKE IRAS FAVS P PTAKVTATV 
RDKCLVKMEQALHLWVEEMN RKRVP I DS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
SKGWLHRFRHRFSHHYKKKKKGIMAQVA 
VSTLPVEEESSSETRMWTFLVSALESM 


12094 


25995 


A 


12205 


272 


1 


KMARAGLLVIEGKVWRTVYYRFATREER 
EGKMSTNLMNKLDTIGFDNKKDLLISVG 
DLVDRGAENVECLELITFPWFRAVRGNH 
ERQPGQ 


12095 


25996 


A 


12206 


166 


478 


ISASGLLPTSPLTGTSKLQDPNEHLNLL 
MLNRVS LLL P RLE CNDTI LAHYNLRL PG 
SSNSPDS AFQVAGITES FTCE INALKDS 
S QVALWS FRCEMDHS S I CNR 


12096 


25997 


A 


12207 


102 


497 


PLLALLFGSQKTFLCLQLSLKPTRSSLL 
SPGSAGNPENEAPCPQLNPEATSLKKKK 
KKKKKKKKKKKKKKKKKKKKKARG 


12097 


25998 


A 


12208 


131 


2 


FGFLGGDKAKGKGPGPPFFFFFETESPS 
VAKAGVQWCDLGSLH 


12098 


25999 


A 


12209 


221 


499 


QTRHKLTNSRSGRTGLQILIKEQHSSGR ' 
WCGTGDILWQSLEDTICYSVPKSLLDFL 
LiFIYFETE SHFVTRLECSGAI SAHCNLR 
LLGSSDSPA 


12099 


26000 


A 


12210 


297 


398 


HNFFFFLKTES CS I SQAGGHWYDLSSPQ 
PPPPG 


12100 


26001 


A 


12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCIiA 
GLLAYLFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
TPPPRGKKNFPPLGLGKMGNNPPPPPPR 
FFFFFFKKKGVFFIiGPGGVKTPSLRETP 
P 


12102 


26003 


A 


12214 


223 


477 


CPTETQLQLIFAREKHKPSPVLIjDLDIiY 
GFFFEMESHSVS PLECSGP I SAQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


TPKRYGKMQTATYLPKSKI YI I ISDLRK ' 
QDYIVYKQSKKPEGWAQWLTPVIPALWE 
AKAGRSPKVRSSR 


12104 


26005 


A 


12216 


328 


471 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 


1 


WDHSASPPTSVGTPLRADGPHPRbTAPG ' 

HWGSSAMLSAPAAGRRGLSLGRWCDRD 

KLSQRGPQPKWILWWGNLPGGGFLHRHI 

HKPLPTIPIPSTVRGEEVLQQLLHSASF 

ILiFIFIFIiDRVSLCHPGWSAVARSRFTT 

TSTS 


12106 


26007 


A 


12218 


304 


43 


EKPIGVAPSRVKKRAGGGEKTECCLLKG 
PLQGFQGVLKGPLKIiNGPLFFFFFLRRS 
LALFRLE CNGT I LA YCNLCLLGS SDS PA | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SEI 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCS CSDVFLYET 
NKVARIQSINYGTIKFFHVIVFSYVSKE 
HLKRKLEK 


12108 


26009 


A 


12220 


171 


452 


ARYKFLHPKFAFGLYPS SSQ I QGARHQL 
W I L I KCS FYLLFLiRQCLTLL PQLiE CSNM 
ITAPLQPQLPGIjKQSSHLSIjPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRSLFIiFIYFFETESHPWQAGVQW 
HDFGSLHRISP 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSLNLLGSS 
DPPTSASPVASTTGGI 


12111 


26012 


A 


12223 


246 


494 


FLCSSYAYDDHFLRPFFPIHLIIjDITAF 
SQDTI ISNLLFQDACSVPKKAANLGQAQ 
WLAPVI PVLWEAEVGRLLEVRSSRPAW 


12112 


26013 


A 


12224 


55 


353 


RIQGCSCWVKLCPKVEKWVQRTDAEQES 
QTKAE IQDMKQELSAVNMMDKFARSARL 
ERKINKMMNKLKTHVKVQTAQSGMLKWV 
ISVAFYKLPGTVIRLi 


12113 


26014 


A 


12225 


199 


23 


LKIiSSIYYVPGTILKYFTGWARWLLPVI 
P ALWEAE VGGS PE VRSSRPANS TS W I ER 
HQ 


12114 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKAPFSSR 
PHRGVLESGQPRCKSRRVDIFFSMERER 
DRERERERERERERGQKRTRRVWGGERE 
KLGYSHAWGWRGGRGLGWGGMCGI PPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGPCRVSLSLFLETRSFLFFLA 
SLSMLECSGAIIiAHCNLRLPGSSNSPAS 
ASQVAG I TGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQALHGS0PLASCTRIQSNYMALQ 
RINQELEDKLYRMGQHYEEE KRALSHE I 
VALNSHLLEAKVTIDKLSEDETVSAAHTW 
PMEDCR 


12117 


26018 


A" 


12229 


86 


455 


DLPTFPVPPHPGFSGPLLGQSELPSNCQ 
TGGRSHLSVSVLCPTLATTPLGIiDFPTC 
GTSQLPLGPLQLAHRSELAGLSLiPGLST 
NCTSQRLPDRPHRHSDAPHTHTHTQTHT 
HTHTHTPATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYS PPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFtiSKGLGPPPPKLKFLGFFFF 
FFFEQSFVLVAQAGVQLECSGTILVHFN 
LYLPGSSNYPASASHPRV 


12120 


26021 


A 


12234 


137 


3 


VPLPGTLKNHSMVPCFLFVCLFVCYETE 
SHSVAQAECSGAISAH 


12121 


26022 


A 


12235 


139 


1 


FKLCFGQAWWFKPI IPALWETDASRSLE 
VRS SRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLLIFI FLEEMGFHHVGQDGLDLLTL 


12123 


26024 


A 


12237 


187 


494 


TERNSFNINKKDIHTKTPFVGYQHQRPK 
VDKTTKMGRNQSRKAENSKNQSTSSPPK 
DRSSLPATEQSWTENDFDELTEVGFRRS 
AITNFSELKEHVLTRRTE 


12124 


26025 


A 


12238 


252 


503 


CQGVGADDIjQSQAE I QCELCAES IDRFD 
I LFFFFLGLiGP PHKNPPPPT PQKWKPGK 
APPIALWDIPRGAKGPGRQPGRPAIHTC 


12125 


26026 j 


A 


12239 


2 


471 


RIALCPAVRIRHEERERERERERERETP 
GHTQLSPGARRTPPLERERAFNDCFSLN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YPGNLCPGDLIEVFRSVYQHCALYLGDG 
YFINITPVYGI PAFFTSAKSVFFSRALE 
KTHLLKDWGSDTYRINYICYETLPSLS 
VEEI ITRSEFVIGQEA 


12126 


26027 


A 


12240 


14 


149 


FWTACRIRHEMGRGLLRGLWPLHIVLWT 
RIASTIPPHVQKSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFESSQKYKEGKYIIELNH 


12128 


26029 


A 


12242 


384 


464 


RGGQVWWIiKPWPALWEAKAGRSLEAG 


12129 


26030 


A 


12243 


325 


459 


TYLFI YLFIYFETES CS VAQAGVQWRDL 
VSLQPLPNS PALPTN 


12130 


26031 


A 


12244 


323 


457 


SDQRECGPADTFFEMDSCSVAQAGVRWC 
DLGSLQNSP 


12131 


26032 


A 


12245 


241 


466 


GALFFFFFFFFFFFFFFFFFFLSNKAVY 
F 


12132 


26033 


A 


12246 


385 


37 


QARRRCLQTSRMEALGRWSYKVTIEFPL 
QILFFPPILKYKVIFGIKITPNFHFLEE 
YEEGQRLGKSTVS WVFNEDTRL INI WNR 
HITVNPRTNSQNRML.KMAHES KYSQVPF 
PLQA 


12133 


26034 


A 


12247 


318 


456 


LILKYFMYVCMYVHMYVCMYIKSCSVAQ 
DGVQWYDLGSLQPP P PGL 


12134 


26035 


A 


12248 


362 


473 


S PCQNNMNS AKTE ARTNI KFMAKLMWKN 
GAIIDALQK 


12135 


26036 


A 


12249 


310 


463 


FS IKTRKNALGKLQS IiLRCHQF YLYVCV 
CVCVCVCVCVCTRTYAILTCIAV 


12136 


26037 


A 


12250 


169 


54 


EAKSGQARWLTLVI PALWKAKAGGC PE V 
RSSRLAWPIW 


12137 


26038 


A 


12251 


49 


449 


GIPATSTSCVQVILLPQPPLLLGIiQDSF 
LPQEI I I KVEGEDTGSLTI PSQEGVNFK 
IVTVDFTREEQGTCNPAQRTIjDRDVIIjE 
NHRDLVSWDLATAVGKKDSTS KQRI FDE 
EPANGVKIERFTRDDPWLSSC 


12138 


26039 


A 


12252 


130 


428 


RWGIAMLPRLVLNSWSQVILPPQPQVGA 
TFGG FPNP PGHWKLLGG I SSG EEAG VEE 
AEEGQALGFLGQFPPSSPQLLLQTSWCL 
RTTRTCRWSLLWKAH 


12139 


26040 


A 


12253 


204 


54 


PGMHKTQKSDLGQVQWLTPVI LGLWETE 
AGDS PESRS SRPAWAKHKI VR 


12140 


26041 


A 


12254 


375 


470 


S LCLFFS FF FQTEPRC VAQ AGVQ WRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 


GQVRWLTPVIPALWEAEGGRSPEARSS 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQS FWGGQ KFFPRPGVFKGVF 
FWGC3GFCFFVFGGFFFFFFFFETESCSV 
AQAGVQWCSLGSLQPLPPRFK 


12143 


26044 


A 


12257 


22 


123 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNSISINKKDVHTETPSKGHQHQRP 
KVDKSTKMR KKQR KKAENSKNQNAS S P P 
KDHNSLPAREQNWTENEFDELTEIDFRR 
WVITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHFVFCFFSCRA 
ENTHEIVWVKVYPFSNQSKAKIiFFSFFV 
QRDRS S PDS YLMPIiQLQMWDTAGQERFR 
S I AQS YYRSANALILAYGITCEES FRCL 
PEWLRQIEQYA 


12146 


26047 


A 


12260 


98 


1 


KYQIDLGGRGSSNSPASASRVAGITGTH ' 
HHTQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A<=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamtc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


L FL I Q KY I TGQERWLTPV I P AP WE AEAG 
RSPKVRSSSPAW 


12148 


26049 


A 


12262 


228 


385 


SCLYLQMIPSSIEKSQGMYKKATRGRAR 
WIiTPVI PALWEAGAGRSPEVRSSR 


12149 


26050 


A 


12263 


227 


369 


MLKHKSNKNliQDLGQMQGLTPVNPVLWE 
AKVGGSPEVKNSRPAWPTW 


12150 


26051 


A 


12264 


102 


487 


QQNRFFG'i'SKGRGTGRGVS S FPQLS PGS 
LELASS PS PSVGRLGSAASQRSRGGQER 
PKAQLRRPPRPSSSPRLRGRENCAHHAR 
TPGGILDRPPPTLGTNLLKGGRPAWMAG 
NPPEAHKTANGPGEQA 


12151 


26052 


A 


12265 


391 


476 


PRSYRVECSGSISAHCNLGLLGSSNPSS 


12152 


26053 




12266 


378 


476 


MS SRLGSIjE VGI LGRAWWLTP V I PAL WE 
AKEGG 


12153 


26054 


A 


12267 


166 


58 


NFFFFFFETDSCSVAQAGVQWHDLSHCN 
LCLPASSN 


12154 


26055 


A 


12268 


209 


49 


TPPNFYI KISRSGQAQWLTPVI PALWEA 
EAGRS PEVRSSRPAS PTRRNS ISNS 


12155 


26056 


A 


12269 


127 


1 


SSSFFPIiKTFGFItFFFLCFFESESRSVA 
QAGVQW CHLGSLQA 


12156 


26057 


A 


12270 


154 


27 


IQTPGKKEFFFFFFETKSCLVAQAGARL 
CPKKKEKENSNFVL 


12157 


26058 


A 


12271 


180 


22 


VLKIHNI GRAQWVTS VI PELWE VKAGGS 
QAIiRSSRAARTRWDACNFVEKTQVY 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNNNLINLKV 
MGQDCSMEQFKIKSHT PAGYGGSHRESQ 
HFVMPRWADHLRSGV 


12159 


26060 


A 


12273 


59 


479 


NSLGGGGVYGSRFRFTFPGCRALS PWRV 
RVQRRRCEMSTMFADTLLI VF I SVCTAli 
LAEG I TWVLVYRTDKYKRLKAE VEKQSK 
KliEKKKETITESAGRQQKKKIERQEEKL 
KNNNRDLSMVRMKSMFAIGFCFTALMGM 


12160 


26061 


A 


12274 


354 


439 


VKTGECS I AKTWNQP KCPTTI DW I KKMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLiECSGMIMAH 
CSLDLLGSSDPPTSAS 


12162 


26063 


A 


12276 


136 


37 


KF PGQVHWLKPVI PALWEVKAGGS PEVR 
SSRPA 


12163 


26064 


A 


12277 


419 


48 


DGKLNVSFLTVSSSGCFLIFLLIVRPPY 
FLRYNNSE IRP INNPTMTCKCSSKRKSR 
I CLTFNQKLE I XKIiS EEGMSKI KTGQNL 
G I IiCHVSQWDEQHKFVKATKS ATAKIN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFS PQMGPGLGKGKSPFLAKLNFPF 
KKTFFFFFETGSCFVAQAGMQWLDPSSP 
DAWADAW 


12165 


26066 


A 


12279 


276 


389 


WFNLSFFCFCFLFFETESCSVARLECSG 
AISAHCNLT 


12166 


26067 


A 


12280 


425 


1 


PPHPPPGKPPPPPRPELFFPKTKNKKER 
VLFFSPPPKKNFFSLTTPRFFFSPPQKK 
KKKLNPPPQKGPPPPVPIKPPPPLFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFPFFFFSFGI/rVRVIMTCWIilVDIGIiL 
I 


12167 


26068 


A 


12281 


253 


90 


I WPPQGS FKTAAPFFFFFVFLVEMGFCH 
WQAGVKLLTSSSPPASASQTAEKLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQISSWFSLPASTSASTMSI 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
HI SSDTVSRVGS SSTS FQGGLGAGMGL.P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

N=Asparagine, P=Proline, 

T=Threonine,V=Valfne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














IiHS S LANRARLS Q KKKKRGGRS KE P 


12169 


26070 


A 


12283 


186 


305 


LHHSTVQVREEEHKSIoHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


26071 


A 


12284 


2 


190 


PRVRFDRVNNRSLNPL I S RTMRIEP I PE 
NPKFSVKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGA 














LSKHCSYLPKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKK 


12172 


26073 


A 


12286 


158 


405 


GVASYAEQRKYFLEMEYFPGGEAISIVE 
MTTEDLHYS INLDDKAPTGFERVYSHFE 
RTCTDDKMLS YC I TCYR E I FCERNKL 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRRAVPPLFFFNFF 
FETE S CS VTQAG VQWCNLGSLQPLL I DW 
WGQPGTVAHACHPGTLGG 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTSPDFWGFSFFPTDT 
GMLEKKKKKKKKKKKRKKKKKKDR 


12175 


26076 


A 


12289 


194 


2 


GPRGFWQFWVTKGFFPKKGFVKFYPPAG 
GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 • 


A 


12290 


179 


416 


LIFLFTFACAVCFQLARFHFKIHIETAL 
RCLI I LS CP FTHPL YKM YREGQARWLT P 


12177 


26078 


A 


12291 


3 


408 


LAYGFHDRTTYDTPSALMMVLLVDELIT 

EKQEDLTTLVTLPVTLGEHQRQRETQLE 
AKLLKKRLELGSLLDLVEDLELI IQLKK 
KKKKKKS PLWGAP FKGPLGAP I 


12178 


26079 


A 


12292 


119 


278 


PKS I EAGVKS VLDHPLPNKAKTHLS CKK 
LKKKKKKKKKKKKKKKKKKKKKKKKKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSIiNS ITGWAWWFTPVI P 
ALWEAEVGKS 


12180 


26081 










RKGGKLKKIFKKAQGQARWLTPVIPAFW 
EAKVGRSPEVGS 


12181 


26082 


A 


12295 


215 


1 


AFKI KLTMHI FS S PL CLPQDE FHPFI EA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 
RMSKEFHGARLEGNQ 


12182 


26083 


A 


12296 


185 


411 


DVPLMFHWLHLHSDFRI ISTE IGVETVG 
KRVLHIRILLLLLLLFFEMESCSVTHAG 
VQWCDLGSLQAPPPGFTPF 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


12184 


26085 


A 


12298 


178 


3 


TLLSVTQVPILPGSASSPLLFSRKDIKI 
PTSVFFS FQTEPRS VTQAG VSWCDLGS L 


12185 


26086 


A 


12299 


199 


450 


TLIRHMI CKTYFLLSHRSPFSSVDYFLCY 
REIYSLIKTLNKLGIGGFFLNIIKVMYD 
KPAAS I IRSSEKQKAFPLRSGTR IRC PL 


12186 


26087 


A 


12300 


3 


407 


E FKDHSTAMDTEPNPGTSSVSTTTSSTT 
TTTITTSSSRMQQPQISVYSGSDRHAVQ 
VI QQALHRP PS S AAQ YLQQM YAAQQQHL 
MLHTAALQQQHLSSSQLQSLAAVQASLS 
SGRPSTS PTGSVTQQSSMVQTS I 


12187 


26088 


A 


12301 


227 


3 


KRGTEGQTLFFPAPRARKSFPPPKMPKK 
VEKEFPKGFFFFFFFETESAPS PRLECS 
GAISAHCSLNPHASAHAS 



1280 



_016483SA2_L> 



WO 01/64835 



PCT/USOl/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12188 


26089 


A 


12302 


292 


380 


KCYRWARWLMPVI PALWEAKAGGSLEPR 
N 


12189 


26090 


A 


12303 


13 


186 


VDERHIRLSEMAWVALYLLSLLWATAGT 
STQTQSSCCEYDVFPRRSGKQLPWGRGC 
DL 


12190 


26091 


A 


12304 


253 


409 


ITAVIAAAAAWGVTI IMTNFRSLPRLE 
CNGAI SAHRNIiLLLGSSDSPASAS 


12191 


26092 




12305 


153 


64 


GRAQWLMPIIPAIiWEAKAGGPLDVRSSR 
PA 


12192 


26093 


A 


12306 


265 


420 


GTLMSRNTKLQLCRMNKSRTIiGQPWWLT 
PVI PTliWEAKAGGS LEARCSRPAW 


12193 


26094 


A 


12307 


295 


1 


RPCLKNKKTSPHWKKKFANPLFGKAPVF 
PLDKKPFKPHYRES PRPLKKKNGPKGFF 
FFFFETESCSVTQAEVQWRDLGSIjQAPP 
PGSYHSPSKLIIKN 


12194' 


26095 


A 


12308 


247 


421 






PGGGALKKKKIFS 


12195 


26096 


A 


12309 


168 


3 


GQAIiLLLiCLT VAFS KTTWCHAP I LGWA 
WWLTPVI PALWEAEVGRSPEVRRSRP 


12196 


26097 


A 


12310 


117 


3 


VAYNIAVFRGGVSLLLPRLECSGVISAH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EPPTPFFFFETGS CS ITQAGAQRRDHS P 
PQS PTPGLSNTPAPASRACESGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPLVUVICFHKKHKTGRARW 
LMPVI PALWEAKAGGS PEV 


12199 


26100 


A 


12313 


207 


3 


VRVRTTGVIMALRGIiCFSVEESMTKDEL 
IARLRSLGEQLNRDVSLTGTKEELALRV 
AELKQELDDSR 


12200 


26101 


A 


12314 


282 


2 


NCLAE KVKFMVS AI S U2PQG I S I LFIiS F 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
FKLFFCL.SPGGGGCSEPKSCRCI PGWQQ 
NETPSQKKK 


12201 


26102 


A 


12315 


298 


84 


KTPQPPVFALS KNFGPPKTAP I FFFFFF 
FETKS YSVAQAGVQ WRDLGSLHS SLGHR 
ARLHL KKRKRKS INL 


12202 


26103 


A 


12316 


124 


1 


S KGH FFFFFFSETES CS VAQAGVQ WHNL 
GSLQVPPPRFTPF 


12203 


26104 


A 


12317 


132 


398 


WTNAVFSIVTFSPSVCHSEVA1AAYKW 
LVCYLLRETYQKLNQEIKPPTLSVPKKK 
KKKKKKKKKKKKKKKDKKK 


12204 


26105 


A 


12318 


262 


392 


PHYCQNQPQARRSGSHQHFGKQRREDHL 
SLGVRDQPGQYSETP 


12205 


26106 


A 


12319 


168 


3 


EAPKAI SNCTTALQPRLQGKNLSQNKEG 
QVQCLTP VI PALWEAKVGGS PE VRSS 


12206 


26107 


A 


12320 


48 


385 


DLGLSGHSTIoASSFISIiLIiSYKKCLSP 
STM IMRLPLPCGS P I KPPFLPKKKKKKK 
KKKKKKKKKKKKKKTGG 


12207 


26108 


A 


12321 


218 


83 


NLSP I FPEGNFFFFFFEMESRSVAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 
DU3SLQAPP 


12209 


26110 


A 


12323 


128 


36 


KGLFFFFEMGVSLLPRLECSGTTSAHCN 
LP 


12210 


26111 


A 


12324 


140 


3 


I LNSNLKP IKKNTFFFETESRS IAQAGV 
QWHDPSS KQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKKKKKKKKKKKKKTKKK 



1281 



_016483SA2J_> 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartk Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Ljsine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 


415 


RTGLLGPGRLQGPPYGSREYKGSGUjDI, 
NASLLDPEVWAPSLSGPETKPPAHSPSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 


26114 


A 


12327 


2 


506 


FVPLSETEAPGCSGSRPPELEPERSLGR 
FRGRFEDKDEQLEEEEELEEEEEEEEED 
MSHFSLRLEGGRQDSEDEEERLINLSEL 
TP YILCS I CKGYLIDATTITECLHTFCK 
SCIVRHFYYSNRCPKCNIWHQTQPLYN 
IRLDRQLQD I VYKLVINLEEREKKQMHD 


12214 


26115 


A 


12328 


1 


419 


FRVPSCARCQYLPLLRGASQLGYSGVRD 
PVEEAVCPFSDIiKLCAGRTTTLFKAVRQ 
GHLSIjQRFIiI.PFVW1jCPAPRSGVYRGRQ 
ASLSCGGLHPVGASWPRCLPTQASAMAG 
TPPPASLPPCSLISDCCASNERGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENS LRCVWVPKIAFVLFGAS LLSAHL 
QVTGFQIKAFTALRFLSEPSDAVT 


12216 


26117 


A 


12330 


302 


404 


SCYRVSVISQARWIiTPVIPALWEAEVGR 
IPEVGS 


12217 


26118 


A 


12331 


94 


375 


SRDMS PGLLTTRKEALMAFRDVAVAFTQ 
KEWKLLSSAQRTLYREVMLENYSHLVSL 
GIAFSKPKLIEQLEQGDEPWREENEHLL 
DLCPGWSAMA 


12218 


26119 


A 


12332 


7 


246 


NPIVDPSPCGGIRVRTPAGRGGPAIjRVR 
PETWEEAGEKMPS E SLCLAAQARLDS KW 
LKTDIQNCFITRKISLLPLFCHHL 


12219 


26120 


A 


12333 


150 


1 


YIYFFHLVRGLPGSGYDEYFWWVNTE 
S PSVTQAGVQWCHWSVQHKPP 


12220 


26121 


A 


12334 


34 


426 


EPGFLFVFFFLIRGKKTPLFFWGTRKNP 
KSGKPLFTPPGKSPSEPPKSRGGPLFFQ 
TRSRIiTPQS WFGGAPKPPLGGE P PKFWE 
AGLGGDPPTPRAPHRADQTLEQP I QAPT 
RI IQNEQWAIKPFQPMPGG 


12221 


26122 


A 


12335 


185 


3 


TVI TPLHFSLiGHRARLWSRVPAP PAENR 
SMEGGLGRAVCLLTG ASRG FGRTLAPLL 
ASHE 


12222 


26123 


A 


12336 


60 


316 


GWGPIPSTDVTVLSPQKPILLQGHERAI 
TQIKYNREGDLLFTVAKDPVSVGWRGSG 
RGGILLPGRWVDTPVLPLAGNQRASHCE 
K 


12223 


26124 


A 


12337 


3 


167 


SFRIQVQGNHTSKHYPKIQYSDQAQWLT 
SVIPALWEAKTGRSLEVRSLRAFRAP 


12224 


26125 


A 


12338 


238 


401 


RPLSVMCGRLTSIRVEDSGARCWFYLFI 
YETEFRSVAQAGVRRRDLGSLQTPPSG 


12225 


26126 


A 


12339 


354 


433 


GWAQWFMP VI PALWEAKAGVS PEVRS 


12226 


26127 


A 


12340 


238 


382 


LtilVYNMMLLPDAVAFTCNSSTLGGQGR 
RITRGQEFETSLANMVKPHL 


12227 


26128 


A 


12341 


101 


1 


KTKQTNKKTRSVGQTWWFTPI I PAYWEA 
QAGEL 


12228 


26129 


A 


12342 


276 


23 


GS CLLEGKLTNRKD IHTKT PS VRHHCQR 
PKIDKTTKMGRNQSRKAENSKNQSSSSP 
SKECSSliAATEQSWMPHDFDELREEGFR 


12229 


26130 


A 


12343 


122 


1 


YMGVNERGCGQS IFKSSLSSQLWWWAP I 
I PATQEAEAGE W 


12230 


26131 


A 


12344 


180 


360 


LHWYFFSVGDFLPPPPPPLDDSSALPS 
ISGNFPPPPPLDEEAFKVQVRAEVKVML 
GKS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12231 


26132 


A 


12345 


35 


411 


TMT I VDKAS ESS DPS A YQNQPG S SEAVS 
PGDMDAGSASWGAVSSLNDVSNHTLSLG 
PVPGAWYSSSSVPDKSKPSPQKDQALG 
DGIAPPQKVLFPSEKICLKWQQTHRVGA 
GLQNLGNTCFANA 


12232 


26133 


A 


12346 


278 


386 


IIYCYKQTI IGQARWLTTLIPSLWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 


111 


2 


KSQTGQVQWLTPVI PALWEAEVGGLLEL 
RSSRLAWA 


12234 


26135 


A 


12348 


215 


316 


LNKRLMI SFI KKJKKKKKKKKKKKKKKKK 
KKKKKKKGGPli 


12235 


26136 


A 


12349 


265 


404 


SNISYFLNNNFSKALSLNIKYMISQARW 
LTPVI PAL WEAKVGGL P E 


12236 


26137 


A 


12350 


372 


1 


P VFPLPP KKLGEQLPP PALRFLAVS PLP 
KAAHEQE I KEKVLAVHKNPI D P VYGFP K 
KGPTNFFLFPLFQRVI FLGAQKGVSWFG 
FYVKGLSLAPKLGGPPFFFFFETESRSV 
AQDGVQWCDLGS 


12237 


26138 


A 


12351 


318 


416 


QGRAQ WLTP VI LTL WEAKAGG S SEVRS S 
RPAPP 


12238 


26139 


A 


12352 


144 


1 


NGYAIVECWIiFCFVLFIiRQSIiSVAQAG 
VQWCNLGSMQPPSTSLVQAI 


12239 


26140 


A 


12353 


116 


1 


SGRYSFKKMKCGQERWLASVI PALWEAE 
VGRSPEVRSL 


12240 


26141 


A 


12354 


102 


2 


ETRYKKQPGGWERWLTPI I P VLWEAEVG 
GSPEV 


12241 


26142 


A 


12355 


112 


294 


LISPS CPMWPSPHGPTWLSRLISQHSPS 
NLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRGGGAF 


12242 


26143 


A 


12356 


138 


2 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
LGS PQAPPPGSRHCPVD 


12243 


26144 


A 


12357 


148 


394 


PGEPPGEKGEGGKGKKPGGRGWGEPPCR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TKIPPLGKKKPPPPQEQEMILGPPQF 


12244 


26145 


A 


12358 


97 


3 


EKFSPCFVRARTHTHTHTHTHTLSLSLF 
GFH 


12245 


26146 


A 


12359 


204 


2 


KHYTPAACFAPCLPDEAPI IAAAKPATT 
TSEQKMAVPPKYANLGKSARNVFNKGYG 
YGLINLLKTKS 


12246 


26147 


A 


12360 


156 


391 


NRGLFKVEESFS IPCSVRCSIHPSALVS 
PTPPTTDTTNWALFFBTES CS IHQTGMR 
WRDLNLLQPLPPGFKRFSCLNL 


12247 


26148 


A 


12361 


179 


1 


ALKKLLIIDDNSHKTNIVKHFSFRNFTF 
LFLLETESRS IARLECSDANPTHCNLRL 
SGP 


12248 


26149 


A 


12362 


218 


391 


T I LKD CTFKM I CLSQAQWH I PWPATWE 
AESRGLLEPTS 


12249 


26150 


A 


12363 


2 


356 


TNS HVDNS I TQKPEFC I PR CCG YI VCQL 
QLVERAFI FNFFFFFLKRS FVLLPRLEG 
RGAIFRVTQEGSNLLTLGPAPLGLPKGW 
NYRGDHPGPAYFYFX»MGKRLtiQIQGGRG 
LKRNLL 


12250 


26151 


A 


12364 


94 


1 


KNPPLFFFFFFFESESCYVAQTAVQWHD 
LGL 


12251 


26152 


A 


12365 


237 


2 


YSSLLFKPAESAKGILRHHVTPKSTSAI 
SKILFCFLFFVETEFLSPGLKCSGAIKV 
HCSLNLSGSSNSPTSASQVPGS 


12252 


26153 


A 


12366 


340 


442 


GLiFVCFFVWFLETES CS VTQAGVHWCDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine 
K=Lysine, L=Leucine, M=Methionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSLQPS 


12253 


26154 


A 


12367 


208 


45 


EGGEFFFFFFSVFFFVRAGVL.PCCPGWS 
ETPGSSDPPALASQSAGITGATVNST 


12254 


26155 


A 


12368 


60 


443 


CSSCVSSKPQS PHLKKQGDRMGAHL I S G 
GCTEATAGKGMRKCLVAWSTCAGARPPS 
RNRGSQGHSARFQGPRLPTWRLPPPPP 
QAKPFCKKWMLGGPISIEPKPQHPPGPG 
KKKWAGKAAPRIiVPT 


12255 


26156 


A 


12370 


34] 


421 


SGWVRWLTPVTPAIiWEAEAGGSPEVGR 


12256 


26157 


A 


12371 


280 


393 


NTIDHYVTQWLTPVIRTLWEAEAGRSPE 
VRSSRSAWPT 


12257 


26158 


A 


12372 


268 


399 


TLCSLSSAES I IHSYPEFSCKDNYLMTH 
NEAYKACMRRAFHT PL 


12258 


26159 


A 


12373 


209 


468 


EAVPDETSHPSPADSFSTSSNIACGTHH 
LRRLLKLYVEEELEYYPRKIQCFHFSFF 
FFESRSLLLLPRLECSGS ISAHCNLCLli 
GS 


12259 


26160 


A 


12374 


381 


488 


T I TCL FQKCQRGRAR WLTP V I PALWEAE 
AGGSQGQE 


12260 


26161 


A " 


12375 


276 


395 


GHWKT KKLKL. I FVFFEMES CSVAQAGV 
QWHDLGS LQPPP 


12261 


26162 


A 


12376 


295 


472 


VYS S VAIxNTFTLLCNHHHQPS PEVLTFP 
NKKCRGAILAHCNIjHLLGSSDSPTSASR 
VAG 


12262 


26163 


A 


12377 


390 


511 


pgavahacnpstlggrggq itripsl 


12263 


26164 


A 


12378 


417 


511 


AHRSFFEMESCSVTQAGVQWRRLGSLQP 
SPP 


12264 


26165 


A 


12379 


133 


406 


KLSLNKRDYKKQVCKETKKKKKKKKKKK 
KKKKKKPGAQKKKS GGS PR 


12265 


26166 


A 


12380 


247 


345 


HMWSSQUJKGLINKKKKKKKKKKKKKKK 
KKKKKKKKG 


12266 


26167 


A 


12381 


156 


15 


NFFFFETESCSSVTQTRVQWFDLGSIiQP 
GKQEQNYVSKNKQKELIH 


12267 


26168 


A 


12382 


275 


1 


KIVFLKGPPLFFFFKKKFVFFLKVGFKG 
GLNF IKTFPPGGKP I SKKRQ I FFFFFFF 
ETEPHSVARLECSGTISPHCNLHLPSSN 
DSLASTS 


12268 


26169 


A 


12383 


43 


379 


LLGYDESRSLSWICLCLSCLGQLLSFLN 
LL VYVFCQTCEVC FHY F FQ YFS TLFLLT 
FWDSNVLNIRHFG I VPQVLDDLVFFSPN 
KEIFLGQLQWLMPVIPAFWEPEAGRSPE 


12269 


26170 


A 


12384 


303 




KGPFFFFGLSPFFFLKKVFFFPPFFLRT — 
PPFFFFPPFFKNFFPPPPKKKNPFFFFP 
PPPFFFFFFFFFETRSRSVAQARMQWGD 
HGSPQPQPPGSSDPTRP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 
ARVQWCNLSSIiQPPPLRYK 


12271 


26172 


A 


12386 


31 




CRLADSPSPNDTGQDSRGRAGXKHTPPL 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKSGGGA 


12272 


26173 


A 


12387 


291 


412 


LSSEMLLFCFVYFLRWSFTFVAQAGVQW 
CDLGIiLQPPLPG 


12273 


26174 


A 


12388 


310 


I 


MTPCP FLFFCPKKRKAGGGF I KKALFFK 
PQKEVFLGPPPKGPFFLGFQPTPPWGKK 
IPRLNPLFWRSS IFFFFLLDGVSRLLPR 
LECNGVI SAHCNliRLLGSS 


12274 


26175 


A 


12389 


117 


2 


KRNPQRGGPLFFFFFFFETESRSWQAG 



1284 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S- Serine, 
T=Threonine, V=Valine, 
YY=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQWHDLGSLQ- 


12275 


26176 


A 


12390 


117 


319 


SFLKKPKDSTKOTjIjKXiINEFSKGSGPICI 
TIQKSVAFIYPNGKHSEKKIREVITFTI 
AAKTI KCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKSWTSLKLRTLGWARWLTPVIPALWEA 
KAG 


12277 


26178 


A 


12393 


127 


395 


IPGSQGFCLI KQS ALHLLNKS S FS FLS F 
IETQSLSVTQAGVQWCYLSSLQPPPPRF 
KLRQKNRLNS GGRG CNKLRAHHYTPGWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCFFEMESCSVAQAGLQWPG 
LGSLRQGFAMLARLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLVILKKS KETVEINCLETKSHSVAR 
AGVQ WCS QDWWEEKAGFNWRPS KARRCP 
TSVLKCHLGWAQWLMPVI PALWEAKAGG 
SPEVRS 


12280 


26181 


A 


12396 


111 


3 


GNNVklRGLRIRPLKGWACWFTPVIPAF 
WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQEKPTERAASPRLECNGTIS 
AHHNLCLPGSSNS 


12282 


26183 


A 


12398 


259 


387 


YQQESFSCLFIFIFLFFIFYDTESRSVA 
C3AGMQWHDLGSLKPP 


12283 


26184 


A 


12399 


103 


1 


KGRFFFFFF YETE S RS VAQAG VQWRDLD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


CGNNFEKAGRVRWLTPVI PALWEAKAGG 
SLEVRSLR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVIPALQEAKVGKLFEPR 
SS 


12286 


26187 


A 


12402 


265 


388 


GLSWLFRRPGHLFYFI FFETMS VS ITQA 
GVGCCDLGSLQPP 


12287 


26188 


A 


12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWI ITLSCC 
PGHCRMFSSIRGLYPLElASSTTSSSCNN 
QNSPDMGTCSLVJSQLLRRLRWEDCLNTG 
GRGCSEPGS 


12288 


26189 


A 


12404 


387 


152 


NPPPGFNFGGPLKKNFFSPPRGEKFVFL 
KRPPPFFFFFFFFFETESHSVAQAGVHW 
RDLGSLQALKSFCDIiVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


IIFHWIiACILVIHSSFDGHGLFLPLAVA 
DSAAVKHSCTSICLNIEYIPGNGIAGSC 
GISVLNFLRNC 


12290 


26191 


A 


12406 


204 


67 


KCFGQAWWIiMPVIPALWEAEAGRSQELR 
SSRPAWAKELLNNRTRAS 


12291 


26192 


A 


12407 


134 


1 


IQRPCIQLFSTFFLILFYFIFTEIDSCS 
VAQAEVQWHDFGSLQP 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKANSHVTKVNKHNTARFL 
ES LL CAHPNP P I KL 1 1 SLLLRE KKKKKK 
KKKKKKKKKKKKKQK 


12293 


26194 


A 


12409 


65 


415 


RNKRI ISQSWNGPSRKPQSSALLCGLGH 
LTSGVRARVSPVQGCLVRIKGGSGPSKP 
KKKKKKKKKKKKKKKKKSSKKKKAQKGG 
ALKKK 


12294 


26195 


A 


12410 


124 


404 


PWWEFLHHRNCQMWPGTVAHTCNPGTIiG 
KKKKKKKKKKKKKKKKKKKSS S SSS 


12295 


26196 


A 


12411 


164 




TGAGLTIMDMVGVGPDLLQIiRFCFVLFC 
FLVFFFETESDSVAQAGVQWCNPSSL 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNPSTLGGQGGW I TRGQVYKT 
GLAKTVKP 



1285 



SDOCID: <WO 0164835A2_L> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidme, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12297 


26198 


A 


12413 


583 


391 


LADTGFHHVGQADLKPLTSCSTRL.SLPK 
CWDYRCEPPHLAVPDTVLSTRGIiISEKS 
FLiPLCTFR 


12298 


26199 


A 


12414 


280 


2 


KVS S MNAQTSVL FSTMLGISETL KYRGR 
KKRRERGREGGREGRERKDECRKVRSER 
I EGGKKRGRKEYREEGRRR I E VSANEET 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


RFLVHGCDI YIC INI YTHTHTHTHSHTH 
T 


12300 


26201 


A 


12416 


359 


485 


TDARMDWCYQEGTGSQAWWLTPVI PAFW 
EAEAGGS FE VRS LR 


12301 


26202 


A 


12417 


171 


1 


PLFRWWGFELFPPFGYQKKKGYPPFFFF 
ETEPCCVTQAGMQWYDLRSLQPPPPEFK 
R 


12302 


26203 


A 


12418 


265 


1 


WHKSNEELAAEASAPVKARASNT ILGPH 
THGKERCPVSLRNVLDEGVKSTNFFSFF 
ETDSHSVTQAECSGAISAHCNLHLMGSR 
DSGA 


12303 


26204 


A 


12419 


278 


388 


SSLTGFFIRRRNEEPGQHGETPCIiLiKIQ 
KliAGCGGVL 


12304 


26205 


A 


12420 


205 


413 


WNLFVYLYQTHKYARVYSYNI ILFTNKK 
RAIGQAEWFTP I I PAVWEAKVSRLPEVR 
SVRLSLPKWRITS 


12305 


26206 


A 


12421 


103 


2 


RSL.SKHRTETILGLPAAVLI I LFP PLL I 
PTSKY 


12306 


26207 


A 


12422 


372 


478 


NI FVFLKE TPCQ S Q WliMS V I PTLWEAEA 
GGSLEPR 


12307 


26208 


A 


12423 


174 


416 


KMHYCVLS AFL I LHLDTVAL I LS TCSTL 
DMDOFMRKRIEAIRGQILSKIiKLTSPPK 
NYSEPEEVPPEVIS I YNSTRDLLQE 


12308 


26209 


A 


12424 


572 


733 


RFSCLSIiPGSWDYKNCLNSGSRGYSEPR 
SHHC I PAWVTEQDS VS KQNKTKQS A 


12309 


26210 


A 


12425 


123 


2 


GYIFIFIYFFETGSCSVAQAGVQWRRLG 
SLQLPPPSSSDS 


12310 


26211 


A 


12426 


224 


417 


TADF YG VNY I S I KLLP KKKKKRGG P FKE 
S KFTAAGLQKNI FFLSAPNFI ALAWLK 
RRDWETPG 


12311 


26212 


A 


12427 


115 


3 


FFETGSHPVTQTGMQWCDCNSLQPLTNR 
LKQSSHLSL 


12312 


26213 


A 


12428 


250 


398 


PWAGHLTSLGLSLPI FNIGIAGQACWLM 
P V I PALWEAEAGRS LEVRSS K 


12313 


26214 


A 


12429 


215 


3 


KKGLWWGANKLWVFTPQKFFLNQFSGW 
AL.TPRGKSWFSKRRAPAPFFFFFFWETE 
SHSVAEAGMQWHDL 


12314 


26215 


A 


12430 


345 


2 


FSHHPAKPQGGLFYRQPLGLEGKETRPR' 
LNDRFMAGPSFPAQQGLRMPFLGKEWER 
EGCSFVQCPVEGFAMWPYPLGSPPGPRL 
FFFFFFEAGSPSVTQAECSGSITANCSL 
EL 


12315 


26216 


A 


12431 


348 


3 


L YS SLGDRVKLRLE KTKTNKWCLHS YI S 
TCKIINLDHYLIPCTKTNSEWIKYLNVR 
TNS IKLLEED I SVNLHELGSDNS FIAMT 
PKAEIIKEKIYKLHFIKIKIFHVSEDII 
EK 


12316 


26217 


A 


12432 


160 


52 


NTAIGH PHQA I YLS YL S 1 1 YLS I YLS I Y 
LSIYLSIYL 


12317 


26218 


A 


12433 


180 | 2 


GRSKLHI CREHSICAIEHVCGCDTENNT 
NLCQVQWLMPVILAIWEAEAWRSPEVRS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
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LRA 


12318 


26219 


A 


12434 


193 


3 


PGAENPPPKGGPPFFFPFPRGFPPGVFF 
LKKKGFPFFFFFFFDTESRSVAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


422 


NVEGDFLQRIKNIRYWPGAVAHTCGPST 
LGGRGG 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTLSPRLECSSAISAH 
CNIiRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


THTHTHTHTHTHTHSGLIiYYLPKGKQSH 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LAIiQARLECNGI ISAHCSLCLPY 


12323 


26224 


A 


12439 


310 


2 


KPILSRGLFCARPCFRRSNMRS FNPSQS 
YQVGKDTFGFTVKVTKLKVLGPSRGPKG 
ARQGVHKGKCIiTLPKMLRFLFFFETES 
CSVTRLECSGAILAHCNL 


12324 


26225 


A 


12440 


151 


401 


EGMPHLPGSHSSSNILKRGRESGREAGR 
KGRKERRKEGRRERRKEEEKRGRGKIRR 
RKDRGENEEEENNRIiSSKVSIiEISrCP 


12325 


26226 


A 


12441 


276 


1 


FHPFGRPSVSPRVQPKIGII IHTCVARI 
EKGGSCKAIjSLCPQYVGSNFFFFFETRS 
FSVAQAGVQRCDLGSMQPPHFPGS9DPP 
APASRVAG 


12326 


26227 


A 


12442 


140 


1 


NSHFPKNIiGSPSSFFFFFYIjETESRSVA 
QAGVQWLDL.RSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


IjPPCSWGLYGAFARCPIiMDDKQLiSQVPL 
WACPIiSTAGRTRLC VACI KAAG KAQG FF 
FFFETESHSVTQAGVQWHDLGSLQPPTH 
AKPKHTHTHTQTHTIPDT 


12328 


26229 


A 


12444 


337 


2 


GFFAIVPKKGQKKMFSPFPKEGIFSGPC 
FISSTPTLLGVKKILPPNTSEIRPVLSS 
I INTSVHIREQKKTTYVGIFTGKSRFFF 
FETDSRS IAQ VGVR WRHLiGS LQ AP P PG 


12329 


26230 


A 


12445 


112 


9 


GRVRWLTPAIPALWEAKAGRSPEVRNSR 
PAWPTR 


12330 


26231 


A 


12446 


172 


3 


I PGELIiIGRAKFKAKNWPWAPRGI PLGR 
QKFSFFFFFETESRSVAQAGMQWLNLGS 


12331 


26232 


A 


12447 


221 


418 


RKAERLEVFFRRLCQRALRPFPHCLAAA 
PMPLIVLKKPILGRAWWLTPVI PALWEA 
EAGGWLESRS 


12332 


26233 


A 


12448 


280 


392 


PLGRVRWLTTVI PAHWEALAGGS PEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PS QE I FP YPGPPGKTP F PLKKKKKNRGG 
GGGLPFPPPQKVKKKKFFYPGRGRFKQP 
KFNPLPPPRGEKKNPKPPPPKKKKKKEK 
KRKEKKLVT 


12334 


26235 


A 


12450 


159 


1 


KKKRGGFLGLTFIKKTGPDPFFFFFETE 
SRSVAQAGVQWYDLGS LQAPP PGLM 


12335 


26236 


A 


12451 


270 


1 


NPLKFFFFSLFQNPIiKKDPAIFNPFFFF 
FPRIFWPFFGGVSKAGIWFFFFPFFFFF 
FFFFFFETESRSVAQAGVQWRDLGSLRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


DKLVKLPL I LKKLN S FFFF FETE S RSVA 
QTGVQWRKIjGSLHP p p pgf kpnawayaw 
AHAWWPGLPAQNLR 


12337 


26238 


A 


12453 


190 


86 


TNDLGWTQWLTPVIPALWDAEAGRSLEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


PPTFPLKTLFFPVFLPKLFSTLFSPKKK 
FSNFLVSAPPFFFLTLGNSQGVLIjKGGP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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FFFFFFFFETESRCVSQVGVQWRDLEKK 
EKKKCNSNGGRHYISTHLKYK 


12339 


26240 


A 


12455 


204 


288 


IQLNTHTHTHTHTHTHTHTHTNETGKMW 
I 


12340 


26241 


A 


12456 


409 


13 


QGGAPPI P I MLSFLQEAGLLHVAQPG VK 
LLTSGDPPSPPSKSAWRQRQETRLNLGG 
RGRSEQRPPPRPPAWATQQDSVSKTNKQ 
TKNNKKIRKSIjCKKTLEKEMLAVMI ITK 
YLLCTDVPSTVXiCTHLYYLT 


12341 


26242 


A 


12457 


204 


3 


KKKSPTPQSAKFPLGGGVFFFFLVGMRS 
PFVAQAGFKPLGSSDPPILASQSTAITG 
MNHHTS SAFH 


12342 


26243 


A 


12458 


289 


3 


KARE KKKVGATKNIR I VDLS KGRTS LCK 
KTEMPPVGNRNKQKPQNPSNLRISTRPE 
TESRSVTRLECSGAISAQVILPGSSDSP 
ASNWRQSETLS 


12343 


26244 


A 


12459 


261 


388 


SWHFGRSKQVYCLSLGVRNQPGQNPVST 
KTTKI SQAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRVHWLKP VI PALWEAVAGGSPE V 


12345 


26246 


A 


12461 


112 


7 


HICIYFFETESRSVAQAGVQWHDLSSIjQ 
PPPAGFK 


12346 


26247 


A 


12462 


389 


3 


ALHEGAARGS pp pggkkkkrgspfpmgn 
qapqvpgklgggfgfwffkkslplqpgg 

KPGVFYKPKKVPTPKDPPPPTIjWGGWGT 
TGGP PGPLFFFFFFFETESRS VTQAGVQ 
WRHLGPLQPQPPAHAS 


12347 


26248 


A 


12463 


26 


349 


CIMIDSTTGIFIHCWWECEMAHPFWKAV 
WQFLFFFFKRGVYFFFPGGIKGPGFGLR 
EPLPPGIKGVPRPNPLGGGDLRPPPPPP 
INLGFFKGKGVFMVWPGVGNFLI 


12348 


26249 


A 


12465 


187 


54 


GYKNKPGQAQWL.TPVIPALWEAKAGGSP 
EVRSSRPAWP I WRTRG 


12349 


26250 


A 


12466 


199 


55 


1 1 QLRQQNS FLFI CFD I ES CS VAQAGVQ 
WHNLSSLQLEVPGFKHAPSH 


12350 


26251 


A 


12467 


107 


3 


FAHCXil GNSFFS FFFFFFE VGS CSVAQA" " 
GVQWSD 


12351 


26252 


A 


12468 


120 


1 


GNWCRAQWLTLVI PALWEAEVGRSPEVG 
SSTRLGLPTHRP 


12352 


26253 


A 


12469 


105 


2 


YYLYFIFFLFFLETGSCSVTQAGVQWKD 
LRSVTS 


12353 


26254 


A 


12470 


238 


405 


FHI CLFILKNQVSKMYTHTVS CSLLE 1 1 
YCCLLQWLTPVIPALWETKVGGSLEVRS 


12354 


26255 


A 


12471 


425 


1 


LNPAPAIFFGGPKKKNFFSPPRGDKFFF 
FKRAPPFFFFFFFFFFFFQKKKKPFFPA 
QKKGRGFPGRALSQWPLAGFLKITQFPP 
WSRVPFSPLL.FFFFDMESPSVAQAGVLW 
HDLLHLPGSSDPPTPASRVAGITGVCHH 
A 


12355 


26256 


A 


12472 


122 


2 


QNKYSELRMNTFGRPQWFTPVI PALWEA 
EAGGS PEVRS S R 


12356 


26257 


A 


12473 


144 


420 


GAPRPISRPRGKTPGGGGKRGKPNKNGL 
GENPFFPKGVKGKNPPI FFPFLGQNGPT 
PLVLGSANMGFYLYLiKRKGPGKGREAIiR 
AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGGQVQWLTPVI PAFWEAEAGGSPEV 


12358 


26259 


A 


12475 


54 


294 


LIKSWQHRQEYVSLMSFJFFPCHIWYQS 
QRS PGTTQI I LE I SDTGKNKYFNCTGSH 
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Predict- 
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ding to 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12359 


26260 


A 


12476 


364 


234 


EEMGFHHVSQDDLNLLNSWSACLGLPKC 
WDYRSETSRLASLPF 




26261 




12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGLiKRI I 
CLGLPKCWDFRRELLPSLPCLLI FL 


12361 


26262 


A 


12478 


271 


396 


TISFLSFFFLFLFFFFFFWRKSGRESRF 
SQKKKKYKKKNIYY 


12362 


26263 


A 


12479 


289 


2 


GSCLLERKLTYRKD IHTKNPSVPHHHQR 
PKVDKTPKTGKKQGRKTGNSKKQSASPP 
PKECS S S PATEQK WMENDFAKL REKGFR 
RSNYSELQEEI 


12363 


26264 


A 


12480 


323 


1 


YSMVYPYVFLTAKKDQLQVNNTQLTCKS 
YQLYHCINHSTLQTHNISTLMILGHIPG 
LCIPVNIjSEAWAATPAIiHFVKLLLTELT 
HHVCRALDII ILAIVSLVALITS 


12364 


26265 


A 


12481 


279 


429 


MNGQRKCVYVYTLYTYTTEYYSTIjKRRK 
WPGTMAHACKPNTLGGHGGWTA 


12365 


26266 


A ' 


12482 


308 


423 


LTFFKNITGFFFFFFETEFCFVPQAGAQ 
GGDLSSLKAP 


12366 


26267 


A 


12483 


272 


474 


MVSWGLSRAKIPWVLSFFFFFEKKSHFF 
FQGGGRGGDLSWEQPIjPPRSKGFSCFTL 
PSTRDYRPAGP 




2626S 




12484 


84 


229 


L FKS VCHLL PLS S CS S HVRQACLP FTFC 
HDCKCSEASPAMLPVQPAEL 


12368 


26269 


A 


12485 


392 


3 


TGKKKRFPFFPKKKI FNSKPQFSWRKKK 
LFKIFSPPGGAPFPPGVQKKKTFFFFYP 
FSPRTPFFSIFFPGKGPPKGGGLGPPFF 
PQKRGGGFPNSKWETWEKFPPLFFFFFF 
ETESHSVAQAGVQWHDL 


12369 


26270 


A 


12486 


76 


1 


KLI S WTWWHMP WPATWRAEAGDFS 


12370 


26271 


A 


12487 


204 


8 


FFI FFFFFFFFFFFCFKTGSHSVAQAEQ 
EWHDHGSLQPQPPQS FHLSLPCATMPGL 
FFLFLFLFT 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETESCSVAQAGVQWC 
DL 


12372 


26273 


A 


12489 


126 


1 


KPPHPFLPFFFFFETESHSVAQAGVQWC 
YLGSLQAPPPGFTP 


12373 


26274 


A 


12490 


107 


2 


IAGRAWWLMIiAI PTLWEAEAGGSLEPRS 
SQPTCAT 


12374 


26275 


A 


12491 


302 


2 


GGFGFFPPREKGGFFQTVLFGVPPGFFS 
PPVFKTGPGVFFLGAQKKKI FFPPPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQWCDLGSLQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTLG I TAQS YSRVHIWNR VYDLAV 
GSGHPDGAAAI FCGS FVQRLKSY 


12376 


26277 


A 


12493 


341 


1 


KTSHFREIYPLEHASSLSKKIETGALPC 
SQELLNSQENSVMNKFPCLNQIjPFQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 
YTHTHTHTHTHTNNHS IFEliLCMQCDSY 
N 


12377 


26278 


A 


12494 


95 


3 


PLFFFETKSCSVAQAGVQWQNLGSLQPP 
PP 


12378 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAASADTHTS PHGS VSGL 
FCFHFPPHRREQRSRPGRKPGAHRIAGR 
ALSQKPVGSGATPHNLHHQ IRTQTNGLI 
QLLGDRQAPWVTPVIjPVLWEAEAGGSLE 
ARSLRPA 


12379 


26280 




12496 


58 


492 


NSPPP PAPS QRTS PPAAAAS PTTGSSSA 
PCPASSSWPRSSIALITFYPDQPCGLSF 
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K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NNLAWKQNVLCFKERLLEKMFRRIQVLG ' 
LLVWTLIAGTEYFRVPAFGWVMFVAVFY 
WVLTVFFLI I YITMTYTRI PQVPWTTVC 
LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQG I LS AG FLVPGDTSVS S PRPRAAGS 
QSSTSRVLWKLSGAMLRLYVLVMGVSAF 
TLQPAAHTG 


12381 


26282 


A 


12498 


71 


346 


GMFMTSKETFTHYQPQGNSDPAHTATAP 
GGLSAKAPAMTPLMLDTSSRKLVAWDGT 
TDGAAVGI LAVAADQTSTTLTFYKSGTF 
RYEDVLWP 


12382 


26283 


A 


12499 


122 


59 


HQIYIYIHTHTHTHIYIYIYI 


12383 


26284 


A 


12500 


220 


3 


PKKESVRCWAMKWILIjILCQTSLHWSL 
PQSSLAATPCCLNLGTVDVLQGIiGAVAH 
AWNPSTLGGRDGWITS 


12384 


26285 


A 


12501 


125 


1 


HVXiVLSHFLFVLFLFFETESRSVTRPKR 
SSAVSAHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVS LLPRLECSGT I SAHHNUCLL 
GSGDYP 


12386 


26287 


A 


12503 


440 


573 


STWEDHLSLGNRGCSEPRLHLCTPTWVT 
E 


12387 


26288 


A 


12504 


146 


5 


HPHLQTMNGAP I PQELTPLLEKERDGLR 
CRGNRSPVPGI PPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTPVIPALWEDEVGRSPEVRSSRSD 
WPTR 


12389 


26290 


A 


12506 


247 


420 


LEISVTFSCSFGCLPKKHIHTHTHTHTH 
THTHTHTHTHT YI RT I YTTLARPLSALH 
NF 


12390 


26291 


A 


12507 


346 


473 


KAY I IGLNCFYETKSHSVAQAGVQWRDL 
NSLYS 


12391 


26292 


A 


12508 


87 


414 


RTGVYRVGKDGRS RS PDLLI CP PLiGLPK 
CWDYRREPPRPACIjPLLSHPSPPPHFS f 

LS pfls fslpcs p flyi ffpppllpisl 

RWRLI PEGRFMAP1.CLQMPLGHDT 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHS ITRLECSGAILAHCNL 


12393 


26294 


A 


12510 


344 


472 


KFLiP FDPAVSLtiG I YPKES KS FYQKDTC 
TRMFIAIIiFTIAWTW 


12394 


26295 


A 


12511 


424 


132 


RGEGPPRSGILSPVGPPGETPPFLKKQK 
I TRGGGGGP LFPPLKRVRGENS FP PGGK 
SFHGAKFPFCPPPWATKRNSVSKKKKKQ 
TKKGSR I I LYNNE 


12395 


26296 


A 


12512 


351 


510 


GGTD F CKQS KLQS LQ PAVE I F I PGRVQ W 
LTPVISALWEAEAGGSPEVRRSRPA 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTLGGPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EEISLLLPRIiECNGAITAHHNLHLPCSR 
NRRPGKAVHVRQAYS fq 


12398 


26299 


A 


12515 


251 


592 


GAFTGL1AFTMAGGRPHI1KRSFSI I PCFV 
FVESVLLG I VILLAYRLEFTDTFPVHTQ 
GFFCYDSTYAKPYPGPEAASRVPPAL.VY 
ALVTAGPTLTILLGELARAFFPAPPLAG 
PV 


12399 


26300 


A 


12516 


3 


413 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLS S SGKRETGTHRETLGKKKVSSAPE 
AQGAGLRLS QALGGLCYDLS PLTEPRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
ISGHSFHSKHCRASQNKXiRKSSHLK 
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K=Lysine, L= Leucine, MNMethionine, 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12400 


26301 


A 


12517 


115 




KSHGGHGVEVLVSVLVQIiAMEVGLVXiWV 
KEYDGAALA 


12401 


26302 


A 


12519 


444 


34 


YCIiRQKHTEYRRRSNLEMASSCLSGRKT 
CTFFTLYQNLKMI KLSEEGTWQAEWLKS 
RPIAPNSKWEAKEKFLKKI KSVTS LNT 
YILRKPNSLIADMENILVIHIEYQTSYN 
IPSSQRLIQSKALCIjFNYIjEMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILI I F INYRCEFLRSKKS S EE I TQ YIQS 
YKGFVDRTVMYNSNFFLKFLDYL 


12403 


26304 


A 


12521 


2 


347 


AMAAGSRTSLLLAFALLCLPWLQEAGAV 
QTVPLSRLFDHAMLQAHRAHQIAIDTYQ 
EFEETYIPKDQKYSFLHDSQTSFCFSDS 
IPTPSNMEETQQKSNLELLRISLLLIES 
WLiE 


12404 


26305 


A 


12522 


50 


205 


WGESDRQSSGFWASSAHACNPTWEVET 
GGSLEARSWRLQCTMFASVKSTRP 


12405 


26306 


A 


12523 


105 


2 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLYER 


12406 


26307 


A 


12524 


279 


455 


KTKNWIiALGGPPCFPGQYGETPSLLKNE 
KIiAGHGGAPLYSRIjliRRliRRGNSLTPGD 
RGC 


12407 


26308 


A 


12526 


305 


454 


LYFLLFPFQI INRMVLFFVFCFFFETES 
HSVPQAGVQWRDLGSLQAPPPG 


12408 


26309 


A 


12527 


363 


459 


CT J. IjGQAWWLTP VI LALWE AEAGRS PE V 
RGSR 


12409 


26310 


A 


12528 


1S2 


36 


QHI WFHKEHGQVRWLTP VI SALWKAEA 
GRSPEVRSSNAAWHACRNSA 


12410 


26311 


A 


12529 


388 


530 


KS FUNF FPRDG I LNC VPQAGLELGS S D P 
PASAFRVAGTTSVWHHSQI 


12411 


26312 


A 


12530 


134 


1 


GHKKGVLFFFFKTESRSVAQAGVQWCTL 
GSLQPPLPAHATRPRV 


12412 


26313 


A 


12531 


230 




KKFGFFPQAGPQGGLHIiFTTIiAPWGQAI 
SQKREKPFFFFFFETEPHSVARLECSGT 
ISPHCNLHLPSSNDSQASTS 


12413 


26314 


A 


12532 


145 


3 


KKLEILSAI I PPKFRKEI FLFFFYETES 
CSVAQAGVQWRDLGSIiQAP 


12414 


26315 


A 


12533 


365 


2 


LFFFTSLIWQITLGFLILNCPCIPGVCI 
YIQFANTFRKFAFUFTNEIAYNILILPI. 
TKQDYTNLIKLIKKHSFFSNFLKQLLFV 
FCFVFVSLFAMESCSVAQARVQWRNLGT 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWSCSSPQQGFGGGITI III ILRWILA 
IiSPRLERKWRDIiGSLQPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


MGSLIMGAELSVYRQVCDCKPQGVCVCL. 
CVCVCVCASTCM 


12417 


26318 


A 


12536 


333 


58 


ACNPS KVRS CRPAWSNMVKS CLS KNAKI 
TKBWWWANFFFVFVFLVEMMFHHVGQLS 
LKLLTSSDLAASASQS YGI TGVNHYAQ P 
ARTRGSFR 


12418 


26319 


A 


12537 


214 


3 


SDPRRACRCPKEAPEQQRRLPGGSVPAL 
SIFFFFKTESCSVPLSPRLECSGVISAH 
CKLRIjPGSCVPPAS 


12419 


26320 


A 


12538 


146 


1 


MNFLAFTNPPRPPQQILKPLLFILRRES 
HS VAQAGVQWCDLS SLL P PA 


12420 


26321 


A 


12539 


195 




IF I PPQAQKRGDPFLS F FFFETRSPS PR 
LECSDAITAHCSLHLPGPGEPPTPPIPK 
SWDHSHVPP 


12421 


26322 


A 


12540 


151 


334 


LLGRIiRHKNKZiNPGGGGCSGT I MALCNI 
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jng to first 
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peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M— Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLPRSSSSPTSASKVAGITGACHHACCL 
RYFLY 


12422 


26323 


A 


12541 


314 


402 


NLNSGPAWWLMPVI PALWEAEAGGS PEV 
G 


12423 


26324 


A 


12542 


339 


3 


GSCLLEGKLPNRKDIHTENPSVHHHHQR 
PKVDKPTKMGKKQNRKTGNSKTQSVSPP 
PKERSSSPATEQSWMENDFDELREGFRR 
SNYCELREDIQTKGKEVEIFEKNFERV 


12424 


26325 


A 


12543 


31 


365 


RSAVMPTEDRATWKFNYFLKI IQLLDDY 
PKCFIVGANDVGSKQMQQIRMSLRGKAV 
ALKGNNTMMRKAIRGRLENNPALEKLLP 
HI RGNEGS VFTNEDLTE IRDMLLGNKV 


12425 


26326 


A 


12544 


278 


410 


FS FKNVTAGRVWWLMPVI PALWEAEVGG 
S PE VKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPSAPQLRPliQAISTKGPCP 
PQSPQPSPRPEGSSAHSRGEKRTRERGK 
EEKRKKK 


12427 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGEEKVTQEEQEALTPW 
RTNFFCFSDRVSVAQARVQCCDSGSLQP 
QLPGSSDLPTSASQVAGTTGMRHHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTS FETESCS VAQAGVQWRDLG 
SLQAPP 














GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
KKIPLSFVFSKKGPLPKKFFFFFFFFET 
GSCSCHPAGVQWCDHGSLQ 


12430 


26331 


A 


12549 


218 


3 


VPPPGLVPGLPQIIPKGFPKTKGPPKGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLSPKNKIFFFFFFET 
ESHSVAQAGVQWRNLRLPGSSDFPASDS 
RGAGITGARPHVQLI FVFLVKTGRKRR 


12432 


26333 


A 


12551 


331 


440 


KSASSWSFNSKGWGWWLTPVIPALWEAE 
VGGSLEVR 


12433 


26334 


A 


12552 


136 


1 


RHTGS KTTATALVD VFLLL FFETE SHSL 
SRLECSGTTPGHCNLYE 


12434 


26335 


A 


12553 


84 


1 


I FFCLFS FEMESHS VAQTGVQWCDHGSL 








12554 






HAS GPE ELS KDPSIi VS QGQPHRKPGLKR 
CSSRPLGPPDKLGGEGKQGLLGFTLWLS 
GP I KPCDDEEKNKKKKKKKKKKKKKLGG 
GPFKKKLFFPPGGGRNFFFLGAPKFFWA 
GRF 


12436 


26337 


A 


12555 


257 


417 


KQLHLLQGRLFS PS FLPS I S KLFFFEME 
SRSVTQAGVQGCDIiASLQAPPPGFT 


12437 


26338 


A 


12556 


268 


403 


YMRLSFQDLVMFKDVAVDFSQEEWECLN 
SYQRNLYRDVILENYSN 


12438 


26339 


A 


12557 


286 


426 


DMLI KTCCHL VCEKKKKKKKKKKKKKKK 
KKKKKKRGGALKKKPLPPP 


12439 


26340 


A 


12558 






PLKEIANRHMKKCSSSLAIRQMQIKTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 


118 


2 


KNYSNNLKTNAGRAQWLTPAIPALWEAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQWRG 
LG SLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSLPRLECNGAISAHR 
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location 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














NLRIPGSSDSPAS 


12444 


26345 


A 


12563 


277 


370 


WSFTLVPQAGAQWCDLGSLQPALSRFKR 
FSR 


12445 


26346 


A 


12564 


55 


251 


AMVIPTVPFNITINSKPLGHISFQLFAD 
KFPKTGENFHTLNNKDKGFGS CFHRI I P 
EFICQGDDF 


12446 


26347 


A 


12565 


129 


3 


KYSVLRPGTVAHTCNPNTLGGRGGQITW 
GWEFETSLANVAK 


12447 


26348 


A 


12566 


172 


1 


PPPKKKKPRPPKKKNFFFFFGAISAHCN 
LCVPGSSDSPASPSRVAE I TGSRCHAEP 
Y 


12448 


26349 


A 


12567 


257 


380 


GICRPLLGGVLQSGATGVRDPLEEAVCP 
LiAELKHCASRSTA 


12449 


26350 


A 


12568 


120 


r 


GVFLFCFVLFETESCSVARLECSGMISA 
HCNLRLLGSSNS 


12450 


26351 


A 


12569 


250 


2 


IGKPKTPQFWFLKTKRGGPNAAFSFKKF 
FFQLKPPPRVFLKI FSCQKKI FFFFFFE 
TESRSVPQARVQWHDLGSLQAPPPGF 


12451 


26352 


A 


12570 


338 


3 


VSTPEKNFVFHTRGFSKKSSKYKVCNFF 
FQKKLG S P PFS LKKVFPGKP P YS CCSGR 
VSTAPYFLNPGPSTKILPPFFGPLGPEE 
KFLS FFF F FEME CRS VI QAGVQ W CDLG 


12452 


26353 


A 


12571 


5 


556 


I CCCLCFKINI FQLHFNKINFFCVTRSG 
PVTQAGVQWCNLGSQQPQPLSSKQS 


12453 


26354 


A 


12572 


115 


2 


DRVSLLVIAHCNLCLRGSSDS PASASRV 
AGITGARHD 


12454 


26355 


A 


12573 


279 


377 


GQVQWLMPWPAFWEDEAEGLLQPSSSR 
PAWAT 


12455 


26356 


A 


12574 


117 


1 


KTPLFFFFFFFEMESRSVAQAGVQWYDIi 
GSLQAPPNGFT 


12456 


26357 


A 


12575 


253 


374 


SLPGMWHACNPS PLGGRGRR I TWGRKF 
GTSLANMAKPCL 


12457 


26358 


A 


12576 


130 


2 


KFPGPFLiGFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


124S8 


26359 


"A" 


12577 


132 


3 


RVPGPPLLFFFFFFEMESRSFAC2AEVQW 
RDLGPPQAPPPRFT 


12459 


26360 


A" 


12578 


3 


370 


LRKNCLNLGGRECS ELRSKHcTPAWVTE 
RDSVSKKKTLSVLMEKWPNFPLRPLALS 
RKKFPRGPFEQVTHLVKEWSLTKPCCA 
EGADLDCYDPRTSPLFAKSWERNFPFPV 
HPGPVEGCPK 


12460 


26361 


A 


12579 


131 


2 


FIYFLFYFLFliETESLSVAQAGVQWHDV 
GSLQTPP PRFTPTRA 


12461 


26362 


A 


12580 


202 


2 


RMRKNQHKRAENSKNQNTSFPPKDHNSS 
QTREQNWMENEFDELTEVGFRRWVITNS 
SNLKEHVFSC 


12462 


26363 


A 


12581 


109 


2 


KIFFFFFFETESRSVPQARVQWHDLGSL 
QAPPPGF 


12463 


26364 


A 


12582 


105 


375 


LIMILLPLQILVHSGTFCFLFFGFETEF 
HSTPRLECNGVMSAHCNLCLPVKTSPAS 
TSQIS 


12464 


26365 


A 


12583 


106 


3 


RPIFFFFFFQTESRTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


S S S WNRAFS RKKDKT WMHTPEALSKHF I 
PYNAKFLGSTEVEQPKGTEWRDAVRKL 
KFARP I KKSEGQKKKK 


12466 


26367 


A 


12585 | 301 


1 


TQPKVRTQMKNNI FEHAFLiSSTFPWPLP 
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T>Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibJe 
nucleotide deletion, \=possible 
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NLPADFYSATSQTKPDILQSMQKAFKNS 
GIGQAWWLIPAIPAALWKAEVGGLLEPR 
NLRPAWVYGKTLSLQK 


12467 


26368 


A 


12586 


76 


379 


PLHPPASCPAPSLSCLIPGSPSDPLFSD 
SARPGWGRGWGRAPQLPPSPPFLYNLIK 
KWSRFCHQ KKKKKKKKKKKKKKKKKKKK 
KGFFFWRAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFKMSGWPTAPEQPAGEMENQTKPP 
DPRPDAPPEYNSHFLPGPPGTAVPPPTG 
YPGGLPMGYYSPQQPSTFPLYQPVGGIH 
PVRYQ 


12469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNPISPFSG 
GFKKNIjGGFNPPPRPPWSPQIGFFLGRS 
KELILGLFPTRIGIGGFFSFFFFCETKS 
HPVMRLECSGVI SAHCNLRLLVS 


12470 


26371 


A 


12589 


262 


I 


GDPPTTSGPQTNQPKEHLMNFKSDSQLY 
EDTLAGRS VL I KNLT PQTLQPRWTGP YL 
VIYSTPTAVRLQDPPHWVHRSRIKLCPS 
EKK 


12471 


26372 


A 


12590 


284 


374 


TESRNWG WAQWLTP VI LPALWEAEAGGS 
PE 


12472 


26373 


A 


12591 


297 


387 


GQAWWLTSVIPAFWEAKEGGSPDVRSSG 
LA 


12473 


26374 


A 


12592 


206 


407 


VKPQNPRKAQKLKGL.VYIH I YVY I YVYV 
YIYIYIYMYICRYISIiTWYTVNSKEKG 
LDTAAHTCNSR 


12474 


26375 


A 


12593 


222 


379 


LTTGS I MGNFS FL I L, V Y TKK VMGS VQ WL 
TAVIPLLWEAEAGGSLGSRSSRPA 


12475 


26376 


A 


12595 


301 


189 


KKIQSQAQWLPPI IPAIiLEAEAGRSLEL 
KNSRSAWAS 


12476 


26377 


A 


12596 


209 


3 


S VKPLF F I N YQS Q WLFLFFRGRLKRSG 
TISAHCNLHLPGSRFSCLGLPKCWDYRC 
APPTLKIDLTKD 


12477 


26378 


A 


12597 


306 


3 


NGKKGFYMAPEPPFFFHFFFFSPLFPTL 
FFFFIFPLPFCFALSFLVGSSSLSPRLL 
CPMQHYF CDAWNTFDALi I WGS I VDIAI 
TEVNVSTWRLSLTVRV 


12478 


26379 


A 


12598 


1 


249 


NTGAQCPLEFSIQELQLLFLGGGEDGVS 
LLLLRI.ECNGAISAHCNLRFPGSGDPPA 
SASRVAGITGACRDHTCEPPRPVPPSS 


12479 


26380 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
TVTHTCNPSTLGGRGGWITRGQEFKTSP 
VNM 


12480 


26381 


A 


12600 


101 


580 


LSLTKNCAIiLGEETMMEQEMTRLHRRVS 
EVEAVLSQKEVELKASETQRSPLEQDIA 
TYITECSSLKRSLEQARMEVSQEDDKAL 
QLLHDIREQSRKLQEI KEQEYQAQVEEM 
RLMMNQLEEDLVSARRRSDLYESELRES 
RIiAAEEFKRKATE CQHKLLK 


12481 


26382 


A 


12601 


464 


3 


CA I C3QTLYEHPMKS S R LG P TQLK I FTCE 
YCNKVFKF KHSLQAHLR IHTNEKP YKCP 
QCSYASAIKANLNVHLRKHTGEKFACDY 
CS BTCLSKGHLKVH I ERVHKKIKQHCRF 
CKKKYS DVKNLI KH I RDAHDPQDMYCGR 
SRGSARMSRALPS 


12482 


26383 


A 


12602 


140 


1 


WIGSFIiSSEEDGTWQKISGVQRSLSLY 
VYI HTHTHTHTHTHTHTV 


12483 


26384 


A 


12603 


112 


2 r ESLRPGAVAHTHNPSTIiGGRDGWINKDK 
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sequence 


Amino acid sequence (A=AIanine 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \- possible 
nucleotide insertion 














EFKMCERV 


12484 


26385 


A 


12604 


139 


1 


KKRVKRGPPGGFKGLGGKFFFSPKTFFF 
FFFETESCSVAQAGVQWR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGEICHPRGFCTLPKAISSLLIAR 
VCLKHFNKGVASLSQTANGCFFEGFSET 
ESRSVIQAGGQGGDPGSLQPPPPGSCLS 
L 


12486 


26387 


A 


12606 


317 


1 


KRPLFKNI FQKPFFKNGAPRKKG PRC IF 
GPNQKNPLFFPAPLSPPFWETKKGAGFF 
FWGFPFPPTKIIGGQKVPFPPTFPPFFF 
FFETESLSVTSLECSGAISAH 


12487 


26388 


A 


12607 


248 


367 


TQILSSGSVQSSLHHPPICFIHTHTHTH 
THKHKHPHS 


12488 


26389 


A 


12608 


2 


387 


QKQQQRAGRETSTCSLRI ISAPTMATFV 
ELSTKAKMP I VGLGTWKS PLGKVKEAVK 
VAIDAGYRHIDCAYVYQNEHEVGEAIQE 
KIQEKAVKREDLFIVSKVQWCIWLAFNT 
IVPKSQWQSLQTPQRC 


12489 


26390 


A 


12609 


258 


23 


MWERG I IRPQTQTLTGE PEGLDHGRS I S 
PNSVMNGIVPHISILTLNINGLNAPLKR 
PGIAEWIRIYQTSMCCLQETGR 


12490 


26391 


A 


12610 


176 


433 


DALS S A WG TVXiGTQ IPSHFTEI LMLS P P 
AWGSSSLTQTLFYVPSGAKRTGSYVLAR 
VGQKYKTL WLGAVAHACNP S S LGGRGRW 
IT 


12491 


26392 


A 


12611 


312 


410 


AHMYRTCSRGWVSWLTPGI PALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


VFEFLSRKLSYILRMFWTFKEWFWLERF 
WLPPTI KWSDLEDHDGLVFVKPSHLYyT 
IPYAFLLLI IRRVFEKFVASPLAKSFGI 
KETVRKVTPNT 


12493 


26394 


A 


12613 


294 


432 


FMKLFFFFFFFERRSCFVAQSGMQGGYN 
ASLQPIiPFGSSWFSGLPL 


12494 


26395 


A 


12614 


138 


375 


SPNATCGHADPLPLCCLHI CQALKRFTR 
QTFFFEGKPYLG I LLYAVFFFLRLES CS 
VAQAG VQQPNLS S LETPP PSA 


12495 


26396 


A 


12615 


232 


1 


GVPQRAKIiRAGLCPPQAMDTFSTKSIiAL 
QAQKKLLSKMAS KAWAVLVDDTSS EVL 
DELYRATREFTRSRKEAQKML. 


12496 


26397 


A 


12616 


236 


419 


LEMRNGWVWFFP PVNPTLWETEGVRDQP 
GQHEETLSLKI I FKISRHGGMRLGSQL.L 
RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHESEGFLSRVSALLFTLLLLVLMESP 
PSVSQPGVQWPHFSSIiQPLLPRFKQFCN 
TLPSGWDYK 


12498 


26399 


A 


12618 


132 


3 


NSFMFHLFTYFEMESCSVAQAGVQWRDP 
GPLQAP P ERVGGRV 


12499 


26400 


A 


12619 


255 


2 


MNSLNSFTDIAVIPLHYNRILPHFKI IL 
SYFLKSKFQVKGIGRVKWLTPVIPTLWE 
AEVGRSLiEPRNCPAHQPGQYGKILLYKK 


12500 


26401 


A 


12620 


141 


2 


FHPTYIIISCVCVCVTESRFVARLECSG 
AISAHCKNHRPGSSNSPA 


12501 


26402 


A 


12621 


274 


421 


TYVMEVCCTYYFLAQVLSIiAPFNYFI IR 
PDPRTDPRPDPRPDPRPE PRP 


12502 


26403 


A 


12622 


132 


2 


NQSIKKKIELQGFQRSGGRGKGQVQWLT 
HI I PARWEAKVGRS P 


12503 


26404 | A 


12623 


390 


86 


IFPFFYQNTNGMAPGNQIPQGFSSRFFF 
FLRQVLALWPSLEYTGEITDQCRLKCSS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLSLCSSWDNRCAHPPIiVNILFYVETGS 
HYWHIGLKZiIiHSRYPG 


12504 


26405 


A 


12624 


297 


396 


I IRVRGRAQWLTPVI PALWEAKAGGS PG 
VRKSR 


12505 


26406 


A 


12625 


303 


381 


KFHLVPS INTMSGSQELQWMVQPHFL 


12506 


26407 


A 


12626 


318 


152 


GDKRRLVSKKKKNIiWNFGRIQWVPPVIP 
ILLEGEAGGS PEVRS FRPTGQQSMTP F 


12507 


26408 


A 


12627 


48 


388 


QLGNGKVRL.LYQRHSSSFFFFWGEGFPP 
GPQGGGGGPNIiGPRESPPPGVKGTPPPS 
LLEGGEPPGPPPPPGNFWGFWKKGGLPL 
YPGGPPTSGPKGEAPPTPPKGGGG I KGli 
P 


12508 


26409 


A ' 


12628 


116 


3 


KVATNQAQWLMPVI PALWEAKAGGSSE V 
RSSRTA0AW 


12509 


26410 


A 


12629 


227 


400 


VLFVLFFCKEWHVFIFLILPVYSYKKKK 
KKKKKKKKKKKKKKNKKDSQKKD 


12510 


26411 


A 


12630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
GSHTVAQAGVCSGAILAHCNLRLPGSSN 
TPA 














ASTIMDLLFGRRKTPKELLiRQNQRAIjNR 
AMRELDRERLKLETQEKKI IAD I KKMAK 
QGQMDAVRIMAKDiVRTRRYVRKFVLMR 
AN I QAVS LKI QTLKSNNS MAQAM KGVT K 
A 


12512 


26413 


A 


12632 


100 


462 


QliLLCCCCCRQWTAFNVPAS PAP P PAWA 
HMAPSLLEPPSLLVTQICKLSAFSGPSI 
NAFLLSKKKKKKKKKKKKKDRGGAPLKK 
KK 


12513 


26414 


A 


12633 


313 


441 


CWELLKWLVCFLCHI I KTNNCWAQWLTP 
VI PTLWEAE VGGS LE 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTPPALYS ISRQALAASPQGRP 
WDLQPTMPESPLLPPRAPAWPXPLRRAL 
P PAPRCLG PS TAQGIiRS AGAWCGTGRHL 
HLRPQC 


12515 


26416 


A 


12635 


308 


399 


E IRGWTQWHVPVI PALWEAEAGGLPE VG 
SS 


12516 


26417 


A 


12636 


285 


390 


IFGVLINSFIYFETESRSWRAGVQWRD 
LGSLQPP 


12517 


26418 


A 


12637 


260 




GQGPFFFGGRGSPKKKPPLKGFFLVGGS 
KTQFFFPRPKKGPFGVFFFFFSQTASRS 
VTRIiECSGMILAHCNLRLPGSSDSPASA 
SR 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVIPAXjWEAKAGGSLE 


12519 


26420 


A 


12639 


219 


484 


LGLQEPDLDPKPILELPLAELAQQLQTE 
ELSLES ILCS YLKQAIiKVHQEVNCLMI F 
LGECEEELLALKKLKKSERGLLYGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLSWGYREHNGPIHWKEFFPIADSDQQ 
SPIEI KTKEVKYDSSLRPLS IKYDPSSA 
KI ISNSGHSFNVDFDDTENKSVIjRGGPL 
TGSYRLRQVHLHWGSADDHGSQHIVNGV 
SYAAELHWHWNSDKYPS FVEAAHEPDG 
LAG 


12521 


26422 


A 


12641 


2 


414 


SGPAAPATPMSIFPELYFNVDNGYLEGL 
VRGI»KAGVLIQADYIiNLEQGETLEDLKL 
HLQSTDYGNFLANEASPLTVSVIDDRLK 
EKMAVEFRHMRNHAYQPLASFLDFITYS 
YMIDNVILLITGTLHQRSIAELVPK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A*=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12522 


26423 


A 


12642 


161 


463 


KEKARGRPKPI>LLLPITSATTAMGLTI S 
SLPSRLFGKKQMRI LMAGLDAAGKTTI L 
YKLKLGEIVTTIPTIVFNVETVEYNNIC 
FTVWDDGGQERIRPLWN 


12523 


26424 


A 


12643 


124 


2 


TVFGGQAQWLMP I I PALWEAEAGRSPEV 
TSLRPAWPRTRG 


12524 


26425 


A 


12644 


258 


454 


NRPELCDYQGLREDSKRKGE INQTACQF 
LWPRQKMSSQTKFKKDKEI IAEYEAQIK 
EIRTQLVEQ 


12525 


26426 


A 


12645 


636 


182 


APLBRPSMIPPRCSGGQPSDLRSSPAQG 
TPRPALGDRKGTPRIAEPGGAGRAEAPP 
PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
APARHS PAF IACRQVGRPRLS ARRSGKK 
PHLTVWCPFSFQDPIQDTTLQI ILLSLL 
GGDTFS DFPCS 


12526 


26427 


A 


12646 


199 


414 


IYHTSRSLUDRKLRAQRYDELPHYGGMD 
GVGVPASMYGDPHAPGP I P PVHHUNHGP 
PFHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRGFbCVHTPLKKKKNTHTHTHTHTHTH 
TNTHTHSHTHTKTK 


12528 


26429 


A 


12648 


192 


443 


LLLCWAQESLGTLGENTASSHTAGLEWE 
SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLECSSVITAHCSLDLPGSRDPPASA 


12529 


26430 


A 


12649 


393 


179 


mSRVQRSGCFCSCVSNPVTGGMHFTRS 
SPQSNQEADGQARWLTPVIIiALWEAKAG 
GLPEIiKSSRPAWATP 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWE PNPGGGNFS PGMGTPNTF 
GVRFGGETKPLFFFFFETGSHSVIQAGV 
QLCNLDS LQP P PPS VKP S Q YS WDHRKQS 
NTK 


12531 


26432 


A 


12651 


355 


111 


QDPCFIRLHLPLSFI.SVFLECTKISLI.D 
WRMLFPLLGDP PHLALSSLVS P INTWNY 
SFQQLMMFRDVAVDFSQEBWECIiDL 


12532 


26433 


A 


12652 


271 


3 


RQLKNIL, I FFS HKNMLS IKHII YNYTLF 
LKSAFIFNLETDHLNCFPGS ISAKSYFI 
LFYFFETEPHSVAQAGVQWRDLSTLQPP 
PPRFT 


12533 


26434 


A 


12653 


333 


2 


PPSPGVFGNPPSPGDPLGKTFFFPRGPP 
PRFWGGGFFFIIAPFPFGGLFPPFPLPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 
QWRDLSSLQPPRARFSDSPLYFLEGR 


12534 


26435 


A 


12654 


310 


444 


TYLSNI FYFILFLVETES PS VAQDGVRW 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


DQPG QHGE TPS LLKI QKL VE CGGAH 


12536 


26437 


A 


12656 


114 


2 


GIFFFFFGLESRSVAQAGVQLCDLNSLQ 
SPPPGFKQF 


12537 


26438 


A 


12657 


191 


1 


KNFI LT FS FRGAKNGDVFFPP PGYIQTG 

ENFFFFFFIjKWS ftlvaqagvqwcnls s 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


lrtgsefsgrdskglaaaeptanfglll 
asiedqgaggggycgsrdqvrrclranl 
lvllteaagvagvalglgvseaggalal 
gperlsafffpgelllrllrmi ilplvg 
csliggansldpgalgrlgawallfflg 
ttllasal 


12539 


26440 


A 


12659 


2 


224 


ESTASRI AFEAWQPETIjPKGIiNYSGAS P 
VVLNAVLPLKKKKXKKKKKKKKKKKKKK 
KKKGGGVFKKKKI KGGGG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12540 


26441 


A • 


12660 


96 


1 


GLKDGRGQVQWLTPVI SAPWEAKVGGS P 
EVMS 


12541 


26442 


A 


12661 


1 


457 


CWVHLQKGCHFTLGRLHSGQAEALL.TSQ 
TGRPGRGAPHIPDGAARQRRSSHPRRGG 
Q AE ALL TS QTGQ PGRGAPHLPDGAAGQR 
RS S P PRRRAAGQRG S S PPRRWAAGQRLS 
S h PRQGGRAEALLTFQTGRPGRGAPH I P 
DKAAGQRGGAG 


12542 


26443 


A 


12662 


175 


351 


KGIFFFPPSWKAGGAIWPKGTFAPRGKG 
NPPPNPSGEGGPKKKKPGPQI PRGPPLG 
PPV 


12543 


26444 


A 


12663 


247 


401 


SSNQEPGCKGPCVLTNFYFILGRAWWLM 
PVIPAI WEAGTGGS PEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCFIKKKKKKKKKKKKKKKKKKKK 
KRGAP 


12545 


26446 


A 


12665 


2 


365 


APTRSRCLRHASCTACRTVSTDTSSLRR 
ADPKGRSALLADIQQGTRLRKVTQINDR 
SAPQ I ES SKGTNKEGGGSANTRGASTPP 
TLGDLFAGGFPVLRPAS QRDVAGKEE F I 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 


1 


LELSKSGSHWMRHFLSRSFYSIjIS PQLN 
TTVWPTI ITPILLTLFLITQLKIUOTNY 
HLP 


12547 


26448 


A 


12667 


279 


35 


NQAPFKARFFTTFKNVFLKEIKKKRCWS 
GAP PI I P P P WGGGGGGPP WGGNLS PPGP 
PVSTPFFNGAQKKKKKKDRYGGACL 


12548 


26449 


A 


12668 


299 


399 


IS IKKQRKGPVQWLMPVIPALWEAKAVG 
SPEVR 


12549 


26450 


A 


12669 


1 


385 


KMQNRGFFFLTFLAIjIjGIjTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHGIRGRGPCNVJKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
S IAKGQETIRVTKPCT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVSSMQAPTCGFPVGGNDNQ 
GQAPDGQFQPPLQQNQTSSPDFSNENSP 
ATPPNEQGQGDAPPQLEDEEPAFPHTDL 
AKL-DDM INRPRWWPVLPKGELEVIiLKK 
GIDFSKK 


12551 


26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHLLETRV 
KQQQVKI KQLLQENEVQFLDKGDENTW 
DLGSKRQYADCSETFNDGYKIaRGFYKIK 
PLLNP 


12552 


26453 


A 


12672 


196 


45 


YLYFHGSREKRLAKKYYDKLFKECCIAD 
hS K YKENKVC FPH I YGKRFLFK 


12553 


26454 


A 


12673 


33 


448 


KEGAERAGAAAPVWS FLLDRRDSARTRS 
GTS LGS ADMGDMKT PDFDDLLAAFD I PD 
IDANEAIHSGPEENDVPGGPGKPQPCVG 
SESEHTASASAGDGPGVPAHASDHGLPP 
PDI SVLS VI VKNTGLSRAGWRP WVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQSPYEDYRIFYTISSSLYHIiS 
IYPPIYLSIYLSIYLSLYLPTHLSIITD 
YVSILEHELRPLWRIQP 


12555 


26456 


A 


12675 


377 


442 


SDRQWWCMPIVPATWEAEAGES 


12556 


26457 


A 


12676 




553 


RIFPGRRFRVKLPSCPDPAMGTRLIjFWV'" 
AFCLvLGADHTGAGVSQSPSNKVTEKGKD 
VELiRCDP I SGHTALYWYRQSLGQGIiEFL 
IYFQGNSAPDKSGLPSDRFSAERTGGSV 
STLTIQRTQQEDSAVYLCASSLATAWHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=llnknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














RLLPAHKPHPSLSLQLLDTLNRGFSLLL 
PPHGKQVDLDLSLS FG 


12557 


26458 


A 


12677 


232 


3 


GWSSDCQQRARVIEEKHSRQKGHIAENP 
SVHRHHQRPKVDKTTKMGKKQNRKTGNS 
KTQSASPPPKERSSSPATCI 


12558 


26459 


A 


12678 


134 


3 


IFIVEFGGKPFSCTSISLSQWLWCLFIG 
IGEUjWGQVSTGTLY 


12559 


26460 


A 


12679 


212 


47 


NGTHPHGYTLHTHKHTHSCAHTHTHTHS 
LLLVRKSFLPQKNTTISFRSCCKLGKP 


12560 


26461 


A 


12680 


224 


2 


MKMASFUAFLLLNFRVCLLLLQLLMPHS 
AQFSVLGPSGPIIiAMVGEDADLPCHLFP 
PMSAETMELKWVS S SQC I 


12561 


26462 


A 


12681 


35 


397 


KPLISQNKIHSVSSSQTGFFFFFFFFGE 
KKPPFAPRGGGKGPPPPLPPGGGNRAPP 
RGEKKGKGKP P PKNPGAFP PQKEKKTPG 
GGGGKKKTPAPPTPKEVDQPLPRREQRG 
PGRLPPRGA 


12562 


26463 


A 


12682 


188 


514 


HTTHLVYVLSMAAFFFFFFEKNFLFAPR 
VEKRGKDIjGSI.KIjPPPGFRHFSGI.TLQG 
SGNNGAP PPS PVI FLVFFE KRGFPLVGR 
EGLIIjPPIjQPAPFCISFRGAINGPS 


12563 


26464 


A 


12683 


366 


3 


ISQTQTQWTDWVSLELQGCELAHRDTVA 
I PQTRS FPFPLMI S CMTLFS LLQMDRAF 
PPFPPQTPITIQGQISTSPATWPLTHIjH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMIASPTV 


12564 


26465 


A 


12684 


99 


1 


SSCIRFFLEQAYGQQSYGTYGQPTDVSY 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FL>KE KRNQF YL F I FQRWLD PNKP I RKQ L 
KRGSPYSIiNFRVKFFVSDPTKLiQEEYTR 
WVYGYI FLENI VKTLI 


12566 


26467 


A 


12686 


416 


253 


PSPMTI PVTGAPRDADLWSSHAKMLAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRMWWLMPVI PALWEAKAGRSLE VR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKLGLWSKHVSPLLCIGVCPPPVS 
MAEIiRQVPGGRETPQGELRPEWEDEVP 
RSPVAEEPGGGGSSSSEAKLSPREEEDli 
DPR I QEE LEHIiNQ AS EE I NQ VELQLDEA 
RTTYRRILQESASV 


12569 


26470 


A 


12689 


157 


488 


REFVSQGGSHIjKAQVRLEALUjTIGTPP 
WAHLS I LHMTAMGQRENKREAAS SRIQV 
LLNHSQAIKQESI ILFFEMESGVWPRLQ 
YSRMI SASCTLFLSGSNNS P VSAPR I 


12570 


26471 


A 


12690 


514 


3 


PSIRAGLLiCGSAENATPFLCGITMAAGP 
LYTYPENWRAFKALIAAQYSGAQVRVLS 
APPHFHFGQTNRTPEFLRKFPAGKVPAF 
EGDDGF CVFE S I AI AYYVSNEGLRGS T P 
EAAAQWQWVS YADSDI VPPAS TWVFS T 
LGIMRHNKQATENAKEECMRPIiRVSS LV 
GP 


12571 


26472 


A 


12691 


304 


4 


AESLRVAHERLDTRSTSSDIFNFPQTQS 
NLEMNSE ILESWANYQSSTS YS INTELS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVRNliVYTVKINPI. 


12572 


26473 


A 


12692 


221 


2 


VGPATRDIjCFADEPWGRQKQKQWWWVL 
CIiSPNRGLAGRPDTtiHITCASAHMRTHT 
CMHTHRHAHAHVHTHCI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12573 


26474 


A 


12693 


3 


205 


YMLCSILSTQEAFSICLGEKFIYLFTYL 
FRDGVSLLLPRIjECNGVI IiAHCNLRLGD 
RARIiHLSKKQK 


12574 


26475 


A 


12694 


233 


3 


IMGPAHLFKGNHWNGGWGLMYKAFFFLF 
FFKTGSHSVAQAGTIPVHCNLRLPGSSD 
SPAPASRVAGNTNVPHHTLY 


12575 


26476 


A 


12695 


408 


3 


PAREMEKFRVCVRKRPLGMREVRRGE IN 
I ITVEDKETLLVHEKKEAVDLTQYILQH 
VFYFDEVFGEACTNQDVYMKTTHPLIQH 
I FNGGNATCFAYGQTGAGKTYTMIGTHE 
NPGLYALAAKD I FRQLEVS Q PV 


12576 


26477 


A 


12696 


123 


631 


REAVQGGKGSGGGFGSRDSRSSGAVSAA 
VGDMGDPGSE I IESVPPAGPEASESTTD 
ENEDD I QFVSEGPSRPVLEYI DLVCGDD 
ENPSAYYSDILFPKMPKRQGDFLHFIjNV 
KKVKTDTENNE VS KNHCRLtS KAKE PHFE 
YIEQPIIEEKPSL.SSKKEIDNLGASDCW 
D 


12577 


26478 


A 


12697 


429 


629 


LALLYPLiKWVGKPVNFMEEDHiGPLPPP 
LNEEEEEAEEEEEEEEEEENPVHKI PDS 
HEITLKHGTKT 


12578 


26479 


A 


12698 


158 




IjPLLSVERLVEIYAYTFIQDLFSASNIP 
YS SS VFR I EMDQ PRTHSG P TTASNPAP S 
STNSSSAPSATNSKQERSSSSLSKPS 


12579 


26480 


A 


12699 


185 


3 


HRRPISIPSSVIHPAMVRLGLQYSQGLV 
SGSNARCIALLRAIiQQVCP I LLSLM I QP 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TVWASMFLSAALRARAAGLAAHWGTHVR 
HIiHKPDMQNGAGGALFVHRDPRENNRDT 


12581 


26482 


A 


12702 


373 


1 


ILVQERDSQRV I RM I D KL VS SRDR VGRG 
VEKQMTDHRVIYSVRKLQKRITPEPFIP 
SRTTPKUiFLLVLPSGKTPGHPVSSRTI 
PEPPLPTEPLERI PEHPVPSGTI PKPPE 
PPLPIEPHETMY 


12582 


26483 


A 


12703 


272 


508 


TKLGKKQNRKTGNS KQQS AS P PP KERS S 
SPATEQSWMENDFDELREEGFRRSNYSE 
LREDIQTKGKEVENCEDNIjEE 


12583 


26484 


A 


12704 


341 


1 


VPQHPHRPLAPPSLGPQSWPLMEGSRPR 
SSLSLASSASTISSLSSLS PKKPTRAVN 
KIHAFGKRGNALRRDPNLPVHIRGWLHK 
QDSSGLRLWKRRWFVLSGHCLFYYKDSH 


12584 


26485 


A 


12705 


187 


365 


WE PSCRGPS AFKPTR CQS YDWACMCGAE 
GRSAMEQPQEES PEVREEEEIEEMAHAE 
GTP 


12585 


26486 


A 


12707 


238 


596 


LWLVRVKYS I MSGAALGLE I VFVFFLAlT" 
FLLHRYGDFKKQHRLVI IGTLLAWYLCF 
LIVFILPLDVST1TYNRCKHAAANSSPP 
ENSNITGLYATANPVPSQHPCFKPWSYI 
PDGIMPI 


12586 


26487 


A 


12709 


131 


2 


DRVSLSPRIiECSGTILAQPSRLKQSSHL 
SLTGSWDYTHAAPCI 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIMALCNFESPGL 
SDPLTSASRVASTTATCHHTC 


12588 


26489 


A 


12711 


151 


1 


NFPBFDHFTVGCLRAGSSLIHYWWECKIi 
VQSLWKAVWRFIKDLKIDLPLY 


12589 


26490 


A 


12712 


28 


411 


RWPARPAGEPREPHVSWVMKLNPQQAP 
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SEQ ID 
NO: of 
nucleotide 
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od 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L= Leucine, M~Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginilie, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LYGDCWTVLLAEEDKAEDDVWYLVFL 
GSTLRHCTSTRKVSSDTLETIAPGHDCC 
ETMKVQLCAFKEGLPVFDVTEKYFLLPR 
MYRYYRCLYTYPSAAV 


12590 


26491 


A 


12713 


351 


675 


ENPRHTFIYSLPLGLQKHQVLTVDIGFG 
GTAIMTVGKSSKMLQHIDYRKRWILQDG 
R I F IGTFKAFDKHMNLI LCD CDE FRK I K 
PKNAKQPEREEKRVLGLVLLRGEN 


12591 


26492 


A 


12714 


326 


457 


ACSHHQRKSQAPPPAHLWPDCPHPGSLP 
LLNPSVCNPGREVSPT 


12592 


26493 


A 


12715 


200 


1 


TITKVCVFTRTCGLKGSLMAKALFDQRK 
DRESETECQVWWLTPVIPALSEAEVGGL 
LEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


KKERENKKHTNKKGRKKMVHI CQWHSSL 
HRKFQGIYRKTTGTGWAQWLTPVIPALW 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKETPTHKGSCKKKFFFFFFETESRSVA 
QAGVQWRGLGSLQAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPAILPRLAILPYLLFDWSGTGRADAHS 
LWYNFTI IHLPRHGQQWCEVQSQVDQKN 
FLSYDCGSDKVLSMGHLEEQLYATDAWG 
KQLEMLREVGQRLRLELADTELEDFTPS 
GPLTLQVRMSCECEADGYIRGSWQFSFD 
GRKFLLFDSNNRKWTW 


12596 


26497 


A 


12719 


537 


1 


LHTMNGGNESSGADRAGGPVATSVPIGW 
QRCVREGAVL Y ISP SGTE LS S LEQTRS Y 
LLS DGTCKCGLECPLNVP KVFNFDP LAP 
VTPGGAGVGPASEEDMTKLCNHRRKAVA 
MATLYRSMETTCSHSSPGEGASPQMFHT 
VSPGPPSARPPCRVPPTTPLNGGPGSLP 
PEPPSVSQACI 


12597 


26498 


A 


12720 


369 


3 


AAKIIPATRKKASLELELPSSPDSTGGT 
PKATISDTNDALQKNSNPYI TPNNRYGH 
QNGASYAWHFEARKSQILKCMECGSSHD 
TLQELTAHMMVTGHFI KVTNSAMKKGKP 
IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


KFFWVLAGLTGICNSDASASLNQVHISPF 
LFSRHHPFSLGPLS S PVLLQGS KRRQLL 
ATLRALESASLSQHPHVPV 


12599 


26500 


A 


12722 


26 


110 


REQ YAEGNMRG P APG KKTS GLQQ KWVE V 


12600 


26501 


A 


12723 


224 


1 


WQNQDQASDPPKYSFTSQCCLSFARLAR 
RYGDVFQ I RLGS CP I WLNGERAI LOAM 
VQQGSAFADRFAFASFRV 


12601 


26502 


A 


12724 


120 


530 


KKVARGRSRSRERSRRRRSRPKAITXNR 
T*GTRCTPRRWRSTVLGMRSCTRAR*QR 
RSGLSRGHTRSAGLCDHGSVRAGSGDGA 
DGTGGGDRRLGLGRDS LLLS S SQSAAFS 
SS ASGSSS FFSATQPLRMLLEYFWL 


12602 


26503 


A 


12725 


434 


222 


KEEKS RGEVKEQD*EKDREEEVEKS RE/ 
RRRSRRRRRRSE V* YRKGKRRREREDI L 
VAERSHRS FRNS IPSTL 


12603 


26504 


A 


12726 


268 


370 


SNILIHMESMFFFFGTDIPSV*KNLPA* 
KTPGPDGFIDKLYITFRGELTTFPHXLL 
H*FTEGAVLFNSFSK\AASITLTPKPNY 
DIMRKENYSPISSYIWNQCSFFLEQIFP 
LS FKLNCPGLI I PH 


12604 


26505 


A 


12727 


103 


636 


VCFISMKQPHGSRHPPCS/PPRPGS*VC 
QQSRPLGPQ PSAFGQLA*HRRGLLGLQG 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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od 
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NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
PGPAPRHGGHSWLLK*LVCAQPPPRP* S 
*ASQAVFTIiQVPGKPQVWTPCPVPVRAP 
TP\ PLSNGGLGVS KRH*DGH * QAPTS P * 
PRCQGGAGEGPQ 


12605 


26506 


A 


12728 


1193 


1638 


GLGFAMLPRIALNSWPQGDPSVPASRAA 
GTRG VHRHTQLQVS FNYYKVLAMHSG S Q 
L*SQHFRKPRRSDHLRSGVRDQSGQHGE 
TPSL/LKNTRISWAWWHTPLVPATREAE 
AREL P E \ PGRQKI AS EPRVKP I CTAS LG 
NTCETPPQKK 


12606 


26507 


A 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*CLGPLKCPSSCHFA*K/PLH 
PSEPARKStAPDTCP 


12607 


26508 


A 


12730 


91 


264 


SQRS ISGLRVKENL IFVMIMFPP I YS SQ 
TFWSQTFLMLKIVFMGISISICYIL/NT 
EKNLGQG WWLAP 1 1 PALWEAEAGGLL * L 
RVKENL I FVMIMFP P I YSSQTFWSQTFL 
MLKIVFMGISISICYILKLRKI 


12608 


26509 


A 


12731 


1026 


406. 


IiAHFRSQ I FS FSHI LVHFERMVLNYR YY 
LACVRR WFRFLHVLAYFGR I VLKYRY F 
IiVHFRREVFRFRHMLACFRRMVLYCYFL 
VLRKI LIiIYVTFLTYFRKVFLRHNFLRG 
G Y *R KYFG YAPRWLS FFLYRCQ C FLHY F 
LFYLWLRLHL-QF/CCFWSFCLFDFLFL 
FSAACVFS VLQ I AI VMFPFHALQLFFF * 
WIiCFSNSRFQ 


12609 


26510 


A 


12732 


1508 


157 


QDVGGSGFKVDTHPRGRMAS I FS KliTG 
RNASLLFATMGTSVLTTGYLIiNRQKVCA 
EVREQ PRLFPPSADYPDLRKHNNCMAEC 
LTPAI YAKLRNKVTPNGYTLDQC I QTGV 
DNPGHPFI KTVGMVAGDEES YEVFADLF 
DP VI KLRHNGY* PQG* LKHTTDLDASKI 
T\QGQFDEHYVLSSRVRTGRSIRGLSLP 
PACTRAERREVENVAITALEGFKGDL\A 

gryNykXlsemteqdqqrliddhflfNd 

KP VS PLLT CAGMRRDWP \DARG I \WHNY 
DKT\FLIWINEED\HTRVISMEKGG\NM 
KRVILSR\FCRG\LKEVERIiI \QERGWE 
F\ MWK* AP * EY I LT\ CPSNLGT\GLRA\ 
GVHVRDPQSFSQDPTAFLKILEKPRTPR 
KRGHKWWDI AA \ VADVYD \ I SNI DR I G 
RSEGEIAVQIVIDGVNYLVDCEKKLERG 
QD I K\ VP PPLPQFGKK 


12610 


26511 


A 


12733 


261 


487 


TGSETDCAKPQAS FLPREVPTVAEMKME 
LLKNKQFW\RGWAHACNPSTLGGQGGW 
IT* GQE FETS LANMAKPHLY 


12611 


26512 


A 


12734 


382 


668 


YKRITDI FVDSETVHILINKRQSCRIPG 
FIQLVQL ISHQLAAPRDYTVSHS VAQAG 
VQW/RNIiGSLEPLPPGFKRFLCLS *HAL 
KNLSSCDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDRAEESAEPRAWSHSDNSHRYTTLFIC 
LTHTHVHNPVHS \HTHTHTHTHTHTHTH 
TVSHRHTETPPLLLKQTGLKFY*NSRDD 
TPRSRPGS SGLQRLSSS PPVP FQPGTVE 
ASADFCGHDIiLTT 


12613 


26514 


A 


12736 


202 


182 


KYLPI F INIilTMNI LFFFETHS CSVGQA 
EVH* S * LKPMPPG \ SSDSPASAFQVSG I 
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26 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibie 
nucleotide insertion 














TGSPSSSWLIIi*FFLKIGFTKKG 


12614 


26515 


A 


12738 


319 


2 


QNTPPHIYEYIPRVDAFLLFL.EMSYSVY 
WAVLS PLY FVNKLALTLHCRLALNS FL 
HKIQEPS\LGSGSG\PLSCNTII*RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 


26516 


A 


12739 


1363 


2000 


DRVFVI PGWKCSGTIKVHCS LDLPGS SK 
PSTSAP*VAC\ATGTYPPCPGIjNFIiYF* 
MEERGFTQVA\QAGSQNSWGSKPNGTSE 
AS PKVLGMNKP * NLPNPGRS I LN *DGS I 
RYKDNYPA/WEPPKRSYKCLRQKKSI *L 
SAGPRDYEASQPRKNYP I S SLLTFCTS S 
LHFNPTELTCSSQLQKS IRRKLKSEES C 
PPLPACSLKHTQAIIKITF*RLHRTA/L 
FYIiLK*NCFRN* S * * KYFLLSVSNLKI / 
QWVRSSLHPGAPGYTPTV*RPKHKSNVL 
I*IQS/ITRCLFYILVYMRTIFPLQI,FL 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRLSLLSSW 
DYRCAPPGLASPSILSS*GSCCSIQFIL 
RMST I SHAINVLVLKNT YLVLVSASEHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GGQIMRSGVHDQPDQHGETLSIjIjKIQKS 
AGPGGMHL*SQIjLRRLRQENRL\NRGCS 
EPR/SRHCTPAWATEQDSVSKKIKK* KK 
*NHLESKQQGQPALEPPEQAGGQLRKTE 
QLQDGRRELAADMTL* PGHSKRLGALPR 
PLLSAYYFNKVHAARRQAYLETPGTFTS 
YQREEGEQALI LGVEAQAS S PTVFHRER 
RQS QALCSHRKS QEAPVR PAHP PRRVPG 
IiGKPSSQGLSAHLGQDRAPAPRRASWDQ 
RSQAP I SVTFPS VLDKEES VPCGPGFPG 
HAP APRG IHGAT WEGAHS RGYPGHFLAL 
PQHNSDEQRPN 


12617 


26518 


A 


12742 


2473 


445 


RGARRSRRRSRHRRRRHQSRPVRAAPRQ 
PEQGRRRGAPTHGPQLIMMDLELPPPGL 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRRKEYELEKQKKLEKERQEQLQKEQE 
KAFFAQLQLDEETGEFLP I QPAQHIQS E 
TSGSANYSQVAHI PKSDALYFDDCMQLL 
AQTFP F \ VDDNE VS S ATFQS LVP WYS PG 
HIESPVFIA\TNQA\QSPETSVAQVAPV 
DIJDGMQQDIEQVWEELiLS I pelqclni e 
\NDKLVETTMVPSPEAKLTEVDNYHF\Y 
SSI PSMEKEVGNCS \PHFLNAFEDSFS K 
HPLHKNDPNQIiTVNSLNSRM / PTVNTDF 
G* WNF I LVF \MAE P S I RQQAWP S P ATltS 
HSLS \ELLNGAHGCFLDLFTL/CKAFNQ 
NHPEGTA\EFHGF\DSGISLNTSP\SVA 
SP\EHS\VESSSYGDTIjLGLSDSEVEEL 
DSAPG\SVKQNGS *NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRKNPISQKDKHSSPLGGLISQR 
DEL\RAKAL\HI P I PCRXKS FNLPVG\D 
FNEMMSKEQFNEAQL\ALIR\DIRREGV 
RNKS /AAASGI CRKKENWENI VELEQDL 
DHLKDEK\EKLLK\EKGENDKSLH\LLK 
KQLSTLYlAEVFQHAYRDEDGKPYSPSE 
YSLQQTRDGNVFLVPKSKKPDVKKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12618 


26519 


A 


12743 


6 


457 


RPRNRPGIRVDPRVRGRVGHAPHEGLVP 
ERMYRGS PTACETQAAA* ERAFGPSS PT 
C\RLP I PRMSTSVPQGHTWTQRVKKDDE 
EEDPLDQLISRSGCAASHFAVQECMAQH 
QDWRQCQPQVQAFKDCMSEQQARRQEEL 
QRRQEQAGAHH 


12619 


26520 


A 


12745 


297 


12 


QERPVRMLVLHFGRLRLADHLKL.GVRDQ 
PGQYGETPSLLKIPKLSGM\MWAP* IC 


12620 


26521 


A 


12746 


98 


428 


YNTS FNETVLLLTT P I TIDCLYTRKDAI " 
APESACGVCPCSLLGVGA* IPKWIRLD 
LS KKHVTT A YGG FM CAKCVLDR I NRAFL 
IDEH\KIVPKV*KALSQSSKAFCEHET 


12621 


26522 


A 


12747 


3 


638 


LWLWS1.C WVWAVSLPLQF I LGS LHPCQG 
QASWRE VDIjLHEGSQEASS KPS SGSVP V 
. GLLI>DQEMVTPLI>VCICG I LLHQEMVS P 
LLVYTRGILLHQEMVTPHPGIjYPWDSPP 
LDGHSPPGLF\RGAAPPS*GG/PLPVLV 
YTRGAAPPSGDS \ PPPPGLYPWDT\PPS 
GDRHS PPGLYPWDTPPSGDGHS PPALYP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 


2 


333 


DMVLIiCHPGWSAGS I * KKTKERERMMNL 
HR**IDRERERERERQIQ*KE*KRINEF 
EGSKERFTQSVKQRNK/ELEKIRTIiDTS 
ETI YLS I YRN* SLRRGQERRTERERMRH 


12623 


26524 


A 


12749 


30 


333 


KTSYLLPVQWWAQNDNERYSSSKNTIMA 
LPLPLPVFPRSPSDAERKLDCSAAISAH 
CNLPA\DS PAS ACRVPAIAGARRHA* L V 
FGFFWWRRRFAVMAGLVS 


12624 


26525 


A 


12750 


231 


39 


INDLL/CL.F*FKKLIWGRVW*1,MPIIPA 
RWEAKAGGSFE PRSLKLQ*AM ITPMYSS 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


hlqvtevfwfwcvffrrwggshcv/aq 

AGV* WIiFTGTWP CCS PELLGSRD PPA3 
AS * VAGTTGACIiAANCRGF 


12626 


26527 


A 


12752 


505 


897 


SYLRVQFYSAFSLFFRHEVLPCHTGWNA 
W*SQLTTASN/CLGPSNLLKAPHLAN* 
IOCIF*RQGLSMLPRLVIiNSWPHMILLP* 
LPE+LGLQARATAPGSGFTFCLTQDSIL 
MCSPTVHKLSDSIALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNEFFIiSQVWWLMPWPATR/ SAEAGG 
L PGPGS QRLR * ARS KPVNSHC S AGGRCG 
IDPISIKQQKNNNR 


12628 


26529 


A 


12754 


356 


72 


WHEYYLLMEHTKKCHLS *GLYNGLNN*Q 
/WCTHPWPATWEAKAGESLEPRSSRL* 
CTMITPANSHCPPAWATARSCLNQSINQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLFI YFRDSLTMLPRUECSN* FTGS 1 1 
VHITLKLLGSSLP /ASASQVAGTTGTRH 
HVQLSNYFKIKIKPKP 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT/ CGSRL* SQRFGRLRQADHL\ 
DQLGQHGKAPSL/LKNTKISWALWRTPV 
FPSSQEAEMEEL I E PTS SRLQ 


12631 


26532 


A 


12757 


254 


549 


Y PGAKKQR PGGDS VRGTHLQS RC WCVLL " 
HNCQ* PKSNTQTFP * PKNLRE * WAT * KS 
TDL/WPGAAAHLCNPSTLEG*GEWIT*G 
QEFETSLGNIPRPQIY 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTIiLSRLKCNGVIIAHCSLKYP 
GSSEPPTSAF*VAGITY\GT*HHAWAY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


IVI 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valrne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12633 


26534 


A 


12759 


91 


393 


KWYTKECLYLLVS I FSRYLIiS I YYRE VL 
AQTLGKLS KTVM C P ALEE TVAVNGLQ I K 
SKK* /WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSIASMAKPCDC 


12634 


26535 


A 


12760 


10 


221 


QSHNTNGCUDIiHFQKGPWQS VGETLNL * 
T I FLCVC VCVCVCVRHVYHM/ C Y I YS DP 
ICPSIECEYFSFAHR 


12635 


26536 


A 


12761 


309 


55 


I IRYKNYI S IKLEGWKRLYHATTHEKKV 
GVAVL I S EKVDFRAKN/MYQDKEGRFLL 
*R 


12636 


26537 


A 


12762 


640 


896 


TIVLKCNFVLFCFRDEVPLCCPGWSAVA 
/QSQTQLW *TTQPGLKQSS CLSLLSS WN 
* RHVPPHLAMFLKTTFTLLYTCGHLETN 
IC 


12637 


26538 


A 


12763 


1 


357 


I VP LHS S LGDRVKPHLLKKKKTCS TSLV 
I RETQ I KI TRYT YS NG* S * R KNKAGNNK 
CWHGYTATGTL IAD / CKSKQMLWKTF * Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 
RCGGLRL 


12638 


26539 


A 


12764 


129 


320 


RWKCVKLKNTGQAWWLIPVIPALWKNQ\ 
AEGSLEARSLRLQ*TLIAPLGYSLPG*W 
SGTLSLNK 


12639 


26540 


A 


12765 


617 


435 


GVKGLILPKLQKLLKSIE/ I/E* KLPKS 
FYDASLTMIPKLHKD\RPISHVKIEARI 
LHKISINII 


12640 


26541 


A 


12766 


243 


5 


QKIKKRIKKNKQNL*EI WDYVK* PNL* I 
IGIPERGEKVYNTENIFEGI IQ/ESVPN 
L / AYLVREVDNQ I QE I QRTPVR YYTK 


12641 


26542 


A 


12767 


101 


16 


RTQLFVSVFWFCFLFCFETGSCSITQVG 
VQWCHLRSLQLPPPP \GSRDS PPPSSQN 
RG*TNI 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGEECTS P YMYI * PMTCVFKHG 
*DQNFCVTYFATI I I YTYI / YLSVCVS I 
*I*CYLCVCVYIYIYIYIYIYIYFIYKY 
MKERRIi 


12643 


26544 


A 


12769 


1161 


824 


QAKDLSTHFTKENM*MG/NKQMKRLSES 
LVISEMQ I KTTMRSHLTHTRITI I KMTS 
DKICP*RCG*MRA/LLQC*WKYKIVSPL 
WKIVWPFr.KT\lNMNISGNIiMILFLGFY 
PIY 


12644 


26545 


A 


12770 


241 


237 


K*RIWA\GVMAHACKPSTLAGCDGWIA* 
SQEFETNLSNTGRPCLYER 


12645 


26546 


A 


12771 


326 


6 


DTVSRKNKSGKI FQLSSRV* IYERSQSG 
VKVYKCKTFGKAFTQ/HF*AHMRMYTGE 
KPYKY*ECGKFFILVLLLLLMIQKYFHL 
I KIVRLYL IRKKVS CKQPSNKI LQS 


12646 


26547 


A 


12772 


413 


1 


KKTFYFFFFETTQAGV* *CDHNSPWP* P 
P\GSDNPLTSAS*MA\GLQSCMHHHAQL 
S * FVFLEAGSHYVAQTDGDSPTSASQSA 
GITAVSHHTRPILYILIVLSYAISFFSI 
LYSAYFLQPHLPIHNNFLFSYISSSC 


12647 


26548 


A 


12773 


2 


1055 


FFFFLRLNLTIiLPRLE / CVILAHCNLHF 
PG*SNSPASASRVAEITGSRHQARI IFV 
FLVEMGFHHVSQAGLE/ SPDLK* SAHLG 
LPKC*DYRRDITPGHH*SF**RYFI.KTY 
ISRVL/ WCSLU3TFMFMIAFDHM* TW*H 
*QNHFHFLLQIWKIiISERLRSAQGQQAL 
KrVTSTFSSKYVCRAFIiAEPGWIAILYV 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny lalanine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TF*SLDSRRTCLWAVSNFFYSTLFLKI. " 
RQQVLLTG/ P YHLFSARPSS CQSDKKAR 
ELNLSLSL\CFFFFGKRVSGTQAGVQ*H 
DHSSLQLWPSRLKQCSHLSIiLSSWDYRS 
ITPHLANF/CNFLSGWSRTPGLK 


12648 


26549 


A 


12774 


315 


98 


FRCFXiNSINPIjESPHVRKTCAGRAQWIiA 
PVIPPFWEAEA\SQMPQVIHSR+LEARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKISWAWWYMPVVLATR \RLQ 
♦AEIAPLHSSVGDRARPCLPKKQKTNKQ 
TKTSGEPVLVQGPATVC 


12650 


26551 


A 


12776 


73 


381 


SIXNSELSSIPRPTWESPSTSDSAWPMQ 
NLASEPLPLP* TTPPPDFPALSA/PE /P 
LPFTPSPRDTLPPPLSPSYKQAPVLSVH 
PPVMPLKYVPSSPLLLSFLYL 


12651 


26552 


A 


12777 


274 


342 


LNC V I A* W1WP I I SALWEAEVGVS FA/ P 
/RSLRPAWATW 


12652 


26553 


A 


12778 


2 


478 


TIYYTKYTTFRVPNIiGLQLLTAFFCVCV 
CVSLiNGFKNAKDYGGSHLKYSVYITGFL 
LQFSIiKFDCFLCVCVCVFP*MA*KMPKI 
M/RGSHLiKYSVYMT*FLF* FSLKLKYFC 
DPILRKRI/WPGAMAYTCNPSTLGG*GG 
QIT*AQEFETSLAKWKPCLY 


12653 


26554 


A 


12779 


680 


453 


SETDWRKNKFFPSNFPSNLRTNSWTNSC 
S KKT FKKH /R VGLGVLAHTCNP STLGGR 
GGWS P* GQEFENSLTNMVNHFS 


12654 


26555 


A 


12780 


59 


487 


SLHKHPERGLQSFQGCWTQEMLGGSHVQ 
QRAWELCAPPNIjTCPGYIiSFFLiRQDQAli 
LSKLECSGTIPAHRNPSLPSPQIjKVSSH 
LSFPSSWNSRCTTPHQLLLLFF*DRIRL 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS 
SHLS FPS SWNSRCTTPHQLLLLFFNFL * 
RQVFTMLPRLVSNS * VQAILPPWPPKVL 
RIiQA 


12655 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTLIYPYTLNFWMSAFPTP 
KEEFFFSFPETRPHSVAQAGAHW\AI IA 
HCGLDLLGS\SDPPTSAS*AAGTTAI IP 
G 


12656 


26557 


A 


12782 


3 


204 


LIDGSIjALSC* LFCGGAI iahcslellg 
LGQ \ S P VS ATR VAETTGVCHHAQLAFVN 
LHLRTRS KHCGH 


12657 


26558 


A 


12783 


2357 


6366 


LTGS \NSHITILTIiNI/NMGLNAPI*RH 
RL\ANWIKSQDPSVCCIQETHLT\CRDT 
HRLKIKGWRKIYPSPMGKQKKKAGVAI\ 
LVSDKTDFNPTKIKRDKEGHYIMVKGS I 
QQE E LT I LiNI YAP \NTGAPRF I KQVLS D 
LQRDLDSHTLI \MGD\FNTPLSTLDRST 
RQKVNKDTQELNS ALHQADL \ I D I YRTL 
HP\KSTEYTFF\SAPHHTYSQNWTTIVG 
S KALLS KCKRTE I IT \NYLSD \HS A\ I K 
L.KLR I KNLTQNHSTT \ WKLNNLLLND Y\ 
WVHNEMKAE I KMFFETNE /NKKTPTYQN 
FWDAFKAVCRGKFI \ALNAHKRKQE\RS 
KIDTLTSQLKKTREAKSKQHSKASRRQE 
I TKIRA\ ELKE I ETQKKTLQP\liKKISE 
SRSWFF\ERINKIS\RPLARLIKKKREK 
NQIDT I KNDKGD I \ TT0LTE I QTTI RE Y 
YKHLYA\NKLENLEGMDKFLDTYVSLPR 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Afanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny lalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysinc, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
. X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LNQEEVESLNRP \ ITGXSAI VAI INS \h 
PTKKSP\GPDGF\TAEF\YQRYKEELVP 
\ FLLKLFQLI EKEG I LPNS FYEAS I I L I 
PKPGRDTI KKENFRP I SLMNI DAKILNK 
ILAKRIQQHI KKL IHHDQVGFI PG\MQG 
WFNIRKSINVIQHINRAKDKNHMIISID 
AEKGFDKIQQPFMLKTINKIGIDGT/YY 
FKI IRRAI YDKPTANI ILNGQKLEAFPL 
KTGTRTGMPSLTSPLIiFNIVFCPIVFWA 
R\AIRQEKEIKGIQLiGKEEVKI»SL>FADD 
MIVDLENP I VS AQNLLKL I SNFS KVSGY 
KINAQKSQAFLYTNNRQT/EEAKS*LMS 
ELMSELPFTIASKRIKYLGIQLTRDVKD 
LFKENYKL\PLIKEIKED\TNKWKNIFI 
PCLWVGRIS\IMKMAIL\PKVIYRFNAI 
P IKLPMTFFTELE K\TTIiKF I WNQKR S \ 
RIAKS I LSQKNRAGGITLSDFKIi YYKAT 
VTKTAWHW YQNSMVLVP KQRY I DQWNRT 
EPSEIIPHIYTILI\F\DKPLLEKNKQW 
GK\DSL\ FIKW\ CW\ENWLA\ I CRKLNL 
\DPF\LTPYTKINSRWIKKKDLNVRPKT 
IKTLEENLGITIQDIGVGKDFMSKTPKA 
\ MATKAKIDKWDIj I KLKS FC/TI CTAKE 
TTIRVNRQPTKWEKI FATYSSDKGLI SR 
IYNELKQIYKKKKKTTPSKKWTK\DMNR 
HFSKE\DIYAA\KKHMKK\CSS\SLAIR 
EMQ\ I KTT\MRYHLTPVRMAI I KK\SGN 
NRCWRGCGEIGTLLHCWWDCKLVQPLWK 
SLWRFIiRDLELE IPVDPVI PLLGI YPED 
YES CCYKDrCTRMFIAALFT I AKTWNQP 
KCPTMIDWIKKMWHIYTMEYYAAIKNDE 
FMSFVGTWMKLETI ILSKLSQEQKTKPR 
IFSLIGGN 


12658 


26559 


A 


12784 


787 


926 


PQAIRRPRPPKALG*HNSVDLGWAWWFT 
PAIPTLWDYTHEPLYIAKIS * CFKKVYK 
FVLNCIQNCPGPHAARHS/SGKTS * V* S 
NYLFFS FCFETKFCS VTQARLQWHDLNS 
LQPPPPGFKRFSCLSIiPGGWDYRRLHTR 
SANFCIFSRNRVSPSWPGWSPTPDL.RRS 
AVLGLPKRWDYRREPPCPAKIYTIMAPQ 
KVNSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C*VIF*CVMILYVLIYVEKITSIRIAVY 
W/CI ITRLFVK* ICSCIiLGC* FLFFNV 
C* LY I F 1 1 FVFFFFFFFFFFFFFFFELN 
ILY 


12660 


26561 


A 


12786 


243 


37 


RRSAGHGGSCL*SQYFGRPTRADHKVRS 
N/RGETLSLVKIQKVSQAWW*EPWPAT 
RQAEAGADAWVDR 


12661 


26562 


A 


12787 


75 


289 


P YRHS PRFFFL YTFFEMDSHS VARAGEH 
WCDLGSL * PST \ * GS SDSSASTSR\QRH 
HAWL I FGFL* RRGFAC 


12662 


26563 


A 


12788 


83 


256 


RKMYWGLFVTAQN* K* PKYPSTEE/YI 
K/RLW YLHTMEYYS AI KIEKLGTRAQNR 
LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNWKQ/ SIPTFQVILMP 
WWLIPII*ATWEG*GRRSPLNPGV*DQP 
GQHSKTPSLQKKTKFS * AWWRVPWPA\ 
IGEAEVSGI PLSPERSRL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H= Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


12664 


26565 


A 


12790 


394 


91 


FYNSTKFKWKNIQFLITRCRGDQIQVIR 
AHAVQLGFHVLYF/CLSQNLS I YLYIS I 
YVYTYRYIHTHMCIHI I * YLYI YTYIVI 
KH* KEHWW I KHCFPTERK 


12665 


26566 


A 


12791 


735 


75 


FFFRQEGFLSPPKRGGVPPPHTKKYFSP 
RGSFLWGWGTSRPP PRKCFS PPPPPVFL 
PPPKKKKIISFSPTKIiAPPPEYFKSPPP 
PS \PPPPSSPSPSTFFFFKNFPFS PSS F 
SSPPFFFPSPAWEDPPVQSRRIY*FLPP 
PF*IPEIPR\RVLKKKPGEGRSWGLFPP 
PKGPKRRVPLGPEFKTRLGHKTKPRFFK 
KKKKKKRFLISCLWWHMPAI PSTliGG 


12666 


26567 


A 


12792 


482 


342 


MGVFVFFFETGSHSSVTWAGVQ* CNLSL 
LQP/ PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


FYKKKKTGSGLALSPRLECSGTI IAHCN" 
LELXX3SRSLRL\SNTPTLAS *AARTVSV 
CHQTHL IF* FFVETG S CYVASA 


12668 


26569 


A 


12794 


270 


94 


YVCYSYKQIKICSHVKRRE*NTFKIYLL 
KYIALGLTGHFFPL*TFSKTVTLWPGAV 
AHRCNIiSPLEGQGRWI I /R/GQE I KTIL 
ANM* IPVFLHVNI FLSVCS YS IHTTLDY 
LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NFIKI IKKKVMGIiFYGKTFLLNLKKKKR 
KKK*WPGTVPHA*NPST\LGGLGGWITR 
GQEFETS TLATN * NPTKS ISRGF 


12670 


26571 


A 


12796 


1387 


32 


APSSFAIRSFFSGPMNAFFSSMVWKRPW 
PNLEWSMNLRSIFSRARRFVCTSKDLR 
RVSTRFLVHLTAFLGYKAGMTHIVREVD 
RPGSKVNKKEWEAVTIVETPPMWVGI 
VGYVETPRGLRTFKTVFAEHISDECKRR 
FYKNWHKS KKKAFTKYCKKWQDEDGKKQ 
LEKDFSSMKKYCQVIRVIAHTQMRLLPL 
RQKKAHIjMGDQVERGAPVPEKAD\WAPR 
EALSSKVLVTQVFWAGI K* SNF IGGDPR 
AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW\HP \ARVA\ FS VARRWGR 
KGLPFHRTEINKKIYKIGQGYLIKDGKI, 
I KNNASTDYDLSDKS INPLGGFVHYGEV 
TNDFVML KGCWGTKKR VLTLRKS LLVQ 
TKRRACiEKIDLKFIDTTSKFGHGRFQTM 
EEKKAFMGPLKKDRIAKEEGA 


12671 


26572 


A 


12797 


86 


364 


EQDNRI FFS FLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\SSFPT*GSQE 
VGTTGAPHD I FCF FNKKGET PRX.YKKNK 
NNRGGGATP 


12672 


26573 


A 


12798 


101 


440 


HCSRYATGIFEWYSGILVLILLRFGIYE 
PMNLNI GDP YDS PPS RLYKMRLWERMAK 
SLIEVSLKNSHFWLGMLAHACNPRTLRG 
IGGRIA*TWGF\KTSLGNIARPHLYLKK 
I 


12673 


26574 


A 


12799 


364 


195 


NPPGDLGEETLLWGGE I MGTTP P * FPPP 
KEGFLPQDPPGGFNSPPI/QGRSFSFPP 
PGKFGPPQGFFKKPPPFFFFLNNNQ 


12674 


26575 


A 


12800 


86 


489 


PTAMAEEG IAAGGVMD INAALQELLKTA" " 
LIHDGLAHGIRKATKALDKRQAHLCVLA 
SNGDETVYVKLVEALCAEHQINLIKVDD 
/WKKLGEWVGLCQIDREGKPRKWGCS C 
IWKDYGKESQAKHVIRENF*CKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12675 


26576 


A 


12801 


64 


211 


GNHKKSNYSLAPWWYTTWPAT* EAEAG 
GSLE\L*CAMITPVNSHSSLA 


12676 


26577 


A 


12802 


300 


3 


NNVTTGSNIQFHSFPLIRKDICVSYIKN 
DAWNY I KNSNPNGE TQVYQ* YS PSN*NL 
KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLINKQK 


12677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCLS \QEGQYSE I PSVQK 
NVKI S WAW * CMPAI PATREAE AGGPLE P 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AFILGKGKYEPYNPCLGWA*WCMPWIiA 
SWEAEAGG\SLEPRSSEL*CAMLiIGCIjH 


12679 


26580 


A 


12805 


397 


238 


ETEFPFCCPGWKWGDLGSQQPLPPGFKR 
FSCFSLPSK\WGQ*SIiAPVGKGCK 


12680 


26581 


A 


12806 


1 


293 


PTRPRTSYEKQGYLLLPPVFS I VLEVLA 
RAISQETE I KSVQMGKEEVKLSIAFVCV 
YIYA* ENPVESTKTVDANLFDTRDWFHA 
IFPQTVGEVGMVLG 


12681 


26582 


A 


12807 


297 


281 


AKNPRRQPREIPGGVFPTPGFPPYFKAK 
MPEGPFPGGFFFKGGVLWGTPPPFFFFF 
FFSETESRFVTR\LQCSGAISAHCKLRL 
PGSCHSPETC**R*G 


12682 


26583 


A 


12808 


255 


59 


AGCSGSQLFGRPRGVDHLSPGV*DQSVQ 
HGEILSTKNTKISWVWWSVPWG\EVGR 
SLEPGR * RLE 


12683 


26584 


A 


12809 


247 


310 


PLHSILGTPTTPAPTPGKPLLIRTSILY 
* KPILSQAWWCAPIiVPATQ/EAEARGSL 
EPRDSRRS*AMIKPVNSHCTPSWALPRP 
LPQPLESHC 


12684 


26585 


A 


12810 


101 


477 


IGKEEI KLLIiFPDNLMEYTVNF* MYKPL 
ELIS * FSKVTEYQVNTEK/SNCI YT*QL 
QIENEIAICTI*FMIASKSIKYIiKISLTK 
CN\KWRDILCICIGRLSIIKVLVL,PKLM 
CGGEKIFNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


CFFVLFFKEMGSHYVAQAGVRWCDHGSL 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLIFKFFVETRSCFIiIQAGLKLliASSYP 
S QPRTS P KCLG FTDVKS LHLAWIiCP 


12686 


26587 


A 


12812 


166 


143 


RKEERFQINNLGFHVEKLEKEEQI \NPK 
TNRKKKI I KTRV* INEIEYRKTIGK\IN 
KNKSWFF*KINKIDKPLAMLDSGRRDSN 
Y*NITKS IEEGRRRKEEAGKRKEEGRKR 
RRRWRQRWR 


12687 


26588 


A 


12813 


163 


423 


KAFWEFINHKKTGPF*RGPIGKVSPPGG 
NLASSI*KG/DPFPSPLKGPQGSSGGPP 
PKKPPTGFNLAFGGLRGRETKRGPFFLA 
GLLP 


12688 


26589 


A 


12814 


2521 


2842 


INTMYFPP* EMLVGWAWWF/RASNPQHF 
GRLRQADRLRSGVRDQSGQHGETPRLLK 
NTKISWAWWRAPVIPATWEAEAGESLEP 
GREGCSEPRSCHRTPVWVTRMRIiY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETLSLLKKKQTNKQTKKKNG 
FKKFL*LDTWVHTY\NPTPLGGRGGGFF 
* GQDFETTIANMGKRGFY 


12690 


26591 


A 


12816 


43 


346 


CVYVS/GVCVCVCVCVCLASVCMYVCLH 
L*VLFGIFYV*DHNICKIiRSFCFIiFKNT 
FSHVYF*KI I WPGVEL * VTVFL FLS SLL 
HTSCVYEKSYAILIFSAIi 


12691 


26592 


A 


12817 


647 


931 


SQHFGRPRRADHLRSGVQDQPGHHGETP 
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Predict- 
ed end 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G = Glycine, H = Histidine, l=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLKI \ QKLAGRGGA* L * SQJQLGRLKQE 
NRLNPGIRSCSEPGSHHCTPAWATE *DS 
VLKKKIGADKY 


12692 


26593 


A 


12819 


392 


2 


PLFFFFFFFLETESHSVTQAGVHNPPGP 
SKP PTSAP+AAGTTGMHYSAHAQ/LI FF 
Ii*TELPSCRPGCSAMAVHRHDPFTDQHG 
SAS FI»I WASS PLLRQTGDSSLLGGHHVD 
AKFSVDTQLALCTKAQNS 


12693 


26594 


A 


12820 


1183 


870 


DRVSLLTPRLGVAVARNLAQWKPLPPGF 
KRFS CLSLPSS WDYKHTP PTPG * FH/RG 
*FFVFLVETGFRHASQ\AGLNSLPQSDP 
PTLA/ FPKCW\DYR\HDHLAWPRKM 


12694 


26595 


A 


12821 


172 


298 


iylsiylpi/ylcihlsihpsiypylyl 
sinpsywvsfsrel*lt*l*ylsiyi.pi 
iyvsiylsihpsihisiyplihpigflf 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFLLHLHIF " 
SGN*SVSYICVCVCVCVCICVCVYICIL 
IKSWCS*S/CGVIi*AICAGVGWC*LIFG 
VL 


12696 


26597 


A 


12823 


199 


668 


QPLPAPSPSAQKGRAHPTPSPARPQAE/ 
SLPSQSLGTRTRPPLGRPP*SPPWGQVD 
L WG WLG P S PTL P APE SLS PAS LS TEWL 
CGTRYLVWGRSGAGQA 


12697 


26598 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDSKTRVGT 
LFHWTS VP VTRPLLL * INIRL I KL.DVI K 
CS IHKI FHSEFLFFKQHSGL/CI * LSK* 
LCS * KIHQSGLGTVAHVCNPSALGGQDR 
RIT*GQEFETN*VNMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT " 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 
PPHTDTGSPPN*GSRLRHERPPWGRERQ 
RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76 


486 


CTVNLFIYFEMESCSWQTGVQWCDLSS 
LQPLI WFSCVPTQISS* IPMCCGRDLVE 
GP*MMGAINSFLCCSYDSK*VS /ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADIjS IiLEKDT * I KL E ENR * QER I 
MLRAEVNEIENKIENINKTGGSSSEKYY- 
KIDNPLTRSIRRKVN\KITSIRNEKGSV 
TTNPTEIKRTVKEYIMKNFMPINLTT 


12701 


26602 


A 


12829 


208 


1 


PSNSTP I FKRNENI CPHKDPCTLMLMAA ~ 

LFVMSKK*KQLKCSSNDE\KLWYIHMME 

*YIAIKRNKLLI 


12702 


26603 


A 


12830 


649 


172 


FPLKYSLSTTSVMAPSFTSCGGSQQVIjL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
SPPRSPPNYSP*SSQGPLLQPQSKHVRP 
LPQAKARLPSAKNPARGPPGAAQAPRDR 
PPA/VCPPLPPHS PLRHERAPRLFLPLP 
ATEPAALHAVHPGRRQERVT 


12703 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGS IFKVPSL / P VPK 
LLLP I PSLCPSLPLQLQPAQPVPFNTAT 
WLCFC*/P* RV* P APG YQRVGHFNF FPL 
QLNFIPIILQ*HLGF*KSASLSSSSSPK 
KVSLCHPGCSAWRSAHCNLKLLIS*SA 
CLGLPKC* DYRLQS PRPP/ PTI FKVTKM 
CLLKSLTGSVSF 


12704 


26605 


A 


12832 | 268 


1 


KCFFLLALNWPE FILDYFILNW *NS LRT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L.ysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, i 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VTEKFLFMTLF1>TY*NMNRPGMVAHACN 
PS / TIjGGSGGRIT* AQES * TSLGNI VI P 
CLYQKL 


12705 


26606 


A 


12833 


654 


321 


LVG IRFLKVKYFNI KFPHLLAI I SPS LE 
VFNLQHFYIYFLLFIYF*ETRSCSVA\R 
LECSGVIIVHCSLKliLGSSNPPALASQV 
AGTITAHHCAQOHFCILEKSNLCTYFPL 


12706 


26607 


A 


12834 


867 


201 


RQMSPTLRTKSRDVASRSSFSAWAWGKL 
FTFWDFSMSLVSLQGPGGPFHS PQP * TQ 
GR*VENNQEPLATPFPGPERSLPGQPRL 
EPAHSGKRRGIiPI.EIRPPVNVSDIHGDP 
SGAQWLVRGVEAGPWAVGGQAP*SEGPS 
PSPPLDVSPDPGSSDPRSSSP*MDLRA* 
LSLTF *TFQ KGSGN/DPAS S PA\GKPG * 
EWRKI KGPPQGLIiIjGPEMWGSQARVLS 


12707 


26608 


A 


12835 


328 


1365 


YAFCLMMLDKKQI \*AVFLFEFKMVHKA 
AKTTQNISNTFGPGTANKHTVQWWLKKF 
CKEES LEDEERDMGHGKVDNDP I * E P \ Y 
*TT * K I TEELS VDYS TVI R WQHS KQ IG 
KVKKIiDKWVPHE LSGNQNYRFE V* SS FM 
IiRNNNPFLNRIVTCMEK* ILYN/RPAQW 
LGPRGQLQSQFPKPNLAPKKKRSLVHW/ 
SPVGIiliAPILDPTYSFLNP/GGETITSE 
KLCSGKIiGKMHGKLQYLLPALVNRKGPI 
\LLHDNT/RDCVFAQPVLQKLKELG\YK 
VLPHPP YSPDLS PSDYHFC * HLiDNFLQG 
KH/SQHGAENAFQEFVKS*STDFYATGI 
NK/LFSHWQKCVDCNGS CFD 


12708 


26609 


A 


12836 


7 


328 


RRERERERERERERERERERERERERER 
ERERERERERERERERERERERERERER 
ERERERERERERERERERERARV*EKST 
RYKS PPPRI P PLRR/RRV* RKHPHARAL 
S FFLYKKCTGGERAPS P VCVGGKFTW 


12709 


26610 


A 


12837 


219 


350 


PFNHTYATWS III SNVQVCFMRAQDIYIi 
LIYLFEAGSCSVA\RLECSDMIMIiHCSIi 
DLPGSNNPPISAT* I 


12710 


26611 


A 


12838 


189 


359 


LGLDGVMRVGPCRALGPS*EEKSSRVQW 
LTP VI PA/ LLEAEAGGS PE VRS LRPAW P 
TW 


12711 


26612 


A 


12839 


368 


77 


KNPNFLKFGSKPMGPP I YS P P \ LEG * AG 
GFINPGF*TPPGYMGKPPFFLKYSNIjPG 
LAAPGGCSPFPGGLGRKISFTPEMEVS I 

NPGS p ls lppgeqn 


12712 


26613 


A 


12840 


16 


173 


KTDVHSKTCTKLFTAALFLIVKKWKHLK 
PP*VD/EINKMWYNHIVEYYIiAIKS 


12713 


26614 


A 


12841 


425 


45 


NSFVFFFFLFENRVLTPVAPRLEXCTGV 
IL/APHCTLPALPEFKRFLVPSAT.iLSSW 
D YRPVP VMPWLNFC I FVE TGFH \ HVAQS 
VLKLVSLKL/PSPTWDVPKC*DFRYSVR 
CGGLFF 


12714 


26615 


A 


12842 


237 


375 


LIJjWVY I LKKLW \ LiGWAHACNPSTLG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12843 


21 


325 


TSFFFLESYSVAQAG\QWCDPGSLQPQP 
PG/ SQ/DNPPTSG* VAVTTG/MHHHARL 
IFVFFCRDGILLYCPGRSKI IFSSCIRE 
LFKKI S D FFLLTML I FCNSNKG 


12716 


26617 


A 


12844 


369 


58 


PISPIiQFMIjPFIjKNQSPYGPFFWKKKKF 
PPPFIiGGAVLKTPGF*NFLFKKIQRGFP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib)e 
nucleotide deletion, Impossible 
nucleotide insertion 














PFPKKKKPRERPKPCPPKFG/SPPFFFY " 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKPKIHFFFSGGGLKGIVWGNTLLDI " 
GLHKTFFF\*GDFYYAL*I *AENALFGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKTNNTKCWQGWGATGTLIHF*GEFTRV 
QSHWK I VWKFL * /SLNILP/PSSSSSSS 
SSSSSSSVKNLCLHKACTWMFIAALLLI. 
AKTWKQP * CE 


12719 


26620 


A 


12847 


416 


125 


KIPTRPGKGGPPLYPRPFRGLNKQIGLT 
PEFGTPLGNKGKSFPFFQKNKN\NWPRG 
GGPIGPPSPGG*VGGLTLKGEVSTKLDS 
PPAPPPGGQKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHIFFIFLiVSIjIjKGIiSFLLTFSKKr*VF 
/SFIDFLILiCVCVCIEFYCFILILFHYF , 
CLFGFCLFLMFVS * CQSLDFYFNS F/YI 
F * YKIQCYKF/ CI L I F 


12721 


26622 


A 


12849 


1 


246 


RPRRHLVASLVLTQLNGSSPLP IKIISW 
AWWLMP VI PTF WEAE VGGLLE PRS SRQ * 
*AMI\SPCPPAWAT 


12722 


26623 


A 


12850 


161 


1 


NKTTSSQIW*PMPIVSTT*EAEVGGSLE 
PRS /I)/KI>KCAM I S PVNSHCTPTWTTK 


12723 


26624 


A 


12851 


65 


244 


STYYLHSLPFLD IS YKWNHTTC \ SRYFM 
GALFL I AI K* KQP KRPS ADEQI * T/ IMW 
YIYTLEYDLAIKRNEILIHAATWMNLKN 
IMLSKRSQTQKT\HW* fhl 


12724 


26625 


A 


12852 


235 


34 


TTIAVSVLILKELMEP'TLQPDFLG/WS 
LITESVDHVQWHAPVI PALWEGEVEGLL 
EPRSLRPAWATW 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQP CP / PEC 
RPSKPRPCGCRI*SPARTSPQPSPAASM 
ALPTNERTP PPALKW * PK/CPKQS PQSA 
KSKSPVKSTERTAK 


12726 


26627 


A 


12854 


244 


3 


ELQVYMHI*IYI/'C*SIYTKANIYIKYM 
HMCVYIYTHTHTKVCIHLHIYLLRHGFG 
CWLMLGKVRGCEGMAGLMKAQQGWGW 


12727 


26628 


A 


12855 


376 


428 


KWEPGGHTS FFFFKGLVLNFGRGVLQKT 
RP*GGGQGGKFNPNFSGPREPPPP/PPP 
GGGKKGPP P PPGVFVFFLEKGGS P I LPR 
GVLNPGAPKG VYTQRGG I KSGNQGATPL 
SFFLKVWF 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN 
SSSR/PPVWRASQTHPPQDWSSHCSPPW 
GAPPQ I * PFSDSLAHP PQDWSSHCS P PW 
GAPPQI * PFSDSLAHPPQDWSSHCS PPW 
GAPPQI 


12729 


26630 


A 


12857 


363 


97 


GLATLRRLVSNSQAQVIRPPQPPEVLSL 
PKC*DYRPGRPPASLIHVNSRQT*KVAP 
/ CSKPIAAAAHQGSPGATETTRRCPPSR 
CGPIF 


12730 


26631 


A 


12858 


322 


361 


KSEKQS * VMIAVCTLDMMKMT\FIS WF 
LPKTHN/LMSNYKKNTRQIPMEGHSTIY 
LTRI PQNCQGHQKQ * KSEKQSQQES KKK 


12731 


26632 


A 


12859 


88 


417 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN " 
FFFLDPPPPG*RDFSPSPFKEGGF*KP/ 
SPPPLVFFFFF*KKKGFPIiGAGGF*TSG 
PGETPPLYS PE VWFF YGGTPGPTPFFF 


| 26633 


A 


12860 


217 


2 | lilinklanmhcflfvfetsijT*agvqw ■ 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HDL/GLLLRLAPPRFKQFFCLILPSSWD 
YRCALLHPATREDEA 


12733 


26634 


A 


12861 


375 


145 


I»SFFFFF* kpgscsvaqagvqwhahssl 

QLQTP\GFSDSLNSAFSVARPVRVHCHT 
RPLSPFNI CWRARPKQKVFL 


12734 


26635 


A 


12862 


525 


852 


NLLS I YFFETESRS VAQAGVQWCHLGSL 
QPLPPRF* IKKPSNTALFM*NIjLSIYFF 
ETESRSVA\RLECSGAILAHCNBCLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCIANFFVFF*VETGFHYVSQDGL/DTS 
* PQVI CPS RPPKVLGLQA 


12735 


26636 


A 


12863 


3093 


204 


E PD KTGP VIiW KVGGG ARVPGMAETLS GL 

gdsgaagaaalssassetgtrrlsdlrv 
i dlraelrkrnvds sgnks vlmerlkka 
iedeggwpdeieitsegnkktskrsskg 
r kpeeegvedngle ensgdgqedvet s l 
enlqdidimdisvldeaeid\ngsvadc 
vedd\dadnlqeslsdsrelv\egemke 
\ lpeqlq \ ehai edketinnldtsssdf 
tilqeieepslep\enekildiiAget* 
rsepvn*esseleqpfaqdtssvgpdrk 
iaeeedlfdsahp\eegdl\dlasest\ 
ahaqs s kadsliiawkrepaeqpgdger 
tdcepvglepaveqs /svaa.selaeass 
eelaeapteaps peardskedgrkfdfd 
a\cne vppapkes sts egad \ qkmss pe 
ddsdtkrls keekgrs scg \ rkflglvg 

LSSTTRATDLK\NIjFRQIWGRVGGAPRI. 
WTNARS PGS FAVYG FVTMSTAEEATKC I 
NHLHKTELHGKM I S VE KAKNE P WKKTS 
\ PKRGSDGKKEKSSNSDRSTNLKR\DDK 
CDRTDDAKRCDDESVEKSK\DKNDQK\ P 
GPSERSRATKSRKSRGPKRTW\MDKSK 
\GVPVISVKTSGSKERA\SKSQ\DR\KS 
ASREKRS WSFDKVKEPRKSRDS \ESHS 
RVSERSEREQR\MQA\QWEREERERliE I 
ARERLAF\QRQRLERE\RMERERLERER 
MHVEHERRREQERI \HRE\REELRRQQE 
LRYEQERRPAVRRPYDLDRRDDAYWPEA 
\ KRAALDERYHSDFNRQERFHD FDHRDR 
GRYPDHSV\DRREGSRSMMG/SIREGQH 
YI.ERH\SDPEPH\GQDSIjR*IiGWGYEIi* 
Q*RLS\EGRGIjPSFPPGAGRDWGDTWLE 

deddpvmgkgtaergpmmdr\dhk\rwq 
\ggersmsghsgpghmmnrggmsgrgsf 
apggasrghp I phggmqggfggqsrgsr 

PSDARFTRRY 


12736 


26637 


A 


12864 


2 


456 


ihlgsgsegdgssgglgrgnsntsrfss 
s s w argdnvprp pp pavcs w i s egdvqn 
pgiigeagagsstpgdggelry/wpgllg 
a* grgiggddddlrtlglagvqqcgkg/ 

RRGPRGPGRGQEPRRPRDLGLRGPRRAQ 
GAA/SATAAPPP 


12737 


26638 


A 


12865 


388 


1 


L I KMTKVKKTAITKC* RGWGRTGAFIHC 
SWECKMIQPI.WQTVWQI.LKKLNIYlAWK 
QPKCPLM/VKWIKKLYISMDSYSAVKKK 
ETr J MYTTA*MTLEVALLSERSQTRKGER 
GHTLWDS I YINFQKMQTNU 
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nucle- 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence {A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12738 


26639 


A 


12866 


198 


3 


KKQKQTKNPPKIS *ALWCVPWPATWGA 
KVEGS PE\ PRRSRIiQ* AM I TPLHSSL CN 
RVRTRGRTRG 


12739 


26640 


A 


12867 


415 


973 


NPVNCAQFSLLPTTEYMGHRVEGATGHG 
TCP SVP PNTHLHTGWWCLQHKRS CRAWG 
RGGSHTHRCPQRVPDGEHP * I YIAVHGE 
PP/ESPPQPSPLSCCPPQGNYIALREPP 
QGLPLPGTLPSHPPPFWHICKTHSSESR 
HPPFPGFCGIiEIiEKGVGDFI IHPPLTIiP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 


A 


12869 


715 


893 


PCVAPHSIREERFRLTSESTNQKVLWWG 
ID*T/VILILTGNWQKRHLKSFFLGQKP 
GGMAP 


12741 


26642 


A 


12870 


342 


58 


KKKKPKENEQS i wdmwdnfrtsdi *ttg 
vleeqfre*grkiifkeivrenssnltk 

/QINLLTQEV*TTT/HKNMNKSTPIHII • 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 


110 


401 


SFYSVMILKASRIVICLFT*LYAYIHTJj 
YIYTLCI YYAYYVNI YICTHI \NIHI I I 
HVYT YY I VYAS I YYS MYSAWYN I KVS KD 
RQTYENIAIilTYY 


12743 


26644 


A 


12872 


439 


256 


SYFMRFGGDKISKLYQFILSRRWKQSTC 
PSMDEWINKMWH/ I TI MEYYSAK* RNEY 
SYMRHE 


12744 


26645 


A 


12873 


356 


43 


GTPERHSHSHS QGS P I IGAFMG I RAS F V 
YSLIjSCVTjLiMLSPQRVLFYFSAFKQQQS 
\ WSGKVAQAYNSNTYGKAGS IT* GQEFK 
TILGNIARPCFYKKKFLANS 


12745 


26646 


A 


12874 


341 


205 


REI KKMVLNKKFFS LQFLGKKKNFWE FF 
LKKKKKVFKAPPVFQF* VP I F/ CSF * KK 
KVFGFFFSPKIFFFFFFFFFFETESTSV 
TQE FNGT I IAHCNLGLQGS S S S PASAF * 
VAVSQDCAIELLGDRGRLCLEKKKKKKK 
KKNFGGKKKSKNI1FFI1KRKKMGT 


12746 


26647 


A 


12875 


474 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 
LSRHLRQSHPHTTVQPWTSGLLPSPPA* 
IjSXjETQHFKTAHF X FP * H * S S GLGPGWG 
SPVGEGPG/HPKLCLQPAAAAEAWPPRE 
GTKARRHEGLPPAP CAWGPAPRDAGSRL 
EFPTIiPTIPVSQSPGR*HGAGGRPSCliR 
AFVPSRGGQASAAAAGCRQSFGDLALLP 
LGSPILGQGQNFSARGR 


12747 


26648 


A 


12876 


257 


467 


YRVFTLLPRLEYIMAHCNLDLGSSDP/P 
QVAGTTLK* LSRASRVAGTTGVHHHSWL 
I FK/ YFL* RRWRRFSIj 


12748 


26649 


A 


12877 


867 


1409 


GPTVLVGGQDWNSVLQQPRAHSFTPRGG 
AGGLGGGYVLHKP I PFSQELLGAFPCQ K 
QYRVPPPLEAQLLRQARSPAFIY*AAVL 
CQAQGPSPSPCSPMVSPRPSGRAPHLGA 
APQSGQKALASTS S PDAEPV/ SVG * WCP 
SNPALQGGQtJGAEPRAVSITSGALSLYD 
CFLDGWWEGRSLGP 


12749 


26650 


A 


12878 


1416 


740 


IRNPRWVQLDVFAAAPMGRSSPLFLIiLF 
S FLKTVLRGSMI PSTVPS/FHKGPQACL 
YSLSLRS*SQAAPSPAPSSLPTPSPHPG 
LTQPGRGRRKADI S /CSHSWAF YVP /SL 
PHLPWACPLSLPOPVLFS ILFCFVLF*D 
GVIiCHPG\WSAMARSWAHCNLHLPGFK 
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Amino acid sequence (A<=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFSCLSLPSSW\DYRCMPP\GI>G*F1,EF " 
* VEDGGFT I LVRLI WNS + PQVI HLPQP P 
KVLGLQA 


12750 


26651 


A 


12879 


961 


31 


PSPVRDSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASEPAPCVLGSRPCFRV 
AFETPWTSPASCMFERCPLWRPPALVAA 
GLD PFLLSS TLTIAVCPRDL I RSWPCKGC 
DVCDSRNCWSTHSRPACRDAILAPILAG 
RGECIGLWG/AAAACTGRV/TPVSGLTF 
STVFDFS SARG/ V* GGLPRS TGGPRVRG 
PGQPVPLPGLPAGVALAPLEGRPQEPAP 
ALSHSLTGSLFLGAWLCGCTRHGRAPVL 
LEVSIVACGQLCEAAPQIFIQPGLQQLP 
TPSATS PAYLWVDVLLTLPS VI KQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


PAIiWVPWARFDGMEETGQNHWSKEKAW 
PGAVAHTCUPMTLGG*GGH/Mr*GQEFE 
TSLANMAKPRIPP 


12752 


26653 


A 


12881 


331 


1 


KNGPFFKNPPEFFPPPKWGVPPNPTPYF 
PWPEPIF*XiTPGPLKKQKIPFQ\KGEIL 
GWGGKMGNPLPFWKWGGPPRESKSSPRF 
YKTPPPI FKKKKKGGSSRSRTS PRV 


12753 


26654 


A 


12882 


370 


287 


VS ILAGCCGVAMI DNTVLVHKE VK I LR K 
* 1 1 KK* CI * * FRGHSK/HNFSG* AWWLM 
PVIQHFGRPR*GSLEARSLRPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 


1168 


EPSRGVWPHEDARINGSKKKKSKDKKRK 
REDEETQIiD/ IVGI W*TVTNFGEISGTI 
AIEVDEGTYIHALNNGLFTLGAPHK/ER 
IALKPGYGKYLS INSDELW/GRSDAIG 
PREQWEPVFQNQEVRNGGPAEMGEEKRN 
GTKWREDTDHTS FPLFPS TGGQPKAHSN 
WRKVCH 


12755 


26656 


A 


12884 


205 


452 


ASGSAHRFRTFSFETGRVIiDKPSQTDHT 
NRKRGGPNVK*SFIj/WXGAAAHACNPNT 
LGGRGRRTA* AQE FRTSLGNTVRPCLY 


12756 


26657 


A 


12885 


112 


290 


KYNIIQYNIVFSCHDPLVSFNLFFFFFF 
GKGGFPLS PRLKARGRVWGYLNPPLPGK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLGPGFQPPPAPMGKTPFF* KNQNN 
PNWGGQPIiNFKTLGGGGPKISF/PPGGG 
GLNNPKPSPGPSTWGKGGNPIjSQKKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRSIiARGGFGRPLQKPYWNEQSFIMAKK 
\YEQPKCPWTDKWVNKMWSILTVG*YAA 
MKRRDVLTCATITCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCFALWLSSLIYN 
*VYLFLFVHNLGRVSFSYFLMIiI * FFEH 
VNHV/HNYVQMIQNTKLYLKSYLETFH/ 
CPC*FPPILLH 


12759 


26660 


A 


12888 


201 


411 


HIWDVILIRHALGNFLTSIS CYI VKIKL 
DAVAHTCNPNTLGGQGG/WIT* SQEFKI 
ILGNIGDPIiSTKKNK 


12760 


26661 


A 


12889 


138 


464 


SCTMNPSEMQRKAPPWRWRHRS *APSTH 
KMNRMVMSEQMKLPSTKKAEPPTWAQLK 
KLTE LA/ TKKKS LENTRGTQTS KNMLFA 
ALMI VSTGCAGVPS S SKETAT I EDKP 


12761 


26662 


A 


12890 


146 


366 | FS VSMKYFV* IH* FYHIj/KIjIjIjMVYIi* Y 
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correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutaraine, R = Arginine, S=Serine, 
"^Threonine, V=Valine, 
W=Tryptopnan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V— possible 
nucleotide insertion 














GFLAINTSQNISCSQKIMKLHL*GRGVV " 
AHACNPGTLGG * GE WMA* AQE FNTGLSN 
TAKPYLDF 


12762 


26663 


A 


12891 


368 


50 


KGRQSGVSSFPVGTVTNYTNLVAHNNRY 
LFPHS PGCQTSE VKVLA\GCSLLKASRG 
EGFFPLPGSGGS KCPLAYGC ITPISAS S 
CDLLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


KLTTLLKLFEK\ I EDRTLS KS FYEVS IT 
♦IPSSSSSSSSP/ISLINTDAKKINKIP 
RNRIQQCIKKIIFHKFSTGIYS 


12764 


26665 


A 


12893 


461 


89 


I YQNLSRKPDN I KCW \ * RCTE I QTL I HC 
W*QCKTMQLLWKI IWQFL* /ELNTEVPY 
DSIPLLDMY/PKKTEC/YTSTQTLIYCM 
NPIEYYSAIKRDKVLIHATTWMNLENI / 
ISKRRQSQKSTYCMIPLI 


12765 


26666 


A 


12894 


222 


185 


RIVMQYLHLNLGACVCVCVFVCD/ CVCL 
CV*MCVHIHI YI STHTY IH I YVCM* I CR 
LS**CEFYS 


12766 


26667 


A 


12895 


186 


166 


IFFFFFEMGSHSVTQARVQWHNHSSKQH 
PPP\GASDPPTSAY*VAKINIFFFF 


12767 


26668 


A 


12896 


55 


485 


TCWDCRDEPPRPALESVFLTSFLEDEFS 
IEIADLVPHPTPCSHPLQPP**VIIYFC 
S \KPLVTGSSS P PRKP FPSLHEPKSASL 
*PPSSTIQPVRSSPKG*PSPPLA*VQFP 
KGMTI PTLCLS P VLPVPSTI PLPNVS KP 
LPFY 


12768 


26669 


A 


12897 


923 


312 


VDVIRKVQIEITLRCHLIHVRIAIIKKT 
RANKCW*GCGEEGAIjVHC*WEC/NYILV 
QPLWKTVWRFLSKLKIELPYDPAIPLLG 
ISPKGNEII/CCTPKFIAAQFTIVKIQT 
QPRCS SMGKWIKKL/ W/H I ERS I DR*LE 
YYIjAFKKKAVLSFAKTWIDLEDIMLNEI 
SQTQKEKYCVISLICGIKKKKVKYIEIE 
NKTVITKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


ESWRWGRIVRPHSVPGETWLCPVPPGPP 
NAPDGSFPLAICPVPPSCRSL/CPP/DD 
CLASAGV+GGFPLPRTEASPHHSWACPQ 
EQPNPPQCKLKVGAPNSTALS EGS 


12770 


26671 


A 


12899 


138 


447 


IFVFHYKTQNIYNEDTLHVI I INLWCYP 
GSYIKSTGRLRPEVERGLGPTTMCC*FS 
SIKNIFYF*KLKSYRKL/WLGTVAHAYN 
PSTLGG*GRWIT*GQKFNRD 


12771 


26672 


A 


12900 


279 


1 


INQSIHNNIKSTYISQCKLIQP*WKIIW 
RFIR/NLKI ELL YVPATPPVGV* LKELK 
SAC*RDNCTT/M/FTAALFCIGYGSNLG 
NSTALIGSRVLFP 


12772 


26673 


A 


12901 


235 


183 


I INYGGGPPEKKKKGGKKI PLI FYKKI K 
KFLWPKKKKLFGGAKNPIV*AHHI FTV* 
I DVDTRAD FTS AT I \ I IAIPTGVKVFS * 
LATLHGSNMK* S AAVL * ALG F I 


12773 


26674 


A 


12902 


180 


19 


LFIYLFIYLFICEIRCHFVTQAGLQW\L 
IIASCSFALLGSSDPPTSAS * VKGI 


12774 


26675 


A 


12903 


406 


949 


APVPGHGFCVSFy CGENGVSLYCPGRS* 
TPGLKLCSCL\CLPESWDYR/R*APVPG 
\LGFVFHLTL*KPPFFKDYMKSLFQFLK 
YLIQG*CTLV*GVRYRSSFIFFFFFLRR 
SLA/SVTQAGVQWRDLGSLQSPPPGFTP 
FSRLSLLSSWDYRRPPPCPANFAFVFLV 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, - 
N=Asparagine, P=ProIine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EMGF\TVLTRMVWIS*PCDPPASASQSA 
GITGVSHRARPKFVLYFKDNGEPIiGGVL 
HRSHIAVHWM 


12775 


26676 


A 


12904 


265 


385 


GNGS PS LRWIGVLRGRS PTLELRH* P I F 
LTAASRRSLDRVS VPMWGTFLSEPLS IE 
GLVSRYLTNNLMERIPILYRN/PFKNET 
M/PESLCYRVLI FLSKGYPRVKGRLDTC 
YSPVRRSPAKKAS FLPDAPRLACVKPVA 
SVHPEPGSNRTRG 


12776 


26677 


A 


12905 


363 


137 


APGMQA* WCM/HCI PGAQEAEAGEWTEP 
RSSRLWCTVIMPMSSHCTPAWATW 


12777 


26678 


A 


12906 


315 


565 


TPVDHGWLECSGAIMAHCSLKFLGSRDP 
PASASRVAGTTGVCHHSQLI FL.FFVGTR 
SNFVAQAGIELLGX *VILLPQSPKVLGIj 


12778 


26679 


A 


12907 


376 


412 


DAWPTWRNL* FGVYVMTDR* EPRQMLTY 
LTA\PLKYFKSGVMAHSCK\PSTLGGQG 
RWIT*GQEFET\LANMEKPCLY 


12779 


26680 


A 


12908 


215 


2 


NIH INGQKIiFL/ WLGWAHTYNI STLGG 
RGGKI S * TQE FETS LANTVRPHLHKKQK 
IALPLPLPLPLPTVR 


12780 


26681 


A 


12909 


307 


379 


KR* KQPQ YPLTEEWINKMRHV \ LQYYS I 
LKKKEIL 


12781 


26682 


A 


12910 


279 


123 


KMILS WARWLMS 1 1 PAAWEAEAGGSVEIi 
RS* \LRLE*AMTVPLRHFTPAWARE 


12782 


26683 


A 


12911 


1 


194 


AKQHI PLI^LECI /GLFTNTVTAHCSL 
ELIiASSHPPASAS*VARMTECVHRAWIiE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


IELSKQGPVYYKWYVRITPPYFPFFFFF 
ETGS CSVT\RLECS /GTITAHCSLNLPD 
LSYSPTAAS*VAW 


12784 


26685 


A 


12913 


238 


426 


TPLLPSQQWLKKESLWLGRWHACNPST 
LKG*GGRDS\RGQ* FETSLGNIARPCLI 
REKFLKI 


12785 


26686 


A 


12914 


115 


1436 


AKDRHFTEV* IANKQMKRCSTSLGMREM 
QS KIVRYHLSE * /R* LFFFNGGNCWQG/ 
CKK/ IQDLIHCWECKMVQPLWNI I W\RF 
L * NHTCI CHASG I AL.LSIYPREMKTCVH 
TKTCIRMFIAALFVMVETWRQPIYPSLG 
G 


12786 


26687 


A" 


12915 


282 


21 


IKFPPCPSTWGTKKKLFPKKKKEVMKML 
CFHFNLLVQN IF* YRLGNE I SDKAG \ P V 
AHAWNPSNLGS*GELIT*GQEFDTSLAT 
IVKL 


12787 


26688 


A 


12916 


254 


150 


TIiPGRSSDFSLPEV/YLKKNNH/WLGW 
AHS CNP S TLGGKGGW I T* GP PHMLKVS F 
LPTDPTSRVKQLYNWPHSTVLIALFPT 
KNPG 


12788 


26689 


A 


12917 


181 


24 


AQWCDG I PVSKGNL/ WLGMVAHACNLST 
LGGPGRWIT*GQEFETSLANEFHRTD 


12789 


26690 


A 


12918 


75 


298 


LFILFTS*VD*MRPTHIKENNL\IAQST 
NPNVHL I Q / KH PHRHS QNR WI PNVWAPH 
G\QSSCHIKMTIMARPSGSQL 


12790 


26691 


A 


12919 


251 


9 


KFMFLARCGGSRL* SQLSGRPRRADHI.R 
PGVQNQ PGQHSES QYLPKSTNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


RFCFS VGDEREGKRA/ RERNRETERKKQ 
TEDTETQREERDRDRKR\ERETDTERER 
PREERHTEREKK* KAAFPFQTLGCRAQD 
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SEQ1D 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQDD 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

locntion 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H = Histidine, I— Iso leucine 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPKQLFPSRGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


RWMERIFVRPHIKDSMWWVRSTLLGRG ' 
DALNDLFLYCWEKEDPRYKERP\RERER 
ERERERERDRCWEK*APGWRERPVQSLR 
ACHPPWHRPALAGPVPSAQ 


12793 


26694 


A 


12922 


158 


460 


EARVELLLSSRCKPRAVITAETQPAVYR 
LQLEIENFPGLSPRLECSGAITAHCSLD 
L.UG* SDPPNSASR\AGTTGMCHAQIVNS 
CKADWLVLLSSRGLTHR 


12794 


26695 


A 








GIQAACIPGLRLSPPADAHCHRHIPS PP 
PSSTAPRDDVGPMQITPDNFPPQDPPL\ 
TPPSQSLLPHRAP*PQGSSVDLPGLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKARED 
VQMRKTPMAREGRHRAAHRWIGALGKAP 
QSQRRLRHTVG 


12795 


26696 


A 


12924 


335 


332 


RI KVYVRGHAKE * LRPG P / VAHTCNCHT 
LGGQGMRITQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 




AQWLMAIIPAPWEAEAGGLLEPRS* \AW 
VTW 


12797 


26698 


A 


12926 


766 


244 


RVP RL P P PAL * LE FS PSGLRIARSHS FQ 
LRSSPPPVSPPST/PFSPHRSPPLSLLS 
LSVPS /PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPEINFGG 
PKKKKIFSPPRA*KFVFLKGPPPFFFFF 


12798 


26699 


A 


12927 


127 


720 


WCGLLESTELQLQSLPGLYVSVIFGKIF 
YISGLG I CCTKI FPS ALPS AQPAASLPL 
PARSALGIVFFLHFC* IE/CNYKKLPIF 
HHI*LIKTFSYGLT*LFFFFFKTKSLIF 
SPRLNCRGPFLVN*NLCLRG*GNSPA*P 
FGGAG I EGAPHKAGLI FGGGGGAFLKKT 
G\LHHVAPGGASNSGTGNISHPPLNPPK 
GAGE 


12799 


26700 


A 


12928 






KEHW*AVGHFNICVTGGP*GERKKGTER 
VFKEI ITPNFPNLIQTIKPKTQETQHTP 
SRRNTKI /TPRYI I IKFLKTSNKEKILT 
TAR 


12800 


26701 


A 


12929 


375 


2 


GCMLSLKTINILEVLANVLAQKKIKRRK 
LND \ BQADTTFTI * FCVRQKTKKS TDKL 
VICL I / ERF YKVAT YM INFLKKSNS FHTP 
ATINC*KQIPFIMATKTIKYLRINPRKK 
CVRFVGKKLTFPER 


12801 


26702 


A 


12930 


202 


379 


SSENFSGKTVNLCCLGRLHRMISWLHTV 
AHACNPIT\GWIT*GQEFETSLADMVKP 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGEKMYLKVKRI \ WLGWAHASNPNTL 
GGQGGWIT* PQEFETSLGNMAKPHLYKK 


12803 


26704 


A 


12932 


280 


31 


I S CKVTKKVTLFELKYI WNKFY I YERVS 
IVGTTEDAACEVTSNVITGPGAVAHACN 
PST/RGRWIT*GRELKTSPANIAKPRLY 


12804 


26705 


A 


12933 


1 


189 


QMYTESE P * CKLWTLGDYNV* M * F IS * T~ 
IQCTTQAGNVNIRE/RLCMCSGGEYMRN 
LCAFLS ILL 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPNVWFSFCHFIVTVFLSL*R 
VFFVFY* I FLSSSHFLFSFCFCPIMFIM 
FPFKFILSCNIVFI*NIIFFLF/VHV*G 
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nucle- 
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location 
correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-=possible 
nucleotide insertion 














* STLVSLFVFFFFLLQF 


12806 


26707 


A 


12935 


225 


11 


ILLCMEPSFLFWFGFICFVKTGSHYIAQ 
VEVLWLFSGMIKVHCSL/KLLHSSNPPT 
SAS*VAGNTGSSYCTW 


12807 


26708 


A 


12936 


74 


12 


GTTHASAGVINI LVFI LSVFFLTHSLCI 
HISKYMFVYICVCI CTRTDFYTEFKSFC 
TFCFFGSSFFFLEKKFPLVAGLEEQGKD 
LG*LKP PPPRLKKFS CLTLPG/AWE* RT 
NG 


12808 


26709 


A 


12937 


166 


311 


PHS*WECQMVRAPALENWQFLTKFIRGL 
PL\DAALPLHM/CHRKSCTQLFI I ALL I 
VAKKCNHHKNL *ANE WINKM * YIHTTDY 
PLAIKQNE ILIHATTWMNWTL 


12809 


26710 


A 


12938 


51 


296 


LCMYTLHTLiYMHTCIYAYICYTCKYIHR 
Y/LHT* IPEDANSVICHKYTHTHIYVHTL 
VYPHTTYLMCSYIYSWVCVCVCVYIYT 


12810 


26711 


A 


12939 


426 


28 


KGFNLVNPRVFPS PRF * TPAPVFYFCGP 
I KKI FFFKAGGVKFDS LKRAPLFF FFLK 
TGSCFVP \ RVECNGI I TAHCNLCLPRSN 
DPPPSDLQVTGVKPPP\AKLFFFFLW*M 
GFHHILDYQGETCLSQLLGWR 


12811 


26712 


A 


12940 


11 


213 


ATAPGLFS FFETGFCS VDQAGVQ *HDHG 
SLQP*PP\GSSDPPALASQSARITGVSH 
CKGPEHTSLLT 


12812 


26713 


A 


12941 


416 


38 


WLNNHSRLGFPNCWDYICKLPRLGPDVA 
SLLIHNSS \GTWYGATKLESNWYFSI P I 
YKDHQK* FAFTWQGQQ YTFTVLPQDYVS 
SAL * HNTVHRDLDHLDNPQN I TLVHYS D 
DTMLLDLMSRKYQAL 


12813 


26714 


A 


12942 


361 


74 


I IHPS FSVSSRNIHKNAYPAALGGYSVY 
GVAILLFHYFINKLAFTLHCGLA/LNS F 
LR* IQEPPLG\SGSGPFSCNVFIAPTEG 
TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


SSFEPRGI PLNPEVAMQKESVNILCSPR 
SQEFLESRI KI KVLTDLTQDEL * GQAQ * 
LTPI I PS \ I WEAKAGGFLEPRSLRLRSC 


12815 


26716 


A 


12944 


92 


306 


krttnspwgcsay/gvaillfl* il/kl 
laftllhglpsnsflrkiqepslg\sgs 

GP I FGNKLVAFGGLM I WS 


12816 


26717 


A 


12945 


322 


24 


NFKKN I NLNPFLRGY FL FLKGGFGLTFK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
NN* / WLGAVART YNP S TLGGRGG W I T*G 
QEFGTSLANMVKKILAK 


12817 


26718 


A 


12946 


2 


448 


GGAPMKRHGSLDRNYRVDTTRPSQLFLF 
QLYHNYLKFAYEATKEYMHRKETLCLPC 
PAIQFS-FPKSNLCYRGYLCMRTCAHTQM 
* MYRY I YTYVYMHI Y I CM/YRAI CTYYT 
FE* KYILQKCILLCI FLFFLQYLKYVS Y 
LNRHHFLIAA 


12818 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGMGS 
RPISLMAG* \GPPWLRRVAGLRGRPATL 
GLRYGPDS YGRQQW \ E YCTMGASLMQRR 
RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKYIHIVYKCVYI FTHVYGYTYLYT 
DVYKYTYLYTDV* I CICIYKCI YLYWYI 
YIHLCINMYIYIHLCINMYIHIHV* IYI 
HIYI\HVYIFIYIFLYTVYK*IHIYEVC 
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EYIYTLWKL 


12820 


26721 


A 


12949 


25 


407 


EALSLLLLLVLWGL.LLGIARKMYQQMNL 
VQYCLLVL* P*NL*EQKRFEREVGLDHR 
EP*FHVLGCIHSEKSFRFWSW*EL*IRK 
LTLVA\ GGVAHTYNPNTLGHRGRRIT *G 
QEFETSLGNWRPLFL 


12821 


26722 


A 


12950 


370 


106 


ILYLETLLKSFIS IRSFSAVDSHFEKSS 
TVGKMLPNS ITCTDP I \FNKKKSQLMWQ 
SSLLPFSQKLPQSPQPVSQQPSPSRQGS 
I PATR * QLA*GS VQVMLFGS I LPTVELF 
S KWESTAEKLLILINDFSKVSRYKI YV 


12822 


26723 


A 


12951 


280 


319 


DQPGQHGKTLiF*VSH*KKKKGWM/' PGVY 
AYNPRFLGGRGGWIT*GQGFETSIiANMG 
KPCFY 


12823 


26724 


A 


12952 


337 


113 


IQLAISLLGICPRDLNTVTCS \sdtctq 

mflsvllmvakr* kqpkylqidewinkm 

QCIHMESYLVSKGRKYLCKL 


12824 


26725 


A 


12953 


9 


186 


dptvcclqdthftykdtnklkvmgwtky 
/ changnqkragwiqieyp * sksiikse 

IVQN 


12825 


26726 


A 


12954 


46 


381 


nentysykkntqmfiaalfwtpnwkqp 
tcps/sgewinkll\yhlmvyysavktn 

/ I LVYVATS VNLRI I ILGKQPDKKRVLT 
V* FHL YRKYKLI S S DRKQ IHGRLE VENK 
GK 


12826 


26727 


A 


12955 


19 


250 


CSDMVLKAC I KKLMS YSKMGH I QAMDYY 
TAL/NKEL/L.MYAT I * ANL.ED I ML>S KIR 
QTQKDNCLMVDCIYIRYAEQSSL 


12827 


26728 


A 


12956 


173 


163 


GAKHSAS GTGWIRKD FTRPGAVAHACNP 
STLGG*GGWLSRSEFETSLTNMVKL/CL 
YKSICCTRCLLGCFPQALSKAID 


12828 


26729 


A 


12957 


301 


79 


EKDNQPLLTPRNCNVLHAIKESEKHRIT 
ELKVTLRPDAVAHTCNPSTIiGGRSG/WI 
A*GQE FE IS LAKKVKPCLY 


12829 


26730 


A 


12958 


314 


126 


KHHHFKKHNFRVLCVCVCVCVCVCYKHF " 
YLFSLS FSQ * P/C I CC* CVCCSFLLCCF 
*SATYLKII 


12830 


26731 


A 


12959 


529 


1697 


VPFMGH I SNFFFS P FLRQKYLAIiLPRLG . 

VQWQ*SRAHCNLRFPG\SSNSHASASRV 

AGITGACHHGWLLFVFLVETWFHHVG\Q 

AGLKTP* PQV\SACFSLPKCCDRRCEPL 

CPATSSILNIi*TMDRIj/PGEPKNQ*NIjL 

VSFMCAVFYREGQQHVYFQRSLWCKREF 

I VRSPWAINIjEKGGIiRMGAV* IRIMLNLi 

TLT*GFCTEDEKTVNSRVNQKLTPT*LG 

DE *DVKREKNINDE I KLLTCNSE YTKSY 

RLGR* IKLGAFCFFMWPMGQTGRWGQL 

KVQV*TGTVAHVCNPSTLGG*DRRITRV 

QEFETNIiGDHSESSSLQKIKLARHGAAR 

L* SLSL* VS WDYRCS PPWP \SNFC I FK\ 

RDG VS P YWP KA/ WS * TPDLQRS AHFS L P 

KVLG 


12831 


26732 


A 


12960 


1 


395 


GTREFRILSEKFKKEIDFFFFFKKQSSP 
PPGPKKQTL/ C*KQTTPTKPQKTNLGPK 
KFGGQPKKPFKI SHPQ IGPPKKKNYNP * 
GQPVWNPPGEKKEGPTPPKFKDPGANLK 
VMGLKGGVKRKRGGKTFPQRA 


12832 


26733 


A 


12961 


90 


436 


LWRLTSPKSAGWAGNLETQEESMLQFKS 
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SEQ ID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 
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SEQ ID 
NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, LHLeucine, M-Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














EGHR/ PAESPLA* S S VLFSLS TDWMRP I 
LI /MEGDLLYS KP IHFNVKIS SKNTY RN 
IWNNVWPHV*TL*PSQSD/CLK*TITPI 
VLLTCTH 


12833 


26734 


A 


12962 


415 


463 


LRAKDQ\TLPKILVC*TLLFFCAQIKKKK 
K/ WPGAVAHAYNPSTIjEGQGGW I t * gqe 
FKTSLGNMVKPSGIPPHWI 


12834 


26735 


A 


12963 


774 


316 


S I SPTCSGKIGGKLNRHFS KEDKTG \RY 
MNKCFISLVISECYIjKP*EDTH*TP\IiR 
MAKIKRADH/DKC*QRYKMTGTLIQC*R 
ERKMVQPLWKTV* QFLKRLNI HLPDDS I 
PLLG\ IY*RKMKACVHTNTI IWMPIASL 
LVKAK\AKKQPKCPS TR 


12835 


26736 


A 


12964 


44 


395 


MYFTLVWGS LG P KFXjGGHQPIiQGQG P FL 
HFLKAL* * FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSS PCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 


A 


12965 


424 


50 


MAFPFLSCAPDAINNNNKKCCREINNEE 
PPNEPliKWRHIMIFFFEAESHSFAR\LE 
CSDAISAHCKLCliPGSRHS PAS AYRCP P 
RHIiANFSAFLVQTGPPRVSQDGKDLQTS 
*SALLM?PLDPAIP 


12837 


26738 


A 


12966 


245 


44 


QSKELGNYPLPSPSPSSPSPAFHGLPPP 
SSPSPAFHGLPLLP\SWTVQPQ*RLTAT 
SLPDPPASPRPG 


12838 


26739 


A 


12967 


205 


30 


QEGASLSRIKRGPGVGAWLGMVAHACNP 
STLGGQG /SW I T* AQE FETSLGNPHLYQ 
KYKN 


12839 


26740 


A 


12968 


114 


465 


DPVSNPETTKPEPPKKTTES * P/EPKSL 
LWLPS\P *NSS * CANREQQPPS PQPHLP 
HPLPTSLQVHRLP* * SLPPPNLFTPLYD 
/SPEPSLL.PVIQFPA*TSPP*PPVYSSP 
PSPTCPLH 


12840 


26741 


A 


12969 


1618 


1038 


VHMVQDKDINLNNQFLSGTMLLFFKETS 
HR*DRGPGDF*SCLYSAPIPTQPLLF*Y 
LPQNFIEG*PS*NSRIKGTSLKTYISRV 
PSFFFSFIi*DGVS\TVPQAGVQWHDLGS 
LQPPP PGFKRFS CLS LLSS WDYRHPPIiC 
PANFLDF * * RRGFTMLARLVSNS * PCDL 
PASASQIAGITRMSHHTLPHVFPL.NV 


12841 


26742 


A 


12970 


197 


411 


EQALRSTVCYCLNPLPPAPLRPRQDC\G 
PCRFRWMPGGRTRWLAPVI PTL * EAEAQ 
VWLEARSLRQAWATW 


12842 


26743 


A 


12971 


616 


777 


MGIHGVRHNAQLNFVFFVEDGGEWQLTC 
TGAGDS S WVLGEP * VFAC/PKNVIiFYLH 
P*LFG* I * I YQLK I TF *NYEGM VP / FVF 
RFLFETRSLLSPQLECSGTITTHCSLEL 
VGSINSSI SAS * LDGNT\GVRHNAQLNF 
VFFVEDGGEWQLTCTGAGDSSVJVLGEPIi 
RYYKERLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* IKIHSQVQW 
LIPVI PAb/LEAKAGGLLEAKTSRPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


Q I IFFLFQKIKNQFNKASFFKKMTLEQM 
TSRQKNSAQISVFKKKNSKWITDLNVTH 
* TIQLTGKN\N IGKNLQDQGLGKEILNL 
TLKAQSTKG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12845 


26746 


A 


12974 


322 


411 


NNKKIFMSTS*KFFYCFIVQLIFETILS 
VRVLES I I LINYL * YCI I CI FPFLFL* T 
*EIRKVTKFTSFPQIEEK*KCRR/WLGT 
VAHTYNPSTLGG*GGRIA*AHEFETSLG 
NTGRLPSVQKIL 


12846 


26747 


A 


12975 


428 


1 


SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERAGEKRRPRRSLQPRPP 
HIAGP /VPRRQGS PSAGPEAPIATDEPP 
PQRRSGIRGGSANIRSKQTSSLRPRVRG 
RVG 


12847 


26748 


A 


12976 


60 


294 


traCTETVLFQ+LTCVCVCVCVCVCVCVT 
ERI /CFFKFLIGKGTYGCIYIiWPKTFIIi 
IFIYILYSPRGRSILKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTLLPGLECSGAISAHCSLCLSG/FK 
QFS CFSLPSSWGFTG/VP PRMPQLI FPY 
FSVKTGFH\HVG*AGLKLLI PPASASPK 
CWDYRRE PPRPGDLWNF* PTGFKLRFPQ 
FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLIiAVCPLGKITLTL 
WDFSFIIYIKQE*WPRAGIFMPY\NPST 
LGGRGGRIS*AQEFKTSLGNIVKPHYLV 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGGIWGFFFFRRGLNSATQAEMQGHN 
LG* LKS P PPG\ * RNFPAS ASGE PGITGS 
HHKAGIVFYIKKKPGLKTFTLGKPTPPP 
SFRHWHQPAPMGVYSEILGEKSLTNGWD 
PQTKKG 


12851 


26752 


A 


12980 


380 


96 


PVFPPPWRAQKGGSIGREIRPPRGNQGK 
PGPP*KKKKNYPGGGGGPPSPPFPEG\* 
PKKWHNPGGGTPP*TEFPPWPSTRGGEK 
KPPSKKKKKEK 


12852 


26753 


A 


12981 


331 


71 


ELLI YSWVWWCT P WPATW * GRELVGRI 
T\KSRRLRIjQ * AKI VPLHSSLGDRVRPC 
PP P PTKKKLANIi CPWTEGSLS FS YLLEQ 
FHSC 


12853 


26754 


A 


12982 


383 


189 


RRALKSCAHSTPAGPKNAAIiSQAQWIjTP 
VI PTL * EAEAGGS LELRS PRLL * AMI EP 
\CTPAWVTE 


12854 


26755 


A 


12983 


144 


6 


ILAQ* KTWIjSWAHTL*S / TLGGQDGRI 
TGAQE FKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


ETLYTME * YTAI EKNEIMS FAGTWLjELE 
A\VIL\SNLV*EQKTKMLPCSPLMGWEL 
K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKLEPRTCSVAQVEVWCL/GSL 
QPQPPGP \ SDPPA* AS * IAGTTGTRHHA 
*EMFPF*VYFVQTSEHILGINPPGRHTK 
VAWEV 


12857 


26758 


A 


12986 


151 


350 


GRGGF*PLP*IFPPGQERPGFTPPKAPP 
PLGFFFPG/RFFFFLRRSLALSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 


1 


RMS LL E KRRSNN * EAR V / CGR V I YVATE 
VIKIMIGISMVAHAYNPSTLGGQGRRIA 
SAQEFETSLGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLRFWG\KNF 
FPPNLPKSLGLRFKPPGPP * KAPPFFPL 
FFFFFEMEFRLCCPGWSAVAQS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leucine, MHYlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


EYGTHILTAAIFSIAKRWAHR*GSLTDE 
WVRKNS S IjQ S ME Y Y \ S A I KKNE VL, L HAA. 
P 


12861 


26762 


A 


12990 


377 


2 


LSIILINFPRFSLQRGKYPGPPPPKKNP 
PQKTPPRAPKKKKPFFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP*PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 
FLRWSLALLPRLEC 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGSITLIPKSDKNITRKEKYL/ 
P/IYLMTIDAKVFNKVLPNRIQQ* IKRI 
I 


12863 


26764 


A 


12992 


892 


171 


QT * SH / S LSRIiEYSDT I IAHCSLKLMGS 
SD P PAS AS Q VAKPTGHCTTMPQPHTEG F 


12864 


26765 


A 


12993 


269 


2 


GIjWCFVKQIiKIjTETCPHFAVPQFPLGIS " 
KIPIS ILFFKKLLL/ WPGTVAHTCKTPST 
LGGRGGWTT*GQEFGLMFHHWPSKFHHG 
QHGET 


12865 


26766 


A 


12994 


324 


4 


SLYPLLFQAEIYFLVTIHRSSMESCIYF 
FSVWPLDSLVQIHVKI /HIKGIGML,*V 
N*QNCFKINISE*L*YIWPSTVADTLGC 
QGEITRAQGFKTS LSNMTKPHLY 


12866 


26767 


A 


12995 


106 


531 


WLLNLFAFLNDKGFLVPMNRSRI FSQQK 
VKQRS FEGCSS FNDVGS C YVTQAGEQWL 
FTGVIGTL*PGTPG\SSDPPASASQAAG 
TTGS*HHIQLNIjILYSSG*QTI *LL*TT 
DSTHL I FAS L.YAKEQL ITFYFQATSLQP 
CT 


12867 


26768 


A 


12996 


1 


367 


NTEPFLNIFFKDKVS / CSIAQAGGQWRL 
YSRCSLKLLGSSNSPTSASLVAGSKGRH 
HYTRL * LFFYVSRPQD YLPWS P VFFFS F 
FLSFFFWKGSFFLPPSWKARAPIWVNGS 
LPPRVKPIFLA 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFSFFSFFFLGTPGG 
KGPPKGAPNPTLTWRGKENFFI *KKKKP 
/RI>I*KKGPPPKMGFKRERFWVLKPKGP 
QFGKGEKG/ SPNYPHPRGDPKAPS1.WKK 
PQFPGGGGGGAPKKAS 


12869 


26770 


A 


12998 


230 


409 


SCRPKKLIiAFIiKSSTEISQRWKQPIYI/ 
PSTDE * INKMWHIHTMEYYSALKRTKVL 
IHATT 


12870 


26771 


A 


12999 


310 


343 


DPVCTI I * I ICDI INL* IECSKVWSLKF 
YKILILFNVCVWPSVGNCHKWLCDCW* 
KQ W / LWPGMVAHTCNP ST VRGGGGR I V* 
GQEFETSLGNI VRPSLYNNKSC I 


12871 


26772 


A 


13000 


390 


289 


IKLCQPRGVICIiDSFKRAPPFFFFFETRS ' 
CFVS+AGVQWCN/LGSLQPQTSGPNNPA 
TSA* LSLIFMCSNSYVCIVK 


12872 


26773 


A 


13001 


1 


289 


G I LRQ I SVNYLP VGRS XiAQAGVQRCDLG 
SL * PLPPKVKQFSCVAGTTRGCHHAWLI 
F I FS REG \GHP VKGAS S DS PT FGFS KAG 
ITRLSFRALACT 


12873 


26774 


A 


13002 


359 


105 


KQKKNPSPPPGGGGGQGKPG/PPKPFPN 
SRDFPPKNQS P FFP KLKKKFFNFY * KAK 
NP*KPNPPLKKGKKPKGFFSPPPPLKKP 
PNV 


12874 


26775 


A 


13003 


243 


293 


MNSQFLKRIARIESPYGSVISLLGTYST 
EK * KACPCKTCTQMFTSALYI IAKKW \N 
KCPSTVRW/IKIiWHINTMEFFCQ*KKFH 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CIDMPQFVHLTVEGHLSPLFCYYI 


12875 


26776 


A 


13004 


465 


307 


SEACFI LKLFMFCG* CCLFVC \ SILLVF 
YFVFCFVLFCFAYYMQFFNQCLFG 


12876 


26777 


A 


13005 


207 


3 


DGSLTMLPKL.EC/ RWLFTGVI MAHCHLE 
/RLGSNDPPASASRVAGTTGVCHHA* LI 
FI FFVKMRSHYPTRP 


12877 


26778 


A 


13006 


2 


56 


DLALLFRL* CNGM I TAHCNLK/RPG* LL 
GSRDPPTMVS * STG I TGMSHEAHRPS PS 
CDF * RSCS WQAWQWHD YS S L 


12878 


26779 


A 


13007 






FGASLLKRHFLFFLS KPLVYNKKI PRPL 
VFLKKT\ PPPPGRATPFYYCCFFHKEGP 
GFGVKKEG*PKG/ SPPPPQRGSTPPFFL 
GAFKKPKVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPPP FFFFFETVSHPVAHAGVQWHDHS S 
LQL* TP\ S SREP PASAS * VAGTTGVH YL 
PGSQLP I LEHS PRR 


12880 


26781 


A 


13009 


227 


353 


GWSELT+ /SN*AQWLTPVILAL*EAEAG 
GSSEPRSLRSAWATW 


12881 


26782 


A 


13010 


249 


32 


NPDSHSWAWWCASWPGTQKARVGGSLE 
SRSLRLR* AMIVP \ CTPAWVAQEPLSLK 
KPPTIETLFPSPQDPSQ 


12882 


26783 


A 


13011 


247 


403 


TPGGGGKYFFFWGPQKKNRGGGFKNGGG 
GKTRGPP*KQPASSSPFAPPPFFWGAPG 
FSPPPRF\KPPPPVFFLGPPKKKIFPPP 
PGGLFFFFLGGPPSPFFFFFFFLGGGPL 
FFPGLGAMGPMAGFRSLAPPGNSLSKKK 
KSEGLGEGGNS VLTRVLLI SS YQ I PGNP 
R 


12883 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS *KAL/WLGTVAHT*N 
PSTLGGQGRRITSSQESETSLANMVKPH 
LLLEI 


12884 


26785 


A 


13013 


354 


31 


YKMNI WD I IH I * HT \ Y I P WP KGKERKR 
KKLEEYL/QDIMTKNFLNLMKNMNIYFQ 
EVQQTPSRINSQTSTPRHI I IKLSKYKT 
GREQWILSVMLALWEAEWGPPEVRS 


1288S 


26786 


A 


13014 


1 


419 


EETSFGKARCLPGYPGFHPRRLRSRSPL 
LS*MPLSHPQGPSQPPAGSMSSSPPATT 
S TCSCS PPAAGCTRGRTTGRPFWTASGP 
AELDWASGSSV/EALTSGLPARGLWGQY 
AGLGVPPRLLGPMTS CCMSLP FPS PARP 


12886 


26787 


A 


13015 


109 


426 


TSLGKIARPH*QTHMATIRSVPPPSWKD 
ST*LRLHPPPPHMCF/PD*SLKSN*Y*H 
YLVNP \ I WPMLTLLFVCLFVCFRDRVSL 
CCPGWSAWPSRLTAAS TLWVQVILPP \ 
RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
LPP*PPKVLGLQA 


12887 


26788 


A 


13016 


149 


422 


LKRIFFLKWFTHDTAPGDPDNMCPYSC 
F*NQI I KS GLFF FKKPHL/ WLGEGAYP C 
NPTLLGGQGGRIT* FQEFETTPANMAKP 
LLYKKYKN 


12888 


26789 


A 


13018 


162 


377 


QSPGYPERVSKLPTITQRGMWMHQPSFV 
CTSL*NV* TGWAQWLTPVI PAF * ED\ AL 
AGGLLEVKNLRPAWST 


12889 


26790 


A 


13019 


205 


427 


IQrGGPPARPFLWPWAPGKK* RVPPEGG 
P P KKKKKT P PPHPGGEREGS / P FS KKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDSKPVGAPPEPGRLGNGPGKIFFGPGP 
FFPLQPQKGRP* PGGEGGSFFFFLGVPP 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LGEPS I FCLDLPENA 


12890 


26791 


A 


13020 


248 


358 


SRCGGEAQWLLPV/ I / PAF* EAEAGGSF 
EPKSLRPAWAT 


12891 


26792 


A 


13021 


2 


434 


ITEAGSDTQE IHHTLYTHLCVYWRFFTK 
KAQVKKKVWWHM* PQLLGRLEMGGSLEP 
RSLRLQGAMTLPVTSTTXQAWAT 


12892 


26793 


A 


13022 


162 


410 


LTQKTAASQI IL.SFSSTSLEKIPISSES 
GGGGSVSVTQAAEQWRYHSLLQP*TPGD 
KRS SHLGI PHP \ S S WD YR I SKDWVMPV 


12893 


26794 


A 


13023 


250 


40 


L I KLN I FSCS XiAI WTS S CMTS LF KYFAH 
FS FF * I DFYS S YYILDTNPFFRN/GITN 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSSTKHGKLRLEPEY*CFIHH* EEKVFE 
SYFEY/ PCQTFJjRKETVRPGAVAYACNP 
STLGGQCRWI I +EQEFETSLTCIAAALR 
VTLR 


12895 


26796 


A 


13026 


110 


523 


CIDSSSWTQTDRCKEDRLCPGTTGTCAP 
GLLFLSAPPLGPF* PSQAHPSFHVSIiSG 
PSPGLRPQLDIPLHPSTSIAPRPLPGPA 
S\LPRLEPHPSI.PSPTGACPFIiPPCLGE 
QRHW PLPTSLiL WIjGLDGHCIiAT PS VS 


12896 


26797 


A 


13027 


52 


53 


CPTLLQLTLC/LPPQASAMVDAPP+ARP 
IjPPSS IIiACCTSGEQGSWVGLAEPVKG 
YNXiLVCRLLRPLEKCXSIWVAVSRFSLYS 
LSWLPLARKG\NPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVDRLSPGVRDSLCNKVKP * QI 
IYTF* FSVSLS VINTYLKGPS TVAHTYN 
PSTSGG*GKWIA*AQEFE/TSLCNKVKP 
CLY 


12898 


26799 


A 


13030 


1 


846 


FRPIQAGRAPWALTLPCMGFTEGSPALL 
SHSSRCDIQMGCNTHAVDDSTGEGPFHK 
GRMRESCWLRTPRCGLYTKGVGGPGMEG 
RERRSRSRRQHMPGRGHSTMASWSLRQC 
MRLKLQWASDLLSQLEMQIPSLPPGESG 
SLGLTGPRSLH/ CEQTIGLSPTPGESWS 
EELG\GGIVGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMS I * LTSLY\AR 
RGEPGPGRPVRKWLRESSQNRIGGGEGR 
WE*GQRRSREGRECPVGWGGRKKPLRWG 
LMWS 


12899 


26800 


A 


13032 


308 


1 


KHYHDFWCWI IIiAl>SDAQSLDHTCV\CV 
CVCVCVCVCVXiRDQGSHSCL* CEG*LVP 
HFVPGPVLCLLS P VI YVC F I CS LTCKVE 
SLSNFFKKLiNKALLRHCI 


12900 


26801 


A 


13033 


763 


244 


FLiGPRI IGLRHE ISVETQDHKSAVRGNN 
THDNYENVEAGPPKAKGKTDKELYENTG 
QSNFEEHIYGNETSSDYYNFQKPRPSEV 
PQDEDIYILPDSY*L/CQNIDFCYWMIN 
IHCKFS TAKTRNQTKC * ST VDW I KKMWY 
TYTI EYYAAVKK/DTKLTWEQKIKYHI F 


12901 


26802 


A 


13034 


326 


58 


EHYKCNYCQKIFQKL*/HL*SKKHVYAE 
DKYYKYKDGWPLLVSQI ILYTFYTRGK 
P*SSCSNFVQNQIIYIGQKSCKCNKFGN 
TFFKKP 


12902 


26803 


A 


13035 


199 


3 


RFI YNHKVCITTFLPHFYF* KLYL/WLG 
TVAHAWNTSTLGG*GGRIT*GQELETGL 
DNTAR PHLHTV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12903 


26804 


A 


13036 


94 


448 


GSCATSMAFKDTRKTAVEPGVWHGIRI 
KPLE KVCADL I TGKKFKNL / KVKG P VCM 
PTKTLKITTRKTPCGESSKA\WDGFQMR 
IQK*LTNLHSPSKIVKQIISISGAPGID 
SEVTNADG 


12904 


26805 


A 


13038 


964 


1717 


HKLSCVKDLKDFMRQAGE\VTFA\DAHR 
PK\LNEGWEFASYGDLKNAIEKLSGK\ 
EINGKNINLIKGSK\RPSRSRSRSSDPG 
TQKSPLGPRSRSPSPWRKP*QPVQEAR 
SREPGARSQVPVPVK*GPPPCPEKEPQK 
\RGSLQSKILSPPSILWIRPEVSGPRSK 
VPRSVDSGN 


12905 


26806 


A 


13039 


2 


725 


SLLFSPVLPLFLFLPLAYILQNLKPLQF 
SPDLKSKHLIFFCNAA*PQYKLDRSSK* 
PENGTLNFS I LQDLNNS CRK I GKWS EVP 
EVQAFF\SHQSLPSLCAQCNS/CPNLPS 
FPP\PVPSVPTPSVTESF*SSFSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
ILLSPPLLTPRPAYSFIP*L/CPPPPAQ 
QFTLKKVAGAKGIVKVNAPFSLSQIR*H 
LGSFSSNIKIQPSSWLVWQQP 


12906 - 


26807 


A 


13040 


1131 


2 


DGVSLLS PRLECSGTISAHCNIjHLLCLN 

ds pas as * vag i tgpatt p pl i f vy f ff 
lvetgfhhvgqsglelptdNsdeppasd 

SQSVRITGVSH/AYLACSQF/CTRC*FP 
1 1 * EQCVNMNINSS K* IFILI SYLYRWM 
NKD P I /VI I ERLLVRMPEGQ S HAAW * NK 
TNKQT* IL.ELVLSVTSCGVSVRFNLS I P 
* VFPSTK*EGWMRCVNPSTLT* FIPQVA 
+TCQEHGLMKIQT*NPGLDPSIVSDHQT 
Q* E PS VXQVLDI SHDKLPMSLEELYHCR 
HTLASGKQISGVPRTI FLNHFHGFYFI Y 
/CI FEMESCS V/SQAG/VQWRDLSLLQA 
RS/ PGSR/RFS CLS/LPVQHPPPGPANF 
\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A 


13041 


97 


626 


I CFTQ VKVLNE S Q VQWCMP WPAT WE AG 
AGGVLEPGS*EL*/CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPDTHSQIKVIKL,MQSEFCWGWAW*I,T " 
PVI PALWEAE /AGGSPEVRS S 


12909 


26810 


A 


13043 


405 


1319 


RYRKKLQNKPLWLH/CLLLLPNSATR* T 
LQK+SK*ISPLESSA**KEPPFI,SVLSV 
TSVKT/SAIPMPPHWENVNTQVPYQLIP 
LHNQTHE YNE VANLFGKTMDRNR IKRIQ 
RIQNLDLWEFFCRKKAQLKKKRGVPQ IN 
EQMLFHGTSSEFVEAICIHNFDWRINGI 
HGAVFGKGXYFARDAAYSSRFCKDDIKH 
GNTFQIHGVSLQQRHLFRTYKSMFLARV 
LIGDYINGDSKYMRPPSKDGSYVNIjYDS 
CVDDTWNPKI F WFDANQI YPEYLI DFH 
*FHFQISWKEALFFFAGRFALQSSSH 


12910 


26811 


A 


13044 


23 


450 


RSRAAAIQLGQRLRLPGPDSPRLTGRQL 
EPGTGRGSRAKRGPCS IFRFRRETPPQR 
BGGEALGATSRQAPPPCPPRQPWELDAS 
AKAAAVAPPKKVGFKAVSRDPRAEALGG 
KGSP\GGKGSLKGVAHR+GGLGPL*GAP 
EAI 


12911 


26812 | A 


13045 


172 


16 


AHVS PE VDKTKFW \ LGPVAH I CNS S TLG 
GRGGQIT/R/GQEFETNLTNMEKPHIiY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12912 


268 13 


A 


13046 


181 


9 


YKWGPAMLPRLKCRAI \ I GSNMAHCRIjD 
LPGSSDPPSSASQVAGTTGTCHLGARGG 
MV 


12913 


26814 


A 


13048 


2 


3507 


YVRVSLPPP PPAAGRPGAAVADDAREEE . 

EEAAPPPPPPPPPRLAAARPPGSQPRPP 

AAGEAQAAADMNHQQQQQQQKAGEQQL.S 

EPEDMEMEAGDTDDPPR I TQNPVINGNV 

ALSDGHNTAEEDMEDDTSWRSEATFQFT 

VERFSRIiSESVLSPPCFVRNLPWKIMVM 

PRFYPDRPHQK\SVGFFLQ\CNAESDST 

SWSCHAQAVLKIINYRDDEKSFSRRISH 

LFFHKENDWGFSNFMAWSEVTDPEKGFI 

DDDKVTFE VF VQADAPHGVAWDS KKHTG 

YVGLKNQGATCYMNSLLQTLFFTNQLRK 

AVYMMPTEGDDSSKSVPLALQRVFYELQ 

HSDKPVGTKKIiTKSFGWETLDSFMQHDV 

QELCRVLLDNVENKMKGTCVEGTI PKLF 

RGKMVSYIQCKEVDYRSDRREDYYDIQL 

S I KGKKNI FESFVDYVAVEQLDGDNKYD 

AGEHGLQEAEKGVKFLTLPPVLHLQLMR 

FMYDPQTDQNIKINDRFEFPEQLPLDEF 

LQ KTD P KDP ANY I LHAVL VHS GDNHGGH 

YWYLNPKGDGKWCKFDDDWSRCTKEE 

AIEHNYGGHDDDL.SVRHCTNAYMLVYIR 

ESKliSEVLQAVTDHDIPQQLVERLQEEK 

RIEAQKRKERQEAHLYMQVQIVAEDQFC 

GKQGNDMYDEEKVKYTVFKVLKNSSLAE 

FVQSLSQTMGFPQDQIRLWPMQARSNGT 

KRPAMLDNE ADGNKTM I ELS DNENP WT I 

FLETVDPELAASGATLPKFDKDHDVMLF 

LKMYDPKTRSLNYCGHIYTPISCKIRDL 

LPVMCDRAGFIQDTSLILYEEVKPNLTE 

RIQDYDVSLDKAliDELMDGDI IVFQKDD 

PENDNSELPTAKEYFRDLYHRVDVIFCD 

KTI PNDPGFWTLSNRMNYFQVAKTVAQ 

RLNTDPMLLiQ FFKS QGYRDGPGNPLRHN 

YEGTIiRDLLQFFKPRQPKKLYYQQLKMK 

ITDFENRRSFKCIWIjNSQFREEEITLYP 

DKHGCVRDIjLEECKKAVE JjGE KASG KLR 

LLEIVSYKIIGVHQEDELLECLSPATSR 

TFRIEEI PLDQVD I DKENEMLVTVAHFH 

KEVFGTFGIPFLLRIHQGEHFREVMKRI 

QSLLDIQEKEFEKFKFAIVMMGRHQYIN 

EDEYEVNLiKDFEPQPGNMSHPRPWLGI»D 

HFNKAPKRSRYTYLEKAIKIHN 


12914 


26815 


A 


13049 


30 


477 


vraehcavwernfeetvrwtsvkflmts 
peiaslswgqmkvkgsnttykdckvwpg 
gsrtwdwretgtehspgvqpaac\dvke 
\a^skgvqtlvigrgmsealkvpsstvey 
lkkhgidmrvlqteqavkeynalvaqgv 

RVGGVFHSTC 


12915 


26816 


A 


13050 


179 


389 


NIETIQSMFPI/DNEMKLEINIiKKAIWG 
IHKYAS QVLWFMP I FPTLiWEAKVGE FLE 
PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


PGKGEMRTRSPSPIiAIVPRPQRASRPLL 
CAVSPMASASGATAKHEQILVLDPPIDL 
KFKGPFTDWTTNIiKLRNPSDRKVCFKV 
KTTVPHRYCVRPNSG I I DPGSTVTVS VM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isolcucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQPFDYDPNEKSKHKFMV/QIFAPLNTS 
DMEAVWNEAKPHELMDSKWR/ CPNENDK 
LNDME P S KAVPLNAS KQDG P TPQPHS AS 
LHDTETRKLTEECKRLQGEMMKLSEENQ 
HLRDEGLRLRKVAYSDKPGSTSTASFRD 
VTSLLPSLLVVIAAIFTGFLFFFWRRSL 
TliVAQAGVQRRDLGSLQPPSPGFKQFSH 
LSL.PSSWDYRRPPPHPANFCIFSREGVS 
PCWPGWS 






A 


13053 


892 


189 


SYDPGALGCRPAPRSGRGKGSGATRDMH 
PMSRERAKFVKS3L \ YCKTTRELMLHAR 
CC1iNQKGTILiGL>DIjQNCSL.EDPGPNFHQ 
AHTTVI IDLQANPLKGDLANTFRGFTQL 
QTLILPQHVNCPGGINAWNTITSYIDNQ 
ICQGQKNLCNNTGDPEMCPENGSCVPDG 
PGLLQCVCADGFH/ GIQVYAPGLVLTAY 
VLRDSGSHHSIRLHSALGDPAPKSQBFM 
NYIGLTIDLRSI 


12918 


26819 


A 


13054 


423 


12 


SFNQKNPLRQKCSLRPTRPASQSIiATWP 
GQSTRFWEQALPWALWDPQSKRAE \MQL 
E P PHAHTWTHAHTCGHTGA\ FCSAHTEV 
YVHLHSPVHAHTHAHTHSPPVKTCGCRIi 
HAHTH\PQTPSPPGLLSLLAVARPQS 


12919 


26820 


A 


13055 


16 


349 


RRSGKNDPWTDQS PRAAASHTRCPHPAV 
AAAAMPKRKTEGDAKGDKAKVKDE S QKT 
S /ARMS S KSAAS KAYAKP I KAPAMNGEN 
VPNG IQRKLEAF YDLQ ACRHLTRL I DFA 


12920 


26821 


C 


13056 


716 


594 


MMQTETGVMPPQARQCIiEPPGAGTGRKD 
SPLYPSEQSWPL* 






A 


13057 


353 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTRS CYVAQAGLQTPG \ S SDPSASA 
NQSAGITGVSHHTQSEILKYTVFCLTFQ 
TPEYTI PSTWS I PCCSHFSCYLCFLFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTSKTAGCRGCAL.VISMLCI I FPQGL 
FTRALLRTCSALLHADWPE I SFVWWF 
FKTESCSVAQDGVQWCDLRSLKPPPP\G 
S SDS PAS AS PVAK I TGMHHHARHGLKS 


12923 


26824 


A 


13059 


2714 


1376 


GRYDGLVEQLGGRATPAVGFAMGLERLV 
LLVQAVNPEFKADPWDIYLVASGADTQ 
SAAMALAERLRDELPGVKLMTWHGGGNF 
KKQFARADKWGARVAWLGESEVANGTA 
WKDLRS ALAVG VI LGVGAL I GWRYWNS 
HQVDSARS ASLAYQNAVTAVSEGKPDS I 
PAAE KFAAENKNT YGALAS LEIiAQQ FVD 
KNELEKAAAQLQQGIiADTSDENLKAVIN 
LRLARVQVQLKQADAALKTLD/ TP I KGE 
GWAAIVADLRGEALLiSKGDKQGIiLSVTL 
LSGCSLFNSEEDWKKSPLPTVENQFTP 
TTAWSTSDGSGIGNFYSNLHPALADNW 
YAADRAGLVKALNADDGKEIWSVSLTEK 
DGWFSKEPALLSCRVTSVVJPNNQTEKFN 
KQRTEGPAVAEPEQAGVRRHYLYPFTGG 
YRRRRAACQAERPAARCATEEHGSRN 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKEL.SALKVKRVDWAPWLMPV 
IPAF/LEAKVGRSLEPRSSRPAWAT 


12925 


26826 


A 


13062 


127 


386 


FSYYYYFFFLFFX.CVFVIPGEPFLKMKL, 
GKPKTIMLKEKKGGESYSELFFETDQME 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WL/ TPVI PTLWEAEVGGSFEHRSSR 


12926 


26827 


A 


13063 


1 


209 


GKYI LDIiKVFCCFLRQGLCHLAHCS LRL 
PGS SD S RASAS Q VAG / 1 S I TGVHHHARL 
NFVFCFFFLVCFCF 


12927 


26828 


A 


13064 


132 


1 


SHAMCCNYLKRFGWARWLMPV/ IPALWE 
AEAGRS PEVRSSRSAL 


12928 


26829 


■a™ " 


13065 


269 


134 


TLWSVFASS/WAWWLTPVIIiAPWEAKAG 
GSPEVRSLRICKQKVLS 


12929 


26830 


A 


13066 


853 


461 


RLWRSVSQFbCACVHMCVPSVCVCGIxAH 
AGTYAVSTCVH/MCVCVCMCARITLAKI 
CSETSGNXiLFRKI ISLLDLFTLNFS I KY 
KVQLVLKSRSLAWWGRI KLLTLSRPFSD 
VNTFERINYWL INDI S I I R 


12930 


26831 


A 


13067 


57 


2066 


AQPTGRQTSTVAPTPATARSRGGRVP I F 
PCPHPAPTLLSTSRWTPAS PAASMKAIi 
RLSASALFCLLLINGLGAAPPGRPEAQP 
PPLSSEHKEPVAGDAVPGPKDGSAPEVR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDRPASPPAPSGSQQGPEEEAAEALLT 
ETVRSQTHSLPAAGE P \ EPAAPPRPQTP 
ENGPEASDPSEELiEALASLI/QELRDFSP 
SSAKRQQETAAAETETRTHTLTRVNLES 
PGP ER VWRAS WGE FQARVPE RAPL P P PA 
PSQFQARMPDSGPLPETHKFGEGVSSPK 
THLGEALAPLS KAYQG VAAP FPKARR P E 
SALLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERLADLASDLLLQYLLQG 
GARQRGLGGRGLQE AAE ERE S ARE EEE A 
EQERRGGEERVGEEDEEAAEAAEAEADE 
AERARQNALIiFAEEEDGEAGAEDKRSQE 
ETPGHRRKEAEGTEEGGEEEDDEEMDPQ 
TIDSLIELSTKLHLPADDWSIIEEVEE 
KRNRKKKAP PE PVPP PRAAPAPTHVRS P 
QPP P PAPAPARDE LPD WNEVL PPWDRE E 
DEVYPPGPYHPFPNYIRPRTIiQPPSALR 
RRHYHHALPPSRHYPGREAQARHAQQEE 
AEAEERRLQEQEELENYIEHVLLRRP 


12931 


26832 


A 


13068 


282 


53 


PGF PQGKFRFKKKI LPQDYP FQGAPNS K 
ARPGQGVPPG I PA\LWRVKKGGPLRSGG 
PGPPGAKGGTPFLPIPNTKKS 


12932 


26833 


A 


13069 


177 


2 


VSLCRPGWMECSGVISAHCNLRLPGSSN 
S / PAIASRVAG I TGARHHALLNSLFKVIi 
SRA 


12933 


26834 


A 


13070 


789 


302 


' PHPEFYYS FQ YF I YFF I S FHFTRHF I VH 
YFISSHFISCHLASSSSSSSSSSSSSSS 
SSPPAFHYSFHHFISFHLAIHY\HFI IS 
FHLFHLFSSFHSISFRDRDLLYCPGWSV 
PPHPAFHYSFHH/VHVMSRHFILRQTRS 
IAQAGIQWRDLCSIjQPEVGSSRSA 


12934 


26835 


A 


13071 


193 


2 


PPYLKSHSQVDLSPGVQDQPGQHRETPS 
/LTKN/TKVSWVWWCTPVI PARWEVEVR 
ESLEPRCSTRA 


12935 


26836 


A 


13072 


122 


1495 


LLSDFFFFETE/SRSIAQACMQWCYLSS 
IiQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYCIFSRDRVIjPCWPGWSRTPDLR 


12936 


26837 


A 


13073 


178 


347 


CCLCNDNSVXiLCMYYLLVCMHACMHALY 
IKTWK/MQLGAVSHACNPSTIiGGQGEWI 
T 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12937 


26838 


A 


13074 


165 


34 


FILSFPFETVSYFVAQAGVQW\LIIAHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


205 


105 


GGWVQWFMSI I P / AI WEAKAGGLLEPRN 
SRPVWAT 


12939 


26840 


A 


13076 


3 


357 


HERE YTQAEKRKF I AFFFFKMGS CS VIQ 
ARVQRHDLSSLQPP/RVAGTTGTHYHTW 
LIFVFSRTDSRLPSSPRSPLTSVISLII 
PLSLIPTTYLTTPSRPSSTSPPSHPSFR 
APLTSDP 


12940 


26841 


A 


13077 


3 


477 


S TP ALTARGLSGGS RAS VAAM VLiLES EQ 
FLTELTRLFQKCRTSGSVYITLKKYDGR 
TKP I PKKGTVEGFEPADNKCLLRATDGK 
KKI S TWS S KE VNKFQMAYSNLLRANMD 
GLKKRDKKNKTKKTKAGSNSS /ATTAAP 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTSPTGQLDS 
GATAFTWWPSGAPTPRGRESAAHQGN/ 
YPVGQRS \RMAGLS PRSFCWWEVSFSR/ 
SHSC/I CWVQLGKSAVLPQQSRGPGAHE 
GSRRRGG S S P P CPP LQTQLVFS PQELS M 
SAPIHRLSVTLQGDCISTGGAPSRCRIiA 


12942 


26843 


A 


13079 


3 


396 


HERGRKMVSVT\RLIQRLRNWASGHDLQ 
GKLQLRYQEISKRTQPTTKLPEGPSHKLi 
FNNYY\CTRDGRRESVPPFIIMSSHKAL 
VSRMPAYS FAEAATDKKAVTPAP S I YRG 
EQSSDHPYLRHCTLTVTRLL 


12943 


26844 


A 


13080 


467 


109 


SLVPWSPHLKEPSCRYLIENVMLVPKE 
QEFWSKEAGSLiFSPPVPFYCWTPSIVFS 
SSFFFLFRCVCFFFFFFCFFFFFF\LLF 
FGTKKALYIiLRARGCKQYKNQKLIWYLT 
FLSLLVK 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLLLPEGIFFSCSLLEVKA 
PMRDVSGVFLTLFLYSVLFFPLGVFFFL 
FFFLILPPPKIFFFFFVCVALFFFSPAF 
GAVFLF\SLWSSVIiPGVGFFTACSFFFF 
FFFFFFFFLRQSFALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSDSEGGRRGHKYI FI PIYTHTHTYIFI 
YTQIYICLHIYACLS/LICTHIFISIYI 
HTYIYIS I YAHIYLYIHTHI YLHIYTHT 
YIYIYLHIHIYSSWVCKGS 


12946 


26847 


A 


13083 


352 


153 


KKLVE Y YS AI KKNSVL IH/VS T WMRLKN 
ITLSLKTQSQRHLYYIIPSVRNVQNRQV 
HKYRSRLVDSED 


12947 


26848 


A 


13084 


121 


417 


DYDYYCYYYPKIYLTKLLYFY \ VGNYI Y 
THIYVRHVYICETYMPVYTHIHTYICIY 
IYIWKETTWLFSCGISKTYRTHDIAYTY 
LVFCLTHVTFHVSST 


12948 


26849 


A 


13085 


242 


12 


LSSRLPMLLETKEITSKILPSRKL/LPY 
GWCFKS FK I TWVS Q I RNRL \ AQWLTTVI 
PQLWEA\SGLLKPRSRRPAWTTW 


12949 


26850 


A 


13086 


274 


340 


GGEKKTPGGFLEKKSFFGGGIFGPPP/P 
TKRGVFPPS PKKFFFPPKTKNFGRGGGP 
KIPPPKKDFFSKKPPGVFFSPP/YKKKK 
IIFPPPVKLGPPKDFLKSPPPFFFFFYF 
FFFFFFSSQEVNLKALSSTSE I I FFYDV 


12950 


26851 


A 


13087 


241 


405 


PPPSKIHIGNFRKMSDVKNATGWAWWLT 
P VI PAR WE AE AG \ VSLRPRNSRPAWAA 


12951 


26852 


A 


13088 


537 


375 | KHS LT LSLRLE CRGT I LAHCS / L / CTS L 
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beginning 

nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 
sequence ' 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl»lanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGSSDLSL.QPQQIAGTTGACHHTHLEKI 
F 


12952 


26853 


A 


13089 


448 


186 


GAIiVFFSSQAPPCPPKMEQSFFFFFFFIi 
ETGSRSVSKAGVQLHHGSLQPPPPGFER 
SSHFSVTTI/TYLYHHRLAIiHVFELHIN 
GIMHR 


12953 


26854 


A 


13090 


365 


191 


GQEREKEREREKRRQKKREVE / REEEKE 
RVRDLESESNRALERERSALVDRAPLSR 
PG 


12954 


26855 


A 


13091 


148 


316 


DKMESCS LAQTGVQWHD \ LG \ YLQPSPL 
GFKRFCLSLSSSWDYRWETSTEPFQQAV 
F 


12955 


26856 


A 


13092 


712 


229 


FVAWVKMAEYIiAS I FGTEKDKVNCSFYF 
KIGVCRHGDRCSRLHNKPTFSQEVFTEL 
QEKYGE I EEMNVCDNLGDHLVGNVYVKF 
RREEDGERAVAELSNRWFNGQAVH/GEC 
TRGGFCNFMHLRPISQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


ELIERLGPNQKPPILMWKPMPSPSNMKA 
S AALLCLLLTAAAFS PQGLAQP VGINTS 
TTCCYRFINKKI PKQRLES YRRTTSSHC 
PREAVIFKTKLDK\EICADPTQKWVQDF 
MKHLDKKTQ\TPKL.LNIHDWNLENQAMT 
LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCP SHFAPTTLTQ PGAHKNMCCI KSRFK 
RDLGLCRTCLVNKMFTSS ILGKSHRHSL 
VSINQGNNALWKAAG\PLSWKAGYC\QG 
FS P CDSLKYG \ S WDEKDLTVPQPDTHKG 
S VLRWI S KRGKPLAVE IEGRATGLPGLA 
PWGTE\CLGYKTPIV\HLFNSEMG\ENR 
PYGGEARHVCSNAALLFFTPLRCLGGEK 
HKSGLRAHPVIVLSLEUJYDIDSFAHMF 
FADLLLIITLLSCYIPFC 


12958 


26859 


A 


13095 


1 


324 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERP/RPRVLFFVRGREGSHPEKCEN 
KKERAPFSRALRKI SIETRGGVSITTEA 
CVFVIGAHRVCRSVYIYI 


12959 


26860 


A 


13096 


508 


840 


DGVSLCRPGRTADCSGAISAHCKIiRFPG 
SRQSPGLSLPSSWDYRRLKPMRPANFFF 
CIFF\VKTGFHLVSQ/AMGIiDIiLNS/S I 
PPRLGLPKCW\DYRREATAPGQELLLNR 
TGM 


12960 


26861 


A 


13097 


53 


254 


WPQTASCLVAQAGVQWSDHSSLQRQTPG 
\SSHPPASASQVARITGMHHCAWLHLI I 
LLRAHGSSPLC 


12961 


26862 


A 


13098 


329 


67 


GPGAPIiLKGEKIiL P KB P PPR I S P I VSAF 
FAWGPPPGYCFLKKGGPFF\VFPPSFFG 
KKKNSPPPPHINGGAKTFPNPPliL,FSSF 
FFF 


12962 


26863 


A 


13099 


253 


444 


EGAFSLGTF YFI FFLVLSNTFLLS I KNQ 
QIWNKKTVW\PS PFLPTL IALTTLLLP I 
SPFILIIL 


12963 


26864 


A 


13100 


341 


590 


PMASSQICIGRIPPFLYLKWYFWPGKW 
HTCNP KTLGGRGAIi I TQGQKFENS IAN\ 
MAKPHLY 


12964 


26865 


A 


13101 


55 


184 


RL.REPTPL YSE / HAPTKYYRMAHHNYPH 
SPPI PQPRHHYRVFLL 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLLNCKRGLERKGCRASWPAVPLFLQR 
RVWPRPTAELNEACMLAQHQNCQNVKME 
YFQLSFTKRQEI /WLGWAHACNPSTLG 
G\QGGG IACSQEFKTIPSSKVRPCFYLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNLGPP\GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


Y PQHYP S CPTFQ YTLKTKNHNFL I FFEV 
GSCSWQARVQWCHHGSLQPP /NSSDPP 
ASAFQVAGTPGVSHRAPPKNQNF 


12968 


26869 


A 


13106 


154 


4 


KKVKNTVHYNGKYFLKSQNQSTFLKE I L 
/WLGGVAHTCNPSTLGGRSGRIT 


12969 


26870 


A 


13107 


391 


150 


RPPKKPNKKYSQITSKGIKMVYTRKYLT 
QKKKAMSQALWY/V/HAVPATQEAEVGG 
SCEPGSSRPRCTMIMPMNTHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


KVGDGGRCL.VFCFYFKRQGFTVAQAGVQ 
W\ATIAHCSLKLLGSSDPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


I>YEF I KI TVLLGLGCPLGRYGCSNQRFF 
S LEME FHTVAWARVQWHDLDS IiQPLPPR 
PNRFSCLR/YPQSSWDYR 


12972 


26873 


A 


13110 


2 


317 


GRVGANVHKGHRQRTYGSVI PHILPLHV 
LKKTFSLRDFHFSVSLKK\NLVLTCLHL 
FL/GVRTPRNDPFVSMMLLFTAFLDRPS 
T I LGTGliIiYTEGLT VALiRLAYLR 


12973 


26874 


A 


13111 


413 


162 


It I LLPQPAECLQIiQASAATLDCLGLjPRC 
RDCRRS LVHS VLiNGAQAGVQWR DLGSLQ 
PPPPS \TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


ESRLSALYI YH I CVCVYNREHLLYGIHM 
/S IHTYTHTHTHTHI YIHMRAFALEDKF 
I CSLLCSRQDNAFI LVSVKLQRKNYFIiT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKNSARGKVPRGFSPNLPHVTVET 
PEGSKTGFELESR/RRHRQIHSSAQCPL 
TAVPGAGDA1 PEDASGWHTWLPLHAQNC 
FLLY I QAPEQP PA 


12976 


26877 


A 


13114 


685 


340 


LSPPRAGSARPTGPPTDAPGQRSTWTCG 
ALKPRRRALiRDAAENliFQEXiQEHFQAliT 
ATLNLRMEEMGKRIEDIiQK\NVNDIi\MV 
QAGIENS I KEQMLKTVTANMSVFGDGAS 
YRSH 


12977 


26878 


A 


13115 


65 


3021 


GVLTQMGDEKDS WKVKTLDE I LQEKKRR 
KEQEEKAE I KRLKNSDDRDS KRDSLEEG 
ELRDHCMEITIRNSPYRREDSMEDRGEE 
DDSLAI KP PQQMS WKEKVHHRKDEKRKE 
KCRHHSHSAEGGKHARVKEREHERRKRH 
REEQDKARREWERQKRREMAREHSRRER 
DRLEQLERKRERERKMREQQKEQREQKE 
RERRAEERRKEREARREVSAHHRTMRED 
YSDKVKASHWSRS PPRPPRERFELGDGR 
KPVKEEKMEERDLLSDLQDISDSERKTS 
SAESSSAESGSGSEEEEEEEEEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEEVSEEEMSEDEERENENHLLVGKN 
L PGDRVPVPESRFDRDSGESEEAEEEVG 
EGTPQ\S S ALTEGDYVPDSLPLS PIELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
TYGWYRAKDKKTDE I VALKRLKMEKEK 
EGFPITSLREINTILKAQHPNIVTVREI 
WGSNMDKIYIVMNYVEHDLKSLMETMK 
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NO: of 
nucleotide 
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M 
eth 
od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Pred to- 
rt ucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^-Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QPFLPGEVKTLMIQLLRGVKHLHDNWIL 
HRDLKTSNLLLSHAGILKVGDFGLAREY 
GS P LKAYTP WVTQWYRAPE LLLGAKE Y 
STAVDMWS VGCI FGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRRI SAEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP\EGGLGYSQLGDDDLKETG\FHLTTT 
NPGGLLPRGPGFSLRVAEVSELDPVIGR 
NSAGDHRRGYLPAWNCDGTRNSSSYFVL 
PCFVFVFLVCKLSRIKSFSH.WRKEI.CF 
LRDLPRAERVGI FRGAHVGQHKPHHHPL 
PLSTRTGLGWAVIWKGTGGSRVDCLIFG 
AGDLFLCWDERCPLAPTHSSRPALSTTS 
PRPPTTQPQLLPAGLEQVFYYKWGVLK 
CXKIFFEE 


12978 


26879 


A 


13116 


469 


202 


MI I I VGLCEYTKSAVLVQHSDPLPPAPG 
VFFLAVLDWDIKIWHPVFCLFVCLYE 
MESCPVAQAGVQWRDLGSLQPPP\PGSS 
TSCF 


12979 


26880 


A 


13117 


193 


383 


AARMASTFFVFWFWFSTF\SLYSICVCV 
CVCVCVCVCRMHTVQLYFFQVGCQCSSL 
LNPLFPMN 


12980 


26881 


A 


13118 


146 


350 


HYLVKFLLLAHPGS KAP PLNTLQPP PLA 
TREQPPLTVIFHYPPTSYKMAP\PYLPS 
LTLFGLS PPAPR 


12981 


26882 


A 


13119 


124 


254 


HALLQTLRDLS QAWWHALVI PATW \EAE 
AGGSLEPRSSRPAWAT 


12982 


26883 


A 


13120 


150 


45 


SLPRLECSFTVLAH/ c/nlhllgssds p 
ASASQ WGG I 


12983 


26884 


A 


13121 


336 


446 


IPLKI S S WAQWLTP V/ IMPAVWEAEAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


QNLTAQDGVQWCDLGSLQPP \LPSSWDY 
RRESLCPSSFFIFWKRWGFT 


12985 


26886 


A 


13123 


463 


2 


GPAVPSGLYLKGKPMRRLCASHS PESHS 
HDKGQGTLPRIPQ/PLLGEGGSR/PLAW 
GAVAGLPSNRPRIVPLPAPTRSGTRVRP 
HTGHQPGMS PGCGCESKWPHS AAAWPEA 
YPHF FLL P FPTQGCE VLGPL YTADPWV I 
CVSLLPSCPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHFSFLCVTGSQSVAQAGVQW\LIIAHC 
SLKLLASSDPPASVFQSTRITGPFLSFT 


12987 


26888 


A 


13125 


310 


486 


NKKTTRGES SKQRQPRVSHHPGWGTVAP 
SGFTAASAS \RARVI LLPRPPE 


12988 


26889 


A 


13126 


124 


461 


G S PLQLLPS PLAALTR DCS E APMGS CSV 
AQDGVQWRDLGSLQRLPPGFEPFSCLSP 
TPAP\FPSGWDYR 


12989 


26890 


A 


13127 


2 


337 


RGAAPAAMAVTALAARTWLGVWGVRTMQ 
ARGFGSDQS ENVDRGAGS I REAGGAFGK 
REQAEEERYFRAQSREQLAALKK\HHDE 
E I VHHKKE I ERLQKE X ERHKQKI KML KP 


12990 


26891 


A 


13128 


681 


422 


CRSDRWAKEHRGKRGQDSS KDVMARLME 
APKQTAQYFFIFYFFETKSYSVTQAGVQ 
WLDLGS LQRPP PG\ S SDSPAS AS CAWPQ 
TAH 


12991 


26892 


A 


13129 


299 


579 


LVMFCKVTKIERLNLWRPGTVAHVLWSQ 
HFGRPRWAVHVGWGVRDP/ RLTQHGETP 
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Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nu de- 
location 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^-Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptoplian, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLIiE \NRNI S QAWW CMP WPATREAE AR 
ESLEPGGQRLQ 










268 




KVDKSMKMRKNRPKNAENS KNKKAS S P P 
NDCNS S PARVQ/ S WTENEFDKLTEVGFR 


12993 


26894 


A 


13132 


1108 


1529 


KIHSS FVLLNAKLVAVAAKTPKFYKKKT 

nT .'Dr»TVWc:iin r pa vow t oi/cbp e tcttj e tt 
Uljf\J 1 1 n&Ay 1 A V Jfo 1 r ViAn ^ ±r IKbf 

IHS FI FETES YS VAQAGIQWHDLPS \ PP 

GS\SDSP\ASVSRVAGTIGVCQQAWLFF 

V\FIjVET\GSFPHLSPLTMIRNKKQIjFK 

KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRLECSGLIIiAHCS \LDPPASA 
S QTAE TT YGHHHAS 


12995 


26896 


A 


13134 


217 


373 


CNHTEITYVNKKKHRPGTVAH/SLYNPS 
TLGGQGGLITQAQEFDTSLGNMTIjS 


12996 


26897 


A 


13135 


257 


507 


PVSPGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 














PSFPTHPQFCFLFCFLfcVFETE/ SSIVA 

qaevqwcdlgspdqgssasppsassi.pl 

SIW 


12998 


26899 


A 


13137 


297 


419 


GGSAVYYI WEIVQVLWLMPVI PT\WEAY 


12999 


26900 


A 


13138 


213 


492 


QVGINYQPPTWPVGDLAKVQRAVCMLS 
NTTAI TEAWGLPGAISSAKCAIjVHW \ YV 
GEGMAVGEFSEAREDLAALEDYETCGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTQLI FLLYCWLCPPPLASE 
ASAES PPLL1PRGFL.CFSSTTSS ITAI PA 
TTRDYVDVSRSGS 














S LE WP FF FS FS ETG S H S FTQAI VQ WCNH 
SSL\SPTSASQIAGTTGVHNHAQL/VFV 
FCFAFMSPQLSNSC 


13002 


26903 


A 


13141 


164 


582 


GLVLVNLYLRSYLMVFIYINSRLLQVTD 
ASRERRENESSWFCRSVCRM/PPVGPGP 
S CVHP CS PRGS I S PCTPHCPLKTTQAPA 

PILVAAHSSWPAPPSPVPVPQEDTEHGA 
G 


13003 


26904 


A 


13142 


235 


367 


LALNTFCWLGWAHACNPSTLGGQGEWV 
TRGQE\QANMAKTCLY 














GALQPATAPWEPLSGIiAEAGAI SPCLQG 
G/SGGRGASENRGCTRPGRVPGGRRLNR 
PRAWSSWPAPPAPGSERPSAINCPRAEE 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


GHSFQNWLLGCPLQLQRHLLHPPDASEE 
CGSPPWTPLRPHIHPSAGPHGSTHK\HA 
GGCLFPLRPSPTHPCGPPVPLPWSQ\QA 
PLCPHPPIHSPHDWECDPGKSAIiIPPPA 
AQS PSRL I KHAAKE PECS PRNRRPG I P 


13006 


26907 


A 


13145 


321 


421 


GCAQWLTPV/ IPALWEAKAGESLLFRSL 
RPAWAI 


13007 


26908 


A 


13146 

• 


3 


398 


IETJJQSHDKTLTDKKLLLLDEQRKWFIjE 
MESTPGEDVNNVELTTKDLEYHINLDD\ 
KAGFER I DSNFERVSAVG/ KMLSNS IAC 
YREIFHERSVKVANFTVAILPQPHKLSV 
TINLI I QQPS TLRQDLLLAKR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13008 


26909 


A 


13147 


228 


261 


SKTQRS SE YYSLS VCVCVCVRVC IYIYI 
FFFFETEFLS\APQAGVQRCDLSSLRLP 
PP\GSSDSPFSTSQSAWDYRREPQRPTS 
LHYFVQLKHNLHRVKRRDLRNTTR 


13009 


26910 


A 


13148 


3 


275 


KHLSDLQILRLGLSSKTRKTRYSHFAQG 
RS I FFLLRPS VDW I KPACMDGNIiliFSKF 
TNLNVNI \ T KKHPHRNVQNNVW PNVWAP 
WPKEVDT 


13010 


26911 


A 


13149 


218 


382 


AWDHI KliESFYTAKETT/RMKRQPTEWE 
KIFANYPSDEGLVTRMYKELKQLYRKKI 


13011 


26912 


A 


13150 


391 


29 


FCFLFLFMFLFFFEMQDGVQWCNLiS S PQ 
P P PPG \ S SHS PAS AS Q WEAENCLNPGG 
GGCSLGDRARLPQRKRERERETERKRKR 
ERASCAFCGAITFVNMWCX3KHHTCNHLP 
WLTVFETNKL 


13012 


26913 


A 


13151 


351 


157 


FGYSRFLCVYINFRTNLS IYNYI CI YIN ' 
LYL\NIHTYTLI YTYVHTYTHIHI FYIY 
IGILTGIT 


13013 


26914 


A 


13152 


402 


111 


TFFGGEKPPFTLRPGPPRREGGGGPPGP 
REGGFFRVPLGAFGPSRPK/RPEKSGPV 
WAPQ I FLWGGKSRAI KESGKGFFFSRI T 
LLVFATE KKKKTTLF 


13014 


26915 


A 


13153 


310 


141 


REHEKQKPLSQVRWCTPLVQAAQEAEAG 
GELEPRSSRliQCSMT\ TPVNSHRP PARV 
T 


13015 


26916 


A 


13154 


153 


21 


LNKCV I SWLGAVAHACN/ PSTLEGR WIT 
QGQEFKTSLGSIiAKPHLY 


13016 


26917 


A 


13155 


429 


275 


SVDIPLEKTLINKDVFkEAAFKCKVS /R 
QARVKFKKRYKPVRTKWFFQKLWF 


13017 


26918 


A 


13156 


303 


42 


YRERAPVYP I I INI AFGPHKQ \ TRAHTH 
THTHTRMHAHTHTHPAI I PAFAKNGLGT 
FHHKGRTYNDKVKPRLS CEAAI PSNLKS 
REFR 


13018 


26919 


A 


13157 


198 


13 


SHGVAILLFFHFLNKRAFTVYSPAMNSS 
LCEIQEPSLG\SGSGPLSGNRMSGKGHR 
KYFAKI 


13019 


26920 


A 


13158 


192 


2 


ISNSHRKDIKDPEVLLIEQKYILSGQMQ 
WLTPVIPAL/V/EAEAGRSLEFRSLRPA 
WATDRDSVSK 


13020 


26921 


A 


13159 


40 


190 


RVDPRVRESRSVAQAGMQWRDLGSLQAP 
P\PGSRHSPASASQVALFLNRK 


13021 


26922 


A 


13160 


1 


178 


PTMVLS PADKTNVKA / AWGKVGAHAGE Y 
GAE ALERM FLS FP TTKT YFPHFDLS HGV 
SSYL 


13022 


26923 


A 


13161 


411 


41 


E SLQ I C VGLHAKPS WDACRPWLGQAWFR 
YFLLT FIiGPL. VFLLVFW FSK I YFL YHLK 
NPSLSSCYNLD\FNTQYLKYCKKKKRIjG 
AVAYACNPS ILGSQGGRITRGQEFETRL 
GNMAKPDAWVDP 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSLSI.S/CDT 
HTHTHTHTHTHAHSGYLCVLRKVK 


13024 


26925 


A 


13163 


247 


89 


QGLVLS P WLKCRGG I TAHCGLLGSSHPT 
TSA\QVAGTTGTQHHAQLKFFILI IL 


13025 


26926 


A 


13164 


122 


409 


FPSTAIANSH/ CSSPQGCSAYGVAIRLF 
LRS SNKLAL \ L YGLAL /NS FFCN FQEP S 
\LGSLSGPLSDNKSNNLCCFKSPSSWLF 
AIAVLGNYYSLNTVRS GRRDPGTPMACS 


13026 


26927 


A 


13165 


69 


313 


PSVSLFFPPNHNIVIHMGLLKINS I FGE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLALSSRMECSVTITVHCGLDLLGSGGP 
PSAAS/RVAVTAGTCHHSWLMFLSFV 


13027 


26928 


A 


13166 


310 


26 


IKSMTPLPALNVSLSYYLPPLRPCRERG ' 
MVQDRFFCFLFCCCCFIFSETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ESLERGRQRLQ 


13028 


26929 


A 


13167 


296 


68 


KNG P LNLATKS LL I LTMKCLG WLGGKP 
NCSKLKKNKIKTKKWGWAPWLTPILPA/ 
LLEAEAGRSLEARNIiRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPIAGWAPLSCWAPPPH " 
CTQGTTPPGLMGTG\AP I AI IGGNARS P 
PMPFLPLSGTRRALSAQECPTF 


13030 


26931 


A 


13169 


248 


406 


TERCGFDLLSLGQVQWLTSVIPALE/EA " 
EAGELLES RS WR PAWATWQNL I SLKK 


13031 


26932 


A 


13170 


33 


308 


NSLQGAEMAAAQS S LFSAS EGYRGKRGR 
GRGRGR PGRG /RS EGDRGGRGRPGRGRG 
RKVLLLRI CVWRRVLRLWNRGRIGVGGR 
VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYLFFIYFLREGLTLLPRLECSGA 
ITAQCSL.\GSSDPPTSASTVSGTTGACQ 
CALSSRDALISLiCYPGWSQAPGLK 


13033 


26934 


A 


13172 


71 


476 


DSLNYDNHIYQMPLIYINMGIAVTISL1. 
G\ILVYRSHLISSLLCLEGIILSLFIIA 
TL ITLNTHSLLSN I VPIAI LVFAACEAA 
VGWEECGFRSRPTALILEGPLRSCQPRT 
CVRRPACTRRG PAFVGKNRHS IGG 


13034 


26935 


A 


13173 


86 


318 


NNCNVCVCVC VCVCVCVC IH I C I YV/ YV 
YICIYTYKYICIYTYKYICICVYIHVYM 
YMCVYVYICICMCVYICIDSKGR 


13035 


26936 


A 


13174 


466 


726 


NCMKFGAVTRIG\DLPWEINPLSSCSLL 
REKDPPTTSGPQTDQPKKHLTNFKSGKR 
PLFTLFSNIiPHYPSTSFSFQSWRHTSIS 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVILICHKMYGCVCVCM/ YACICNGM 
CIYMCVCVKSWKLKPELKCLCISDEVGW 
VLTQWAGREEN 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVE/WQGLFTGH 
GIAHCNLKLLGSSNFSASISQGAGTTGI 
VANG 


13038 


26939 


A 


13177 


577 


1052 


SPTSTRTGSGVAMWLSQPMGRT\CRSKV 
ASKSRIiHGPLIWAHPPIVRCPMLRHHY\ 
KAQAGRGLSLEE\LRVAGIYKKVAQTIG 
IS\EDARRRNQST\QALOAKVQRLK\ED 
RSSIiILFPRK\PLAPKKGDSSAEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


13178 


496 


678 


TLNFVWAQGbKITRPWKNAPVFPVIW\R 
LRRKNIiLTLKGE I CSDPKWP YCLPPWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


G I YTFFLS FFLFLRQSLTLS PRAGVQWC 
NLCIiPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LQLAI KLLKMRLGWAHICNPS / TLGGW 
G \GRS QGQ E FETS LANI VKP 


13042 


26943 


A 


13181 


351 


51 


KKKKKI FFGTKKRRFFFLKGVWGPPPPK 
VSPPPPPKISPNPPKNNPLLKA/IKSPQ 
NWLFFFGPPSCKFFPPFFLNPKNSPWEN 
PPSGGE FS PKKKKKKSAI 


13043 


26944 


A 


13182 J 


2 


218 


IHFGCFVFLFFETEFCSCHPGCRAGVQW 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucinc, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RD\LAHNSLLPPPL.G\SSDPPTSASQSA 
EITGMSHHACP I FKKHS 


13044 


26945 


A 


13183 


33 


304 


EKHGIFRAVKILCMILQRWTYATHLSRP 
TGYTAPRVNPKLWTLGDNDVSM/SGSLI 
VTNTALGWGLLIRRSSACMGQGDMGNLC 
TFLSILL 


13045 


26946 


A 


13184 


277 


13 


FSFFLIVEMGSCYIAQAGLELLGSSNPP 
SSASQSAGITGMSHHPAWLATliKRAFVS 
D\NSLSFPSSENAHLSILAGHFCLMGFF 
FFQ 


13046 


26947 


A 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVCV/YYIHTHTYIHIYIHVFIYIYIYT 
FMHTYIYPYVCMVIiXiQIiTL 


13047 


26948 


A 


13186 


64 


759 


CJjSAESAPTSTMPMTLiGYWD I RGIAHAI 
RLLLEYTDSSYVEKKYTLGDAPDYDRSQ 
WLNEKFKLGLEFPNLPYIiIDGAHKITQS 
NAILRYIARKHNLCGETEEEKIRVDILE 
NQVMDNHMELVRLCYDPDFEKLKPKYLE 
ELPEKLKALTS\EFSGGKRPWF/SQGDK 
ITFVDFLAYDVLDMKRIFEPKCIiDAFLN 
LKDFI SRFEGLKKI SAYMKS SQFLRGLL 
FGKSATWNSK 


13048 


26949 


A 


13187 


41 


426 


LFFFFFESGPCWVTQAGGRR/WNDHGSL 
QPGFPGLKGSPCLTLRRSWNYRPGMCRH 
TQLVFAFFFFPREKKFLLGAPTGFKTRG 
LRGP S RFGFQKGGNKGQEP P P RAKGL I F 
LEPERGQAFFGVFFGPTQ 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCSFIQAGVQWCEHSSLQPQ 
TPG\SSAAPASASCKAGTTEAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRliYDQKTV 
YHECRMWANSHCP \ PPKGLIiRDI TPRCW 
AP 


13051 


26952 


A 


13190 


2 


826 


PGST I S SRRRGACG SRGGHFPS PRGGS G 
VASLERAESWSTEPAKAIKPIDRKSVHQ 
I CSGQ WLSLS TAVKKI VENSLDAGATN 
IDLKLKDYGMDLIEVSGNGCGVEEENFK 
GLi \ TLS ALKHHTS K I QEFADLTRVETFG 
FRGEALS SLCALSDVTISTCHVSAKVGT 
RLVFDHDGKI I QKTP YPHPRGTTVS VKQ 
LFSTLPVRHKEFQRNIKKKRACFPFAFC 
RDCQFLEGSPAMLPVQP\AKLTPRSTPP 
HPCSLEDNVITVFSSVKNGPGSSR 


13052 


26953 


a" 


13191 


235 


1 


SPCARQCCPPNPAGQEPRRRLERGPGKW 
WPRS I KFPLPAV/ RARFPLLPSAPLRQL 
VSGRVGGRVGRPGKAVQVSGGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLIiLFFETESCCVAQARDGVSPCWPG 
WFQTPG\SSDLPASAPK/VAGITSVSHR 
TRPDVRGFKKC/GLQL.EMSHLALLL/TT 
SHRFWYFVLLFS FVS KGFF I FFFFLFFF 
FFLGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


GQTKAFVLSYCGAS PSIKQ/ PCPQAKDH 
PLEPSMHPEGTQLQSCSTMLGPRQLSSE 
KQPLLPPRSHLKSSPMLRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGAGSC 
TYASFCKCKEYKCTSCKKSECGAFPGNQ 
GAGQSQRREPRAQQAGAGQ 


13056 


26957 


A 


13195 


670. 


390 


PRETYIKPFARSIGCQPPKTIHPTSRRP 
STRPPARPHARPPVHTP/APSVHMSDRP 
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SrCPPSIH\RPPSTDVSRPQYSLSPSTS 
LALPGQSCLGR 


13057 


26958 


A 


13196 


2 


438 


LSFTSIEFCHLLFLNQLICNYVIFTKSS 
TISCEALGRLLWYPWTQRFFESFGDLS 
T PDAVMGNPKVKAHG KKVLGAFS DGLAH 
L\PPIiKPTFATVSELHCAKLHVDPENFT 
LLGNVLVCV/MKHNFGK 


13058 


26959 


A 


13197 


57 


499 


SPSWPRSSARPCSGHCLLPGLWDHGKAP 
CLRAARHSQGHG FRKGRKEGCGLWAAGW 
TKNPVPTWRAG / S RRGPGVTS PPPVPS V 
QNPP\PPQRRHGGSQATRPVXiFTPPPKH 
MEG PNAP I LQDQR I TAPP PE I L YRPPRA 
ATCDALRRG 


13059 


26960 


A 


13198 


34 


465 


GILPYSPCVLRAPNSKRVRKCGCTTPRH 
L/CKATRKCGCTTPRAPLNETRKCGCTT 
PRHLSRQLGNAGALPRGHLNETRKCX3CT 
TPRAPLNATRKCGCSTPRHLSRQLGNAG 
ALPRGHPLKETRKCGGATPADAFYETKK 
IGGGL 


13060 


26961 


A 


13200 


177 


1075 


PTSSSMAFPKKKIiQGLVAATITPMTENG 
EINFSVIGQYVDYLVKEQGVKNIFVNGT 
TGEG LiS LS VS ERRQ VAEEWVTKGKDKLD 
QVIIHVGALSLKESQELAQHAAEIGADG 
IAVIAPFFLKPWTKDILINFLKEVAAAA 
PALP F YY YHI PALTG VKIRAE E LLDG I L 
DKIPTFQGLKFSDTDLLDFGQCVDQNRQ 
QQFAFLFGVDEQLLSADVMGATGAVGST 
YNYLGKKTNQI V\ EAFEQKDFSLAIjNYQ 
FCIQRF INFWKLGFG VSQTKAI MTLVS 
GDSNGPTPASTCRKPPGEFT ! 


13061 


26962 


A 


13201 


35 


464 


VQEFKTSLGNVAKPCLYKKKKKKSPLGG 
GVPPPPKKTWVGGTLSPQKIRPAGPIiIF 
PPPLPRGQQGEPLSPIKGGGGGGSS\YP 
PPPFPCLTGRA/PPPSPISAKSSPSPPP 
CQYVYHPPLLPLFFFLHALRRATAPLFL 
FLVRA 


13062 


26963 


A 


13202 


1153 


259 


AGGVLRLGWTGS RMAS DSGNQGTLCTL 
E FAVQMTCQS CVDAVRKSLQGVAG VQDV 
EVHLEDQMVLVHTTIjPSQEVQALL\EGT 
GRQPVLKGMG\TGQFSESGGQPVAI PGG 
GliGTVQGWRFLQLTPERCLI EGT I \ PG 
LE PGLHGIiHVHQYGDLTNNCNS CGNHFN 
PDGAS \HGGPQDSDRH\RGDLG\NVGAN 
AAGGAFFRMEDEQLKVWDVIGRSL\ I ID 
EG\EDDPGAREGQSLYPKITRELPGSRL 
SCGI\IAKSAGLFPEPQSKICSCDGLT\ 
I WEERGRP I AGKG \ RKES AQP PAHL 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYWEMPVI PALWEAKAGGJjIj \ E P 
RKSRTAWATREDSVSTKN 














SDSPQTPRMRVMAPRTLiILiLLSGAIiAIjT 
ETWACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDSDAASPRGEPRAP\W 
VEQEG P E YWDRE TQKYKRQAQTDRVS LR 
NLRGYYNQSEAGSHTLQWMYGCDLGPDG 
RLLRGYDQS AYDGKDY I ALNEHLRS CTA 
ADTAAQ I TQRKWEAARAAEQWRAYLEGT 
CVEWLRRYLENGKETLQRAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAE I TLTWQR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DGEDQTQDTELVBTRPAGDGTFQKWAAV 
WPSGEEQRYTCHVQHEGLPEPLTLRWE 
PSSQPTIPIVGIVAGLAVLAVLAVLGAV 
MAWMCRRKSSGGKGGSCSQAASSNSAQ 
GSDESLIACKA 


13065 


26966 


A 


13205 


417 


13 


GHPRVMLQCHMGSASIjSIMTPPPIRSYQ 
P ALTRS S ATKPA\ TS APS ITPMT P VTAN 
IS/VWPTANTTTTKLSNPTSS/STLSTP 
IWHPS ILPAPPSTPAPIiTTVTNSATTST 
I CFHVQLFQAHHCHHCIAAALEYTS 


13066 


26967 


A 


13206 


1139 


559 


GLSELS IHPS VARVQEGHGATPERS I PW 
PGGKPRRGGRGTGLREWGGGSHRSIiGQT 
KKKGAKAKRDHCAHGPIAAFQGPFPRAT 
QLLSPCHGCSWKENQPP*GPPGHACTAW 
E KTVTHGS CRPVTKPS PDPG FVGQAPGT 
KGNPAPGMDGCAGPRISCHPVPITKPLS 
NTGICVLIiCYEFGFLIAIVGCWGFK 


13067 


26968 


A 


13207 


306 


332 


ENGEIRIYIYICLYLIKKHWEEAGCGSV 
HL* SQHLKRLKWEDYLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFLLSAVISRNVYIL 
T*L.SLSIiSQCVCVCVCVCVCVCVCVAIL 
ENYPR 


13069 


26970 


A 


13209 


250 


2 


YCEGDLGSSRPGVSKCREFSPRRDKRIM 
PWPGAVTYAYNPSTLGGRDGWIA*A*EF 
ETSLGNIVKPHLYTHTHTHTHTHTHT 


13070 


26971 


A 


13210 


615 


120 


SVFWPLLGGVSWSGYTEVRGPLEKAVCP 
LSELEYCAGRSAALFRAIKLFKDALSIjL 
KLCPQPPFPLGAPLS P* EALVC WPLHV 
STCLS*WCPTPCPPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 
LSLFNSLLRLRTCGVWFSVPVLVC 


13071 


26972 


A 


13211 


239 


429 


FILLRNKLQPGTVACGCNPSTLGSQGKW 
IA'AQGFETSLDNMVKPCFYLPQLKKKK 
KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


VQLLPLCGRDIiYLFAFNLPTYRCWQHFL 
SKPRAP I LAGGSDRWPCPCCPFWLP * WC 
HVSAHAPQPFPCAKPEAPLPVGCPCS PA 
P SLSAALL I IiRAGIiWIiRAPFCFSAHLL 
S CHSQCCQFPCSEKRS CLGGRDRHAASN 
PLSAVWPPGE 


13073 


26974 


A 


13213 


1844 


1147 


LHSQIYSTAKKASLSMKGSRDKTRAASS 
RPVPSVLGVPPWSTLLQHPQNMWPGPAQ 
QQGQPSGRQAWCTPGEAPGAEAAPQ* QP 
HPEEDHSGGPQASAAIiALPPSPPSQRDV 
QGETGMQGRSAPRS AS S SACCACRWSRL 
PCPQLLQRHPGLRI.VS PAHSRPPGPAPS 
SSSGSGLVPGYLPQKGLAGLSAGAVLCP 
PGLLRVAHGAGYGPSAQMLHARLSS SSL 
RGKRFURF 


13074 


26975 


A 


13214 


413 


60 


LCSKSCHGSFAMDCETLLTFPRLFYPIL 
HMPTVPGQALPVHTPFL * LKLPLLPGI P 
SCHSPLFLD*AQESPPPGGLPGHPPGSG 
VLCLWFHSLKHPVLSGSSVSTPGSASPS 
RSRAL 


13075 


26976 


A 


13215 


67 


196 


GLSLSSPKVTHNATLMGCIFQN*KAFVY 
HSMKQKKI I LliFNMA* P * YPLDFGEQWP 
LHGSHAYSTI L* LDLFCKKEEE * DE I PY 
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K=L»ysine, Lr=Leucine, M~Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














V*CFMLLWKSTTM*KKI ILLFNMA 


13076 


26977 


A 


13216 


161 


2 


SVFFLFIDRRLYLIKSVWAAYNSSTLGG 
RGRR I T * AQEFKTS LGNKARPRV YQ 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVSLCYPGWSP W* SYLTAAS 
NSWSKAXLPPQPPE 


13078 


26979 


A 


13218 


292 


400 


LKVKQGRGHLDLNQGPLDL* SNSLPLSH 
TPSHRRTV 


13079 


26980 


A 


13219 


186 


394 


YLVLS FL.FYFLFI FLSRVSLP I FFILEI 
*LGIVAYTCNPSTLGSQGGRIA*AQEFD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 


315 


448 


KTKDLGVVDHVYNPSTFGG*GGRIT*GQ 
KFETSLDNRARSCLYK 


13081 


26982 


A 


13221 


3 


13 


LR PLRS F VS VERLWASCCLGS E LWDKT 
KRKKRRELSEEHKQEIKDAFELFDTDKD 
E VI DYHELKVAM I ALGFDVKKAD VLN I L 
KDYDR*VIGKISFGHSIEIVTDFQL*R 














LRLVAPMWNGRIRGIHRLGAAVAPEGNQ 
KKKRTILQFLTNYFYDVEAI.RDYniL.QRE 
M YKVHE KNRS YT WLE KQHG P YGAGAF F I 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCEVPVEAVDAGDCDINYEGLDNLLRL 
KELQSLSLQRCCHVDDWCLSRLYPLADF 
VAGAFRWAGCPRI FRTGAKHGFHHLQ * D 
LSSGWATCPGTSPT 


13083 


26984 


A 


13223 


1019 


1238 


LTGAEYKKSDGLTESVTANLEPEAPKVF 
SRLDDEAPVTVLPARLPLPAP * HTARPD 
P PQRRRSMQLAPARP FL 


13084 


26985 


A 


13224 


1534 


1181 


RWNSEPDYSAGGPRAGRGARLLPVPKSP 
GGGGAVAPGPQTPLRAERTAVRLSISS * 
NSSTAPADPGAAPPLHPQPAEAGTWVSA 
FLRVIFCFPSLGFGCWEKKIAFVPFPPT 
DEKCL 


13085 


26986 


A 


13225 


328 


22 


CLESPRDRPFQGPAFPKHPPPPLAFLSQ 
PGKVQYFFESNCKS LSSQE I KNSRSVPT 
SPPAPSPPPPPFLYFSLTAEGGEV*KET 
LDSDRAPPSSREAPICQV 


13086 


26987 


A 


13226 


180 


5 


NS I D YKINVKNSNRPGAVAHTCNPSTLG 
SQGRRIT*AKEFETSLGNIVLLIYTIIY 
YI 


13087 


26988 


A 


13227 


237 


528 


VGLCQDPLLLTLTDSFCSVLWGGSHLAF 
HKNLYWHIDLVMYFSRISFYFTLSAAI 
CLSLYLGSHILSPFL*LFYCAYTTLYLC 
LITTLFFLIPKV 


13088 . 


26989 


A 


13228 


140 


366 ; 


NATCLWHPMS P YAPT Y * C* LS PGLLRWV 
LAITP I VLMIL I SS YNHVS I S YGGTFRS 
NFFLDQ 1 1 LTFLI YELHI V 


13089 


26990 


A 


13229 


257 


25 


L PAHS LGRDLSAQ P YSMPRPGGELRAEG 
CVNVFACVLLSTKVCLHPLPE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPMLLPPVPPGSLG* *SFSP 
LPAAAALPAPCAAPRPAPLRPCGPAPTP 
AAAPAPAPPAAASLSAARAAAGSPPGSR 
PSARGARRRPSGPCAPPEPRLGPRAPSS 
APRTRTPS * GRARSGGSAGNAPSARRTP 
QGPPRAACSLARSWIWPAGRGGSCGRAL 
GA 


13091 


26992 


A 


13231 


335 1 


103 


CQKYSTSYCNKWLATLVIICSRKI *DGL 
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G=Glycine, H=Histidine, I=lso leucine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KGQAWWLVP VI PMLWKAEAVGLPEARSL " 
SS PCNMARAPALKKIFLIGQV 


13092 


26993 


A 


13232 


452 


712 


VGLEGED*PPFSPAQSQSLSCWPDCSCP 
PSWPPDQRPGERPQPPDPPESHGAEEEG 
LPERPSDA*ARPQPFSLHPGVFPPQNAS 
TQ 


13093 


. 26994 


A 


13233 


480 


2 


SCFSEDWNPLKFKLQLSTSLSYRKSHTL ~ 
HCAHHFRGQCHSD * AFLVFSLTCILQVG 
SLRFTAGDVSPSYPEKLRVL*EGRVADF 
HVMARKGQQAALAI LLQNARTGYWWLT 
PV I PALiWKAKASGS VKPRS LRQTWQNGE 
TRLLDRGESASQSAGMTGM 


13094 


26995 


A 


13234 


692 


824 


QIMALPFPSPPVPPLVM*CKGGKIPPPP 
SKNPRTCVGNTVI I KA 


13095 


26996 


A 


13235 


657 


872 


PRGPRLDVLYACEPQLITCLELGTPLKV 
RFWLEWDHTYNPSTLGDQGGRTA*GQE 
FKTSVGNS KTLS LKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCILQTCTCAHQNLK* KFGGKSRLG 
AVAHACNPSTLGGQGGRI I *GQEFETSM 
VNMVKPYLY 


13097 


26998 


A 


13237 


31 


338 


WYIVYICVCVCVCI YIYEIYMI YI *YI * 
YIYIKHTVQ*H*HLAFSTFTLLCNCHHC 
PSLELFHYPTLTYHTH* 1 1 TPHCSS C* Y 
QPLFYFHLYVFDYSRYLV 


13098 


26999 


A 


13238 


714 


1001 


SSGVS CLNRAGWI TGVNHHIHS LVLLKR 
EK*TQNI ITEVCLMFLLLPHLAIEATVT 
WPGMVAHACNPSTLGSQGRRSV*AREFE 
FSLDNIARPCL 


13099 


27000 


A 


13239 


250 


406 


NLA* HGGVHM WS S AS QVAG I TGTRCHAQ 
LIFI FLVKMGFHHVGQDGIiKLLTL 


13100 


27001 


A 


13240 


256 


193 


WINFISIiPLSSLT* TLSLCVCACVYVCV 
CVCLRPCVHIVGGNLKRYIVFARSVTLG 
CKFTHLHISLI INNDT 


13101 


27002 


A 


13241 


1 


257 


GMDLWQMCHSLSLPLI FFF FFFS EKTRS 
NFVPQAGFQLRALRDPPA*ASREAGITG 
LTHHVRPGQI FKKIGTLKPT I LLPLLPR 
K 


13102 


27003 


A 


13242 


319 


406 


KRGWT* WLMP VI PTLWEAEAGGS PDVRS 
L 


13103 


27004 


A 


13243 


228 


56 


KGAPPVTiKPGYPKAAKNPTPFKPPGSKV 
GENPPIiF*NPCPGGGNPFPTFFFFFFFF 


13104 


27005 


A 


13244 


1687 


612 


ILGNQCCKFDAYNALANESTMIS IKLNE 
*VREREREKEKEKERVRERKEEREKKKL 
TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


I 


TKDPSSTMPPQPNPLLSFKSQFIFLIFL 
FIKKYVG WMQWLLPVI PAL * EAEAGGS L 
QPRSSRPAWATWRNP I F 


13106 


27007 


A 


13246 


204 


375 


AS WPPGLQYELR I I KCLLHCWTRAMI FR 
ERERRERERERERERERERE * VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


IWVFKETFFFQNFFFFFGKENWFNGFFF 
FFFFFFFFF*DRVSLTLHPSLGYR 


13108 


27009 


A 


13248 


221 


3 


NPS QGFPLLKI F F FFLRWS LALS PRLE * 
NS I * KN*KI *KISWAWWRTPWPGAWEA 
EPGESLEPGRQRLQNE 


13109 


270 10 


A 


13249 


81 


341 GELNDTIHVKYLG*YIKVL*RNRTNRVC 
j VCVCVCVCVCVYACTQKDLF*GTGSCNC 
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E=Glutamic Acid, F=Phenylalanine, 
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N— Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /^possible 
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GALVRPKSARQASRLKTQKTWIRSLKA 
VC 


13110 


27011 


A 


13250 


299 


1 


ATARLR FLHGAWWYAS WP PTQEAEARG 
FIiD PRS SRME*AM I LPRCTPAS VTDKI P 
SLASQ IHRPSRVACVILLSSHAPTEAPS 
RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQVKQATWTKYSSSQI FAILDCMT 
TLSSHPAIiKYCNYVIjISNQSPKHLRIj* C 
SGMISAHCNLCLPG*SNSPASASRVAVC 
FKASQASYLD 


13112 


27013 


A 


13252 


217 


158 


LFE I FC I PFFFFF FNFFFETRSHS VSQA 
GVQWCKQGSLQP* IPINFFWGKEGVIDK 
LILISYGNAQGFKVAVTPVCTTALQPGR 
QSETLSQKKN 


13113 


27014 


A 


13253 


150 


56 


TIL ILLLLYETESQSVAQDGVQWCDLGS'" 
IQSPPRSSCHSPA*AGVNRLRQENGMS F 
EAEIVLSPDRTTALHPGLQIETLSHII I 
I ILILSTI SFHQLLH 


13114 


27015 


A 


13254 


174 


145 


PCLKMTTYDFKSALEIAFFCFTAILRYK 
LLIKVTHFKWLRVAAHDCNPNTLGSGGG 
RITWGHEFKTSLANMA* PTTRSYSQRVG 
IMG I TI PDEVWSRTQPSHIRSGP S KVY P 


13115 


27016 


A 


13255 


179 


273 


GQWLTPVISAIj*EVEVGGLLETRSLKQA " 
WAT 


13116 


27017 


A 


13256 


264 


382 


GLKIGKGCSWWFTPVI PTL* EVKARGLL 
EPRS LKSAWAK 


13117 


27018 


A 


13257 


1199 


1563 


YL WGP AVPPLDLE P GQ AGATCG P VSHY 
KIFAKLWAVGSILDLMPGPE*GSFLSVF 
TSLCFI YGVT YPFHS EQPHPAS PSASGL 
LLLKGRAAWLVSTLCIGSPAPMLTGIFP 
HKVYNQLFI 


13118 


27019 


A 


13258 


194 


242 


ILYVETGSHYVAQAGLEIjLGSGNPPAST 
SEIAGIEA*ADEFIYVGVSWQDECIKSM 
KQVAHAS IPAISEVEAGGLPEPRSSRPA 


13119 


2702O 


A 


13259 


322 


120 


GPTPVLDAFQKLCLSFTSGQQADGI I PA 
FPIRKQGGTQWPKPVI PTLWEAKAGRFL 
* PRSFRPAWAT 


13120 


27021 


A 


13260 


237 


1 


TS KKI S LGWWGKQG I PGTWEG * AKRPFE ' 
PGKPRVQWTQVPALDFSLGGKARLCLKK 
KKKKQKQKTLKLCTHSR I TYSRA 


13121 


27022 


A 


13261 


52 


318 


SAVGIHRCDDGSH* P * TPEHKQLS FLSL 
PS S WDYRG I TREL FQRF P W I FLQL I TAV 
I S SES TVLKNLELAAVRGSHVRVIMMAV 
PINPF 


13122 


27023 


A 


13262 


274 


1 


KKKKNSPVWWWTPVIPGSPG*AGELLEP 
WRQKVQIAQWPLAKWPQTKWQG CPL 
ALQPGEHQGIFVSQKKKNPKTKRKYWAL 
FCSLPSC 


13123 


27024 


A 


13263 


717 


877 


NSTKEMAHWPGWAHTCNPSTFGG*GGW 
IT*AHEFETSLPNMVKPHLYKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW" " 
S S TWEVE AGGS LE P RR * RPAWATK 


13125 


27026 


A 


13265 


80 


287 


FMNGEAS *KTS ILSQAWWCAPIVPAAQ* 
NEAGELLE PRSE WI>VW ATRALRI S S RGL 
RFRFRLRRFTSTR 


13126 


27027 


A 


13266 


3 


241 


VGLFLFFFETESCSVTQAGVQWCGLS*L 
* P PPPG IRDS PASASQVAGTTGTHHHTW 
LIFLYF* +SSGFHYVGLGRSSNS 
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peptide 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13127 


27028 


A 


13267 


103 


359 


I CVYVYVC I YVYEMRATS KDFbNQVWFS 
FLRTVAVLFEEYLIVI^MWIiSFHIHIYI 
HTHIHIFIYVYTHVLIYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


13268 


172 


12 


SARSSQAIQIQKS VTSGQVKPQW* QS * P 
TPFXPTLIALTTLLLPISPFILIIL 


13129 


27030 


A 


13269 


2 


203 


RIENIRSYKNLAAIjFMTAKKMGTIQMPH 
NFEW INKM WY I HTLEH* SDKKKKKKKVN 
RAEIPKLIIQK 


13130 


27031 


A 


13270 


236 


365 


KI YRQGAVAHTCNPNTLGGQDGWI T*GQ 
KFETRLANMVKPHIiY 


13131 


27032 


A 


13271 


371 


15 


QNQINFALILIINTLLALLLI I ITF*LP 
QLNG YI E KS TP YECG FDP I S PAR VPFS I 
KFFLVAITFLLFDLEIALLLPLP*ALQT 
TNLjPLiIVMSSLLLIIILiALSIiAYE*LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTSSRSRAALELIFFFFLGPPNLPV 
YNGPLGRTKPGTGELDTGGSPILCVGQG 
RHPYWKGGAKPLAPFGKGGGP* GTRPLA 
GPI VPPKAGLQSEMPGMAPFCGSFG* PT 
RPD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGIFSAHCNLCLMGSSNS 
PTS ASRVAG I TGVAGQ ATDKTPQTLS * R 
SQPSHMDPVRVVSP 


13134 


27035 


A 


13274 


175 


13 


APCDHRPCP PENNPL * L* FS I TFPNP I K 
RPHPYLPLLTTjFSDSAHLHPGEIEQR 


13135 


27036 


A 


13275 


355 


861 


PL.TTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRLSPNISEQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLSCTPESHTEFKPPPTGGGRRWA 
RLGLNGAT*GREEPLQTRIiPAEYPGPGP 
IDPLQPPPISTASMATAFSDFIililiGRDP 
A 


13136 


27037 


A 


13276 


865 


667 


KFLCILLDFLFFIF'EMRSQHRLECSCA 
I I AHCSLKLLASTDPPTSAS * IAGFTGM 
CHCAQLFLTF 


13137 


27038 ; 


A 


13277 


103 


375 


WSRRLPWRRGLGYI EL.FQGLE IRHHFL F 
GPHYLRRTQCQGPVT PSELDGQGWDYMS 
PGV*DQPGQHDETPSLQKI*KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


QQTEGSRAHSNSHRRPGAWriASNPNTL 
GGRGG*NI * SQKFKTSLVNMVKPCLYGR 
V 


13139 


27040 


A 


13279 


379 


372 


SR*WVCMVAHACDPSILGGQGGRIT*AQ 
EFETSLRNMVRPCLCLGNTNIYIYIHTH 
TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 


PPAHAARAS PPSYTWLCYEVKI KRGRSN 
LLWDTGVFRGPVLP KRQSNHRQE VS S WE 
CRKHISKMSGGGLSTVYFFHRRFQITWF 
VSWNPCLPCWKVTKFIiAEHPNVTLTIS 
AARLYYYRDRDWRWVLLRLHKAGARVKI 
MDYEGERCRGQGSMTGRNSIjRDGWI cna 
INYASLHRTLKEILR*GSPSGLIVSLLS 
PPAHPPEDS PGNESWLCFTMEVTKHHS A 
VFRKRGVFRNQVAPKSYIiHPK*El.SSWE 
RRKHNT*HTNYEVTWYTSWSPCPECAGE 
VAE FIiARHS NVNLT I FTARLCYF WDTDY 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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Predict- 
ed end 

location 
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last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E— Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H— Histidine, I— Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 














FVYS DDE PFKPW KGLQTNFRLLKRRLRE 


13141 


27042 


A 


13282 


1558 


1824 


SFPYLASFPFCLGIiPLETLFHPLGL*KD 
*SDTHTHTHTHTHTHSHTHAHFPSFPDP 
LFQSSPFSSGFIDEYKYPHLWPVMSVTC 
CRFCV 














PHFFLPQGFWGFFS PFPL*KSSSPIiKAIi 
IFLGGFSPFFPPPKKRFFSKIPRGVFFP 
PPKGKKFFFPPPG* IWPPQGFFLKGPPS 

ssssssssssssssssssssss 


13143 


27044 


A 


13284 


349 


159 


CPLE PKKNYP FCFFF* ETGS CYVA* AGL 
VLVASSNPPASASQIAGIQGTSRHAQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLGWAHACNPSSLGGRGGQI V* PQEFE 
TSRGNMARPQSL 


13145 


27046 


A 


13286 


315 


410 


VQWFTPVNSALWESEAGGLLE IRS *RTT 
WAT 


13146 


27047 


A 








AJR.CGGMHX» YS Q I FGRLiRWEH CXjR* KVIjS 
QVWWHAPVFAD I WEVEVGALLEARSLRL 
* CAMLL P VNSHGP PTWAMQ 














KSYFSSHLHFEGKKCVSSIliD * S ILVLM 
CSWLMNYTHTHTYI YIYTRTHI CVHVYV 
HNF 


13148 


27049 


A 


13289 


379 


434 


KRGPPPPKKRGCFFSEKFF*GPPKTPFF 
FFFPKKKKKKKKKKKKKKKKKKGGRSRS 
RIADAWADAW 


13149 


27050 


A 


13290 


253 


344 


GLF YNLQKI KI F YVKNLFFFFSNS ITEA 
G VQ WPNLGS LQ P P P SGSNDS PAS E * R I V 
V 


13150 












TYNAGTIiGGQGGWIT*GQEFETTLANMV 


13151 


27052 


"a""~ 


13292 


289 


2 


TLPQGEDFNKFVFGSIKQKLQINL*NTD 
FIGNVFQSWAQQYTPWPPSWVAEVEGS 
LAARS S RPLCTI I TPLNSHCSLAWATQQ 
DPAGRVGRPRV 


13152 


27053 


A 








LKNKNVNKEKLRQGWAHACNPS ILGGR 
GGWIT*GEM 


13153 


27054 


A 


13294 


276 


441 


GLFPKFLIQKINQNWPDAWHAYNPSTL 
GVQGRW I T* GQ * FKTS LANRGRKS ENK 


13154 


27055 


A 


13295 


258 


389 


NMVEKRLGQAHAYNIiS TLRGQGERIT * A 
WEFETSLGNWRPCI 














MFKRNIGGEERGGGG 














13156 


27057 


A 


13297 


196 


1 


IiMGKAPFLGGFIPPAFFFFLKKKKGGPG 
AVAYTCNLSTLGGRGRWIT*GQEFETSL 
ANIAKPCSC 




27058 




13298 


182 


436 


GVTILNVkHRHRKKVtIjVRDJjKKVRKCP 
klhgnlrkvfqaeg ias s ktlkghvwwl 
MPVIPAL * EAKMEGLLEARS LRSAWATQ 


13158 


27059 


A 


13299 


334 


440 


RHYWLNF*AFYHSSIAPAPQGGGHWPPP 
G I TPLNSLHVPLLNTSA*LASGV* LT* A 
HHRLVPNNRAL.ilQALRITRILGLYSTL 
L PA* KNFEAP FT I S DG VYGSAFF * ATGC 
HGIiNV 1 1 GSTFLTI CP I RQL I FHCTS KQ 
♦LWLEPPAWDW 


13159 


27060 


A 


13300 


359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine,'L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GV*PFFCLEGFFSPHKFFFFFIFSPNIC "" 
SPKKS FFLKFP PALFFFNLPFLKNI PS P 
PFLKKASF*EI FFCGPPPFFFFFQNTLF 
FFPPFFFFFFFFFFFFFFLDKFAVY 


13160 


27061 


A 


13301 


233 


406 


VTVEGMEEVLAGSSEGMGSRPTS* V* FL 
KYIiIiGAVAHAHKPSTLGGRGGRIT*AQE 
LKTSLCNKVRPRLYKINK 


13161 


27062 


A 


13302 


92 


429 


GSHFNSELIEMKFYIIKRNIFIILNCFY 
LIKHQHFHYTKFIiIiVSVINKF*LiATVAH 
AYNIiWTIiGGQGGRSA * THE FKTSLGNT V 
RSCLYKNLKMSWVWWCTAIWEAEVRGLP 


13162 


27063 


A 


13303 


302 


140 


GGAVNSVQSQAWWHMPWPVPWEAETGL 
IiKPRSIiSIi*CPMIVHVNSHCTPAWAT 


13163 


27064 


A 


13304 


115 


312 


IiRSPSFVLiSPRLKCGGTIMVHCSFDLIjG 
♦RNPPALPSKWETIGLHHHALLRLIiI F 
FF*KWWLFF 


13164 


27065 


A 


13305 


90 


246 


PGWAHACNPSTLGGRGGRIT*GQEFKS 
SLCKIAPPPERKEGRKEGGKKKERG 


13165 


27066 


A 


13306 


19 


310 


DSS FS PIMCFGMS VNKTGLWLGKKKKKK 
KKKKGGPP * KKPIiGGPNLPGGGKKKFFP 
♦RGAKKNPPGDFWKKTLFLGGEKLGPPP 
PQK* N P FFGGG KI G PTP P PKI KPFGEKK 
KF 


13166 


27067 


A 


13307 


179 


56 


WVLVYKKQSILG*AQWVWSVISALWEAE 
AGGSLVSTSLRLA 


13167 


27068 


A 


13308 


425 


290 


KKNIYIAPPGYFWPPQRFFLSPPPPPNV 
VIFFFFliFFFFFIjFFFFFFliIQYAEGIG 
VMRGRGEGKMGR*W*KFFGRRVLiNIRSI 
LLANFKHWPYW 


13168 


27069 


A 


13309 


214 


371 


IYF+ * RW I SQAWWLMPVI PKL WEAEAGG 
♦LEPMSLPGQYNETSPli 


13169 


27070 


A 


13310 


975 


572 


AWGMWG VGKHS LVS LGIEE CQASTAL.S 
LDKSGWWEEAARREDVLCEDRRCLLCHV 
PAGVRGSLKPELGSRKGKNGQQSGSKPS 
VPSLGPRPQGPGNPALSKGTRPN*AICF 
LCQTPADHS AKKQAPHTL I P I R 


13170 


27071 


A 


13311 


270 


10 


TPNKSIil *RLQFSNIKLANLGLGTMAHS 
CNLNTLGGQGARITSGEEFKTGLGNIDP 
I PIESKVI KTLAGMVSPDAWVDAIKRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTEWSGVIL*NCIYLLPQYTSTRVHAY 
IHT YIHTHTHTSGLS STSVGSTNHKSKI 
FGGKKG 


13172 


27073 


A 


13313 


136 


1 


SNTSSSLVTQAGGQWCELGSIi* PLPPKF 
KRFSCLSLPSSWDYRRL 


13173 


27074 


A 


13314 


88 


183 


RVSTLLKKSCFFFFFFFFFFFFFFFFFF 
FFFFSSQKKKNLKNKKRE*KKSCFFFFF 
FFFFFFFFFFFFFFFFF 


13174 


27075 


A 


13315 


481 


517 


IiLiSP*RPKGKPKGKKRKPNIiKEKKIGEA 
RGGKERQRKTPQRTGKQKKRRKYRRIGR 
RNYR 


13175 


27076 


A 


13316 


307 


196 


GMHLYliPRTW*WVRVIPGSREIEVHAPL 
EPGSQRLLVAEMTPMHSSLDNMNKSPFP 
QPPKTEKQTKNPPKPHI IA 


13176 


27077 


A 


13317 


2 


209 


PARALDLKGSPYMES CSDAQAG VQDS I Y 
GDHL*LRAPAV* *GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 


3 


178 


SFCFSVLREIIGQSLIMKTLP*ITHTHT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTHTHTHTHTQRE I ERERDRERGESQ VL 
GC 


13178 


27079 


A 


13319 


40 


382 


DSVPSLLLKLLMETGPHHWQASTTTPS 
*PFILEGFHHVAHAGLKLPSSSRPPHLA 
SQSAGIYGCEPLRSARLI INYIGSRLWG 
LRNI CLSQDHEE CLLCF I LQAI S MYRFC 
LG 


13179 


27080 


A 


13320 


107 


1 


PPLIMANSGWAQWLTPWPTLWEAKAGG 
*SEVRSS 


13180 


27081 


A 


13321 


111 


382 


ASSQLLELAIQVFPGIWIPPTTKQKGVP 
SGKTLVTSKIIAGLKMPKDSRSRPRTVA 
HTYNS STLRGQDRR IT * AQEFKTSLGKI 
GRPCLF 


13181 


27082 


A 


13322 


49 


108 


CGLY* FFF* ATRS WHVAQAGIiEFLiGIjSS 
FPASASSVAGTTGMCHHTQLHY* F* V 


13182 


27083 


A 


13323 


1 


257 • 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERARERE * E C V * EGETVS DRERDFERA 
PPRVLGAVAIYIEWCLASLiIQ 


13183 


27084 


A 


13324 


1 


248 


EFQYIIiCSALIFGRSKILK*ATKEVKKS 
KHIPRI SCQDTRKYNWSAKAKRRNTTGT 
GRMRHLKIVYRRFRHGLHEENLFLKH 


13184 


27085 


A 


13326 


375 


1 


PPKRRGKTHCS PPKVFPPPQNN* TTE PP 
QLIICYL*RGGVFFSLPLII*APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFRDT 
DAEGHIiASSRPSGAVFPSQGMHTTGSTR 
RWGQCVPFPPTRP 


13185 


27086 


A 


13327 


382 


15 


NWPKNCPLHFQNWFLRKTPQIFFCLKNF " 
FFFFPKKVFYPPKKPFLEKPWPPVIiN* K 

kppppfffgwgpggpqycpfkkappl.fi 
rgkerfplwgkgdffqipcppgplkkkk 
raaardlgts 


13186 


27087 


A 


13328 


2 


220 


GRVGSSRARAVAIiFFFFFFFFCFFFFFF 
QNPPLKRGEKKKNTPPLKKKRPLRGGFK 
KQKEC*EKKKKLSGPK 


13187 


27088 


A 


13329 


427 


289 


IQLWSSL*K*LHTHTHTHTHTHTNTCFL 
TVMKELS THPGNK 


13188 


27089 


A 


13330 


203 


269 


INFGPPREPGGFNQKPQFPSAPGFEPWN 
PPQGPKP * KKKKRKKPFKVWAPGGGSKV 
QNPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


DVPVHYCRLYKPCTLRLHSIYKKQILLW 
PGAVAHACNPTTLGGRGLWIT*GQEFEA 
SLANWKPHIiY 


13190 


27091 


A 


13332 


160 


325 


WERQLFKIAQSGLARWLTPVI PALWEAE 
EGGLFEYTSLR*LWATQQDPISTKMFK 


13191 


27092 


A 


13333 


339 


62 


GMLPLFVPPQKRGSPPYPCYGVYNSPPL 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 














FSCFLHCP 


13192 




A 


13334 






DPSVRINTLLALLLIRITF*LPQLNGYI 
EKSTPYECGFDPISPARVPFSIKFFIiGA 
ITFLLFDLEIALLLPLP*ALQTTNLPLI 
GMASLLLI IILALSLAYE+LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QCIRVSFLIJTVGAVLRPGFQCLDWPCL.V 
YARLSGWTGFPPCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG* CRCDDGCCH YGE 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leuctne, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unk.nown, *=Stop codon,/=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


13194 


27095 


A 


13336 


298 


349 


KTTPFYGLiRGSKI GTL * CSGVI I AH * S L 
KLLGS SDPLSLVAQAARTTGACHHDQliT 
NAFFFFPLERVPTPLKNNPFLWVKGVKN 
WDLD 


13195 


27096 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGGYSGSLQQER 
GELSSLPCP* PES PPPLLI FLKLPSNPL 
PCSPLPCTQGPGQPAPYVSIHTSLKFPN 
AKLS VCLSRAI EFS 


13196 


27097 


A 


13338 


154 


354 


FNI>Q VKPDG Y F VIiF I CH ILPTDSKSG KA 
PGAPCHSRNFHIRKTU3WVQWLTPVILA 
LWET+ FSHQENSGLGAVAHTCNPSTLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM * LSAHS SNTHEAYVA 
FPKAHKPGLNSKTAF* LGTE AHACNPN I 
LGGQGGQ I TRGQ AFKTSLANMAKPRLY 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNSKI SRGWWWAP VISGP 
WEGEGGAFV*TGRPKIWLTEVGPLPFNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 


2 


FRPQKVS VRVGVLAHACNPT I LGAQGGH 
II *GQEF*ASMANMVKPSSC 


13200 


27101 


A 


13342 


367 


17 


TSRS WNLFHVLVRFPTADKDIRE I GS FT 
KKRSLI DLTIPYAWAS FTLMMEGKKEQV 
TSSINGSRQKVRSQSGELLSLTPSDFSW 
AHWLMPVFPALW*SETGSLFEVRPSRPA 
WPTW 


13201 


27102 


A 


13343 


166 


492 


EGTQETLCGCI ICLVRGDALNIiFHLKCS 
WVGWRGAICCMELRETAKQKLSVWKIjYP 
FEIVFS FSNNKPRPGAVAHI CNPNTLGG 
*GGRIARAQEFETNIjCNIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFIi*DRWnDVAPGWESSDMIMGP 
LQALTSWGSTDPPTLGLQSGLGDLQDTW 
PPYPTSFYYFFLQGRGLTMLPRLVIiNSC 
TQAILPPQPGQRSKTPISLKTN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPLEKNNFSPPP* FFPP 
KKQ I KP PPPF*GGGGKKKKTPKP PLEKN 
NFSPPPFKPRKGIFFLFPPLSWVKKKGD 
PPGGTRP1APL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCS CGHFL* SMARYKTKPGVRWL 
TPVI PVFWEAEAR* LHEPRSSRPASATQ 


13205 


27106 


A 


13347 


138 


332 


RAWKPS FEI FYFFF * RAHTF* * S CVCL 
CVCVCVCVCV*THTHTHTHTHKHTHDY 


13206 


27107 


A 


13348 


314 


395 


WLiGIVAHAYNPSTliGT*GGWIT*GPEV 


13207 


27108 


A 


13349 


388 


371 


IMNGWQDKEIiVSRVIQTGIKK*KPRNRP 
EFRAPP*WHERNGRKERERMEGRKERER 
KKGKGKERKRKKEACI YKVKIE I SNNLA 
NLI INVER*DPJRRPDILSWVSLSSRVSS 
VFIALVTCLTSPQFFLSLHLNQ I LFPLG 
KNL 


13208 


27109 


A 


13350 


412 


40 


LVFRFWMCLFSRELFGCFSHLLTKSMSQ 
M*WAFGDIMYTFDLLFIKKTKNNCKIiW 
QGCKEEGSI.IHCWRECKLVQPL*RPI *R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
Y*RNICTSGRVG 


13209 


27110 


A 


13351 


372 


148 


FFFFFFFFFFFFFFFFFFATGSCRVA*P 
GVKWLISGTVPLLISTGVLTCS ISDIjGQ 
PVHTSLGNLWPYSQELiPY 


13210 


27111 


A 


13352 


209 


206 


KKKKKNFFSFHGKNPGNGGPFGGPPPPP 
F*TFFKKGGGLPKGPPPKGFFWNPPQNG 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidtne, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P = Proline f 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
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GGGP*PGGEKKTPKPSLGVLFFFFFKFF 
KKRPTAFFFLKKKI FF 


13211 


27112 


A 


13353 


43 


334 


KKKKKKKKKKKKKKKKKRGGGPFIKISG 
GAQFFRGEKKFFFFFLGGGKKTPRGFFE 
KKPFFWGGNFWPPPPQKNFPFGEKKKFL 
GGGGGKNRVFFWGEKIFSLGFFLKKFF* 
KKPEGKNFFPPKKNPVFSPPPP*KFFFF 
PKGEIFLGGGGPKIPPPKKRFFFKKPPG 
GFFSPP*KKKKKFFLPPEKLGPPRNFYK 
RPPPPFFFFFFFFFFFFFFFFF 


13212 


27113 


A 


13354 


264 


496 


kptilrvthflegcekyefllpvafsnf 
SEPLKRYYYWIiGAEAHVCNPSTIjEGRGG 
QIT*GQEFETSLVNMVKTCLY 


13213 


27114 


A 


13355 


183 


9 


IRSPKIPEIRVFFFSRWSYALVAQVGVQ 
WGNLS*MHPPRARFQPIiSSLSVPST*KY 


13214 


27115 . 


A 


13356 


208 


378 


EICLEFKI IDEMDKCGKLWLGTGAHT*N 
PITLGGQGGQTT*GRAFETSLiDIMEKRY 
L 


13215 


27116 


A 


13357 


256 


341 


KGQTQWLTPVIPTLWEA*AGGLLESRSS 


13216 


27117 


A 


13358 


78 


344 


IVGLFESESKKGQHIVSGWIYLRLLIYR 
FLFPLFSCLCKFFFFFFSPETEFCFVPQ 
AGVQ* HNLG * LKP P P PQLKQFS C LTL P S 


13217 


27118 


A 


13359 


259 


291 


AQ*LMPWPALYEAEAGGSSLEPRSLLS 
VWAT 


13218 


27119 


A 


13360 


448 


121 


RLFDLGNKKTSSFVQNSENAKYEDSLQF 
YCRIYIYVYIYVYICIHICVYM*TYMCM 
YI C I YMYT YTHVYTHVYMC I HTYM CI * A 
YICVYIYIF*KACLPLAISTLSSGH 


13219 


27120 


A 


13361 


125 


262 


FSFQAS VEFTSKTVWS WVQWLTP I ISTT 
* EAOAGGSLEARNSRPD 


13220 


27121 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK 
KI * ITPPPKFLFFFLGKTKIKNPPWGFW 
PFGFPKKKRGGGARR* KTPSPFREKPPP 
QKLKRFKTPFPPLFFFKNPRP 


13221 


27122 


A 


13363 


378 


48 


FKKAAREKVSTRKFRGFVCLFWDFFWRQ 
SHSVTQAGVQ*CDLNLiHPPGSSDSPAAA 
SQVAWTTGTHHHTQLI FIFI FCRNKIS P 
S LLKKYKKNLPGWAGACNPSHLGG 


13222 


27123 


A 


13364 


174 


396 


SLIFILEKRECYYPRSLFTVANFTVAKR 
* QTLNC P S TDKW I NKMCH I HT VE Y YS D I 
KRNE I PMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLLPFKAWQWVGCHYW*KLD* INCYFR 
ALLAFSEYMYIYI YIRTHTHT* I YI YI Y 
IYKLTYSHLKVG 


13224 


27125 


A 


13366 


133 


324 


YLL* ILVYCSHL I FSLLCLEG I I LWVFI 
IATLITLNTHSLLINIVPIAILDIiAAGQ 
AAVGLALLVSKKKKKGPPLKKPPLGAQI 
S PANQR KKFPPKRKP I KTRRGTF 


13225 


27126 


A 


13367 


488 


190 


FSYAACFLPQCVFVCTFIYLHADGFLLL 
I I FFENS VFILCHSCWVFCFLLQWFLLM 
NLFSLFLLMLMHPVS*MECSIYSFSSLI 
IFHFSLMLLFFCFMI 


13226 


27127 


A 


13368 


229 


372 


YYATKDFSRPRAVAHTCNP S TLGGRGRQ 
I T*GRQFE TSIiANWKPQLY 


13227 


27128 


A 


13369 


122 


2 


KEEC VGSGTWWVMP VI PAL * EAEGRGSL 
EPRSLRPAWTKK 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQtTCGDSENISGCQGLGKKGMN 
GKSTEDFEGSETTLYDAITVGTCHNKFI 
RSH I VYNTKSGP *HKI>WTWGGHDVS V* V 
PQLRKTYHPGERCli* *GRLSMYRGREYI 
GNLC I FCSLLL 


13229 


27130 


A 


13371 


136 


1 


KSPTWPGAMAHTCNLSTLEGQGGWIT * G 
QGFETSLANMVKPSDAW 


13230 


27131 


A 


13372 


228 


385 


GliLIHKLEFKHFWliGTVAHTCNPSTIiGG 
QGRH I T * GQGFETS LANMAE P CLY 


13231 


27132 


A 


13373 


277 


429 


LETTTKAGLPTIIFTSSGQMSIWFFQNK 
PWDSKKLFNLLSISSP**SSKNY*PWQS 
TVAHTCNPSTLGGQGRWIT*GQEFEISI, 
ANMVKPARVGRHVIRGLQVS 


13232 


27133 


A 


13374 


194 


3 


NQENY I Y I YTHTHTYTHT YI Y I L* LLLQ 
WVYLAYFLCLSYPSLFAQGFTHTTYTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLL.PRLECRGI IMAHCRLPRLVSN 
S * APALS FQSAE S TGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKCPFIYKITFVIFLIHVSCKKFRNY 
RQREWKLPTVRPLSPS*AI I FPVTCTYT 
SRWPEATKDPQKK 


13235 


27136 


A 


13377 


336 


38 


VWWCTPWPATQEAEVGGSLESGRLRLQ 
*AVITLVNEHRESALASRRGPEETSSVK 
PPLPT I LxAHTCFSLPRTGQD ITS RFLAQ 
RNTEENL.ELQMEARA 


13236 


27137 


A 


13378 


376 


293 


FFFFFFFFYYFFFFK*KFLTKKKIIjSSQ 
YI 


13237 


27138 


A 


13379 


215 


16 


HLTWS FTTATEGS KTVTQHS VYRKAKLG 
IiGAVTHTCNF S TLGGLGRW I T* GQE YKG 
IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDESLEGWMMDEWMSGWRGGCINRYMHA 
WMDG * GDGW IGG 


13239 


27140 


A 


13381 


233 


417 


LKPITKGRKPRGFFLPFKPKQKKYFWGF 
PQKTP 














13240 


27141 


A 


13382 


34 


360 


RWNTTNADHDLKDNILSPPQINLYIRQN 
SSRLFCRH* QMDSKS PVKI PAGFVLYI * 
IYRYTHTHTHTHTHTDSELYMETHTRMA 
NT IVR KNNS LEHS Y YLMLRMT I KLP 


13241 


27142 


A 


13383 


214 


71 


QTLNTDSG PGWI HACNPSNLGGHGGK I 
I +GQGFETS LANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTLHRTDSLYI F YTY* PPQSCEPI S 
YNNFPCLSQYLYliCPSJjHTHTHTHTHTH 
THTHTHTHTHTHVSVGRRSLFFSGPHTE 
APRSRVSVYT 


13243 


27144 


A 


13385 


393 


271 


I EGQ I QYTS TIGNKFKDFYL I LCKEG I M 
SREISPPSSCHLRQQ*RVRLRERDRERK 
RQRERQRERE*GRSVIiHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDILCFPFSVLLCIQFHIiLI 
QHACFKYPNPNSRFGS WPGAVAHI CNPS 
TLGGGGRWIT*NQEFEARLSNMVKPRLY 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKGVPSTILKMETLLGTVAYPCHPST 
LGGQGGRIAEAQEF*DHLE 


13246 


27147 




13388 


188 


470 


ARPPCKGRDSSAEGPPGPPFPSWSSLGC 
WTRE P PGRGEP I Q VAVRREES AQDWARP 
ELI IKEWWPGLVAHTCNPSTLGGRGGW I 
A*A*EFENSQ ■ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


13247 


27148 


A 


13389 


454 


367 


NKKKPPPRPP*PPPPPLGKTPFFKKKNP 
NFPRGGGNSPGSPSFGGGKGKDPPPGGF 
FPPPPYPRGEGGNPFFKKKKKKKRASLD 
P * WSSQPC PAYE KELNFAP VUVTVTL P P 
GHSS 


13248 


27149 


A 


13390 


34 


354 


HTLIjAL LL 1 1 1 T F * L PQLNG Y I E KS T P Y 
ECGFDPISPARVPFSIKFFLVAITFLLF 
ELEI ALLLPLP * ALQTTNLPLIVMSSLL 
LI I I LALSLAYE * LQKGLD * TEY 


13249 


27150 


A 


13391 


1089 


792 


THGFGPQVGSPLDLKQPQKAMPRGPSLG 
NPPPGCQQHSHGAGGP*VPPGPRQPVMA 
SRNLCLPLALPRTHPPSPGQTRDKPRTQ 
DRSCWRDPRNGPGV 


13250 


27151 


A 


13392 


299 


428 


WIRCRGRGRSCLPVPHSAPALLSPLWD 
GTRRREAGGGTCQGGSGCMGAHWGLGGG 
EGSGCRSQALPREAAEARREFKHGGIPR 
H*LTESLVSWEFRHA 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA+WLTPVI 
PTLQEAEAEASLKPKSSRPAWVTWQDPS 
LQKI 


13252 


27153 


A 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGLVIPLL 
KNGQLS TDACVP P WG RVGVERAG P PNQ P 
AGEGGGQGQECSLGLLPEANRCHPCNPS 
TLGGRSGQIA*GQEFEISLGNTVRSCPY 


13253 


27154 


A 


13395 


146 


18 


FSPGWAHACNPSTLGGQGGQIT*GQAF 
KTSLANTVKPKRPQ 


13254 


27155 


A 


13396 


1056 


679 


DIQVPERPL*NVPEPEAKGEPPDRAVGE 
HIDRDCRSDPAQQKRKIFTNKCERAGCR 
QREMMKLTCERCSRNFCIKHRHPLDHDC 
SGEGHPTS RAGLAAI S RAQAVAS T S TVP 
SPSQTMPSCTSPSR 


13255 


27156 


A 


13397 


440 


423 


I HS PPTQRRV*QRERERERERERERERH 
AIiAERNRTREG I SGTTGERGNLMLVGHH 
LPGPLSSRRLGFCPDGFGGQHCLAQGDQ 
VRL 


13256 


27157 


A 


13398 


2 


256 


IETLGSAVEFIPYENTYQTYLKKKNRNS 
HMQSQPNRKGHIRLKYIAAWA*WLTPVI 
PALWEAEAGRSLESRSRNPVSTKKKIQK 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
P * TPGGPSMQFQLPQEQGQRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYS AWHTCNPS TLGGGGG W I T * GQEL 
KTSQANMVKPHLYKNPKIEF 


13259 


27160 


A 


13401 


19 


447 


ESALNLPSAGIIGVSHRTQPIFCIliNAL 
ALGVL ILERS PLQ S PQVP PSHSHTKPGS 
LPSVTPGRGPGRPRAAEPTAQGRRYNSN 
PVLISPGSVHPASFALPAEPP*TGAPST 
PRPCVIiRPLLLGNAMDLLCTKLSSRIjPP 
YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADLGTNQVLTHPSTT 
AMYFEHYCQPP*IVHGTINT*PPWHKN 
PIHIITPSPCLRASTAINLLLSHINCYS 
*ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHTYSLKKSAGITKFQNWHIVNCICI 
CQWVPLEMVNRHTVILCQYPVKPRILY 
QHHTAILVTIL.TFTLRPGWSHACNPST 
LGGQGGRII*AQEFKISiiGNIVRPCIiY 


13262 


27163 


A | 13404 


39 


365 


SGDRRVRLLLKI ITF*LSQLiNGYIEKST 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan T Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RYECGFDPISPARVPFSIKFFLAAITFL 
LFDLEIALLLPLP*ALQTTNLPLIGMSS 
I»LL>III3jALSIiAYE*LQKGLD*AE 


13263 


27164 


A 


13405 


32 


350 


HE FATS TS EMTKGTS * I VKRR I KTHTLC 
RRCGSKAYHLQKSTCGKCGYPAKRKRKY 
NWSATAKRRNTTGTGRMRHLKIVYRRFR 
HGFREGTTPKPKRAAVAASSSS 


13264 


27165 


A 


13406 


326 


668 


LQFKIFCMPG1FTISEVSSMSN*RTAYG 
SQS FHKI VL I * LATFGQALS C * IL* IHL 
CYLKNKTKHRALVAHACNPNTLGS PRGR 
IA+GQEFETSLGSIVRLLLSTKNLKKKK 
NM 


13265 


27166 


A 


13407 


170 


472 


KLISVWS EESSYCVEHGEPWPQEKEYS 
WLEIGKAEMERSTLESLLGATLEALSSN 
PTR KMQG LKLKLQRAS AFKE E P WLGWA 
HACNPSTLGG*GGRIA 


13266 


27167 


A 


13408 


178 


422 


FYNRYF* YTKTYQTENIYCKIMSFRLLV 
*KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 


13267 


27168 


A 


13409 


274 


444 


LFSQYIVFVHYSSFVLILFV* *LFSQYI 
VFVHYSSFVLILFVILKNYTYIYIHTHT 
HTYTLIFLLIFSQISLGWMKLIL 


13268 


27169 


A 


13410 


205 


23 


KE FLKFHRKC I CQQVKK I W PRA VAKACN 
PSTLGGQGKRIT*GQELETSLANVEKPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


S VW WNS LETRSS RLRLLKK I * NLQNKKR 
KFEHRHAQRKYDLETQGEDAIYKLRRER 
RKEEE*EKEEKKRRRRKEKEKKKKKPTli 
LCLKKQRNQHLDLRLPASRTVKKLISW 
*ATHSWPCYGSPSKLIQQSIH 


13270 


27171 


A 


13412 


227 


52 


EIGSHSCHPGWSTVAQS*LTLTSNS*TQ 
LILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 


27172 


A 


13413 


345 


67 


SKCCYWAHDCI LLDL I CLYFLELVGCV 
DCFS SSLGHY * PLFV* I FFLS PSLLFWY 
SHYEHICVPHFSKAPFIFLHFFPSPEFP 
FLCIEMVYV 


13272 


27173 


A 


13414 


345 


33 


KVF FFLII CKMRMQNKQ KNKHG I CLLNS 
AMNTCIS FPLRQQI MTTWWLEVTQI WGV 
LFYLFHFIEMGSCPGWSAVA*SQPSATP 
VSWSQVTLLPQPPEDLGNS 


13273 


27174 


A 


13415 


187 


415 


FNQGWAFLFPFCVI FSYPWKDRNRTI FS 
IKITSFQESVLKI>FCIFEVS*ILiKTGLG 
QVRWLTPVIPALRRPRQADH 


13274 


27175 


A 


13416 


143 


50 


CMKLQSSI VMLGWAHACNPSTLGGQGG 
WIT*GQGGWIT 


13275 


27176 


A 


13417 


170 


17 


KQMREKEGNGLLT I S KVWKQPKS PSQS * 
* IKKM* YICTMECYSALRPEFPE 


13276 


27177 


A 


13418 


7 


163 


IK*IWYI*TMEYYSAMKRKKIMSSAAIW 
MKLEAI IL.GEVMQEWKTKNLMFSFISGS 


13277 


27178 


A 


13419 


901 


709 


SGRKTSLGNIARSCLYTKSFKIHCLPW 
PATQEAEAGGSLKPRSLRLQ*ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VIRAKTGSHNSLFPPHTFFFQGKNLCFF 
QKKKKKKIYIQSKPGAMAHACNPSTLGG 
Q I T * DQE FETSLATMVKPNLY 


13279 


27180 


A 


13421 


46 


141 


NSVISAHCNLCLPGSSDSPSSASRVAGI 
TGWC* HCNLCLPGS S DS PS S AS RVAG I T 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
nd 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanirie, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWC 


13280 


27181 


A 


13422 


604 


1121 


L>GSGDL>P*EINPliSSCSL>LREKDPPTTS 
GPQTDQPKKHLTNFKS APYKT I TDAELR 
VTLTVEAHLHPGE INSHVAHTKPVWWSL 
HTDAHEIWCRDSDLKLVPTLPLIPLEAA 
LRNITHSLSIPPPKNFRRPNTSTLFCVI 
FLINIRRQEYQASEPKPSHRI PCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTIiPESCNSIPFPF*KFI 
WSGWAHAYNLNTLGG*GRKIARAQEFE 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS * VQWHTLWSATQEAEAEGS 
LEARSSRIiLCALI IPVNCHCTPAWAFPM 


13283 


27184 


A 


13425 


729 


885 


AKC Y P VRN I S L W PG AVAHACN P S TliGG Q 
GGRIT'GQEFRTSLANMVKPHLY 


13284 


27185 


A 


13426 


231 




ASVFFKATLVDLHW* IYIYTYI YTYIHI 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HINANQQGYIYTGRTPICFF 


13285 


27186 


A 


13427 


184 


171 


LS * YI Y I YTHTHTHTHTHT Y I LY I YT YE 
QPSSQSS 


13286 


27187 


A 


13428 


372 


166 


CKFRPRKINTTFSHICGS* KVDI IEAA* 
HWLGTVAHACNPSTLGGQGRWTI * VWEF 
KASLANLLTPLLQ 


13287 


27188 


A 


13429 


391 


437 


MYSLGVGFFHSTI * LLKDI FVJSGWAHT 
CNPSTL * GQDGR IT* GRKLETRPGNRAK 
LYLYKKNSN 


13288 


27189 


A 


13430 


39 


254 


EFIPRAQDLETSQGNRVRPRLYRKFKNN 
* VWWCAS WPAMWEAEAGGLS E PGRLRL 
HSNMDNGERSCLKKK 


13289 


27190 


A 


13431 


336 


488 


DIWPGMVAHACNPSTLGGQGGRIA*A*E 
FETSLSNMARPWLKKKLARHSGA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 
GAIFV*PNLTLPG*GKSPASPL*KGGKK 
GGAP P P * L I FFFLKKKKL P LLGQGGLKL 
RALGKPPPFPSQRGGINGVKTPFGL 


13291 


27192 


A 


13433 


388 


280 


KSKLKNGCCVALCLLGL.I * SVLLA* FKT 
* LFCVSLDMYIDTTCSLS I S I YLSIYLS 
IYLSIYLSIV*SVS 


13292 


27193 


A 


13434 


187 


48 


S I SSTKGPGAVAHAYNPNTWGGGGGR IT 
*GQEFETSLANMVK?RNS 


13293 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVELAHPLAAERYHCERA 
ET*VKALLWLELCDERVSSRSHTEEDCT 
EELFD FLHARDHC VDHKLFSNIiK 


13294 


27195 


A 


13436 


18 


375 


RPAVPGRPTRPINTLLGLLLIIITF*LP 
QLSGYIEKSTPYECGFDPISPARVPFSI 
KFCLLAMTFLLFDLE I ALLLPLP * ALQT 
TNLPLIVMASLLLI I I LALSLAYE*LQK 
GLD*AE 


13295 


27196 


A 


13437 


373 


27 


THPYYSHQEYQSP* PLTGALSALLMTSG 
LAM*FHFHSITLLILGLLTNTLTIYQ*W 
RDVTRESTYQGHHTPPVQKGLRYGIILF 
I TSE VFFFAGFF * AFDHSS LAPT PQLGG 
HWPPTGITPLNPLEVPLLNTSVLLASGV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 


IjGTVAHTCNSSSLGGRGGWIT*GQEFET 
SIiANMVKTCPQKK 


13297 


27198 


A 


13439 


247 


168 


LENL I YTRVLERHREAKVH FPFSNISYS 
DKRTDTFVLTKTH*HTHTHTHTHTHTHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartrc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Clycine, H=Histidine, I=Iso leucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTRGSHFFY*HTHTHTHTHTHTHTHTRG 


13298 


27199 


A 


13440 


391 


168 


TKKKKHPSPRGAREVFF*QPPPPPFCVL 
*FFIVFFLSQGGGGGGGEFPPFFFPPPT 
SPPSPPPRGVWKKTPPFS 


13299 


27200 


A 


13441 


403 


246 


LFPTAIIGGPPGFPPPPF*NPAPEFFFW 
GPKKKKKLS PPPAI KMVFFKG PPP 


13300 


27201 


A 


13442 


305 


414 


KNNFHPSILGG*GGWIT*GQALETTSLG 
NMVKPHIY 


13301 


27202 


A 


13443 


269 


409 


PGTVAHAYNLSTLEG*GTMITQGQEF* I 
S LVNTVI PHLYQKKKKKGR 


13302 


27203 


A 


13444 


295 


37 


DPSYIiAWHMVTPHQVSTQHTLPRPRPCL 
PEMNFLVSL.KIWQCWPGTVAHACNPSTL 
RGRGGW I T *GQE FETS IiANMTKTRTRGS 
AR 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPCTLGGRGGWIT*G 
QKFRTSLAHMVKPRVC 


13304 


27205 


A 


13446 


277 


374 


SRLTATSSRGGWIT*GQEFVTTLANMVK 
PLLY 


13305 


27206 


A 


13447 


14 


387 


PQVIKSPWPPKVLGL.QA*TTSQAWWLTP 
VI PALWEAKANGSLEVRSS 


13306 


27207 


A 


13448 


185 


379 


HFGRPRPADHLRPGATNHPGPQGETPSL 
L * NTKKKKRGRPFKKKKLQSLVWQNKI F 
FLiAELKLLW 


13307 


27208 


A 


13449 


168 


346 


ISILRTNNMISIKINLRLFIDELKKRDP 
FIFPYFFFFFFFFFFGGNHSIj*SIjIVIF 
LIC 


13308 


27209 


A 


13450 


57 


176 


VAESRPGPGGTATELVPPSTRLLTRAPR 
DLLTGKRKPPPLGRMSRARVQWHNHF*T 
ES CS VAQ ARVQQRNLGS L * TLP PKI KQ F 
SCLSLLSSWEDCLSQRVRDCSEK 


13309 


27210 


A 


13451 


411 


241 


PPLFFFFFFKQKFPSVP*GGGQGWDFGS 
LQPPPPRVKQIFCPKIPPPWPPKKGGVP 
G 


13310 


27211 


A 


13452 


307 


298 


PTHSIVRNNKSL* IN*TSSMCSGLKVMR 
R * SQ KQED P * SGWAHACNLS TLGGRGG 
WIT*GQEFSTFLCPSA 


13311 


27212 


A 


13454 


191 


108 


GWSQTPGLGRSSLLSLPQWWDYRMQCRG 
VILAPHCGLDLLGSGDPPASASQVAGTA 
AFENCATSFWLP*FLNTQCFKIYKTKN 


13312 


27213 


A 


13455 


242 


400 


IATLLDYLQIPNTGPGAVAHACNPSTLG 
GQGRWI I *GQEFENSLANTVKSCL 


13313 


27214 


A 


13456 


1 


346 


YVTTARCSCWAGTESHMGAKDFYRQEKH 
PFHADSGIGWA*DRLRQRYGRFGRRM*F 
SWARGAGEKHSLA*AQWLMPWTTFWEV 
KEGRSLEPRS LRPAWAT WRNVS TK I KKK 
KKG 


13314 


27215 


A 


13457 


166 


289 


VRSSKLNTWVRKLVPI ILTLWEA*AGGF 
bEPRI srlvwatq 


13315 


27216 


A 


13458 


220 


76 


TASLKLSKSWLDWAHACNPSTLGGPGR 
WIT*GQEIKTSLANMVKPSP 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKNNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ*EQTFL 
LIY 


13317 


27218 




13460 


126 


1 


WSLGAVARSCSPSTLGVQGG* IT*SQE 
FETSLANMVRTLTS 


13318 | 27219 


A | 13461 


330 | 420 


N I CS WAQ WLTPVTPAL * EAEAGGS P E VT 
SS 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13319 


27220 


A 


13462 


577 


766 


AQWHTS IVLAA* E AAAAGLLE PRS * S RL 
KCTMIIPINSHCTPAWQHMRPCLLKKNF 
SNSFISK 


13320 


27221 


A 


13463 


206 


387 


VTVGETEAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNPSTLGG+GGRIT*DQEFETS 
FLGS 


13321 


27222 


A 


13464 


167 


41 


DFWLGAVTHACNPNSVGG*GGQIT*GQE 
FETSLANMVKPWNS 


13322 


27223 


A 


13465 


480 


79 


SRVISPLLI liNS VI SAKS L.LPVGSQGKS 
KGQVWSTQEERLEPGVMGEFTPGPCPCL 
VSQGPSLCPS PPPYS FRWSRLSQAPGS I 
RVLI *RPAAATSAPPPAQLGLPDTAFPP 
PLLSRDGQWAAGRQGDRDKGLP 


13323 


27224 


A 


13466 


3 


373 


DAWVAHASAHASVCLI I DTLLALLLI I I 
TF*LPQLNGYIEESTPYECGLEPISPAR 
VPIiYIKFFLVAITFLLLDLERALLLPLP 
*ALLTTNLPLIVNSSLLLIIILALSLGY 
E * LRKGLD * AE 


13324 


27225 


A 


13467 


214 


55 


TRQKIFNIiFSS*KKFLIPPARAKDFLFF 
IFFFFFFFFFFFFFFFSCSRILLHR 


13325 


27226 


A 


13468 


111 


358 


VMKVFYI * IGELVTQAYIFVKSHQIAHI 
LFHSMYVL.P* IKWFWPGTVAYTYNPSTL 
GGHSRRIS *AQEFETSLGNIVSPHLY 


13326 


27227 


A 


13469 


199 


607 


RIRGTSQKLLLLKKLISLIPKSQPTRGT 
P * TTFPP PNTTNFP P P PVAATAPD P S PA 
HFVSSPYNPDLGSP*PECPSPGRLQREI 
EQCKKDIQNFPFPTTSRICSNDLSLKGS 
ASRRRGHLFCERPVNQFRSPKPKK 


13327 


27228 


A 


13470 


128 


256 


EDWMWWLTPVIPTPQEVEVRGSLEPRSS 
ELQ*AVIVPCTPSE 


13328 


27229 


A 


13471 


94 


388 


KGEGKTEE LWS WRSHRGTHLCLLAD I SA 
LPIiHYYYYYYYFWQRWSLPDQPGQHGET 
PCIiPKIQI INRAYWHVPWPTDQEADAR 
* LPEPNRQRSQRPE 


13329 


27230 


A 


13472 


359 


73 


ATE PGQL F YF * ES E SHNVTQAGVQWLDH 
GSLQPPTPGLKPSFPLIHTGITAASHYT 
WLNQLLPMNVSLLTFYESPQLKIQRSHI 
LCNSIKIKFLE 


13330 


27231 


A 


13473 


332 


10 


ILS KRG FPKPGRYCEVNP FLCVQGAK* V 
RP*EDTSLHCE I YHTHTHTHTHTHTHTN 
LKP+TSKELIPQKTESLFFSLCIKKSLL 
RFQTFPDAGSTGNSVGCACFISK 


13331 


27232 


A 


13474 


355 


385 


I MVC I KQS C VH* KNN * FWPGT VARVCNP 
RTLGGQDRQTA*AQELKTSLDNMAKPCL 
YKKFF 


13332 


27233 


A 


13475 


297 


425 


RKSAWWMPWSPL'EAEAGGFLEPGSL 
RSAWATWGDPHLYKK 


13333 


27234 


A 


13476 


160 


1 


KSTGRLGMVAHICIPSTLGSQGGWIA*A 
QEFETSLGKILSLPEIQKSASLITW 


13334 


27235 


A 


13477 


334 


79 


GMKEQINFLiYQKRGIKPNKXiQAENWLI 
HYN * KLLSNS FK * LQDTAS I * YFFGNYL 
NRPGMVAHACNPSTLGGRGGWIT*GQEF 
L 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS INEKKPDVAHACNPGTLGGQG 
KWIT*GQEFKTSLANMTKPRIY 


13336 


27237 


A 


13479 


408 


129 


KRSINRKKGGELSSSHFLLLLLPPLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC*MDIjGPCKCPARKVFSHLSCSIj 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VLQVCRSP 


13337 


27238 


A 


13480 


251 


384 


TI TP WRGAVAHVCN PSTLGGQS RQMT * G 
QEPKTSLSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


AGDDSDNSMGLLGRIHEIKFFKCLGPCM 
TTKF I VPALWEAE VGGWL.E PTSLRLR * A 
LIALliQ CTPA * GT VYD YKVYRPSTLGS R 
GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFUMNPFSPHPVIiQLVSPGPDC 
LIFRYFKF*LHQLM* INPNANFCGARSF 
LCKFRVWIS*LRYPSLVALSVLFVFK*S 
VQ I * VKKKKKKKKKKKKKKKKKKK* K 


13340 


27241 


A 


13483 


39 


199 


LDVGTCSPIVPASQEAEAGRLHEPTSLR 
LW*AMIVQLYYSLGDRARPCL,YLKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSLYNCVSQ 
FLITNLYISIYRDIY*YIYIYLYREIYL 
YIYIEIYTYIERYVYIHI* ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


LDSASDITQAGVQWCNHSFL* PQTPGLK 
* AS TS AS * VAE I TGLSQHVWVKI IMSVS 
LSSYTFIQLPSILLiYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HI LSHCTVLWFLTPHHGTTHYKNTTGQ P 
S IVAHTHNXiSTLGG * GRWTT * GQEFKTS 
LANTVKCSDAWVDRD 


13344 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPIiKKKKNFPPPGGK 
IVFF*RGAPPFFFFFFFFLFFFFFFFIiK 
KNYLLLEKVQSPEKEG 


13345 


27246 


A 


13488 


183 


459 


RRVKKICKCIIWQNNYL*KGINICSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF*KTKIKPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGLPIQFASWFKYAGFHSLESIFQ 
SFCQTKKLEAPKELSPCSQLYRYNWQLT 
CRRMKLDPHLLPYTKINSRPGAAAHA*N 
PSTLGG*GGWII*GQEFETSLTNMGETP 
PTRP 


13347 


27248 


A 


13490 


313 


320 


KQGGPPLAQARVQWYNLSSGETSLPLPG 
SSNPPPSATGVAGTTGSPPLA* LIFFFL 
* DRENGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPIIPAL 
WDAEVGGLLEPRSLRPAWATERDSVS 


13349 


27250 


A 


13492 


211 


83 


EYAKKNGKLLSVTRKYL* SVCVCVCVTTC 
VCVCVCVTTCIKNI 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLNTLGG* GGRI T* GQEFET 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


KFIFKQMQWDDHSSLQP*TPELKQSSCL 
RLPSTWDYRREPLHLAM 


13352 


27253 


A 


13495 


265 


3 


HLILLEEDLTWQRGKWLKGRVSLCSLGS 
* TQS LTVLKKGRGliGWAVAP S TLGGRG 
RWVALAQELETSLGNMVKLSLSKIQKLA 
GMV 


13353 


27254 


A 


13496 


319 


1 


CFKLWDTCAQCAGLLHRYTCVMWCYTH 
HPEFCYNIMGLPKYMWSYIiWPKCCHAVY 
H FRY I KNliLPG CVAHACNP STLGGRGGW 
IT*GKEFKRRMKTKV*IGIYFI 


13354 


27255 


A 


13497 


265 


11 


VLGEVSNQHIiLGAPSNRTPSEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
MPIIP*VWEAEVGELLEPRSLRQA*ATW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 




13355 


27256 


A 


13498 


164 


22 


G I KR KL I K S G WAHARN S S S LAGRG G W L 
T*GQEFETSLAKMMEIPSV 




13356 


27257 


A 


13499 


277 


363 


TYLCYI FN I CLFVGE I EKQR * LI * FHRS 
TLNGH I LYVR P VS PNPSQ AVAS PVTCTY 
MPTWPEWEESQKK 




13357 


27258 


A 


13500 


1 


270 


YTCSVIjLiLVTVNIjIjLCIiI YKbNFI IGMLi 
RKKTVYIYIERIQYYLKFQASTGCLGVY 
PTV*PYMYTYTYMYMYMYI YI YIYIYIL 
CLCLDG 




13358 


27259 


A 


13501 


170 


383 


ILYSSKLTYPSPQIVIFFCGKST*GWVW 
WLM PVI PAI * EAE VGGSLESTSLGQAWA 
TGRKAFFVFYKEYKN 




13359 


27260 


A 


13502 


31 


402 


GDYLYLREGWQEPARREQHAQNLEPNRT 
FTPHLi I CLTNYRLFSLAWS ECEERRNET 
TDKNQTVKDLALDMKRVLPFS *GHQKVI 
KVG WA YACKPS TLGGQGQW I TRG EEFK 
TSLINMAKPYLY 




13360 


27261 


A 


13503 


232 


446 


RNRPGAVAHACNPSTFRGRGGRIMRLGD 
*RS 


; - ; ' 


13361 


27262 


A 


13504 


196 


390 


VNKAEKKHLYFVARKEILRPGWVHTCN 
PSTLGGRGGWII*GQEFETSLiPLPKTQK 
LAECGRGCL 


v 


13362 


27263 


A 


13505 


146 


7 


LKYVLYWLGAMAYTCNTS TLGGQGGW I S 
♦GQEFETSIiANMVKPCLF 




13363 


27264 


A 


13506 


177 


18 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKLTRHGGACVQSQLIiERLRREDHLGP 
GGGGCSVP*SVNI I 




13364 


27265 


A 


13507 


117 


338 


NKI LKKKKGGGRFKES KFTS PGLQGNS F 
FMGPPKLNSRAGV*QRREGKNLGVPQLK 
PFEANPLFARGPNTKNP 




13365 


27266 


A 


13508 


306 


443 


LARYEPAVRTRAC* AQWLTAVI PVL* EA 
EAGGPLEARNLRPAWAT 




13366 


27267 


A 


13509 


146 


390 


KMFKGHEQAAHRKKKKRGGRFKGSKFTS 
ACLQRNIFFLGPPKLNSRAGV*QRGDWK 
NPGVTQFNRFEENPLFARGPNTQKP 




13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGCPGR*LLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWRGTESVLFSGL 
FVCCVFAQEKAKATGRAEVSL.CP 




13368 


27269 


A 


13511 


221 


21 


EDLQRDKPLGS C YS TCGWAEQ W YLQH PG 
GAGSKCRIWLGAVAHAYNPSTLGGQGRW 
IT*GQEFLSL 




13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDHACVKRS IQSA* YLTP 
AVLAH WE AE AGGLL EPTS LRPAWAT 




13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTAC I QFS LGN I VRPYF YK 
TKPKPKPNQNKTKIS*AWWCMPIVPATR 
KAEVGESLEPRRSRLQ 




13371 


27272 


A 


13514 


135 


11 


KFFFWPGVLAHAYNPSSLGAQGGRIT*G 
QEYETSLANMVKS 




13372 


27273 


A 


13515 


162 


57 


EGTLRSRRPPLGGWVT*GQEFETSPANM 
VKRCLY 




13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWPPVFPLPPFFKPPPR 
IFFLGPPKKKFFSPPPGLKIFFF*KGPP 
PFFFFFFFFLFFFFKKKKKRLGGFFFFF 
FWARPFSFLSLFFFKETIPLYYNIWMHK 
DSCKAVIHHYH 




13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FFFFL*SSKIFIFFLiKSFFFFFFFFFFC 
FFFFFSQ I LFF I SS PCFVFFFLNYTSRT | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: Of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CS YVCAL I LFKGNT YS RKARH I LW 


13375 


27276 


A 


13518 


273 


393 


I KCLS YRKKATTLGGQGR W I T * SQE FE T 
SLSNM* KPQLY 


13376 


27277 


A 


13519 


450 


139 


I PP I All VS P FAKHT WQQQMGL VRRS S KG 
NNMSKGWEIIHPSRIFCSNHTLLRIALW 
LGAWHTCNPSTLVGRGGWIT*GWEFKP 
SKTSMERFCI LSRHKRTRGS 


13377 


27278 


A 


13520 


47 


427 


KMKQLFEMLVFQNWT I FLS HS I FRTI R * 
LTLE PLMFR YYFKQ * RTLLP P KF * DRVW 
LCHAGGCL*CSGTVSAHCNLRLLGSSDP 
STSAS-VAGTVGAGHHIWLILNNRELPK 
QKFFINSSRNLFFT 


13378 


27279 


A 


13521 


199 


98 


SLPSVKLSICCEFPEDIVSKDPIPWLP 
WKRE S CS LCAL PNLLPK * E KTN I QVP * 
VFQSTI KAKAQLGMVAHACNPSTLGGRG 
GWIT*GLR 


13379 


27280 


A 


13522 


314 


18 


QTLNFYILKIYNIISLVHC+ IFEARFQI 
LV*V* *NYPF*SLICFTFSWHETQNT* * 
KNLI I WAGAVAHTCNCNTLGDQGRRIAL 
AQEIEPSLGNIAIQ 


13380 


27281 


A 


13523 


18 


231 


CPPAVFGTSIEQIQLKYDIKD*KIGQAW 
WCAP VI PGTQAEVGGS LEIRSSRL* CAM 
STSMNRHCVPAWAT 


13381 


27282 


A 


13524 


239 


465 


RVQVS ECLLHREKCLYFQLGLS * LI PQT 
GWLKQQKLPGTVAHTCNPGTLRGRGKRI 
T*GQEFETSLANIVKPRLY 


13382 


27283 


A 


13525 


388 


468 


SLRLGTVAHTCNPSTLGG*GGWITRSG 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLLYSISDFGVSAFIATGGDITR 
NKVRKT* LRLGTVAHTYNPNTLGGQGRR 
LT*AQEFKNSLSNIGRPHLYKTKKTKKT 


13384 


27285 


A 


13527 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNS H I YTHM Y I HTHTHTH I C I Y KL 
EEKYKEli 


13385 


27286 


A 


13528 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSHI YT HMY I HTHTHTH ICIYKL 
EEKYKEL 


13386 


27287 


A 


13529 


34 


299 


SAPASASPWGIGVLYRPIACPEDRFSPG 
PEANCGEIETTELRVTLSPRLECSGTIT 
APCSLDLQGSRDPPPSAS*VSETTGASH 
HAQL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL* PVI SALWEASAGGS PE I 
KSLRNRRPG 


13388 


27289 


A 


13531 


342 


1 


CFFLKGAPPFFFFYFFNFFFFFFLVGTD 
KLI L K VT W KWKG P K I D KA VP K KNNQ ARG 
LP* PDI KI FYKVWLGAVANAYNPSNLGG 
* GGWIA* E * EFETSLANMTKPQ YKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRIKICGTGPGMVAHAVNPSSLG 
DGWIT * DQEFKAS LANMI KPHLY 


13390 


27291 


A 


13533 


74 


446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAIEARHYQCKVSRPSDAAGSEGRPRDM 
TQDTEVADTPPPNI S S I TATRTMAT I G V 
GWAHAYNPNTLGGQGRRIA*AQEFKTS 
LGNLAKPCLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLPYLSSGFYCSLGPCFGLGSSHMA 
AWHKE PLHNACSDS RRSAP I RQGLGS PS 
ATHDTHTRTLAHICKE I FKERLHEI KER 
EIDS* RG* NTQR * VE REEAENKYKPTE I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 














NTEIENHKQQEMLRHAGEERENSPREAE 
RKCRNEKCE I KSRNSRKEARLRKKVKFK 
TNEKGKG*QIITVHKEKREVGRERRGRQ 
NKKRTKSTQHVTQKGDMR*SVKEEPHLR 
RAKRNSNKMKNERYVLRQDTDHSCEQKE 
KMRQRHTRKKLARETWKWVRHTLNREQR 
AKPRKKETKEIHIHSAILKC 


13392 


27293 


A 


13535 


1192 


1476 


E I KRKWG P GAS CHACN P I TGR P RRVDH L 
RSGVQGQPGQRGETPSLLKNIQISWAWW 
CAPVI * E I KRKWG PGAS CHACN P I TGR P 
RRVDHLRSGVQGQPGQRGETPSLLKNIQ 
ISWAWWCAPVIPATWEAEAGGSLETGSG 
GCTEPRSHHCTQAWVTE 


13393 


27294 


A 


13536 






RAE S CS liAEDGVQ * GNLSS L.Q P P P P K 


13394 


27295 


A 


13537 


7 


325 


S PVE FRLGEPTKGTSS FGKRRNKTHTL C 
RRCGSKAYHI J QKSTCGK*GYPAKRKRKY 
NWSAKAKRRNTTGTGRMRHLKIVYRTFK 
HGFREGTTPKPKRAAVAASSSS 


13395 


27296 


A 


13538 


259 


274 


KFEAAVCYHCTTLSL* *CHTVCFLiSFFL 
L>E IG YHY I TQAQSG AM I AHCS LiKLLGS S 
NPPVPAF*VAETRSACYNAWLVFVKFLM 
GL*ELWKDMS 


13396 


27297 


A 


13539 






QASVLPQKQLMSWQKLSESTFVKFWNLV 
KTGRLGEEISCCLWREYYSISDCLATI 
KLPASHLSMRKPRHKDFKSTFLISTKNQ 
GNHGHGTKGNEVNNFQQRISV* KTYFKK 
YNMRPGAVAHTCKPSTLGG*DWWIT*GK 
NCQNQLS 


13397 


27298 


A 








YRRLISKKFFKHRWKSVSFFKFSFFWLG 
EVAHACNPSTLGGQGGKIA*AQEFLTSL 
GNIVGPCLYKK 


13398 


27299 


A 


13541 


308 


437 


FSWAFEIGSQAEVQWHDLHSL* PL.PPRF 
KRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF* FETVSCFVAQARGQWHDPG 

GATGMGHHAQKI * FLVETGSPHVSQGGL 
ALiLK 


13400 


27301 


A 


13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLiSDPEGDGA 
ARMKLVRFLMKLSHETVTIELKNGTQVH 
GTITGVDVSMNTHLKAVKMTLKNREPVQ 
LETLSILGNNIRYFILPDSLPLDTLLAD 
AE PNV KS MNR EAVAG * R * G * ARE I FDE I 
ES 


13401 


27302 


A 


13544 


47 


362 


VAFQGLQVPATNFANFFFFWKGGFFFVP 
QGGRQGEKIGLRDP WPPGVRKI PGPTLS 
GTREKGAPPPPPI YF* FFWKKGGQKGGP 
GGF*TWDPKGPPSPTPPKGGD 


13402 




A 








M KKKKG P P P F F * KKRVG KKKKKG ARAGG 
PPPKPPPFGGPRGGGPRGQNS KTPRPKG 
GNPPFKKKKKKRNI FPAGPMGLKGGGHL 


13403 


27304 


A 


13546 


214 


32 


NYIKIS SVDGGAKI K * WT * NLKCL VLS V 
KNITRWVDCLSSGVQDQPGQHGETPSLL 
KTRKTSWA* WRLGIjIQRWRKMPWPSR 


13404 


27305 


A 


13547 


350 


159 


QTGTLSQKTKQKKQAQWLMTVILGL*EA 
KAGGLLEPKSLRPAWATWQDPIYKKKFF 
LERKTWR 


13405 


27306 


A 13548 


367 


149 j 


FFFFFFFF*TGSHYVAQNWTQIPGLNLP 
ASASQVTATTGMHHHTRHTPFPSLTSHT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalnnine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSVSTASWTYLQNSGRH 


13406 


27307 


A 


13549 


321 


2 


GLVKVTHHGYDRNLGYPDSQASVLSSFY 
TA+MVQ*LQLPDKVQIYPAHPLFINDMP 
P * WLLK I PS FG PGTVAHP YNPNTLGGRG 
RRTTRDREFETSLANWKPHLH 


13407 


27308 


A 


13550 


216 


13 


GYSGKETVFSMRKGQOSGPCLQGPS I PM 
GKAG INQIIP KTTLGGRGG WI A* GQEFK 
TSLANTVKPCLY 


13408 


27309 


A 


13551 


199 


391 


KDFS I FI ACFKRPFITGSQGQEQWL I FP 
HAYNPSILGGRGRQIT*GQEFLTPARLG 
NMVKPCLY 


13409 


27310 


A 


13552 


70 


239 


ATCVKN * RTSWAWWCVPWPATHEAAAG 
GLLE PRS SRLQCS VI AS VERHRTSVWTT 


13410 


27311 


A 


13553 


105 


257 


GQAQWLMPVI PALWEAEAGGSFAPRSLR 
LA*ATVPGPNNFLKRAFHSLLNLFLP 


13411 


27312 


A 


13554 


499 


165 


Y*MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKIAPIWTPSPG 


13412 


27313 


A 


13555 


415 


719 


YTDKSIRSISLLPVKGSTHSMCPVKFIQ 
LQEAGGIHMAKLSGQRVDREWRLGTVAH 
TCNRSHLEGKGGQIT*AQVFKTSLGNMA 
KSCLCKKYQKLSWIWQH 


13413 


27314 


A 


13556 


224 


523 


DKVSLCHPGWECGWVQSQLTTASISQG* 
TILFPQSPE 


13414 


27315 


A 


13558 


355 


1 


QVSLQLPYCVLFQFHIYNKQAALLQRCY 
VSATNLLINAIWQYVNSLKMCEENQNVS 
LQNMPF*HRNYFGEAGHGGSCi.*SQLLG 
RLRH*NCLNLGGGGCSELRLRHCTPAWA 
TKASPS 


13415 


27316 


A 


13559 


73 


259 


KYVPHKGKISERRLiCSVAQAGVQWCDHG 
SRSLEVLiGSNDLLASAS * VAGITGCWGS 
SDPHTT 


13416 


27317 


A 


13560 


205 


21 


WKISEDFLLGDAKKWAYQEDIKRLRLG 
RGAVSHTCNPSTWRGRGGWIT*DQEFET 
SLPKC 


13417 


27318 


A 


13561 


254 


481 


LMALLPGSSDVDLSEYGWEHFNFVILIS 
KEIFILLFF*DGVSHLLPRLECSGMISA 
YCHLCLQGSS DRRD FLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 
I IVHCSLKLLALSDSPTSAS *.VTATTGM 
PPHLAFTVK 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGI 
DVTREYPTFIAHRVXjTYAATSGSDFIjGQ 
AQWLMPVI PAI * EANAGESLEPKSLRIA 
*ATVPGPRSLSQTLiLHRSEL>CGIj 


13420 


27321 


A 


13564 


206 


486 


RQLAAVHILVTPLPDHVTWANYVTSLSL 
LS S SENGVNNDNTHF I GLLLRDYK* GW V 
*WLTPVIPALWEAKVGRIARGQELENSL 
GSKARPRFV 


13421 


27322 


A 


13565 


212 


398 


SRVRGCFPFNLPKSSCI*ECGQGAVAHA 
CNPSTLGGRGGQIA*AQEFKTSQGNVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


YSPVHTDKCSGVRKLGLFFFFFFEKEFH 
FFS PAWKARAKIGLNGTPLSQGKGNFRP 
KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKVHPNFGPKGNPPP* PPKGAGKKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 . 


195 


ATRVSLAPTKNKINWGWWPTPVIPPPQE 
GEAGEFL*PQKWRVRLNKIVPLPSRPRD 



1359 



MSDOC1D: <WO 016-W35A2_I_5 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KTRLCFKKKKKN 


13424 


27325 


A 


13568 


316 


154 


SQPFPGLRWEDRLSRGVQNQPKQ*LMPV " 

IPAFSGAEVGGSLEPRSSKPA*ATNNLK 

SKVIIGGQICDKASKVSIVESSWWGWGC 


13425 


27326 


A 


13569 


297 


475 


FDYFFIY*ETKSHSAAQAGVQ*CKHSSL 
QP*PPGLKQSLLPWPPKSAGLTGVSYGT 
WPI 


13426 


27327 


A 


13570 


116 


332 


NRVLLLPKGEYRGTISAHCSLKLLGSKR 
PPTSAA * GDDETTGVCHKTPV * FLKTFF 
IEIRPGYVPRVGHWRF 


13427 


27328 


A 


13571 


.233 


405 


TVPELRASCRLKKKKKKKKGEKNLKKIK 
GFPGPGPRV*GPEAKNPGGGGPGIPKGP 
G 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPPPGKFFFINGPPSFFLNRG 
PSFFFFPGFPPPFLKNFFFFPLWFKGG 
GYKRGVFPPKKKGPPQPLFFFFFFFLI * 
KKTRPGTVAHPCNPSTLGGQGRWIT*GQ 
EFLSSLAKMAGRWRPGNCESV 


13429 


27330 


A 


13573 


273 


406 


I KE FKAERGGHS *NTSTFGGQGR WI A * S 
QEFETSLANMVKPHLY 


13430 


27331 


A 


13574 


385 


38 


KKNS P PP PGKNF F F F * GP P P S P FFFFFF - 

FFFFFFFLGGKKTFFFPPPSRFFFFLKP 

FPPKNQGPKNFFPPKKKIGD*ARPPPLT 

GGAKGAPLFKKKKKKKKLKVRPGAVAHT 

CNS 


13431 


27332 


A 


13575 


230 


53 


HWtCVLKENKQI FTKIS FKMNAI KDSVGQ 
AQWLMPVIPAL*EPDMGGPVEPKSLRPA 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNS I VWVS S PFRVS TFIS FFELKSCS VT 
QAGV*WHDLSLLQLQTPGLKRSPAPSRG 
LSFYLSS IFYPHSSQTELFVGTLNAASF 
TNSRLLYLSLLLCGKYHPIjVPSLPLSTI 
WYFSLRVSHHFP 


13433 


27334 


A 


13577 


426 


294 


GGFPSPPVFKPPPQFFFFGPLKKKISPP 
PPGEKKIFF*KAPPP 


13434 


27335 


A 


13578 


268 


3 


LQAKGPAMGEAGARCSSEVWGWPRKGFD 
L* VSR* PS FDSHVR IGRVQRLMP WPVL 
WETEADISFEPCSSRPAWASWRKTYiqk 
PQRI 


13435 


27336 


A 


13579 


395 


49 


EKKKFFVCFPGFPGNPPNPSPQFF* KGL 
ISPSGSLRTRRGFFP*RVFFFFLKKQFP 
LS PRVECNGI I PDP * PPFGE VTSTPQ VA 
GAIGPPPPPRVNFYFFVKKGFLNVGPMF 
FFF 


13436 


27337 


A 


13580 


68 


466 


GASPAQGSTLHIjVILPRYFFKIPTVRTE 
SFSGSLVTSPPPLHFLPLNKERGGPGIP 
LPCPQIjLQVLGGTSLPPVPSLPDTSQDK 
WPLHGVPPGHVCS*PLAGDGAWPPSPHW 
IPLNPGTSKSLQPAPPWNSA 


13437 


27338 


A 


13581 


67 


249 


ATAPGLLCS YKI FSCQLQWCVPVTQiiAW 
ETEAGGSLEARSSRL* *TMITPVNRHCI 
LAWAT 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPASVAVSTCWRGLPLPWS 
GGQNRGKGS L PLQ VHRGA * G TENKNQGG 
TPRPGGGPGPSAPRGSSLGAHRKLPVHH 
TTLSSSRSALPPTPRHPAPSSPPCTEEF 
HRTRPI 


13439 


27340 


A 


13583 


174 | 1 


FFFFFVETESHSVAQARVLEYSGAIPAQ 
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WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vatine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CNLDNLSSSELPASAS*AS * VAAXTGT* 
KR 


13440 


27341 


A 


13584 


192 


206 


LFFLEVGSCYVAQAGFEFPGSGHLPLSA 
S * WGS TGVPHHTQLV* S FS S LYNLERN 
C 


13441 


27342 


A 


13585 


244 


28 


KKPENQEINYIFYHLFNVHEQIRISLGP 
GAVAYNCNPSTLRD * GGRI T * AQEFKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 


312 


701 


RGPEET*PRTRKSSPRPTRSCESVSKLA 
RPPRRCSAAPAQVPRLSLRSPKPDPPRV 
SSAEKCAPLPLLPECSETGALPRNSFLC 
• QNASSPLLSLGLPPSPTVQALQPRAL.HQ 
HIjGSTNKEDAHVAPAKKK 


13443 


27344 


A 


13587 


410 


292 


GYMIKQQTVYHECRM*ANSHIiHL<PPEGIj 
LRAVTLWRRAP 


13444 


27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKIFPPPGGKIFFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 
GPFF F VQD * QGLENP KEGS S LG WE KLGR 
ALLPQQRSFVPGLGPTRLFWGLVFWFFG 
FFFLPFGLCFFFALIFNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRP YQAG E P AAH VRADS THGLG 
A* RRVAGHL PPR VG A P HHPG AS S REG AP 
P LHP P PNL * HRRPNGS CRFLPG PAAP P Q 
GAEGGRGRDRRGTHSVAQTGGPGGFGSG 
VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTLGGQGERTA*G 
QE FKTS LGDMVR FCLY 


13447 


27348 


A 


13591 


230 


3 


KLVNILVEKSLVAMKSTVSEGKMVIRSP 
ASLGMVAHTHSPSTLGGQGRQIA*AQEF 
ETILGNMAEPCLYKKHSVY 


13448 


27349 


A 


13592 


223 


340 


LTPIIPEPWESEWGSLE*LTPGAVTHA 
YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYHELL 


13449 


27350 


A 


13593 


35 


220 


DRASLS PRLECSGMI VAHCS INLSGSSD 
PLTPAS * GDGTTDTCHHARLI FTGADFC 
IDASS 


13450 


27351 


A 


13594 


87 


369 


NVQKTE CE I SGKMQ I WQKGS VKTKCSNG 
LFNF P I FS KKI AS CKVKKLTRE * KYKSQ 
ARWCTPVIPATEAEVRGQLEPRSLRPAW 
ATYQPHFKS 


13451 


27352 


A 


13595 


261 


489 


SWVNERMWG**KHETSQAQWLMPIIPA 
LCEAEVGGSLEARSWETSLCNIVRLHLS 
KKKRGKGRKRGGNQIAPSRE 


13452 


27353 


A 


13596 


238 


2 


NDCLWWLFRI PANVSTFGLLHMS LKVNT 
PGNNRFKSELGTRCLIHTCNPS ILGGQG 
GWI T * GQE FVTSMQKG S AVKNQ 


13453 


27354 


A 


13597 


36 


687 


RDVHRSTYQAGSKQDWGPGEAERLSSSR 
RGAYS CPVPI TFAEGKTRMGRDKMRL IL 
GLSWGPSLTLLLPITNAPRRPGM*EPAL 
PGNSTS*SGAVRFPGGGRK*EDSAESWS 
CFCSHPSPSSGPNPPSPPSAVCMLPGLS 
GLQR*PKQLS PP PALS L PVS S LLVRLS P 
WPPTSSHLLPQALPQSLHPQGSHRAVND 
SFSM*GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGREP WLTPAI PALWKAKAGGS P * 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMLAVDNSSVEKTCPQAMRISRPGAV 
AHTCNPS ILGGQGGLIA* VQEYETNLGS 
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MSDOCID: <WO 0164835A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=Glycine, H~Histidine, I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














MAKTHLYKN 


13456 


27357 


A 


13600 


161 


657 


liIQGCWCSLFFSTRVFLAAIGMKSQWCR 
P VAMDLGVYQLRHF SIS FLS S LLGTENA 
SVRLDNSSSGASWAIDNKIEQAMDLVK 
SHLMYAV+EEVEVLKEQIKELIEKNSQL 
EQENNLLKTLAS PEQLAQFQAQLQTGS P 
PATTQPQGTTQPPAQPASQGSGPTA 


13457 


2735S 


A 


13602 


121 


423 


MIKVNSFGLYLYSQVIYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLEI 
LGSTDPPASAS*AAGTTNTSHHGQLYSY 
FKKRPTYAEMEQYKI KK 


13458 


27359 


A 


13603 


1889 


1717 


VPRIRPLSWTPPSS I *RPKPPSSTFSHP 
GKPSMGPPGLNRCPVQRALLPGWYQHCG 
R 


13459 


27360 


A 


13604 


81 


267 


ETCCIKNNTLK*I*SQMLWCAPIVPATW 
QAEVGGSLD PRS S S L * CAM I TS VNNHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTCNPGTLRVSAGRIT*GQE 
FKTSLGNIARPHLYKIIINKYI 


13461 


27362 


A 


13606 


428 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNSPTAAAGSVFAETLQPQCSSLCC* 
VKSSPWPTASLPQPFGSAPQTLPVRYAL 
I PGPSLVSTSLGPRLAPGLPGSPS PPS P 
QVTLDSERYPSPTPCIAPALED 


13462 


27363 


A 


13607 


267 


442 


RPPPTLKVPWLGVEVHACNPRTLGGQGG 
S I S * AQ KLENSLSNI LRPHLYQ KQQKQQ 
KK 


13463 


27364 


A 


13608 


3 


280 


YRLSVICEDPMREREQTCFPPPPSFPPN 

SGGGWRGCCSPAGAPGSPLCGSSQGGPV 
SPWGQKRS 


13464 


27365 


A 


13609 


291 


248 


EKRAHGSQPMKGRERNSMCKSSEFRWA 
WSLiAGVKGDGGR*GSGAGGSPKXjG*AV* 
LDPEGHAGQAVFCGEAGARTQIGGKG*R 
RDQAPLPLSPNLCSGSCLPTKHSLARMP 
LRVKSYS SAQLGRTTCTRPSPPTITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDLLYNTPTHPISPRCDPKHTAIPDKQ 
SLLFFFFEMESCSVARAGVRWHDbRSLR 
PLPPRF * KFFCCS I PSLYCGRSRGSSQT 
YTS 



.0164835A2_L> 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ' s, =Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13470 


27371 


A 


13615 


197 


428 


KTMVGLKTSASFHQDFIQIPLNHNLSVP 
*TMLGSQPRFLFP?GTVSHICNPSTLGG 
RGGW I T * GQE FETS LATMVKP 


13471 


27372 


A 


13616 


445 


550 


IS I TRGRARWLTP VIL VLWEAEAGGS P * 
GRSSRLP 


13472 


27373 


A 


13617 


3 


435 


TMVLSPADKTNVKAAWRKVGAHAGEYGA 
EALERMFLSFPTTKTYFPHFDLSHGSAQ 
VKGHG KKEADALTNAVAHVDDMPNALS V 
LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHLPAE FTPAVHAS LDKFLAS VSTVLTS 
KYR 


13473 


27374 


A 


13618 


719 


918 


CEGRREKWKIGRERKGRGRRKGEQMGRE 
GKGREDGGRKRPCTSRPRSSSRDRSNII 
ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


WLMPAVPALLEAEVGISLEAQSCRPPWA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLWSWPGAVAHVCNPSTLGG 


13476 


27377 


B 


13621 


25 


189 


MVLS PAD E TNVKAAWG KVG AHAGEYG AE 
ALERMFLCFPTTMTYFPDFDL IHGSAQ * 


13477 


27378 


A 


13622 


142 


356 


RIVENEKINAEKSSKQKVDLQSLPTRAY 
LDQTWP I LLQGLAVLAKERP PNPIEFL 
ASYLLKNKAQFEDRN 


13478 


27379 


A 


13623 


18 


1353 


AGAAQCE WS AGEAGARTMS EADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES I AHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDEPVCGRPLG IRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDIiTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLR FPNGVQL S PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPS SSGGYWVGMS TI RPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVNKFVPRYSLV 
LELS DSG AFRRS LHDPDGLVATYIS E VH 
EHDGHLYLGSFRSPFLCRLSLQAV 


13479 


27380 


A 


13624 


18 


1353 


AGAAQCE WSAGEAGARTMS EADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES I AHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDEPVCGRPLGIRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLR FPNGVQLS PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWI KRMI FKLFSQETVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLGSFRS P FLCRLS LQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKRERERKRMSLSDWHLAAKL 
ADQPLTPKSILRLPETELGEYSLGGYSI 
SFLKQLIAGKLQESVPDPELIDLIYCGR 
KLLDDQTLDFYG I Q PGSTVHVLRKSW PE 
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ISDOCID: <WO 0164835A2_L> 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
13SSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence <A=AI:inine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PDQKPEPVDKEAAMRD 1 


13481 


27382 


A 


13626 


980 


1089 


KRIRIQI.TGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


13482 


27383 


A 


13627 


116 


587 


VCGELRADSWPVPSQPEQASGPQKQAFI 
WPEAPSRSARLPITYTDYWDSRLQTQES 
QMLGSMARKKPRNTSRLPLALNPLKSKD 
VLAVLAERNEAIVPVGAWVEPASPGSSE 
I PAYTS AYL I E EE LKEQLRKKQE ALKH F 
QKQVKYRVNQQI TLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAFPQASRGVRKTEVGEQQGQGTACGGC 
GHQCPPPTWHIQRAHCVPSTCGAGGWHL 
QGRASCRRPPSQSPQVYQTQVGRQDPHF 
GVGRDSRGELAWSSFIHPYLLSVCNPL 


13484 


27385 


A 


13629 


330 


397 


ARCEWLTPVI PALWEAEAGGSH 


13485 


27386 


A 


13630 


10 


137 


SFTGAVILIIAHGL.TSSLLFCLANSNYE 
RTHNLFFLNDFFFS 


13486 


27387 


A 


13631 


380 


98 


FQWLVSFTATLWFEERHRKNKIEREERR 
AKGERGDRKEERREEGGEERGRRGGERS 
DKREPKKKSKEESNHPKWKFSFICSFC 
FLTPFFPVFF 


13487 


27388 


A 


13632 


56 


827 


PLFEAFTACGFVHDCGLLIHPEETCGLQ 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLS CSDNRTCD WRE I TWQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 
YLIEPJuNETLQEWQKVHGTKFYHNVNGG 
KTLISYSYYPQFWISPSLRPTFENALEH 
LLQRSRPLENTGQTVLWGGVQWLNSNH 
LQIIHKVLKSPFTTLNQPVTKSCLQAIY 
FPRLSPTLHSNCLDLVYSFTKSFNIYFV 
VQFLN 


13488 


27389 


A 


13634 


3 


2718 


SGPCRTTVAPLLRAAPVEHCVAALRPTD 
STMLKKFDKKDEESGGGSNPFQHLEKSA 
VLQEARVFNETP INPRKCAH I L.TKILYL 
I NQGEHLGTTE ATE AFFAMTKL FQSND P 
TLRRMCYLTIKEMSCIAEDVI IVTSSLT 
KDMTGKEDNYRGPAVRALCQI TDSTMLQ 
AIERYMKQAIVDKVPSVSSSAL.VSSLHL 
LKCSFDWKRWVNEAQEAASSDNIMVQY 
HALGLI/YHVRKNDRLAVNKMI S KVTRHG 
LKSP FAYCMM I RVAS KQLEE EDGSRDSP 
LFDFI ES CLRNKHEMVVYEAASAI VNL P 
GCSAKELAPAVSVLQLFCSS PKAALR YA 
AVRTLNKVAMKHPSAVTACNLDLENLVT 
DSNRSIATLAITTLLKTGSESSIDRLMK 
QISSFMSEISDEFKWWQAISALOQKY 
PRKHAVLMNFL FTMLREEGG FE YKRA I V 
DCIISIIEENSESKETGIiSHLCEFIEDC 
EFTVIATRILHLLGQEGPKTTNPSFCYIR 
FI YNR WLEHEE VRAGAVS ALAKFGAQN 
EEML PS I L VLLKRCVMDDDNEVRDRAT F 
YLNVLEQKQKALNAG YILNGLTVS IPGL 
ERALQQYTLEPSEKPFDLKSVPLATAPM 
AEQRTESTPITAVKQPEKVAATRQEIFQ 
EQLAAVPEFRGLGPLFKSSPEPVALTES 
ETEYVIRCTKHTFTNHMVFQFDCTNTLN 
DQTIiENVTVQMEPTEAYEVLCYVPARSL 
PYNQPGTCYTLVALPKEDPTAVACTFSC 
MMKFTVKDCDPTTGETDDEGYEDEYVLE 
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NO: in 
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09/5 15,1 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DLEVTVADHIQKVMKLNFEAAWDEVGDE 
FEKEETFTLSTI KTLEEAVGNI VKFLGM 
HPCERSDKVPDNKNTHTLLLAGVFRGGH 
DILVRSRLLLLDTVTMQVTARSLEELPV 
DIIIASVG 


13489 


27390 


A 


13635 


118 


629 


LWALQLHPEPELPSRRGTGAAGVWTGVA 
MIRFILI QNRAGKTRIoAKWYMQ FDDDEK 
QKL I EEVHAWTVRDAKHTNFVEFRNFK 
I I YRRYAGLYFCI CVDVNDNNLAYLEAI 
HNFVEVXiNEYFHNVCEIjDIjVFNFYKVYT 
WDEMFLAGE I RETS QTKVLKQLLMLQ S 
LE 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETLKRLQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


LSGGHAGAUSSLWVHLYCLLSSQQLLGN 
VLVTVLAIHFGKEFTPEVQASWQKMVTG 
VCSALCFRYH 


13492 


27393 


A 


13640 


319 


623 


DMEEASEGGGNDRVRNLQSEVEGVKNIM 
TQNVERILARGENliEHLiRNKTEDLEATS 
EHFKTTSQKVARKFWWKNVKMIVLICVI 
VFI I ILFIVLFATGAFS 


13493 


27394 


A 


13641 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTL I ETTDYAGL I I PPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AEYLAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFIjEYDQDL.SVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFLINYYNRIKDSCVKADKMTRSHKNVA 
DDYIHTAACLHS LALEEPTVI KKYL.LKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLN I EAAKDLLYRRTKAL IDYENSNKA 
LDKARLKS KDVKLAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LE IKHARNNVSLLQS CIDLFICNN 


13494 


27395 


A 


13642 


210 


772 


SVKMVRYSLDPENPTKSCKSRGSNLRVH 
FKNTRETAQAIKGMHIRKATKYLKDVTL 
QKQCVP FRR YNGG VGRCAQAKQWGWTQG 
RWPKKSAEFLLHMLKNAESNAELKGLDV 
DSLVIEHIQVNKAPKMRRRTYRAHGRIK 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKXISQKKLKKQKLMARE 


13495 


27396 


A 


13643 


168 


2172 


SPLCEVSVPSFCFRVFCKKHKKHKSDKH 
LYEEYVEKPLKLVLKVGGNEVTELSTGS 
SGHDSSLFEDKNDHDKHKDRKRKKRKKG 
EKQI PGEEKGRKRRRVKEDKKKRDRDRV 
ENEAEKDLQCHAPVRLDLPPEKPLTSSL 
AKQEEVEQTPLQEALNQLMRQLQRKDPS 
AFFSFPVTDFIAPGYSMIIKHPMDFSTM 
KEKIKNNDYQSIEELKDNFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 

QSLKQS idfmadlqktrkqecdgtdtsqs 

GEDGGCWQREREDSGDAEAHAFKSPSKE 
NKKKDKDML.EDKFKSNNLEREQEQLDRI 
VKESGGKLTRRIjVNSQCEFERRKPDGTT 
TLGLLHPVDPIVGEPGYCPVRLGMTTGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
nucle- 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isr>leucine, 
K=Lysine, L=Leucihe, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQSGVNTLQGFKEDKRNKVTPVLYLNYG 
PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLiPSDFSIHEFLATCQDYPYVMAD 
SLLDVLTKGGHSRTDQEMEMSLPEDEGH 
TRTLDTAKEMEQITEVEPPGRLDSSTQD 
RLIALKAVTNFGVPVEVFDSEEAEIFQK 
KLDETTRLLRELQEAQNERLSTRPPPNM 
I CLLG P S YREMHLAEQVTNWLKELAQQV 
TPGDI VSTYGVRKAMG I S I PS PVMENNF 
VDLTEDTEEPKKTDVAECGPGGS 


13496 


27397 


A 


13644 


1032 


207 


P AD VT P KPAT E AVQ SEH S DAS P MS I N E V 
ILSASGACKLIDSLHSYCFSSRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 
RKLQEKLDELRRVSVPYPSSLLSPSREP 
PKMNPWEPLSWMLGTWLSDPPGAGTYP 
TLQP FQYLEEVH I SHVGQPMLNFS FNS F 
HPDTRKPMHRECGFIRLKPDTNKVAFVS 
AQNTGWEVEEGEVNGQELCIASHS IAR 
IS FAKE PHVEQ I TR KFRLNS EG KLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGS WREVPE S AS LPSRGAKG KWRGLCCC 
CCCCCC CCCCCC YHCHQEQ GDLICHQADL 
WRSGRTQNQAGIWQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLL.ESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTLIETTDYAGLI I PPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AEYLAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVTiFTGVKEVDDFFEQEK 
NFLINYYNRIKDSCVKADKMTRSHKNVA 
DDY IHTAACLHS LALEEPTVI KKYLLKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNIEAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKEEL.INFKRKRVAAFRKNLIEMSE 
LE I KHARNNVS L.LQSC IDL FKNN 


13499 


27400 


A 


13648 


1 


1206 


MSTSQSPCES ICDYVTSHDKSNFTDMIK 
LNILRCEVILDYPAPGGGSLGAKHCCSC 
YTVSSGVTEGERNAGEKGVKLNADGARI 
RGT PGRGRRAEAE AS SPAPAAVAAAC W 
AAAAASRQLASGNRTRVSSGVPAPAFLG 
TMNPNCARCGKIVYPTEKVNCLDKFWHK 
ACFHCETCKMTIjNMKNYKGYEKKPYCNA 
HYPKQSFTMVADTPENLRLKQQSELQSQ 
VRYKEEFEKNKGKGFSWADTPELQRIK 
KTQDQISNIKYHEEFEKSRMGPSGGEGM 
EPERRDSQDGSSYRRPLEQQQPHHIPTS 
APVYQQPQQQPVAQS YGGYKEPAAPVS I 
QRSAPGGGGKRYRAVYDYS AADEDEVS F 
QDGDT I VNVQQ I DDGWM YGT VE R TGDTG 
MLPANYVEAI 


13500 


27401 


A 


13649 


3 


394 


GDGGGHLGSGRNGGGSMNAPPAFESFLL 
FEGEKITINKDTKVPNACLFTMNKEDHT 
LGNI I KSQbLKDPQVLFAGYKVPHPLEH 
KIIIRVQTTPDYSPQEAFTNAITDLISE 
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peptide 
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nucleotide 
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correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, ' 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LSLLEERFRVRAGPGGAD 


13501 


27402 


A 


13650 


1 


471 


SRPTGLREAAGSGPREAPRRSGCKSPGL, 
GTVGMLRPKALTQVLSQANTGGVQSTLL 
LNNEGSLLAYSGYGDTDARVTAAIASNI 
WAAYDRNGNQ AFNE DNLKF I LMD CME GR 
VAITRVANLLLCMYAKETVGFGMIjKAKA 
QALVQYLEEPLTQVAAS 


13502 


27403 


A 


13651 


1334 


82 


CYTGGTQSLWPGSSCASSVARPSSLFRS 
AWSCEWSVRCARACTMSVPAFIDISEED 
QAAELRA YLKS KGAE I SEENS EGGLHVD 
LAQ 1 1 EACDVCLKEDDKDVES VMNS WS 
LLLILEPDKQEALIESLCEKLVKFREGE 
RPSLRLQLLSNLFHGMDKNTPVRYTVYC 
SLIKVAASCGAIQYIPTELDQVRKWISD 
WNLTTEKKHTLLRLLYEAIjVDCKKSDAA 
SKVMVELLGSYTEDNASQARVDAHRCIV 
RALKDPNAFLFDHLLTLKP VKFLEGEL I 
HDLLTI FVSAKLAS YVKF YQNNKDFIDS 
LGLLHEQNMAKMRLLTFMGMAVENKEIS 
FDTMQQELQIGADDVEAFVIDAVRTKMV 
YCKIDQTQRKVWSHSTHRTFGKQQWQQ 
LYDTLNAWKQNliNKVKNSLLSLSDT 


13503 


27404 


A 


13652 




377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
EMEDKVTSPEKAEEAKLKARYPHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYNMAKAK 
MKNKQLPTAAPDKTEVTGDHIPTPQDIjP 
QRKPSLVASKLAG 


13504 


27405 


A 


13653 


424 


596 


SLKNIYGLSCRKKKKGAVKKI ILVQAWW 
LMPVITVXiWEAEVGGLLEARGliRPTRAT 
W 


13505 


27406 


A 


13654 


190 


448 


LRSYPAPHLGSPELRIRKGRGHSHCLAG 
AAGPQRTALCGLSAPLCPPSPTPPGAGA 
PRYCSGSDAPPCIjLRGAGPPIPGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRLL TQ VG AQ VL>GRLGDGIjG AALG 
PGNRTHIWLFVRGLHGKSGTWWDEHLSE 
ENVPFI KQLVSDEDKAQLASKLCPL.KDE 
PWP IHPWE PGSFRVGLI ALRLGMMPLWT 
KDGQKHWTLLQVQDCHVLKYTSKENCN 
GKMATLSVGGKTVSRFRKATSILEFYRE 
LGLPPKQTVKI FNITDNAAI KPGTPLYA 
AHFRPGQYVDVTAKT IGKGFQGVMKR WG 
FKGQPATHGQTKTHRRPGAVATGDIGRV 
WPGTKMPGKMGNIYRTEYGLKVWRINTK 
HNI IYVNGSVPGHKNCLVKVKDSKLPAY 
KDLGKNLPFPTYFPDGDEEEL.PEDLYDE 
NVCQPGAPS ITFA 


13507 


27408 


A 


13658 


128 


2626 


NSHRWVYVRARRWRRRGKQREQPEDRGV 
PMKRAAMALHS PQ Y I FGDFS PDEFNQFF 
VTPRSSVELPPYSGTVLCGTQAVDKLPD 
GQEYQRIEFGVDEVIEPSDTLPRTPSYS 
ISSTLNPQAPEFILGCTASKITPDGITK 
EASYGSIDCQYPGSAIiALDGSSNVEAEV 
LENDGVSGGLGQRERKKKKKRPPGYYSY 
LKDGGDDS I STE ALVNGHANSAVPNS VS 
AEDAEFMGDMPPSVTPRTCNSPQWSTDS 
VSDIVPDSPFPGALGSDTRTAGQPEGGP 
GADFGQSCFPAEAGRDTLSRTAGAQPCV 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

correspoh 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GTDTTENLGVANGQILESSGEGTATNGV 
ELHTTESIDLDPTKPESASPPADGTGSA 
SGTLPVS QPKS WAS LFHDS KPS S S S PVA 
YVETKYSPPAISPLVSEKQVEVKEGLVP 
VSEDPVAIKIAELLENVTLIHKPVSLQP 
RGL I NKGNWCY INATLQALVACP PMYKL 
MKF I PLYS KVQRPCTSTPM I DSFVRLMN 
E FTNM P V P P KPRQALGDK I VRD I R PGAA 
FEPTY I YRLLTVNKS S LSEKGRQEDAEE 
YLGFI LNGLHEEMLNLKKLLS PSNEKLT 
I SNGPKNHSVNEEEQEEQGEGSEDEWEQ 
VGPRNKTSVTRQADFVQTPITGIFGGHI 
RSWYQQSSKESATLQPFFTLQLDIQSD 
KIRTVQDALESLVARESVQGYTTKTKQE 
VEISRRVTLEKLPPVLVLHLKRFVYEKT 

NKNFKCHRTYRLFAVVYHHGNSATGGHY 
TTDVFQIGLNGWLR IDDQTVKVINQYQV 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 


A 


13659 


42 


382 


TLWLKTIQIYLTISLGLGSDYGLAGFSA 
LGCKQAAVKMLTAFILIQGLDLGRSHFQ 
AHSGCWQDSFPCDSRIHGGLLLHVQQES 
PWFQSAERVSCITKCNHRSDTHHLCSIL 
L 


13509 


27410 


A 


13660 


178 


349 


DMGPCYVAQTVLKLLGSSDPPTSASQEA 
GTTCHHAHLLSKPFLMIYRNFSFIQCLE 
T 


13510 


2741 1 


A 


13661 


268 




RQQKVILSSSGSLMSEMGRTVLPLiKSPG 
MNPSSPLLASGGCWKSLACGHIIPFSSF 
ILMWPSPLCVSVSSSLPMRKPLLLHLRS 
TLFQ 


13511 


27412 


A 


13662 


1)2 


483 


AGVGALRMVQRLTYRRRLSYNTASNKTR 
LSRTPGNRIVYLYTKKVGKAPKSACGVC 
PGRLRGVRAVRPKVLMRLSKTKKHVSRA 
YGGSMCAKCVRDRIKRAFLIEEQKIWK 
VLKAQAQSQKAK 


13512 


27413 


A 


13663 


2 


873 


S VE E FDRG CTGRG CG AD ARAGAAM VKI S 
FQPAVAGI KGDKADKASASAPAPASATE 
IIjLTPAREEQPPQHRSKRGSSVGGVCYXj 
SMGMWLLMGLVFASVYIYRYFF1AQLA 
RDNFFRCGVLiYEDSLSSQVRTQMELEED 
VKI YLDENYERINVPVPQFGGGDPADI I 
HDFQRGLTAYHD I S LDKC YVI ELNTT I V 
LPPRNFWELLMNVKRGTYLPQTYI IQEE 
MWTEHVSDKEALGSFIYHLCNGKDTYR 
LRRRATRRRINKRGAKNCNAI RHFENTF 
WETLICGW 


13513 


27414 


A 


13664 


118 


3 


AWSLIPVIPWREAKAGGLLEPRSLRPT 
WATWQDPVST 


13514 


27415 


A 


13665 


1 




IRQRINFSRKNKWDSRKLDQAGVSELAT 
NQKLILVCGRYEGIDERVIQTEIDEEWS 
IGDYVLSGGELPAMTLIDSVSRFIPGVL 
GHEASATEDSFAEGLLDCPHYTRPEVLE 
GMEVPPVLLSGNHAEIRRWRLKQSLiGRT 
WXjRRPELLENIALTEEQARIjLAEFKTEH 
AQQQHKHDGMAADEAGRTFLRADFNMI E 
EGDRIMVCLSGGKDSYTMLEILRNLQQS 
APINFSLVAVNLDQKQPGFPEHVLPEYL 
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amino acid 
residue of 
peptide 
sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














EKLGVE YKI VEENTYG I VKEKI PEGKTT 
CSIiCSRLRRGILYRTATELGATKIALGH 
HRDDILQTLFLNMFYGGKMKGMPPKLMS 
DDGKHIVIRPLAYCREKDTQRFADAKAF 
PI IPCNLCGSQPNLQRQVIADMLRDWDK 
RYPGRI ETMFSAMQNWPSHLCDTNLFD 
FKGITHGS EWNGGDLiAFDREE I PLQPA 
CWQPEEDENQLDELRLNWEPDGPRHRH 
APDARRFCQYIjRHQARYLLSWPNLjDVA 
VTEPLANGDGLNVLIKREWGFRANTVE 
KTGENQYRVWPNEMPADLHKIRPHHPLN 
RNLDHFFPLDLTNSTFSGGYVHVLKGVL 
SDDLLLKS FQKMGYVRRDSHRLMVTALP 
PACQLVQVALiGC FALRLECE I LGEVLiAQ 
LGTSVIiPAEELLQARRASGDVASCQRLA 
QDEEPPPLPPRGSPAAYRAPLDLYRDLQ 
EDEGSEDAS LYGEPS PGPDS PPAELAYR 
PPLWEQSAKLWGTGGRAWEPPAEELPQA 
SSPPYGALEEGLEPEPSAFSFliSLRREL 
SRPGDLATPESSAAASPRRIRAEGVPAS 
AYRSVSEPPGYQGTQLPVPWRPAHPLLR 
HLSPAACCPLCSPARLPSRPLAACAAWR 
RPAALVATACTDGHSALQQPRGPALGQL 
GPRSRALGRQLPWCFSIjLGRGLGL.WLPC 
PWGSPGHSRGLESPGQPLLEVGLSVYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTP AS P CLEMDPNCS CSPVGS CACAGS 
CKCKECKCTSCKKSCCSCCPVGCAKCAQ 
GCICKGTSDKCSCCA 


13516 


27417 


A 


13669 


2 


667 


GRVDAEQSRLGATERAAAAAMNPEYDYL 
FKLLLIGDSGVGKSCLLLRFADDTYTES 
YISTIGVDFKIRTIELDGKTIKLQIWDT 
AGQERFRTITSSYYRGAHGI IWYDVTD 
QES YANVKQWLQE IDRYASENVNKLLVG 
NKSDLTTKKWDNTTAKEFADSLGI PFL 
ETSAKKATNVEQAFMTMAAE I KKRMGPG 
AASGGERPNLKIDSTPVKPAGGGCC 


13517 


27418 


A 


13670 


176 


398 


RILiKTQIiQENNQPTTTTKNRQKTRTDTL 
PKGI YRRHI S LWKLGTWLTP VI PALCEA 
NAGGRLESRS SRP VWATY 


13518 


27419 


A 


13671 


149 


247 


RKGLALSPRLDCSGAITVNCSLNLQGLjS 
NIPI 


13519 


27420 


A 


13672 


257 


359 


VASLHGWAQWLTPVI PTLWKTKVGEFFE 
PRCSRP 


13520 


27421 


A 


13673 


142 


1 


HLKGSGLAKHQWLMPVI PALWEAE VGGS 
LEPRSSRPVWATLQDPISS 


13521 


27422 


A 


13674 


619 


168 


MFIESCCTPKIYTSFVTjRQSHSFAQAGV 
QWLDLGSLQPLSLGSSNS PASASC I QVL 
FVIYTSVKLGVGGRLGNKKVMSSNKKFV 
CECRGWI IL.PFSVPSLRFQDGGWDHGVS 
SAE KALQAGRTAGHRVG VPPLGAP CGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


ARALLHPRLHQWQCLCHLQGTGGCPQQS 
PSSEGRADLLSPALEEGYPPPASGCFIP 
HFLCGCSIFNCLPPIMRHAHKSPDALLE 
TLGPCRAWRLMPVI PTLWEAKVDRLLES 
RSSRPSWPTW ' 


13523 


27424 


A | 13676 


3 


281 


IGIiRPELSVSLGRQRLTAIRLQPLLPAR 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutnmine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TPCGFLRARVALALMADETDLPLLPRLE 
YSGMITAYCSLDFLGSSDPPTLGSGVAG 
TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQI EKS KKKRCWLGTVAHACNPSTWG 
GRGGR I TRSRD 


13525 


27426 


A 


13678 


261 


363 


NSSWGHARWLTSVI PALWEAEVGGSPEV 
RSLRPV 


13526 


27427 


A 


13679 


149 


44 


SCVMSVSVLKKMYWLGAVAHACNPSTLG 
GQGGRTA 


13527 


27428 


C 


13680 


1 18 


267 


MSHCTWLMCVCVCVCVCGVYAYMYTYSF 
LRWDLPMIjSRLECSGYSQAQ* * 


13528 


27429 


A 


13682 


16 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKSPRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPLFFFLINFFFFF 
LCKI ISLVYNRRGGPCLSKRFFLKERKS 
PL 


13529 


27430 


A 


13683 


244 


140 


GRVDVFHHVAQAGLKLLSSGDLPAWPPK 
ELRLQV 


13530 


27431 


A 


13684 


191 


777 


NSDEHVYRCLYGHKLSFLLS KYLKSGLA 
GVLL.PLDAAVDMEKI EEQFANLHIVKCS 
LGTKEPTYIiLGI DTSKTVQAGKENLVAV 
LCSNGS I RI YDKERLNVLRE FSGYPGLL 
NGVRFANS CDSVYS ACTDGTVKCWDARV 
AREKPVQLFKGYPSNIFISFDINCNDHI 
I CAGTEKVDDDALFGGFGDARME FLQN 


13531 


27432 


A 


13685 


150 


253 


VGWGLWLVPVI PAFWEAKMGESLDPRNL 
LPCWAT 


13532 


27433 


A 


13686 


500 


321 


IEMGSHYVAQAGLELbSSSNPPTSVSQS 
AGIVGMSHHALAAISKSSDASITSHHPM 
TTP 


13533 


27434 


A 


13687 


6 


424 


MSLLQCGGIRCFKMPEPAKSAPAPKKGS 
KKAVTKAQKKDGKKRKRSRKESYSVYVY 
KVLKQVHPDTGISSKAMGIMNSFVNDIF 
ERIAGEASRLAHYNKRSTITSREIQTAV 
R LLL PGE LAKHA VS EGTKAVTKYTS S K 


13534 


27435 


A 


13688 


178 


293 


TG Y S S Q AQ WLMT VLTATWE AE AGGS LQ P 
RSLRPAWATQ 


13535 


27436 


A 


13689 


663 


144 


KELSAVSAGIPHSCGSQGCGGGSVAACV 
PAAPAAAGLCSGRAQ KVP PP P S LAGWPP 
GVNAPPPPVCSSVRLHVCQSDRLWVRLA 
ARRG I LALLRSALKAATLAGCQS VRWS V 
RPSESLRPTSNAASLFRSSVPTVLSHSV 
PLAASLGKRRACGGREHAS.VAVYLSVCL 
SLPT 


13536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWWLRPVIPVLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFPGKLQQDFQYYVK 


13538 


27439 


A 


13692 


144 


1 


MLKIVQSGECLTLKFKNFCLLSLCTLFP 
TLIALTTLLLPISPFILIIL 


13539 


27440 


A 


13693 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
WSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
S DMS FAE I KHKYLWS EPQNCS ATKSNYL 
RGTGPYP PSVDWRKKGNFVS PVKNQGAC 
GS CWT FS TTGALES AI AIATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYII, 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YNKG I MGEDTYP YQ3KDGY CKFQPGKAI 
GFVKDVAN ITIYDE E AMVE AVALYNP VS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGYGE KNG IPYWI VKNSWGPQ 
WGMNGYFLIERGKNMCGLAACASYPIPL 
V 


13540 


27441 


A 


13694 


87 


2 


PGWAWWFTPVILALWETEAGGSPEFKSL 


13541 


27442 


A 


13696 


130 


2 


CNILIII IQVCWAWWLMPVI PALWHAE P 
GGSPEVESSRPALV 


13542 


27443 


A 


13697 


282 


107 


GLTKCPGTVCRSLKWISYQLSIICLSM 
MEHSGTILAHCSLKLLGSSDPPTSVSRV 
AS 


13543 


27444 


A 


13698 


198 


3 


CSITVNKVKVMTLFRSSMSLEIVCDIDL 
AVLRRDLGSLQDPPPRFTPLSCLSLPSS 
WDYRRPLV 


13544 


27445 


A 


13699 


2 


120 


ARGDGVSLWLPRLKCNGAISAHCNLRPT 
GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPEEEEEEQEEL 
VD PLTTVREQ CEQ LQKCVKARERLELCD 
ERVSSRSHTDDDCPEELFDFLHAKDHCV 
ALKLFNNLQ 


13546 


27447 


A 


13701 


6 


540 


KNSRTLGQCGGIRGSLCRPRKPGVGGTQ 
TRAVRPAVCSADSARPHLPLRRADMKDS 
LVLLGRVPAHPDSRCWFLAWNPAGTLLA 
SCGGDRRIRIWGTEGDSWICKSVLSEGH 
QRTVRKVAWS PCGNYLAS AS FDATTC I W 
KKLTL R I YN I LR KLEAC I KP LCCAL KYG 
CLEEKQLHS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPVIPALWEAKVGKSLEPRISR 
PLWATWQN 


13548 


27449 


A 


13703 


210 


300 


LM P V I S ALW E AKARGLL E PRT LRPAWAT 
W 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCI SQGFPERQN 
NRDRE IHRERYERDRERHRER YERGLRE 
LAHVTVEAEKPHHRPFITWETLGCWWSG 
SVQVQKPWEPGKLMV 


13550 


27451 


A 


13705 


204 


438 


LLNVLLTQLFFLFLSLRQSCSVAQARGQ 
WYNHGSLQFSTHGASNPPTSASQSVGTT 
GMSHDHGLFLHFSTLDIFSSL 


13551 


27452 


A 


13706 


1707 


1821 


AKAGGSQHLE I LANAVKPCLYWECKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


IRRANFKILTEIGWTQWLTPVIPALWEA 
EGGGLLE PGS P S LVFLFF I LTT I LMRKK 


13553 


27454 


A 


13708 


224 


344 


KDTAMEE E I KDTEKTGKP WLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 


A 


13709 


1476 


380 


NSWSTLASELTLWAZVTSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMSFAEIKHKYLWSEPQNCSATKSNYL 
RGTGPYPPSVDWRKKGNFVS PVKNQGAC 
GSCWTFSTTGALESAIAIATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
YNKG I MGEDT Y P YQGKDG YC KFQ PGKA I 
GFVKDVANITIYDEEAMVEAVALYNPVS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVTiAVGYGEKNGIPYWIVKNSWGPQ 
WGMNGYFLIERGKNMCGLAACASYPI PL 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nu de- 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














V 


13555 


27456 


A 


13710 


763 


257 


YEKI LKLTADAKFESGDVKATVAVLS FI 
LSSAAKHSVDGESLSSELQQLGLPKEHA 
ASLCRCYEEKQS PLQKHLRVCSLRMNRL 
AGVGWRVDYTLSSSLLQIRGKSPWCHCG 
WKVATCPQGPQAPACLPCPFSSKTKFPR 
SSLARTESRPKTLMKLPWAEEKGVPRPV 


13556 


27457 


A 


13711 


133 


401 


VLS KSGNPYWGLSTNHEIKGQEGCAWQL 
TPVIPALWEAEAGGSLEPRSLRPAWATS 
WLPREIKLDLLKKGCGPSKITQGGPMVA 
GSLKL 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPFFFFFFFFFFFF 
FFAWGNP LFLFNS T FGRT 


13558 


27459 


A 


13713 


324 


141 


DRVLPRLECSGVITARiTRLNLLGSRDPP 
TSASLIAGSTGTCHHAWLSFSYLFTYVF 
FIIL 


13559 


27460 


A 


13714 


299 


431 


IRLIWLSAVDHACNSSTLGGHGVQMTKG 
QDFETNLANTANPHPY 


13560 


27461 


A 


13715 


133 


1 


I LVWAW WDMPAI P VLWEAETEGLLEPRS 
LRPAWATWKNTISTKN 


13561 


27462 


A 


13716 


18 


165 


I PLHTHTHTHTHTHTHGYLLALSGTEFL 
SVLYMASERECRTYNLMLNK 


13562 


27463 


A 


13717 


1)9 


1 


KPNVSQAQWLAPVI PALWEARVGVLFKP 
RSLRPTWATQQ 


13563 


27464 


A 


13718 


271 


64 


VLRSI STLNEIIAKTRNPPLQ I LMKPRK 
RWGTVAHLRAPVGDQPDQHSETPSLLKI 
TKISWVWWRTPVI 


13564 


27465 


A 


13719 


21 


209 


KDGPKGTGGSETNESVEMTECCSVTQAG 
VQWHDLGSLQPPPPGFKRFSCLGLPSSW 
DYSRND 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEKKKKKRGKI FFFLKNPLGFFPKKKV 
LGEKKNPYSGVWGGKKKRGPPPHP 


13566 


27467 


A 


13721 


109 


2 


RRLGWAHACNPSTLGGEVGGSQGPGNC 
G.QVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQADGAHSLFPTTPSLNAYI IFSSP 
IGPHTHRPYHATPTYLPFYTNNLIKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGPF 
KKTPGGAP I FGGGG K 


13568 


27469 


A 


13723 


58 


1208 


FWNENS PAS ELAPNGGGS VTS VPRLEDY 
LTRPQLYKLTGVEGPSRANSRDS FHDRK 
TYKTPS ANMMVLKVE ELVTGKKNGNGEA 
GEFLPEDFRDGEYEAAVTLEKQEDLKTL 
liAHPVTLGEQQWKSEKQREAELKKKKLE 
QRSKLENLEDLEI I IQLKKRKKYRKTKV 
PWKEPEPEIITEPVDVPTFLKAALENK 
LPWEKFLSDKNNPDVCDEYKRTALHRA 
CLEGHLAIVEKLMEAGAQIEFRDMLEST 
AIHWAS RGGNLiDVLKLLLNKGAKI SARD 
KLLSTALHVAVRTGHYECAEHLIACEAD 
LNAKDREGDTPLHDAVRLNRYKM IRLLI 
MYGADLNIKNCAGKTPMDLVLHWQNGTK 
AIFDSLRENSYKTSRIATF 


13569 


27470 


A 


13724 


400 


135 


KWNGKELNGVDG IGWDWNEMYWKGLE FH 
ETEWNELERNGLEWNGMS WDGI KCNG I K 
WNGIECSGMLWNGMECIGMECPLMEWIR 
VEWN 


13570 | 27471 


A 


13726. | 186 


344 


SASLGI.WRCRDCRRSLVHSVLNVAQAGV 
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NO: of 
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sequence 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaginc, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QWRDLSSL.QPPPPSRLPWPPKVL.R 


13571 


27472 


A 


13727 


380 


168 


APPQKKVFSSKFPQGFFFAPPNRKKLFF 
SPPRKIVP PPKFFLWAPPLFFFFF'FFFF 
FFFFFFWYSRTDYK 


13572 


27473 


A 


13728 


324 


219 


RIMFFIFFYFFFFFFFFVFFIiFFFFFKR 
PPPFFY 


13573 


27474 


A 


13729 


2 


270 


RLQEASIjS PVPE I LGGALPSAKRTSYLL 
IiSTLCLLLSETASRGNSLTGLGHRSDHY 
NSVRSGGQSLYS ACP I VTT I HGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 . 


3 


AVHRCKKWGRGWGRWLTPVI PTLWEAE V 
GGPLEPRSLRPALAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVC FMNDLI C FLDNTFKNNVLSQAW 
WCVHLVPTIWEAEAGGSLEPRSLKLQCP 
WAPVNNCTPAWAT 


13576 


27477 


A 


13732 


188 


32 


I FFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSETRSQNKKQKTKNKQTKNLP 


13577 


27478 


A 


13733 


341 


3 


GTVTPEEPKKMGDQTPRGLPRPERTLGR 
LSTSESRVDPTPRVGGGSSGGRTAPWGP 
PRERGRVAGSSLHRACNFSTSNTI ISVG 
LWLTPVI PTLGEAEAGGSPEVRSSTHAS 


13578 


27479 


A 


13734 


208 


2 


KLYKNFFFRFTKNPPPLLKKGFYLYNFF 
FKKKKKIVKKKKKKKKKKKKQKKKKKKK 
GRPLAI KNYPRV 


13579 


27480 


A 


13735 


16 


212 


EG VSLLL PRLE CRGT I S AH CNLRHRGRR 
CTPASASQILRRLKQENHLNLGGGGYRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


214 


PDRPALPTRPGGWITWGREVRDQPGQCG 
ETPSLLKIEKLAGCDGTCLEQNCLQMNN 
SNNKVGIR 


13581 


27482 


A 


13737 


386 


51 


WGNSITQQAIiFEHLiCFSGHTSLPAVPCP 
HQACFPTLiPLICRGIIIAHCNLGLiLiGSS 
DS P I S AS P I DGNTGLiHPHARL FS FLL W Y 
RWGLTMMPKRSLS PGLKPSSRLADAWV 


13582 


27483 


A 


13738 


204 


431 


S QH FG I S RRADHL S SNVGDQ P E QHGKN S 
LiYKYKKINQGWAWWLTPVIPPLWEGQVG 
GS PEGKEBKTPLRNLGKPHL 


13583 


27484 


A 


13739 


69 


159 


I KKEALSWAWWLT PV I PALWKANAGGS P 
E 


13584 


27485 


A 


13740 


32 


285 


FFLSFFLFFFFFFGGGGKKKTKKFFSQD 
FLGPPKILGGGGFFSFFFGGPFFLKKGK 
KSPQNLFFLKKNPPPFFFPPPKKPLWGF 


13585 


27486 


A 


13741 


372 


213 


YRSCVLLQQNPPLFYQSWWNFSLCVCVC 
VCVCVCVCETGSLKKFIiLTQKKKR 


13586 


27487 


A 


13742 


29 


237 


•TKRKGYKTNKGYWAWAWWLTPVIPALFI 
AEGNL I LQMQKQAQ P EAGCGCVC VC VCA 
CMCVCVCVFKVFF 


13587 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTHTNPIHVFWL.LLKY 
PKIQSHQFHISAFQDPLL 


13588 


27489 


A 


13744 


305 


373 


VGWGWWLTPVIPALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQM KWEMRQEAVAHS CNP S TLGG 
QGEFETSLVNMMKPCLYKKMQKISQVCS 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLSQSEMAAVKASTSKATRPWYS 
HPVYARYWQHYHQAMAWMQSHHNAYRKA 
VESCFNLPWYLPSALLPQSSYDNEAAYP 
QSFYDHHVAWQDYPCSSSHFRRSGQHPR 
YSSRIQASTKEDQALS KEEEMETESDAE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEO ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glurnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VECDLSNMEITEELRQYFAETERHREER 
RRQQQLDAERLDSYVNADHDLYCNTRRS 
VEAPTERPGERRQAEMKRLYGDSAAKIQ 
AMEAAVQLSFDKHCDRKQPKYWPVIPLK 
F 


13591 


27492 


A 


13747 


2 


305 


GRVGS FS VRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFSCLRIGMRGRELMGGIGKTMMQ 
SGGTFGTFMAIGMG IRC 


13592 


27493 


A 


13748 


2 


305 


GRVGS FS VRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFS CLRIGMRGRELMGG I GKTMMQ 
SGGTFGTFMAIGMG IRC 


13593 


27494 


A 


13750 


238 


423 


AVSWDQLTWGTGVQEKKIQAQAWGLMPV 
IPTLWEGEVC-GSPEVKSSRPACPHGKTP 
FLLKMQ 


13594 


27495 


A 


13751 


455 


248 


ISVGPGIFQLNFCFLEPYGTFLiSEKKFF 
IMETGSRFVIQAGVQWGSYSSPQPQFFG 
FESSLKPPHSGVK 


13595 


27496 


A 


13752 


2 


98 


IPTPTIHTHTHSHTHTHTHTHTSFLYMP 
PDLK 


13596 


27497 


A 


13753 


196 


2 


I WAPPKI FLKKPPLFFFFFFFFFFFFFF ' 

FFFFPFFIISFIPWPLTRKQKLERSWVF 

FKDSACSA 


13597 


2749S 


A 


13754 


917 


379 


ICYKKCSLQKNLLLVGCKKYSLCYRRHKL 
HLVTHGERKKPAAVNSFFLFFSFFFFLN 
LVRNTE ITKKRVNLHETKADAES CNQDC 
TTSSETASELEQIRSGKHNSGKWAGEGA 
AGGRREGERMEWTEMRRGARGRRGGRER 
EMERARVRGGEEREKEIDLYKKVTSKIE 
ETKLGNLLKLG 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPLFFF 
FFFFFFFFFFFFFLKKSWRPLAI 


13599 


27500 


A 


13757 


234 


341 


EIGWVQWLMPVI PAVWEAEVGGLL.EPRS 
LRAARAI 


13600 


27501 


A 


13758 


1 


365 


PAPNRRGGHIQDRATNSTELGANQCFFF 
SPRPPSLEKKTPEINKEPRPAPQPSNPG 
NLGTREGGDSWAGTTRCLRRTDEGDTYR 
TEPPTALSWGQTRAFFFPPALPAGKKRH 
RNLLKTQFF 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTHTHTFYLHWIHFDMEI 
LGL 


13602 


27503 


A 


13760 


1 


228 


ARGERERERERERERERERERERERERE 
RYREGGRLLLTMGEGETERATDL YHTP P 
PS I KAWRLCEPRPRAG ILCGRN I F 


13603 


27504 


A 


13761 


383 


197 


RCDP PAWVSQSAR I TGVSYRAQPATS VL 
MGEEDFLEDPVIAFVRLAPAVLLSKLSE 
VSVAMT 


13604 


27505 


A 


13762 


276 


57 


YAVLGGGGGRKNLFVFVFFLFFFFFFFF 
FKKQG FGLMPLGEDRG PNMG FFYFPMVG 
DKTKLPFKKKKKKRPR 


13605 


27506 


A 


13763 


377 


209 


PRPAYGPAFLFTEG FS WNP FFF FQMES P. 
SVAQAGVQWCYLGSLQPPPPWFPGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQPSYPSIDEWKNIMSIHTVEYYTAM 
KRNKQLYSQ 


13607 


27508 


A 


13765 


390 


284 


ESGGHFLSLSLSIYIYIHIYIYVCVYIY 
THTSHTI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W= Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13608 


27509 


A 


13766 


350 


41 


YWILQENRNTQNRKLLPKSTELCMLLQ 
MLLFILNTTNYPLPLWQCYHRFCSFFLK 
TGS FYVTQAKVRWLFTGM I TVHCSLKLL 
DSSHPPKWLVLQELATVAS 


13609 


27510 


A 


13767 


330 


156 


KIT QAW W CVHWPDN REAE VTG V I R LGR 
MRLPLEGACTNCVPSWEKEQDPVSHNEK 
K 


13610 


27511 


A 


13768 


25 


292 


KCFFLS WRGG S RLQS QHFGRPRRADKLR 
PGVPDQPGQHGFLLVKMSDKPDLSEVEK 
FDRSKLKKTNTEEKNTLPSKEIFFSLVG 
VNIQD 


13611 


27512 


A 


13769 


585 


680 


KCLGSRTRWLTPVIPTLWEAEAGGSLRP 
RSS 


13612 


27513 


A 


13770 


488 


359 


PSPRERKFLPFFKKKKGWGPPPPPKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 


515 


PDMGLEDEQKMLTESGDPEEEEEEEEEL 
VDPLTTVREQCEQLEKCVKARERLELCD 
ERVS SRSHTEEDCTE ELFD FLHARDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLCFCMRASLPRHS KPQARKTKK . 
KKKKKKKKKKKKKGGSLKKKI FGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNEI FF 
GGGAI 


13615 


27516 


A 


13773 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNN 
WLSNYS ISSTVTRHFPTLFQF 


13616 


27517 


A 


13774 


1 


247 


GLQSIADLLSGLLQKVCLPLSNGIMDLY 
LLYYLFTFETGSCSVAQARVQWCEHGSL 
QPHTSASSDPPTSATTTCHHARLIIW 


13617 


27518 


A 


13775 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNN 
WLSNYS I SSTVTRHFPTLFQF 


13618 


27519 


A 


13776 


107 


468 


EKARSPERARDQEGGTERDRNSEREKIL 
PKLREELPWVSGGWRCPWPHQGWSHWED 
NPGWGIPTGPSVGWGEKKGPGEGRSHKY 
GTGRTKCELGVS IGNSAFTFLLHFYFKH 
RKRERKQI 


13619 


27520 


A 


13777 


125 


2 


NIFFPLFELYYFKCHWARWLTPVIPALW 
EAKEGGSPEVRSS 


13620 


27521 


A 


13778 


166 


74 


GRICYSHSLSKHTHTHTHTHTHTSHISF 
IP 


13621 


27522 


A 


13779 


1 


341 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERLS FSLGGGGALKKKRI FLCVTLE 
GKHLPPLTHTGFFGENTHTLSMKTREGG 
FICAGGDERRSASVMRAYIYRECEPHTH 
V 


13622 


27523 


A 


13780 


614 


337 


RRCSSALCYRRHGNHKVKIRSKQGASVS 
PHEHLR I LSLLE VRNVGDGHS IFPHSLR 
M YVCM YVCM YVC ISIYLSIYLSIYLSIY 
LSISISSR 


13623 


27524 


A 


13781 


390 


158 


VLIRLS WGRI CFQAHSGCWQNSGS CSHK 
TEGLCLLLAGSCSWLLGGALKGWASKEE 
FYTMQHSCQSDTLSPLPYSIH 


13624 


27525 


A 


13782 


308 


163 


TKFYLLFFFFFFFLFIFFFFFFFLKKKF 
LFFIQGGGKIKTPGTLTFLG 


13625 


27526 


A 


13783 


45 


277 


IASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNFVAYIKGKQGNPFLFGCGDLFN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














APQL I KQLSQLG QKY PKKPLL 


13626 


27527 


A 


13784 


413 


218 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
GEGKAFPGQGCPVGGGGFAGPTFFFGPP 
AFPEILYFF 














I D WEND LTPMVLCHG P S I YNK Y S FS QN I 
FPISANFFFIAHINLTLLQLVSELSHKI 
ILRHFYIKPIKDILNLYLILPGVWLFDL 
CKSTAICRYLCVCIYIHLYIHTHXHIHI 
YTHIRL 


13628 


27529 


A 


13786 


161 


425 


RWGSCYCPGYPHLKIFKLLLLLVLVTjEM 
gvlpllprlecsgmitvhcnleflgtsy 
PPISVSQTKKKKRPFTLPPLPNLQIGLP 
LRGE 


13629 


27530 


A 


13787 


70 


311 


DLiIS ITALVI S PNKFYI YLS I YLS I YliS 
I YLS IIYLIFIS TNtiLWVLS LWETLNKT 
RLLNLFLTIQNNIWNPKLLVSALI 


13630 


27531 


A 


13788 


25 


219 


RNMAAATLTSKLYSLLFRRTSTFALTI I 
VGVMFFERAFDQGADAIYDHVNEGKLWK 
HIKHKYENK 


J 363 1 


27532 


A 




279 




RRS LALiS P R WD CG LQ WRN LG S IjQ ALL PG 
FTPFSCLSLPSSWDYRPF 


13632 


27533 


A 


13790 


138 


28 


GGWVRWLTPVI PALWEARVGGLLEPGSS 
KKERGKKEV 


13633 


27534 


A 


13791 


133 


7 


YSCQRCLTHGQAQWLTTVI SAPWEAKAG 
RLLE PRSVRSAWAT 


13634 


27535 


A 


13792 


86 


290 


EHVHRTLIEASVSGWGWWLTPVTSAAWE 
AEVGGSLEGQEFE I SLGKCSETPSLTTT 
TTESLGS FAATS 


13635 


27536 


A 


13793 


150 


37 


KI IYPSQARWLMPVIPALCEAEVGRLLE 
PSSSRLAWAV 


13636 


27537 


A 


13794 


216 


77 


PTQHPTEGYMIKQQTVYHECRMWANSHL 
CLLPEGLLRAVTPWCHAP 


13637 


27538 


A 


13795 


251 




VARAWGIAPVI PALWEAEVGGSPEGRSL 
KPAL 


13638 


27539 


A 


13797 


177 


274 


VQWHMSWPAIQEAEVGGSPEPRSLRPA 
WATY 


13639 


27540 


A 


13798 


307 


406 


KFKKGWAWWLMPVIPALWEAKAGGSPES 
RSLRP 


13640 


27541 


A 


13799 


194 


3 


FFFFFFFFFEMGSHYIVQARVQWLFTDA 
NIVHCSLQLLASSDPPVSTSQVGLQACA 
DDAQNPE 


13641 


27542 


A 


13800 


333 


198 


LPSVFFFFFQTESCSVTQAGVEWCDLGS 
LHS SLGNRARLCLKI K 


13642 


27543 


A 


13801 




335 


ETERGRERETERGRERETERGRERETER 
GRERERERERERERERDREEGRERQRGE 
GKKNRDREENRRRETERVGDWCVCPHP 
NLILNCSPYI SHVLREGLG KTYLNYGGS 
FPTLCSG 


13643 


27544 




13803 


14 


454 


RSHLISSLLCLEGI ILSLFI IATLITLN 
THS LLAN I VP I A I L VFAACE AAVGLALL 
VSKKKKKGAPVLKNPWGAQSLRGQARKY 
FFPYREPNKNLPGNLGKEPFLGGGDILG 
QPPYKN 


13644 


27545 


A 


13804 


1474 


1661 


TDFHNSHLLKCCKNCIFSTLNYYIRREH 
FSIVFIFCYICWKVHYIRRELFRCLYF 
LLYMCC 


13645 


27546 


A 


13805 


360 


3 


LWGKRGGPFSPPPFFFFLELFFFLKKMG 



1376 



_0164836A2_I_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
Inst amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, IW=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGGFFGGFFKGPPKGEIFFFVFFWAPQK 
KKKKKPPPFIFWGGGPFFFFFFPPPKQK 
KKKNKKKKKKKQKKKKKKKKKRAAHWPA 
RSRSSA 


13646 


27547 


A 


13806 


1 


425 


RLGGVALRSAADGAFVSGEFCGGKLLRW 
CLVTDFPPDSCTTCSYSRRSTPGCSPGG 
SRGLSEGEGSSVSLQRSRVLSAMKHVLN 
LYLLGWIiTLLS I FVRVMESLEGLLES P 
SPGTSWTTRSQliANTEPTKGLPDHPSRS 
M 


13647 


27548 


A 


13807 


5 


313 


EKPYLQGITFTSKTTHLIYEFTPYTIIV 
KEYSTNYVLLIGNTQITKIYHIVFRKFS 
MTP I YKFQDLAQWLTPVI PTLWEAKPGG 
LJjEPRSLQSSYSIjIILIL 


13648 


27549 


A 


13808 


182 


314 


VITQLDKTERAQWLMPVILVLYEAEVGG 
LLEAMSSRPAWATKTW 


13649 


27550 


A 


13809 


219 


1812 


LPPPESGAMSGFNFGGTGAPTGGFTFGT 
AKTATTTPATGFSFSTSGTGGFNFGAPF 
QPATSTPSTGLFSLATQTPATQTTGFTF 
GTAT LASGGTGFFLG IGAS KLNLSNTAA 
TPAMANPSGFGLGSSNLTNAISSTVTSS 
QGTAPTGFVFGPSTTSVAPATTSGGFS F 
TGGSTAQPSGFNIGSAGNSAQPTAPATL 
PLTPATPAATTAGATQPAAPTPTATITS 
TGPSLFASIATAPTSSATTGLSLCTPVT 
TAGAPTAGTQGFSLKAPGAASGTSTTTS 
TAATATATTTTSSSTTGFALNIjKPLAPA 
GIPSNTAAAVTAPPGPGAAAGAAASSAM 
TYAQLESLINKWSLELEDQERHFLQQAT 
QVNAWDRTLIENGEKITSLHREVEKVKL 
DQKRLDQELDFILSQQKELEDLLSPLEE 
LVKEQS GT I YLQHADEEREKTYKLAEN I 
DAQLKRMAQDLKDI IEHLNTSGAPADTS 
DPLQQ I CK ILNAHMDSLQW I DQNSALLQ 
RKVEEVTKVCEGRRKEQERS FRITFD 


13650 


27551 


A 


13810 


134 


263 


KNSLFKKKNNGRP WPLMPVI PALWEAEA 
GGLLEPRGL.RPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKPISFEAYFARVRGFIL.EVSETR 
NPPIGWTRWLTPVIPALWEAEAGG 


13652 


27553 


A 


13812 


241 


139 


LHEAGLAPMIPALWEVKVGGLLEPRSPR 
PAWASW 


13653- 


27554 


A 


13813 


539 


256 


RTTQMSTAAGFTEAPNWKQPRCPWTEEQ 
TNKMWSLHAMECASAMKWNEVLIQPAVR 
MARENSRRKPGDMHDIWSVWSAGEWLPG 
LGRSTGKGSD 


13654 


27555 


A 


13815 


63 


369 


VRETPLKTHYLEEISSPASPTAIPQSLL 
FSFFISPPSSIATGSGHSGHPVHSLHHP 
PETEPSPSVCLWAGPKVPPGAAGKGSPD 
SNPLVIRSLLAPPPASL 


13655 


27556 


A 


13817 


258 


3 


AESAPPFSTNSLFFTHTHTHTHTHTHTH 
THSLRTTRWAPSQVCRPQTSQRGGECVG 
PTAFAP S PTLLKPHHPSS HVHLP SQPRR 


13656 


27557 


A 


13818 


516 


665 


WFKSGSFWLGiWAHrCNPSTLGGQGGWI 
TSGRSRTS VTTTTTS QT CAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVI PALLEAVTGGSLEPRS 


13658 


27559 


A 


13820 


166 


368 


GWWATQHSAQYWSSSS IGALCLGESCS 
VTQAGVQWCDLSSPQPSHPGFKRFLCLS 
LPSSWDHRGDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26- 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

location 
correspon 
ding to 
Inst amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine' 
N=Asparaginc, P=Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W == Tryptophan, V = Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


RKYCGQCLNMALNNVSLSSGDQRSRVAY 
RSSHGDLRPRASAIiAMVSGDGFLVSRPE 
AIHLGPRQAVRPSVRAESRRVDGGGRSP 
REPDGRGRSRQARFSPYP I PAVEPDLLR 
SVLQQRLIALGGVIAARISV 


13660 


27561 


A 


13822 


76 


1958 


RQE L I W PLCS P PQGDRFLQKS W I FFRPV 
MADKLTRIAIVWHDKCKPKKCRQECKKS 
CP WRMG KLC IEVTPQSKIAWIS ETLCI 
GCGICIKKCPFGALSIVNLPSNLEKETT 
HRYCANAFKLHRL.PIPRPGEVLGLVGTW 
GIGKSTALKILAGKQKPNLGKYDDPPDW 
QEILTYFRGSELQNYFTKILEDDLKAI I 
KPQYVDQIPKAAKGTVGS ILDRKDETKT 
QAIVCQQLDLTHLKERNVEDLSGGELQR 
FACAWCIQKADIFMFDEPSSYLDVKQR 
LKAAITIRSLINPDRYIIWEHDLSVLD 
YLSDFICCLYGVPSAYGWTMPFSVREG 
INI FLDGYVPTEKTLRFRDASLVFKVAET 
ANEEEVKKMCMYKYPGMKKKMGEFELAI 
VAGEFTDSEIMVMLGENGTGKTTFIRML 
AGRLKPDEGGEVPVLNVSYKPQKISPKS 
TGSVRQLLHEKIRDAYTKPQFVTDVMKP 
LQIENI IDQEVQTLSGGELQRVALALCL 
GKPADVYLiIDEPSAYIjDSEQRLiMAARVV 
KRFILHAKKTAFWEHDF I MATYLADRV 
IVFDGVPSKNTVANS PQTLLAGMNKFLS 
QLEITFRRDPNNYRPRINKLNS IKDVEQ 
KKSGNY FFLDD 


13661 




A 








KCWDYVHEPPRPAPMQTYQVDLRCCFVS 
T INLLVCGERAS L P S VES LVFSGLAE VW 
MKAVLAPSRESGKLiVIiQQVPEYAEAIWG 
AR P ACRGLV FSA 


13662 


27563 


A 


13824 


245 


91 


YKWFGPPRGSFFRPPPFFFFEVFHFFF 
1 1 FFFFFFFFFFFS FFLGCFMYV 


13663 


27564 


A 


13825 


276 




PGQKKKTLSPKKKKS I YLY IHTHI Y I YT 
HTHYIYKYIFVCVY-ICVSIYIHTHIVPR 
IFS 


13664 




A 








IYNTYIHTHTHTHTHTHNVYLYIQSGDC 
IYAVLLGELSFIYIFVFLSSLISILFHL 
TY 


13665 


27566 


A 


13827 


278 


34 


KPRG LS P AG VS AQ RAWG PHS C PRLPD P R 
TELQVRQIPALVGSSAFQGRRGRKGPVG 
GSGPSRPRAPPQPRPGCCQCTLL.QI 


13666 


27567 


A 


13828 


62 


307 


GGARAI FLNVPHLKRPMFFLELCLSTKC 
PVGTEFALDSLLRLYNSNTFCNVQWLMP 
TI PALCKAEMGGJjLiEARSLI PAWAT 


13667 


27568 


A 


13829 


386 


478 


NRIGMVSHACNPSTIiGGQGRRIMRSGDQ 


13668 


27569 


A 


13830 


364 


484 


NFK I WLG WAW WLMP VI PALWEAKAGGS L 
EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERERERERERERERERERERERG 
VICVCPCALLCVRESDPLSQKKYCVCVF 
WRYTESLALSPFCLGRCLALLIFLEGM 
WRKIWAPSFLSLCVGALFFSQHPPPFWC 
IiCVLSLVRPLPLSLSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


YS AVE FAMAG VGAG P LRAMGRQALLLLA 
LCATGAQGLYFHIGErEKRCFIEEIPDE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
HVEVKDPDGKMIiQVVLSRQYGSEGRFTF 
TSHTPGDHQICLHSNSTRMALFAGGKLR 
VHLD I Q VGEHANN Y PE I AAKDKLTELQL 
RARQLLDQVEQIQKEQDYQRYREERFRL 
TSESTNQRVLWWSIAQTVILILiTGIKQM 
RHLKSFFEAKKLV 


13671 


27572 


A 


13833 


17 


130 


RL.QEFGTRRERERERERERERERERERE 
RDTHSFLHG 


13672 


27573 


A 


13834 


283 


460 


LWRLAICKNYSLKRETEPGSVAHVYNP 
STLGGQGGRTARGQEFKTSLDNIARPPS 
LQK 


13673 


27574 


A 


13835 


3 


435 


WPRFCTALQEFGTRRERERERERERERE 
RERERERERERERERERATSLSRARPSF 
LFPSACVSHTPYVGERENLSLHTLKHTH 

TREKHSLS ytnas ardtrar fs egrap p 

IYSVYLTHGVFFFFFFSLCLSVRHAEGL 
AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVHISKFTGNLSDLYYVQSEWQDLGT 
ADLSIMTSVXjHCLPSIiESIKNTDSWPSS 
VAHTCNPSTLGGRGRCRDQHGQHSKQPS 
bLKIQKIARRCGG 


13675 


27576 


A 


13838 


336 


150 


TFCYKKYLWGAHCFIFRTTGLPIKNFYF 
GQAWCLLPVISARWEAETGGSLEPRSLR 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCS LGLPQVLL 1 1 LYKI LLG WARWLTP V 
I PALWEVEAGGLLEARSSRP 


13677 


27578 


A 


13840 


163 


297 


IIFFLLDLKLQGAMTIYVSIYLSIYLSI 
YLS I YLYTHTLWNTLQL 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPLLCHSLRKTSSSQGGKSE 
LVKQSLKKPKLPEGRFDAPEDSHLEKEP 
LEKFPDDVNPVTKEKGGPRGPEPTRYGD 
WERKGRCIDF 


13679 


27580 


A 


13842 


4474 


2586 


DGGSG CVKME FPGGNDNYLT I TGPSHP F 
LSGAETFHTPSLGDEEFEI PP I SL.DSDP 
SLAVSDWGHFDDLADPSSSQDGSFSAQ 
YGVQTLDMPVGMTHGLMEQGGGLLSGGL 
TMDLDHS IGTQYS ANP P VT ID VPMTDMT 
SGLMGHSQLTTIDQSEJjSSQLGLiSLGGG 
TILPPAQSPEDRLSTTPSPTSSLHEDGV 
EDFRRQLPSQKTWVEAGKKQKAPKKRK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSKIVASMWDSLGEEQKQIY 
KRKTEAAKKEYLKALAAYKDNQECQATV 
ETVELDPAPPSQTPSPPPMATVDPASPA 
PAS IEPPALSPS IWNSTLSSYVANQAS 
SGAGGQPNITKLIITKQMLPSSITMSQG 
GMVTVI PAT WTSRGLQLGQTS TAT I QP 
SQQAQIVTRSVLQAAAAAAAAASMQLPP 
PRLQPPPLQQMPQPPTQQQVTILQQPPP 
LQAMQQPPPQKVRINLQQQPPPLQIKSV 
PLPTLKMQTTLVPPTVESS PERPMNNS P 
EAHTVEAPSPETICEMITDWPEVESPS 
QMDVELVSGSPVALSPQPRCVRSGCENP 
PIVS KDWDNEYCSNE C WKHCRDVFLAW 
VASRNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


E FYRMNS P S PLWLC VCVCVCACLFMCLC 
ALTCMCVWYKSLLHSNLTQVLSSKFYDS 



1379 



MSDOCID: <WO 0164835A2J_> 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ISPAENPGK 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVI PTLWEA 
EACGSPEVKSSNST 


13682 


27583 


A 


13845 


475 


225 


GHTELYGGALFQLLQAAI PEGGGTSCLF 
NPSSDGGSSAAGGWGGAGDLARSSRVP 
SPTSSRPVPSPDPSPPSLYLTPWKETH 


13683 


27584 


A 


13846 


479 


319 


RDCRJRS LTHSVLSGAQAGVQWRDLGSLQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


13684 


27585 


A 


13847 


182 


499 


LLCVKLCDS CNENKAIiREQTRAQVQGGH 
SRAQVQQVQGGHSAHTWQ IRS AASRRK 
AFSTCSSHLGMVLLFYGTGSSTYMRPTT 
RYSPLEGRLAAVFYS ILIPTLN 


13685 


27586 


A 


13848 


388 


494 


RDSWERWLEPLI PVLWEAQAGRLLEPRS 
LRLAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDIMFRKVSQTQKNKYYM I 
PLIWKTKKVLSHREQK 


13687 


27588 


A 


13850 


555 


353 


RCP PGAQAPLPAPS PAPPVH I ALS PLS C. 
GVSWPRELTAHPPEPPPPPFFKIHPHPQ 
LSVGKRGQMTP 


13688 


27589 


A 


13S51 


169 


66 


LLAGSTGQARWLTPVI PELWKAEVGGSL 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


IASGLHDFFKKKKIKKKKKKKKKKKKGG 
AL 


13690 


27591 


A 


13854 


256 


363 


ICLFIKDSDRGQTQWI/TPVILTLWKAEA 
GGSLEPKS 


13691 


27592 


A 


13856 


23 


436 


IDRLSDSHEERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERESSLLCVRHTCFAPPTYIFLWETLR 
VCPTPCVWGALSVFSHTERARIFFYTCG 
GSVARAPVCAHTRI FSSRARARALCARP 
PLSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDT S PLTTDRGETPLGATLP S CYCGG 
QQSFEPKTKKTTNQKKKKKGKKFTPQKK 
NPKIFPPKPFCPKGPRGENQPKKTQNQP 
KKKKKI LGGGPAPNS PbGGGKKNS WAGF 


13693 


27594 


A 


13859 


465 


43 


KRVTTENPQHICSAETAIPWSLSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVY I Y TR I CM CVC I YVHI C I HTHTHI Y 
IYIHTHTHIYMERERETVSCSVAQAGVQ 
WCDLGSLQPPNKRLKRFSCLSLPSSWDY 
W 


13694 


27595 


A 


13860 


290 


52 


WYKYLPGTRGFL I WLNYLKCI GRI ELLV 
GYFSFIYLSIYLSIYLSIYLSSRSIAQP 
GVKRCNHSSLQPRTPGLMHLSNS 


13695 


27596 


A 


13861 


108 


22 


IFFLFSAVKTRFSYAFPKEFPYRMNHVS 
I 


13696 


27597 


A 


13862 


167 


33 


VFTVPGTYMCVSVRDL.GQAWWLTPVIPA 
IjWEAE AGGS LEE FLRT 


13697 


27598 


A 


13863 


448 


355 


FFFFFFFFFFFIFFFFFYFFFFFFFAKN 
PLN 


13698 


27599 


A 


13864 


321 


445 


SQSTQAWCLMPVIPALWETKSSESFEPR 
SLRPAWATGHNPI 


13699 


27600 


A 


13865 


380 


80 


MILSPSPKFPSFLFSLFSALLLWHCLEL 
PFL.SNVGKGMQNDPVGL.SPTSTPSDLSR 
IYLFFCFFVCLFVCFETGFPSVAQAGVQ 
WCDPLGLKWFLPPQSP 


13700 


27601 


A 


13866 


372 | 264 


KMENPEDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13701 


27602 


A 


13867 


161 


304 


LFIAEMTSHYIAQAGLELLASSNPPTSA 
SERTG ITDGATE I LPQLRL 


13702 


27603 


A 


13868 


230 


348 


VSFIQNLVFCVERVYRVPDFGVWERGSK 
YNNGSTELHSK 


13703 


27604 


A 


13869 


168 


260 


EVIAECYNGCFQTLSLCVCVCVCVCVCV 
CY 


13704 


27605 


A 


13870 


140 


16 


LKYGQWRDLGSLQPPPPRFKQFSCLSLL 
SSRNCTTMTRDHE 


13705 


27606 


A 


13871 


143 


30 


KILKVWSQAWWLIPVIPALWEAEVGGS 
LEPRISRLA 


13706 


27607 


A 


13872 


158 


59 


HLHG C I HLL KLRLD AVSHACN S STLGGQ 
GGRIA 


13707 


27608 


A 


13873 


232 


115 


APFFFKGAPIFFFFFFFFFFFIFFFLIR 
GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTTWRRHLEKLQDMSLLRHRVHVDSSQG 
LFPFFQRQGLLPRLKCNGDI I PYCNLKL 
LGW SNS PAS AFR VARTTS LCRHTWL 


13709 


27610 


A 


13875 


219 


84 


NPLQSTSTICNRAWDRVTCLWSQLLRRL 
RWEDHLSPGVPSCSAL 


13710 


2761 1 


A 


13876 


166 


496 


KK KKKKKKKGGG P LKKT PG GAKI NRGR K 
KKIFFQKGGQKKTPGGILKKXPFLGGGK 
KGPNPPKKIKGLREKKKFKRGKGAKPAQ 
NPWGKKISPPGFFLKKFFPRGRGFFI 


13711 


27612 


A 


13877 


132 


348 


PSKKKKRGG P FKRTPGGPKFNGGGKGNS 
FPL MGGG I K PNLG I LG KNP Y LGGGTNGN 
NPPTEIKGFGEKKKF 


13712 


27613 


A 


13878 


287 


134 


QQGPPISPGPKEENRMGPPSPGWGNPGI 
RS P PGAGPNPGAFQDGMFPS KKQ 


13713 


27614 


A 


13879 


333 


480 


VHPLRSAEGPRPESNMLHLTLKSKQQWL 
TPVI PALWE AKAGGSQGQE I E 


13714 


27615 


A 


13880 




227 


PKIHCKIQHIWLVSFFPSFFIjSFFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


13715 


27616 


A 


13881 


276 


378 


GWPWWLTP VI PALLEAEVGGLLEPRSIjR 
LALAT 


13716 


27617 


A 


13882 


126 


389 


GSVLGTGCCGSVLGTVCRGSVLGTVLIA 
SATPDRYLRLSSGSSLTRITNSDLVPWL 
TPVI PALWE AEAGVS LEPRIWR PARETW 
EDPH 


13717 


27618 


A 


13883 


414 


150 


ARQAPKWGNPTGSPPPGFLNPPPKNFFL 
GPQKKKIFFPTPPPPHFFFFKGPPPFFF 
FFFFFFFSPHASGILLCHPGRIAVAQSW 
LTE 


13718 


27619 


A 


13884 


197 


363 


DLGWWNTFMLKFKCLNVSRGHVQWLI 
IPVIPILWKAEAGRSLKPRSLRPAWAT 


13719 


27620 


A 


13885 


101 


236 


NIHYANFPYFGLACPEYIVRALQKKYTQ 
THTHTHTHTHTRTTHS L 


13720 


27621 


A 


13887 


17 


140 


TVHLKM VKIGLHMV AHACN P S TLGGRGG 
PIMRSGALGNFCL 


13721 


27622 


A 


13888 


135 


1 


VLHAGLKLLGSSTSPTSASKGAGTVGIH 
YHTQLAFQFFLFCFLIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEMGFHHVGQAGLELLASSDPP 
ALEFETSLHNKVRPRL 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PA I PS F WEAKEGGS LEPRS S KHE I V 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAWVGLWSMVWRVPPFLL 
PILFLASHVGKSPLSPSPHF 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

correspond 

amino acid 

peptide 


Predict- 

location 
ding to 

residue of 

peptide 

sequence 


Amino acid sequence (A=AI:tnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=lVlethionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 


13725 


27626 


A 


13892 


166 


1 


LFSKIFWWLLPVIPALREAKVGELLEPG 
SSRPAWATWKNPVYTKNTKISQAWQCT . 


13726 


27627 


A 


13893 


12S 


2 


FSTTYPNP I KLPHPYLPLLTLFLDSAHL'" 
HPS E I NSL VAQKKK 


13727 


27628 


A 


13894 


225 


391 


HLS IHI LTFPFSLiPEQWWLVPI I PVLWE 
VKMGGLLEPRSSRPAWATQQDPQVSRK 


13728 


27629 


A 


13895 


192 


1 


TLVCVSSSVPKNYNNICLACWFMPI IPA 
LWEVESGSLLELKSSPPAWPTWWVPLKS 
KNTKISLA 


13729 


27630 


A 


13896 


152 


. 245 


DQS IADVLMYYTHTHTHTHTHTHTLQNQ 
VPG 


13730 


27631 


A 


13897 


270 


406 


ISTRFFLYRDRASLCHPGWSAGVQSWLP 
GSSNSQAQAILl.PQPPE 


13731 


27632 


A 


13898 


232 


73 


EAEGPRDVTSSTFHWRLYDQTTNYHECR 
MWANS RLRLTPEGLiLKVI TPWCHAH 


13732 


27633 


A 


13899 


107 


7 


LGQAQWLMP 1 1 PALWEAKAGGSLE PRTS 
RPAWA 


13733 


27634 


A 


13900 


62 


224 


PTR P I liAHCNLCVLGS DD P P AS AS QS AG 
VTGMSHCAWSNL I IHLPSRSTAICTL 


13734 


27635 


A 


13901 


450 


194 


KRQVKSHSEVKTTHVQWPG I DPGS PAWE 
AR I L PLNHQRLS G TT P WR I TE KSNH KDL 
ETVLSGFFKCRLKANKDIQTKCVFIGNF 
Y 


13735 


27636 


A 


13902 






AAPGRAPQKKKKKK.KKKKKKKKKKKKAP 
FIKRGR 


13736 


27637 


A 


13903 


131 


29 


ALGSLQPLSPRFKRFSCLSLPSSWNSTA 
LTVTTM 


13737 


27638 


A 


13904 


267 


74 


VWVLVP PFTS YGVLDKP I L I VNN I YVHV 
CVCLCMCVCVC VC VCCTV I RRHC WHNM I 
IIISQHDQ 


13738 


27639 


A 


13905 


193 


64 


IRSHQIHKLHFKKFRILGWTWWLTPVIP 
APWEAEEGRS PEPSS 


13739 


27640 


A 


13906 


309 




QSTDFLENRRVGWAWWLiTPVI PALWEAE 
VGGLPEIKNS 


13740 


27641 


A 


13907 


404 


251 


SCLISL.RSWDYRHVPPQPANFVFFFFEM 
ESCSVAQSGVQWQEENSISKINK 


13741 


27642 


A 


13908 


471 


59 


GPPQAKKGKGFPTPGPPQKKGGGQPKSG" 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
FFFFFFRKWEHKNELFPFSKTEKPRRAG 
EARKGSQSTKYAGLSRGERQASPTLRPG 
HLAKAS AGG WLQYTRWQG KLPEPGN 


13742 


27643 


A 


13909 


479 


231 


QEYCMLIFCEHFGDFDLNIVFCLEKKMV 
GVIADFYLLS S FTGRPQWLMPVI PTLWE 
AEVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 


19 


CIVLGIMLDSLWEPRQWVPS WVRW FTP V 
IPALWEAEAGGSPEVRSSNSS 


13744 


27645 


A 


13911 


328 


446 


GS FSKI TDLSQTWW YVP VVLALQES EVG 
RSLEPRSSRLQ 


13745 


27646 


A 


13912 


145 


460 


NAFMSTGCEVLSYSELRQRNLVWWFLPV 
IPATPQAEAGRLLGPRSSRPPWATKGVR 
ALSHKQTNTKTNKIjSHSTGVSFVFTHHI 
HS PG I LWRLSQWIiLAMNIiATN 


13746 


27647 


A 


13913 


319 


210 


KLGFFKGPPLFFFFFFFFFYFFFFFGFL 
TVSCNFTL 


13747 


27648 


A 


13914 


372 


10 


PNTTCTPITYGKKCCPREISYYLCHEFI 
RIRILTFFDGESCYIAQAGLKLLGSRSP 
PTSASGVAGNTDVCHHTQP I LFYILETG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
Od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LiALLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN 


13748 


27649 


A 


13915 


594 


471 


PS Y I AYM CICMCVCV YVY IN I CVCI YTH 
VHTPYNPVISKN 


13749 


27650 


A 


13916 


235 


3 


EANNLIRKTYTSGWFQRTSKRPPAVRRM 
KVLFFETESCSVAPVGVEWHDHSLLQPR 
PPGLNRSSHLS FPS SSDYRWA 


13750 


27651 


A 


13917 


272 


184 


KTTAWAQWLTPVISTL.WEAKVGGSLEAG 
5 


13751 


27652 


A 


13918 


46 


321 


SRVQGTGGHRGRGCIQAGVGEQEAKRET 
EGPPWRWLTPVISKLWEAKAGGSLEPVS 
SRRATWNLLKRI YYTDEREYLI PLALNT 
SQGSTFSF 


13752 


27653 


A 


13919 


387 


484 


F I E 1 1 GWARWLM P VI PTLWEAEAGGS PE 
VRSS 


13753 


27654 


A 


13920 


142 


321 


FLFSIHFHLSCWGLFSNVWQSLSIFSYL 
KVRLYGQAWWFTP I ILALWEDKAGGSLD 
PRSL 


13754 


27655 


A 


13921 


270 


412 


I EDKKI YS ILHTETLERGQARWLTPVI P 
AFWEVKAGGSFEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


NFGWAQWLTPVIPALWESEAGGSLEPRG 
LRS 


13756 


27657 


A 


13923 


305 


387 


VWWL.TPVIPALWEAEAGGSPEPRSSRP 


13757 


27658 


A 


13924 


103 


3 


NKSPELGWAWWLAPVIPAIiWEAEAGGSP 
EVRSS 


13758 


27659 


A 


13925 


195 


400 


CLLILSGLVASCI IERHGKGQAWWLMPV 
I PTLWEAKVGGS LELRNLRSAWGTWRNS 
ISIKYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQTLHKGR YACVC I CVCVCVC VYT 
YTDIYIFYIHIWPMNTKCSMLLVIGEMK 
ILSRMWSNQ 


13760 


27661 


A 


13927 


363 


2 


NWGPPGFPPPPFLKTGPVFFIFGAPKKK 
FFLSTPRALKFVLLKGGPLFFFFFFLWV 
FLVKTGSHCVAQPDLELLSSSNPVLVSQ 
GAGITGMSPQVQPIjDLFFFFSFFETGSR 
SYAWADAW 


13761 


27662 


A 


13928 


1 


218 


LRFKKRKTNKPKMWSVHTMEYHSALKRK 
E IRTQTTRWINLED I VLSE I RVAVTKGR 
ILYDSTQMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFIAFLHVYKMFWFFFSFIMFFFP 
LNCNKAFERKKKKNKKNNKKKKKRKKIS 
LSPPAPP 


13763 


27664 


A 


13930 


161 


22 


WHPPASASHVAGTTGACHHERLRQEDHF 
SLRVQGCSEPRLCHHTPP 


13764 


27665 


A 


13931 


116 


I 


INTGWARWLTPVI PAFWEAAVGGSLEPR 
STS Q AWATW K 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKISPLPPAISALWEAEAGG 
ALEPRSSRPAWAKSDAWVU 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFF I KK 1 1 HSWAPWLT PWPAL 
HEAEVCGSPEVRSSG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWAIjWLTS VI PALWEAEAGGPPKV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KTYYGG 1 PPPGPPKKGEKKPPFFFKKFF 
LFFISFFIFFFFFFLRQSLAKMQSASVW 
KVS 


13769 


27670 


A 


13936 


245 


616 


DDKKKKEAAQKKATEQKIKVPEQIKPSV 
SQPQPANSNNGTSTATSTNNNAKRATAN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

0(1 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 
nucle- 

location 
correspon 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R = Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NQQPQQQQQQQQPQQQQPQQQPQPQPQQ 
QQPQQQPQALPRYPREVPPRFRHQEHKQ 
LLKRGQHFPVEF 


13770 


27671 


A 


13937 


121 


292 


NTDLDLALALIVRVLTSDDNFKNWGWVQ 
WLMPVFPAPWEAEVGGSSEATNLRPAWA 


13771 


27672 


A 


13938 


421 


202 


GPREAPLFPSTQGISRPLFFFFFFETES 
SQSVCMPEAAILVSQIT 


13772 


27673 


A 


13939 


205 


543 


ILESHRRLGWSEALPENQLLCRASSWES 
LGPPDPPCLKPPSPAFWGAAGGWRAKST 
S VS EP VP VG TLL VAL FLEMA VGLLL GL P 
FCFHS LGAPHLLCE PDGPS VLPAEARQS 
R 


13773 


27674 


A 


13940 






LSQHSEEYTLPPPLYLEKESSRPPPPQH 
TPHPPPPWPWHPDIADIQ 


13774 


27675 


A 


13941 


47 


303 


GLIFLIIWTCIVKTSTDFPRMEDCSQCI 
HQVTEESNKRMGFLSYYANPHHGSSRLL 
WPQHAAPWDDGRRGKPVFSLGFVSFPFP 
Q 


13775 


27676 


A 


13942 


1488 


1719 


PLVSFSKNTYPCLGNVKQWQKTTFCSRW 
KLHVLSSNLSSPAEVTVVASINLVVSEQ 
SFQNSQLPAMLCLIVHLQMS 


13776 


27677 


A 


13943 


127 


402 


QPVTHPPNRTMCSSMIFLFLYLCFLSLS 
P F PSHLS LYAP P KKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKAGGGGPFK 
KKIHFPPGGGRI IFFY 


13777 


27678 


A 


13944 


852 


209 


EQTVYCTRIDLQQKLVFLRLEAFCLLSQ 
GDPGSPMMCQLQQFDLWVLRGVLNFGGE 

LSS LHHHEKL I S FSHHGPNATMTQKT YS 
DSE LGHVGS YLQGQRRT ITHS RLGNS SR 
DSLDVREKDVKESGRSPEASVQPLYYDY 
YGGEVGEGRI FAGQNRLYQPEE I ILGSL 
RACFLFAAVSSPGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVILYAISQFLLFPCVKN 
IASWLGSVAHACNPSTLGGQAGESLEPR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLPPGVKYGSFKRAPPFFFFFFFFLVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 
TGMS HHTW PLEGACLAI PQALR I LLVLV 
PLH 


13780 


27681 


A 


13947 


317 


17 


AGHGGS TCNPSQP I VTAVP INL WGRDLL 
QQWGAQAFI PEQLYSPQSQHTMQEMGYV 
PGMGLEKNLKGLKKPLQAEGQNSHQGLG 
YNFSWQPI.LSLQNL.YIi 


13781 


27682 


A 


13948 


55 


224 


NFRLTSSFSYLKMSFWLGMWLTPVIPAL 
WEAKTGGLFEARS LRLVWWQS KKAGSHL 


13782 


27683 


A 


13949 


1464 


805 


RATSVRGAGRERS CGAAWS PAS I GP FLR 
RSVLPNAHFRSSREGGMAASTDMAGLEE 
S FRKFAI HGDPKASGQEMNGKNWAKLCK 
DCfCVADGKSVTGTDVDIVFSKVKGKSAR 
VINYEEFKKALEELATKRFKGKSKEEAF 
DAI CQLVAGKEPANVGVTKAKTGGAVDR 
LTDTSRYTGSHKERFDESGKGKGIAGRQ 
D I LDDSGYVSAYKNAGT YDAKVKK 


13783 


27684 


A 


13950 


356 


98 


NAGPGIYFWGPIKKILPCPPAGVKLGSL 
KRAPLFFF F VEMGFHHVAQAGLKLLS SR 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, i=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














DSPIPVSQSIRITSTCEPPCLAKIKSSL 
RL 


13784 


276S5 


A 


13951 


194 


48 


NFLEGWPGQRLTPVIPALWKAEAGGSLE 
PKTFRPVWAMWRDRLRKPQH 


13785 


27686 


A 


13952 


341 


418 


DQPGQHGETPSILKIQKLAGRGSTCL 


13786 


27687 


A 


13953 


158 


323 


TWLKKY FQSTNQL P PAHLLNVQ PLLPFG 
LPLLPLKKKKKKKKKKKKKKKKKKIKK 


13787 


2768S 


A 


13954 


231 


355 


VFLPQKQKQNQCPLTGKWVNNSWNIHTL 
EYYSAVKGMKQLI 


13788 


27689 


A 


13955 


493 


39 


PAAFSSWGWAPGSSL.LKKTTPASWPPGL 
LPPAPVSSMAAPDLDSFPLPLCWTFLRV 
LCRVFSSSTIAFLSCVSTSFGALTPAAS 
SNIFSISLPHWASPPLPTARLTSAVSPE 
PLPPLPSRLQRQPRGAPSHPGMSMGP I L 
DATSNSTTRSR 


13789 


27690 


A 


13956 


497 


372 


NTKISWASWYAPVIPATWEVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 


13957 


438 


301 


GTTYL.DIDMTCDYVSVCLCIYICTRQQS 
YIYTYVYTHTHTHTHMYI 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFFRS F 
AAQGLGESVHINTPTSLSLRGRPFISSS 
GPGARARVTCAPCLHAARI LCCFWRGLH 
RLKCSKVISAHCNLiRFSGSSDSPASASQ 
VAGITGRSSLEQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVLiSS 1 1 FWMQE I FHNFKKSSRT 
QWLTPVI PALWEAEAGGSPEPRSSRPAS 
SIQLENS 


13793 


27694 


A 


13960 


320 


943 


VLSFSLSDRAQRCNCRMGKQNSKIiAPEV 
MEDLVKSTEFNEHELKQWYKGFLiKDCPS 
GRLNL.EEFQQLYVKFFPYGDAS KFAQHA 
FRTFDKNGDGTIDFREFICALSITSRGS 
FEQKLNWAFNMYDLDGDGKI TRVEMLE I 
I EAI Y KMVGTVI MMKMNEDGLTP EQRVD 
KI FS PCMDKNKDDQI TLDEFKEAAKSD P S 
IVLLLQCDIQK 


13794 


27695 


A 


13961 


191 


8 


GLFRFRRLSEEVKHCCCCCCCCCCCCCC 
CCCCCCCCCCCYLRLAPQSPAAPAPEIiH 
TPLPGSRRNR 


13795 


27696 


A 


13962 


203 


28 


VWGNQHFCFVPVFRKKVFCFFPI.NKiLR 
GGFFFFFFFFFLVFIDNSWVFLGEGDLA 
GS 


13796 


27697 


A 


13963 


18 


171 


GIRHEERERERERERERERERERERERH 
PARE IDMCVSKRDTRGAL CALF 


13797 


27698 


A 


13964 


25 


299 


HDFCTRRERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERERERERERERETHTLSLYIYMCV 
FLYIPLYFSFLYIHTQREAPPPLSATIiS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNG QG S RGVT P V I P S LiWE AKTGG SLE P 
RS WRSAWATWHTP PVPGPS KEI SLY I 


13799 


27700 


A 


13966 


91 


26 


FCCCGCCCCCCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


P FCH I Y I YFLES CF VAMLE CSGK I LAHC 
YLRLLGSSNSLVPSLA 


13801 


27702 


A 


13968 


129 


7 


SWAQWLMPIIPTLWEHEAGRSLEAMSSR 
PAWATQQDPVSKK 


13802 


27703 


A 


13969 


445 


268 


KIQLTKLKNASESLPSRINQAEEKISGW 
AWCLMPVI PVLWEAKAGGSLEPRSLRRG 
QVT 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utaniic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13803 


27704 


A 


13970 


1628 


1309 


HRRKKCAGYSSLSSALLRERERERERER 
ERERESVCVCVLGTITCFRRQYCVIQGI 
SGVRQNCVQI PALSDPDLFLFLSLSVLH 
SNTWWRPTLRVTERKKGAMCV 


13804 


27705 


A 


13971 


314 


140 


QCHNCYILTQLVKIKGDQAWWLMPIIPA 
LGAKAGGSLEAGSLRSAWATWRAPMSTK 
IF 


13805 


27706 


A 


13972 


292 


482 


YKSVRTQDS CAHGVDRLVGKI R YVNLFV 
Y I CQH S DGRT W WLTP V I P PL W E TKGGG S 
SEVGRSG 


13806 


27707 


A 


13973 


502 


303 


STAIGPSFSPFLCTHGSFHGFLFFFFCF 
CFFLVEMGSHYVAPAGLKLLGSSDPPAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


IPPTALGTSFSPFLCTHGSFHGFLFFFF 
CFCFFLVEMGSHYVAPAGLKLLGSSDPP 
AS ASPSHLGLQV 


13808 


27709 


A 


13975 


405 


121 


IREAAQSESLFVRAAPHTGLXSGPCITE 
IYAVKDTRCMQRRILVRELECAAHGLTE 
CLS IDTHTHTHTHTHTHTHTHTHTHTHT 
HGTRWEAGQSLKGES 


13809 


27710 


A 


13976 


135 


254 


TVTCTNSSWSLTFTSFLFLMPGWIQKGR 
RVCVCVCVCVC 


13810 


27711 


A 


13977 


179 


58 


VPLS INSPQRQCI FCCCCCCCCCFLVLQ 
KIISERSWAEIG 


13811 


27712 


A 


13978 


97 


186 


KI IFWLGWAHACNPSTLGGQGGRIMRG 
RD 


13812 


27713 


A 


13979 


289 


420 


ELGLKIHGFPSQAPWFMPI IPALWEAEA 
GGLLESRSSRIAWAMW 


13813 


27714 


A 


13980 


421 


106 


SGGRNFFFFLGGGFYKKKFPPPPFLRGK 
KKKPPLPKKKNPTPIFFPPGGGPPFSPP 
PKKGGGKIFFFFGGKNFIMQNFGGFPPL 
KGKKKKFLPKKKNPLFFFFFF 


13814 


27715 


A 


13981 


309 


135 


KHIYIYLSIYLSIYLSIYLSIYLPAYLP 
T YLHAHPL I HANAYKN I H I D Y P FKKG FE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKSIWHKKLSRKPYCYIISEYSS 
MVKLRVS ISQVQWLMPVI PALWEAKAGG 
LLKARSSRFMWAI 


13816 


27717 


A 


13983 


209 


359 


HLHTHTHTHTPNG VKQ IMTQI PILPLTG 
PLILEKFLNFSKPLSHPGKRDP 


13817 


27718 


A 


13984 


391 


249 


IWPILPPQKIFFFFFETESCSVTQARVQ 
WCNLGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDSPESTLGILSQPKDYITKDQTLT 
FR P VWAHACNPS TLGSQG KW I TSVQEV 
ETSLCNTARPCLYQKYKN 


13819 


27720 


A 


13986 


441 


358 


ETGSRCVILAGAQWRDLSSVQPLPPRFK 


13820 


27721 


A 


13987 


7 


233 


AS I PCI YQKTKI PNTKQTKNKNTK I TPA 
WWCTPIAPATQKAEIPPLHSSLGDFLRL 
SPKKKKGGPNWAPTPSKFF 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 


13822 


27723 


A 


13989 


188 


379 


NVSCHAAETRVWWEKERPLPVPFGAYLL 
DGMPLVWLLSAS SILSIWLSI YLS I YLS 
IYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SSILFHASKYHFSGRLFSLLSCKRRNIL 
QLVPHMYSPYLYMYIHTHKHI YEYMYVH 
I Y I YMNLRNCR I YLHS CTP KENQAR 


13824 


27725 


A 


13992 


37 


290 


RLGS PASRHQQ IQCL IRACLKDGTLGWV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 

nucle- 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














WWLI PVI PALWEAKVGGSLEPRSSTPAW 
ATQGDL I LKKDGTLLQCPHVAKGHS SHS 


13S25 


27726 


A 


13993 


189 


370 


SRNAI FKKRWESSCQSSFNSLGVFPQAA 
TKGWVRWLTLV I PAL WEAKAGGS LEPRS 
SRPA 


13826 


27727 


A 


13994 


179 


25 


SKHNKKNIQICIiLCSKRSMFVHVCIHIH 
ICVYVYRYAYIYIGMCIHIYIHS 


13827 


27728 


A 


13995 


159 


2 


EDPLS PGVQDQHGHHRKSLSGRGGKI SL 
GQEFKSSMAAMYFGRSGGCKLERR 


13828 


27729 


A 


13996 


307 


99 


FIWEHFVNYTMFFTYKKPTSQAWWLMPV 
VPALWETEAGGSPEVRSPRPPGQHDDVR 
PLGPSLHVYLQMD 


13829 


27730 


A 


13997 


218 


21 


FQHFGRPRRVDHLSLGVQDQPGQHGETP 
SPbASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


13830 


27731 


A 


13998 


227 


404 


GYSVTLLLGLTQLTQRIAHGTRKGERER 
ERERERERERVCS CLMHELASHLLFFFL 
FRE 


13831 


27732 


A 


13999 


400 


668 


PVGRRWELQGMGLGVRTWQSSLPCQGPT 
SSSVLSLGRGTPGISNTCLPPRSATELG 
liPAQVPFPNVQSQDQDLSFSLGCFPRTY 
NRQTL 


13832 


27733 


A 


14000 


225 


414 


NRVLLCHTGWSTVAI LAYCSFKLLGSRD 
S PTLAS RAARTI VKCRNVWL I KKKKKKK 
KGGRRL 


13833 


27734 


A 


14001 


397 


118 


LFSGPFFFDKSECFQEPTVPFCLIjLYNQ 
MQPYLLRSAFVDSYYMYDSFLRDRVSFC 
QSGWSTVAPSRLTAAliNSLAQVI L.LPQP 
LNVLCGISR 


13834 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCFYK I R I KNMFNRNRLNL 
GQWCVP I VPATQ VAEAGGS PEPTSLRP I 


13835 


27736 


A 


14003 


440 


290 


YTHTLYIYIYIHTHTHTHTYIYliCTYWG 
QFDLLVLDNKLYKE I YYLSVFI 


13836 


27737 


A 


14004 


2 


116 


ARLVAMP FKYEE I KD FLLTARRKDAKS V 
KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


SAKAPLSCLRTNS LLLKNGLRGWAWWIT 
PVI PTLWEAE VGGS PDVTS P 


13838 


27739 


A 


14006 


3 


497 


GG I GDSRCGS TRAS S S PQLAGRS SSVLiP 
AAAQPCTPTMDVFKKGFSIAKEGWGAV 
E KTKQGVTEAAE KTKEGVMYVGAKTKEN 
WQSVTSVAE KTKEQANAVS EAWSSVN 
TVATKTVEEAENIAVTSGWRKEDLRPS 
AP QQEGE AS KEKE E VAE E AQ S GGD 


13839 


27740 


A 


14007 


1 


292 


SLGGGGCNDPRSYHCTPAWATEQDTVSL 
SLS I YMS VRVCI YM YMC VCVC I R I YTY I 
HTHIHIRVYIYTIPYIWGIYIYIYVDLV 
IYMGWCGCPPPSI 


13840 


27741 


A 


14008 


57 


250 


KIFFFLMNHTWGGVPWLATWEAEVGGS 
LEPRSLGLQETMMTSLLS RLPRQQGE IL 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPS IQDTLKQGRTG 
QWLTPVTPMPWEAEAGGSLDARSLRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEPSRPGRWLTSI I PACWEAEAGGPPEA 
RSSTREFRTAK 


13843 


27744 


A 


14011 


403 


168 

■ 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
I ESHLHVS QRETAARLRDRERERARERE 
RERERGKNASSGKPPGSASCVP 
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SEQ ID 
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od 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13844 










422 


E I RRGS LAG PLS LN F ILS VMGSHS KYLR 
KVIGRAQWLMPWPALWEAEVGKLHELR 
RLGHTWAAW 


13845 


27746 


A 


14013 


200 


405 


WLCVLGWSLLALSPRLECYGPISAHCNL ' 
HLLRSSDSPASASQALQEAWYQHLLLMM 
PQAASTQVGRKK 


13846 


27747 


A 


14014 


319 


176 


KPSTITFLFLHLFISSRESRPVTQTEVK 
WL.NHSSPQPPTPRFSLPSS 


13847 


27748 


A 


14015 


63 


401 


EQENKNTLNS PGWDGVHI HVLRFLMTRM 
PLGHLSFSEGVSGSGSSGRRCQAGAGGR 
SATAAARPCQCLGLKGLPPAVPSCGLEG 
KAPERAGYS ASHP PHGCETGNGWVLYVN 










416 


63 


VSKVYPKINHTQRTVRHETFKIAMPKYY 
SNLCS II.RFNKDRLAFMRHYTKQCS RTS 
ITEYIAKFFFVCLFVLKIGAHCHPGWNE 
WLVCSQVTAASTSRAQAILPPQPPKIL 
GMQQA 


13849 










425 


SLMFRPPLFSFCCFCIRYPFIFVFFSLL - " 
C FAFS WCS LLLFFFHYLKL FLiLFLLF F 
LPVLL.CLDS FLVHDFFLLSS 


13850 


27751 


A 


14018 


3 


233 


SSRMGRGTKTIAFDMRWSCEVDRARHRD 
REREREREREREREKGAEPQEFPVISPW 
VDRPCLAGILHLKCGCGHRL 


13851 


27752 


A 


14019 


163 


1 


AERWHDQI CVERE I WG WVQWLMPVI PVL 
WEAKAKNLLEPKNLKLQRNRRPGNSR 


13852 








203 


83 


LKKKKNKSQAQWLTSWLASQEAEAGGS 
LEPRNSRPAWAT 


13853 
13854 


27755 


A 


14022 


358 


3 

431 


VTWQRELRLQMELRLLICRAWWLTPVIP ' 
VPWEAKAGGSPEPRSSRPAWETRNRRPG 


13855 


27756 


A 


14023 


98 


3 


WHKKKNQNWLGTGAHACNPSTLGG 

GRPAGWLTPVAFTLWEAEMGGSLEPRSL 

GHE 


13856 










385 


RENWAEPSSHSLLNI FLQLGTVARACKP 
STLGGRGGWITRSGD 


13857 


27758 


A 


14025 


140 


1 


KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTLSPRLQCI 


1385S 


27759 


A 


14026 


579 


852 


QREWVGWAGKEGEGWVSHVPASAQLPAN 
GQRGQPHPS PLGGTGSTWAQEGAYCCLS 
SCSHCCCCSSCCCCCPPGFCLFPSLGAI 
CHLYHL 








14027 


30 


663 


LRIRALRELPASHIPGSLTICCVPRPPL""' 

PCSSTKPDAKGYKPLAQQKLGSQRTGPC 

LGHAPCYSPLWELRGHGCSPYGPSPSGS 

VSLWQEAMRLPKNTPEEKDRRTAALQEG 

LRRAVS VPLTLAETVASLWPALQELAR C 

GNLACRSDLQVAAKALEMGVFGAYFNVL 

INLRDITDEAFKDQIHHRVSSLLQEAKT 

QAALVLDCLETRQE 


13860 


27761 


A 


14028 


1 


267 


AVGVHHAFHLPHCFFASLLES PVS PRLA 
^DPNCSCAAGVSCTCAGSCKCKECKCTS 
CKKS CCSCCPVG CS KCAQGCVCKGAS EK 
CSCCD 


13861 


27762 




14029 


46 


1746 


PAAGAATMEFRQEE FRKLAGRALGKLHR 
CiLEKRQEGAETLELSADGRPVTTQTRDP 
PWDCTCFGLPRRYI XAIMSGLGFCISF 
3 1 RCNLGVAI VSMVWNSTTHRGGHVWQ 
<AQFS WD PETVGL IHGSFFWGYI VTQI P 
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nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, iyi = Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibie 
nucleotide insertion 














GGFI CQKFAANRVFGFAI VATSTLNMLI 
PSAARVHYGCVI FVRILQGLVEGVT YPA 
CHG I WSKWAP PLERSRLATTAFCGS YAG 
AWAMPLAGVLVQYSGWSSVFYVYGS fg 
IFWYL.FWLLVS YES PALHPS I SEEERKY 
IEDAIGESAKLMNPLTKFSTPWRRFFTS 
MPVYAI IVANFCRSWTFYLLLISQPAYF 
EEVFGFEISKVGLVSALPHLVMTIIVPI 
GGQIADFLRSRRIMSTTNVRKLMNCGGF 
GME ATLLLWG YSHSKGVAI S FLVLAVG 
FSGFAISGFNVNHIjDIAPRYASILMGIS 
NGVGTLSGMVCP I I VGAMTKHKTREE WQ 
YVFLIASLVHYGGVIFYGVFASGEKQPW 
AEPEEMSEEKCGFVGHDQLAGSDDSEME 
DEAEPPGAPPAPPPSYGATHSTFQPPRP 
PPPVRDY 


13862 


27763 


A 


14030 


240 


63 


VFTAQSNGGVELNRFNEGPLSAVLKGRV 
AWAQWLTPVIPTLWEVKAGGLLEARSLS 
PA 


13863 


27764 


A 


14031 


289 


414 


HAEMGRVQWL.TPAIPALCKAEAGGPPEP 
RSLRPAWATY 


13864 


27765 


A 


14032 


386 


1765 


LGDARAPEKMSAIQAAWPSGTECIAKYN 
FHGTAEQDLPFCKGDVLTIVAVTKDPNW 
YKAKNKVGREG I I PANYVQ KREG VKAGT 
KLSLMPWFHGKITREQAERLLYPPETGL 
FLVRESTNYPGDYTLCVSCDGKVEHYRI 
MYHASKLSIDEEVYFENLMQLVEHYTSD 
ADGLCTRLIKPKVMEGTVAAQDEFYRSG 
WALNMKELKLLQTIGKGEFGDVMLGDYR 
GNKVAVKCI KNDATAQAFIAE ASVMTQL 
RHSNLVQLLG VI VEE KGGLY I VTEYMAK 
GSLVDYLRSRGRSVLGGDCLLKFSLDVC 
EAMEYLEGNNFVHRDLAARNVLVSEDNV 
AKVS DFGLTKEAS S TQDTGKLPVKWTAP 
EALREKKFSTKSDVWS FGILLWE I YS FG 
RVP YPR I PLKDWPRVEKGYKMDAPDGC 
PPAVYEVMKNCWHLDAAMRPS FLQLREQ 
LEHIKTHELHL 


13865 


27766 


A 


14033 


619 


414 


EQARCLEQHACTS PRQPRPQLQCSELKG 
HSLGMEHQGHVPCLHPCS PCPP VKAS PA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 


2 


614 


LESRPDGRPSSTHPASPSAFSAPGKPHP 
P EAKMS S KRAKAKTTKKRPQRATSNVFA 
MFDQSQIQEFKEAFNMIDQNRDGFIDKE 
DliHDMLASLGKNPTDEYLEGMMSEAPGP 
INFTMFLTMFGEKLNGTDPEDVIRNAFA 
CFDEEASGFIKEDHLRELLTTMGDRFTD 
EE VDEMYREAP I DKKGNFNYVEFTRILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVEFGPAGPGSLGRAAAMIIPVRCFTCG 
KIVGNKWEAYLGLLQAEYTEGDALDALG 
LKRYCCRRMLLAHVDLIEKLLNYAPLEK 


13868 


27769 


A 


14036 


494 


148 


FRFLSDCGVFAEGHIELQVESGVPLGFS 
TMAEDMETKIKNYKTAPFDSRFPNQNQT 
RNC WQNYLDFHRCQKAMTAKGGD ISVCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


IMGDVKNFLYAWCGKRKMTPSYEIRAVG 
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SEQID 
NO: of 
nucleotide 


SEQID 
NO: of 
peptide 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 

26 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NKNRQKFMCEVQVEGYNYTGMGNSTNKK 
DAQSNAARDFVNYLVRINEIKSEEVPAF 
GVASPPPLTDTPDTTANAEGDLPTTMGG 
PL.P PH1ALKAENNS E VGASGYGVPGPTW 
DRGANLKDYYSRKEEQEVQATLESEEVD 
LNAGLHGNWTLENAKARLNQYFQKEKIQ 
GEYKYTQVGPDHNRS FI AEMT IYIKQLG 
RRIFAREHGSNKKLAAQSCALSLVRQLY 
HLGWEAYSGLTKKKEGETVEPYKVNLS 
QDLEHQLQNIIQELNL.EILPPPEDPSVP 
VALNIGKLAQFEPSQRQNQVGWPWSPP 
QSNWNPWTSSNIDEGPLAFATPEQISMD 
LKNELMYQLEQDHDliQAILQERELLPVK 
KFESEILEAISQNSWIIRGATGCGKTT 
QVPQFI LDD FI QNDRAAECN I WTQPRR 
ISAVSVAERVAFERGEEPGKSCGYSVRF 
ESVLPRPHASIMFCTVGVLLRKLSAGIR 
GI SHVI VDE IHERD I NTS FLL VVLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQEYFLEDCIQMTHFVPPP 
KDKKKKDKDDDGGEDDDANCNLICGDEY 
GPETRLSMSQLNEKETPFELIEALiLKYI 
ETLNVPGAVLVFLPGWNLIYTMQKHLEM 
NPHFGSHRYQIliPLHSQIPREEQRECVFD 
PVPVGVTKVILSTNIAETSITINDWYV 
IDSCKQKVKLFTAHNNMTNYSTVWASKT 
NLEQR KG RAGRS TAG FCFHLCS RAR FE R 
LETHMTPEMFRTPLHE I ALS I KLLRLGG 
IGQ FLAKAI EP P PLDAVI EAEHTLRELD 
ALDANDELTPLGRILAKLPIEPRFGKMM 
I MGC I FYVGDA I CT I AAATCFPE P FVNE 
GKQLGYIHRNFAGNRFSDHVALLSVFQA 
WDDARMGGEEAE I RFCEHKRLNMATLRM 
TWEAKVQLjKEILINSGFPEDCLLTQVFT 
NTGPDNNLDWISLLAFGVYPNVCYHKE 
KRKILTTEGRNALIHKSSVNCPFSSQDM 
NYPSPFFVFGEKIRTRAISAKGMTLVPP 
LQLLLFAS KKVQS DGQ I VLVDD W I KLQ I 
SHEAAAC I TGLRAAMEALWE VTKQPAI 
ISQLDPVNERMLNMIRQISEPSAAGINL 
MIGSTRYGDGPRPPKMARYDNGSGYRRG 
GSSYSGGGYGGGYSSGGYGSGGYGGSAT 
PSGRI CAGVGGGYRGVSRGGFRGNSGGD 
YRGPSGGYRGSGGFQRGGGRGAYGTGYL 
D I EE E VAA I KLG YVS S VCRQ 


13870 


27771 


A 


14038 


431 


542 


EGITPGWARWLTPVI PGLWEEEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


14040 


238 


362 


RRCTFLSFRMLAPRVYSWGKRAFSPSV 
CVRAHGKCDYSYS 














LLARPHGKGSFQVGRQQHQAVRVGSAHA 
S S AG YVC VS AFP FVFLR FCVGGGPG I SR 
VYALFYGECNPTREWAVSSELS PS FQEQ 
NKMNKVEQKSQESVSFKDVTVGFTQEEW 
QHLDPSQRALYRDVMLENYSNLVSVGYC 
VHKPEVIFRLQQGEEPWKQEEEFPSQSF 
PEVWTADHLKERSQENQS KHLWEWF IN 
NEMLTKEQGDVIGIPFNVDVSSFPSRKM | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first • 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metliionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














FCQCDS CGMSFNTVSELVI S KIN YLGKK 
SDE FNACG KLLLN I KHDETHTRE KNEVL 
KNRNTLSHRENTLQHEKIQTLDHNFEYS 
ICQETLLEKAVFNTRKRENAEENNCDYN 
EFGRT FCDSSS LL FHQ ISPSRDNHYE FS 
DCEKFLCVKSTLS KPHGVSMKHYDCGES 
GNNFRRKLCLSHLQKGDKGEKHFECNEC 
GKAFWEKSHLTRHQRVHTGQKPFQCNEC 
EKAFWDKSNLTKHQRSHTGEKPFECNEC 
GKAFSHKSALTLHQRTHTGEKPYQCNAC 
GETFYQKSDLTKHQRTHTGQKPYECYEC 
GKS FCMNSHLTVHQRTHTGEKPFECLEC 
GKSFCQKSHLTQHQRTHIGDKPYECNAC 
GKTFYHKSVLTRHQIIHTGLKPYECYEC 
G KT FCLKS DLT I HQRTHTGE KP FACP E C 
GKFFSHKSTLSQHYRTHTGEKPYECHEC 
GKIFYNKSYLTKHNRTHTGEKPYECNEC 
GKTFCQKSQLTQHQRIHIGEKPYECNEC 
GKAFCHKSALIVHQRTHTQEKPYKCNEC 
GKSFCVKSGLI FHERKHTGEKPYECNEC 
GKFFRHKS SLTVHHRAHTGEKS CQCNEC 
GKIFYRKSELAQHQRSHTGEKPYECNTC 
RKTFSQKSNLIVHQRRHIGENLMNEMDI 
RNFQPQVS LHNASEYSHCGBSPDDILNV 
Q 


13873 


27774 


A 


14043 


187 


422 


ltipklhsaqepagpvspppppsptfqi 
ggrsraeprnqycpqvapvpalrgcl.pl 
spgepphdtssprhltcggivf 


13874 


27775 


A 


14044 


340 


442 


NKFLSWVRLLMPVI PTFWEAKVGGWLEA 
RSLRMQ 


13875 


27776 


A 


14045 


3 


1240 


LVEGAAGQGVSDGARLRKCGTRS FPGSE 
EVLSSMARGSAFPAAALWLWSILLCLLA 
LRAEAGPPQEESLYLWIDAHQARVLIGF 
EED I L I VS EGKMAP FTHDFRKAQQRM PA 
ISVNIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKG I MAD PTVNVPLLGTVPHKAS WQ 
VGFP CLGKQDGVAAFE VD VI VMNS EGNT 
ILQTPQNAI FFKTCQQAECPGGCRNGGF 
CNERRICECPDGFHGPHCEKALCTPRCM 
NGGLCVTPGFCICPPGFYGVNCDKANCS 
TTCFNGGTCFYPGKCICPPGLEGEQCEI 
SKCPQPCRNGGKCIGKSKCKCSKGYQGD 
LCSKP VCE PGCGAHGTCHEPNKCQ CQEG 
WHGRHCNKRYEASL IHALRPAGAQLRQH 
TPSLKKAEERREPPESNYIW 


13876 


27777 


A 


14046 


18 


274 


YSQEVLCE MNRS G WS N I KRG W AW WQLT 
TTIPALWEAEVGGLLEPRSLRLASPQPP 
QHEDYSMSYCAQPRFSLCIRGFIFYSLW 
F 


13877 


27778 


A 


14047 


177 


441 


GGSHYAWPNITFIFTKLIAQFFCFFEMQ 
SCSVARLECRGAISAHCNLHLPGSSNSP 
VMAQI LKNQRKSNFDEDATLLGLS YTLL 
GEI 


13878 


27779 


A 


14048 


136 


319 


DVFLLLFHSNYFLFFETGSHSVSQAGVK 
WCDLS S LQP P PLGS SDP PTSAS RVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELKLPSYKGQSPQLSLRRYFADLIAIVS 
NRFTLCPSARHLAVYLLDL.FMDRYDISI 
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SEQ1D 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 




C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucte- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=GIycine, H=Histidine, 1— Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QQLHLVALSCLLIoASKYESDLHDWKFJuM 
FI 


33880 


27781 


A 


14050 


262 


3 


VIYNYISVRPFPNIYHTFYTLSLFLYLS ' 
THTHTHTHTHTHTQRETGHVCVCIYTYM 
YMCMYAKWWYKFTFLISGKARIHTHLHP 
CI 


13881 


27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYLSL 
TAYPLTW KPPE P PATPQPTPNP PPGHPA 
TRG PKS AL.LDHS YACAS APP LGTEGTAP 
SPSFALI 


13882 


27783 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRGLiASTVSLSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAAELSK 
NLSSPSSYPPAVNKGRKVASPSPSGSVL 
FTDEGVPKFLSRKTLVEFPQKVLSPFRK 
QGSDSEARQVGRKVTSPSSSSSSSSSDS 
ES DDEADVS EVTPR WSKGRGGLRKPEA 
SHSFENRAFRVTVSAKEKTLLQKPHVDI 
TDPEKPHQP KKKG S P AK P S E GRENAR P K 
TTMPRSQVDEE FLKQSLKE KQLQKTFRL 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
PKGS PAPAVLAEEARAEGQLQAS PPGAA 
EGHLEKPVPEPQRKAAPPLPRKETSGTQ 
GIEGHLKGGQAIVEDQIPPSNLETVPVE 
NNHGFHEKTAALKLEAEGEAMEDAAAPG 
NDRGGTQE P AP VPAE P FDNTTYKNLQHH 
D YSTYT FLDLNLELS KFRMPQ PS SGRES 
PRH 


13883 


27784 


A 


14053 


177 


1253 


EKKRTLPVRSVTGELQGKSLSDLAAGTM 
DSEKKRFTEEATKYFRERVSPVHLQILL 
TNNEAWKRFVTAAELPRDEADALYEALK 
KLRTYAAIEDEYVQQKDEQFREWFLKEF 
PQVKR KI QE S I E KLRALANG I E E VHRG C 
TISNWSSSTGAASGIMSLAGLVLAPFT 
AGTSLALTAAGVGLGAASAVTG I TTS I V 
EHSYTSSAEAEASRLTATS IDRLKVFKE 
VMRDITPNLLSLLNNYYEATQTIGSEIR 
AI RQARARARLPVTTWRISAGSGGQAER 
T I AGTTRAVS RGAR I LSATTS G I FLALD 
WNLVYESKHLHEGAKSASAEELRRQAQ 
ELEENLMELTQIYQRLNPCHTH 


13884 


27785 


A 


14054 


1585 


2099 


I CVKTFPPUVLQVRMAAAEHHHSSGLPY 
WPYLTAETLKNRMGHQPPPPTQQHSITD 
NSLSLKTPSERLLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSADDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
APPSVDDNLKTPPECVCSLPFHPQRMI I 
SRN 


13885 


27786 


A 


14055 


2 


2865 


ALPDGGASVASDRAEGRPAKPSKTAARE 
KTEGAVAAVGGGPSS FRCCYGCCHEARL 
GRTSLPRGVIMLTEASLSIWGWGSLGIV 
LFLITFGPFVI FYLTFYILCFVGGGLW 
TLLFGKTWSEKYLEQCEHSFLPPTSPGV 
PKCLEEMKREARTIKIDRRLTGANI IDE 
PLQQVIQFSLRDYVQYWYYTLSDDESFL 
LE I RQTLQNALIQFATRSKE IDWQPYFT 
TR I VDD FG THLR VFR KAQQ K I TE KDDQ V 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TS P KDEEGFLRDLCE VLLYLLLPPGDFQ 
NKIMRYFVREILARGILLPLINQLSDPD 
yiNQYVIWMIRDSNCNYEAFMNIIKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIKNQINSLLFVKKVCDSRIQRLQSG 
KE I NTVKLAANFGKLCTV PLDS I LVDNV 
ALQFFMDYMQQTGGQAHLFFWMTVEGYR 
VTAQQQLEVLLSRQRDGKHQTNQTKGLL 
RAAAVG I YEQYLS E KAS PR VTVDDYLVA 
KLADTLNHEDPTPEIFDDIQRKVYELML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
SFRGSDDGDGESFNGSPTGSINLSLDDL 
SNVSSDDSVQLHAYISDTVYADYDPYAV 
AGVCNDHGKTYALYAITVHRRNLNSEEM 
WKT YRR YSDFHDFHMR I TEQ FESLS S I L 
KLPEKKTFNKMDRDFLEKRKKDLNAYLQ 
LLLAPEMMKAS PAuAHYVYDFLENKAYS 
KGKGDFARKMDTFVNPLRNSMRNVSNAV 
KSLPDSLAEGMTKMSDNMGKMSERLGQD 
IKQSFFKVPPLIPKTDSDPEHRRVSAQL 
DDNVDDNI PLRVMLLLMDEVFDLKERNQ 
WLRRNI KNLLQQL I RATYGDTINRKI VD 
HVDWMTS PEQVADS VKR FRDAFWPNG I L 
AEAVPCRDKS IRMRTRVAGKTKLLAIMP 
GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRRGFATLHSGLKLLGSSDPPSS 
TSQ I PG I TGTSHCVQPTHLFFLALS 


13887 


27788 


A 


14058 


53 


211 


RQHITCLDIFFIHMNYKYKYINIYLSIY 
LSIYLSI YLIYLSISHSSYITERA 


13888 


27789 


A 


14059 


1 


1642 


RDGRKMATATIALQVNGQQGGGSEPAAA 
AAWAAGDKWKPPQGTDS I KMENGQSTA 
AKL.GL P PL.TPEQQE ALQKAKKYAMEQS I 
KSVLVKQTIAHQQQQLTNLQMAAQRQRA 
LAIMCRVYVGSI YYELGEDTIRQAFAPF 
GPIKSIDMSWDSVTMKHKGFAFVEYEVP 
EAAQIiALEQMNSVMLGGRNI KVGRPSNI 
GQAQP I I DQLAEE ARAFNR I YVASVHQD 
LSDDDI KSVFEAFGKI KSCTliARDPTTG 
KHKGYGFIEYEKAQSSQDAVSSMNLFDL 
GGQYLRVGKAVTPPMPLLTPATPGGLPP 
AAAVAAAAATAK I TAQEAVAG AAVLGTL 
GTPGLVS PALTLAQPLGTLPQAVMAAQA 
PGV I TGVTPAR P P I P VTI PSVGWNP I L 
ASPPTLGLLEPKKEKEEEELFPESERPE 
MLSEQEHMSISGSSARHMVMQKLLRKQE 
STVMVLRNMVDPKDIDDDLEGEVTEECG 
KFGAVNRVI I YQEKQGEEEDAEI IVKI F 
VEFS I AS ETHKAI QALNGRWFAGRKWA 
EVYDQERFDNSDLSA 


13889 


27790 


A 


14060 


1238 


1474 


VLAliQEGRP WRRE PAS IDACRLNFQRLR 
RGKFSNVLFPGLAQEALYSGGYHLKFAD 
ELiMGGNLKKSTADASGSRGHQL 


13890 


27791 


A 


34061 


266 


400 


GQWARPVIPALWEAKAGGSLEPSMLRPA 
GQHIKTPSVX.ITSKKT 


13891 


27792 


A 


14062 


2178 


1881 


VIjQAPS I LLDAPRTDGGDMGRAMVARLG 
LWLLLLALLLPTQI YS S ETTTGTS SNS S 
QSTSNTGLAPNPTNATTKAAGGALQSTA 
SLFWSLSIiLHLYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 

nucle- 

locntion 
correspon 

Inst amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
C=Clycine, H=Histidine, l^Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13892 


27793 


A 


14063 


261 


1 


NSVLTLALLFSLHVHLPPPPSGVTDTAG 
ALSTGGPPSPTSVTGRGPAHSHASQLPP 
APGE FAPLNES CRGWAGE AFLERFPDAW 
VDP 


13893 


27794 


A 


14064 


2384 


1115 


QHFSRRGLCWEQRSSVTSSWTSGAWSP " 

PCP PSNAS CNTLHTRDWAS PDPGGQGS L 

GESPGPAPPGQLHTL.DTDLHSLAQIGGK 

SPVAGVGNGGSLWPRESPGTANGHSPEH 

TPPGPGPPGPCPTKRRLLPAGEAPDVSS 

EEEGPAPRRRRGSLGHPTAANSSDAKAT 

PFWSHLLPGPKEPVL.DPTDCGPMGRRLK 

GARRLKIjSPLRSIiRKGPGLLSPPRASPV 

PTPAVSRTLLGNFEESLLRGRFAPSGHI 

EGFTAEIGASGSYCPQHVTLPVTVTFFD 

VSEQNAPAPFLGI VDLNPLGRKGYSVPK 

VGTVQVTLFNPNQTWKMFLVTFDFSDM 

PAAHMTFLRHRLFLVPVGEEGNANPTHR 

LLCVLLHLRFRSSRSGRLSIiHGDIRLIjF 

SRRSLELDTGLPYELQAVTEAPHNPRYS 

PLP 


13894 


27795 


A 


14065 


232 


416 


GWAVQLGIGTEGKEGTSSSERQREGRGD 
GNRKTGRRREGRRSTKTDAPSPYPAHRP 
R3KLI 


13895 


27796 


A 


14067 


238 


45 


IQTCGSHSFCIPSIWEAKAGGLLEPRSL 
SPACPTQKDPISHLGSELPQSFRLLIRL 
VPLFLSAC 


13896 


27797 


A 


14068 


3 


1705 


SCESKATPWRAVSASQELQHPQGGQRSP 
IiPGDLTPARPNPAYPLTVECQRSCSRPV 
PAPLPPHTHPPGSSCFYSSFSFITKATA 
PGAQRRAVTQAERGRMGFLGTGTWILVL 
VLPIQAFPKPGGSQDKSLHNRELSAERP 
LNEQIAEAEEDKIKKTYPPENKPGQSNY 
SFVDNLNLLKAITEKEKIEKERQSIRSS 
PLDNKLNVEDVDSTKNRKLIDDYDSTKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVK 
KIAARIYEENDRAVFDKIVSKLL.NLGLI 
TESQAHTLEDEVAEVX.QKLISKEANNYE 
EDPNKPTSWTENQAGKIPEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGLERRTKT 
YSEDNFEELQYFPNFYALiLKSIDSEKEA 
KE KETL I T I M KTL I D F VKMMVK YGT I S P 
EEGVSYLENLDEMIALQTKNKLEKNATD 
N I S KL F P APS E KS HE E TDS TKE EAAKME 
KEYGSLKDSTKDDNSNPGGKTDEPKGKT 
EAYLEAIRKNIEWLKKHDKKGNKEDYDL 
SKMRDFINKQADAYVEKGILDKEEAEAI 
KRIYSSL 


13897 


27798 


A 


14069 


1 




ntsvaiqtieyysafkrketlthvplwm 
NTjKDTMLREISQSQKDTV 


13898 


27799 


A 


14071 


212 


104 


HTNHQCQTVIjPAGQALATPOGLAPSPVF 
PQSCLRMV 


13899 


27800 


A 


14072 


358 


112 


fpl.tvmlnvflwwqnlkhfcw fi qeqlm 
fffffflrtgshywqaglellgstnlp 
asvsrvagttgihnctqfnplhtlhl. 


13900 


27801 


A 


14073 


3 


151 


YMGFHHVGYAGLELLTSSDLPPWPPKCW 
DYRHEPSCLAMFFYFALIAPE 


13901 


27802 | A 


14074 


8 


1493 


VTIHHLFVGQAVRADTLKKKSPSCLIiCI 
VFIPVPRIDLTNIDQVAVIFKHHFPVGR 



1394 



ISDOCID: <WO 0164835A2_I I _> 



WO 01/64835 



PCT/USO 1/0492 7 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/5 J 5,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GDAVLKTWAPAQCLCSRMGPAWLWLLGT 
GILASVHCQPLLAHGDKSLQGPQPPRHQ 
LS EP APAYHR I TPT I TNFALRLYKELAA 
DAPGNI FPS PVS I S TTLALLS LGAQANT 
SALILEGLGFNLTETPEADIHQGFRSLL 
HTLALPSPKLELKVGNSLFLDKRLKPRQ 
HYbDS I KELYGAFAFS ANFTDS VTTGRQ 
INDYIiRRQTYGQWDCLPEFSQDTFMVXi 
ANYIFFKAKWKHPFSRYQTQKQESFFVD 
ERTSLQVPMMHQKEMHRFLYDQDLACTV 
LQIEYRGNALALLVLPDPGKMKQVEAAL 
QPQTLRKWGQLLLPSLLDLHLPRFSISG 
TYNLEDILPQIGLTNILNLEADFSGVTG 
QLNKTISKVSHKAMVDMSEKGTEAGAAS 
GLLSQ P PS LNTMS DPHAHFNR PFLLLL W 
E VTTQS LL FLGKWNP VAG 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-13901 , a mature protein coding portion of SEQ ID NO: 1-13901 , 
an active domain of SEQ ID NO: 1-13901 , and complementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3 . An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
associated with a regulator)' sequence that modulates expression of the polynucleotide in the host 
cell. 

1 0. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one of SEQ ID NOS: 1-13901. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound diat binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 
5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
1 0 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1 ; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

1 5 sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

20 16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 1 0 is detected. 

25 

1 7. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
5 that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 

of claim 1 0 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
1 0 the group consisting of a polynucleotide sequence of SEQ ID NO: 1-13901 , a mature protein 
coding portion of SEQ ID NO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-13901 , under conditions sufficient to express the polypeptide in 
said cell; and 

15 b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof. 

20 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprises the 
25 sequence information of at least one of SEQ ID NOS: 1 -1 3901 . 

23. The collection of claim 22, wherein the collection is provided on a nucleic 
acid array. 

30 24. The collection of claim 23, wherein the array detects full-matches to any 

one of the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any 

one of the polynucleotides in the collection. 
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26. The collection of claim 22, wherein the collection is provided in a 
computer-readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising a polypeptide of claim 
10 or 20 and a pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically 
acceptable carrier. 
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